Saturday, June 14, 2025
Show HN: AnyCrawl v0.0.1-alpha.5 – custom user-agent and richer scraping API https://ift.tt/vosCgP0
Show HN: AnyCrawl v0.0.1-alpha.5 – custom user-agent and richer scraping API ## [0.0.1-alpha.5] - 2025-06-14 ### Added - Integrated AWS S3 storage support with new `S3` class and environment variables for seamless file uploads and retrievals. - Introduced `FileController` for serving files from S3 or local storage with robust path validation and error handling. - Added multiple content transformers (Screenshot, `HTMLTransformer`) improving HTML/Markdown extraction and screenshot generation. - Extended scraping capabilities with new options: output `formats`, `timeout`, tag filtering, `wait_for`, retry strategy, viewport configuration, and custom user-agent support. - Added Safe Search parameter to `SearchSchema` for filtered search results. - Refactored engine architecture with a factory pattern and new core modules for configuration validation, data extraction, and job management. - Implemented graceful shutdown handling for the API server and improved logging for uncaught exceptions / unhandled rejections. - Added Jest configuration for API and library packages with ESM support and updated test scripts. - Updated CI workflows to publish Docker images on version tags. - Expanded README with detailed environment variable descriptions and API usage examples. ### Changed - Refined error handling in `ScrapeController` and `JobManager`; failure responses now include structured error objects and HTTP status codes. - Enhanced `BaseEngine` with explicit HTTP error checks and resilience improvements. - Updated OpenAPI documentation to reflect new scraping parameters and error formats. - Migrated key-value store name to environment configuration for greater flexibility. - Enhanced per-request credit tracking in `ScrapeController` and enhanced logging middleware to include credit usage. ### Fixed - Improved job failure messages to include detailed error data, ensuring clearer debugging information. - Minor documentation corrections and clarifications. https://ift.tt/ix8GSuJ June 14, 2025 at 11:18PM
Subscribe to:
Post Comments (Atom)
Show HN: I Built a Pocket OS with JavaScript, Electron, and Gemini https://ift.tt/aCzVxIL
Show HN: I Built a Pocket OS with JavaScript, Electron, and Gemini Welcome, traveler, to a new era of digital sovereignty. Oopis-OS has alwa...
-
Show HN: High school robotics code/CAD/design binder release Hello HN! My name is Patrick, and I am a junior at my High School’s FRC robotic...
-
Show HN: D&D meets Siri – Interactive voice adventure Hey HN! I've been building tooling for voice-driven apps over the past few mon...
-
Show HN: I Made an AI Social Media Manager to Automate Content Creation Hey HN, I am a Solopreneur, and I love building apps to automate bor...
No comments:
Post a Comment