Skip to content

The fastest, most efficient web crawler and scraper written in Rust.

License

Notifications You must be signed in to change notification settings

DimitriTimoz/spider

 
 

Repository files navigation

Spider

Build Status Crates.io Documentation Rust Discord chat

Website | Guides | API Docs | Chat

The fastest web crawler and indexer. Foundational building blocks for data curation workloads.

  • Concurrent
  • Streaming
  • Decentralization
  • Headless Chrome Rendering
  • HTTP Proxies
  • Cron Jobs
  • Subscriptions
  • Smart Mode
  • Blacklisting, Whitelisting, and Budgeting Depth
  • Dynamic AI Prompt Scripting Headless with Step Caching
  • CSS Scraping with spider_utils
  • Changelog

Getting Started

The simplest way to get started is to use the Spider Cloud hosted service. View the spider or spider_cli directory for local installations. You can also use spider with Node.js using spider-nodejs and Python using spider-py.

Benchmarks

See BENCHMARKS.

Examples

See EXAMPLES.

License

This project is licensed under the MIT license.

Contributing

See CONTRIBUTING.

About

The fastest, most efficient web crawler and scraper written in Rust.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Rust 100.0%