Skip to content
View shaneaevans's full-sized avatar

Organizations

@scrapinghub @scrapy

Block or report shaneaevans

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Scrapy Training companion code

Python 173 46 Updated Jan 30, 2019

💫 Industrial-strength Natural Language Processing (NLP) in Python

Python 30,609 4,425 Updated Dec 16, 2024

WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.

Python 45 10 Updated Mar 19, 2018

Convert Javascript code to an XML document

Python 186 23 Updated Mar 14, 2022

Visual scraping for Scrapy

Python 9,338 1,401 Updated Jun 26, 2024

Blog crawler for the blogforever project.

Python 22 5 Updated Jan 31, 2014

MessagePack serializer implementation for Java / msgpack.org[Java]

Java 1,422 321 Updated Jan 5, 2025

A client interface for Scrapinghub's API

Python 203 63 Updated Dec 16, 2024

Deprecated HubStorage client library - please use python-scrapinghub>=1.9.0 instead

Python 16 12 Updated Dec 5, 2016

Erlang LSM BTree Storage

Erlang 307 58 Updated Aug 7, 2016

Erlang/OTP application for accessing Amazon S3

Erlang 1 Updated Sep 8, 2011

Python wrapper for extended filesystem attributes

Python 199 40 Updated Jan 6, 2025

sworkflow born inside mydeco to help us solve data processing flows quickly. It's a library to help you create data workflows using Tasks.

Python 3 1 Updated Apr 2, 2013

A pure-python HTML screen-scraping library

HTML 1,868 272 Updated Apr 4, 2022

Python library of web-related functions

Python 394 105 Updated Oct 16, 2024

Python bindings for the snappy google library

Python 479 106 Updated Oct 16, 2024

Prospective search for python

Python 26 3 Updated Dec 4, 2012

Scrapy crawler for the average weather conditions of cities on the BBC Weather Centre

Python 3 2 Updated Dec 14, 2010

MochiWeb is an Erlang library for building lightweight HTTP servers.

Erlang 1,874 474 Updated Mar 21, 2024

Scrapy, a fast high-level web crawling & scraping framework for Python.

Python 53,718 10,610 Updated Jan 7, 2025

Erlang/OTP application for accessing Amazon S3

Erlang 31 18 Updated Sep 30, 2010

a Map/Reduce framework for distributed computing

Erlang 1,631 241 Updated Jan 30, 2018