Skip to content
View wRAR's full-sized avatar

Organizations

@scrapinghub @scrapy @zytedata

Block or report wRAR

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A stub implementation of a subset of Zyte API

Python 2 Updated Apr 22, 2025

Websites for testing spiders

Python 3 Updated Apr 18, 2025

Default Twisted does not ship with a CONNECT-enabled HTTP(s) proxy. This code provides one.

Python 51 21 Updated Feb 21, 2017

An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.

Python 38,820 4,162 Updated Apr 22, 2025

Python client for Zyte API

Python 24 5 Updated Apr 5, 2025

🎭 Playwright integration for Scrapy

Python 1,154 129 Updated Feb 22, 2025

Zyte Smart Proxy Manager (formerly Crawlera) middleware for Scrapy

Python 364 89 Updated Mar 24, 2025

Zyte API integration for Scrapy

Python 38 21 Updated Apr 9, 2025

https://mimesniff.spec.whatwg.org/ implementation for Python

Python 13 2 Updated Jan 16, 2024
Python 14 15 Updated Apr 22, 2025

Scrapy Extension for monitoring spiders execution.

Python 540 100 Updated Apr 11, 2025

Spider templates for automatic crawlers.

Python 28 4 Updated Mar 31, 2025

Remove DIVs, style stuff and normalize HTML preserving structure information

Python 10 2 Updated Feb 10, 2025

A pure-Python robots.txt parser with support for modern conventions.

DIGITAL Command Language 65 28 Updated Mar 24, 2025

Contains the common item definitions used in Zyte.

Python 9 9 Updated Apr 3, 2025
HTML 12 3 Updated Feb 5, 2025

Page Object pattern for Scrapy

Python 121 28 Updated Feb 12, 2025

Web scraping Page Objects core library

Python 99 15 Updated Feb 10, 2025

Common interface for data container classes

Python 67 12 Updated Mar 24, 2025

Library to populate items using XPath and CSS with a convenient API

Python 48 16 Updated Mar 24, 2025

Collection of persistent (disk-based) and non-persistent (memory-based) queues for Python

Python 277 55 Updated Mar 31, 2025

CSS Selectors for Python

Python 296 61 Updated Mar 24, 2025

Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors

Python 1,218 152 Updated Mar 31, 2025

Python library of web-related functions

Python 403 107 Updated Mar 24, 2025

Scrapy, a fast high-level web crawling & scraping framework for Python.

Python 54,971 10,758 Updated Apr 22, 2025

Complete lxml external type annotation

Python 58 7 Updated Apr 19, 2025

kchmviewer is a CHM (Winhelp) files viewer written on Qt/KDE. It can be build as a standalone Qt-based application, or a KDE application. The main point of kchmviewer is compatibility with non-Engl…

C++ 78 19 Updated May 18, 2024
Next