-
Zyte
Stars
A stub implementation of a subset of Zyte API
Default Twisted does not ship with a CONNECT-enabled HTTP(s) proxy. This code provides one.
An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.
🎭 Playwright integration for Scrapy
Zyte Smart Proxy Manager (formerly Crawlera) middleware for Scrapy
https://mimesniff.spec.whatwg.org/ implementation for Python
Scrapy Extension for monitoring spiders execution.
Spider templates for automatic crawlers.
Remove DIVs, style stuff and normalize HTML preserving structure information
A pure-Python robots.txt parser with support for modern conventions.
Contains the common item definitions used in Zyte.
Library to populate items using XPath and CSS with a convenient API
Collection of persistent (disk-based) and non-persistent (memory-based) queues for Python
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors
Scrapy, a fast high-level web crawling & scraping framework for Python.
kchmviewer is a CHM (Winhelp) files viewer written on Qt/KDE. It can be build as a standalone Qt-based application, or a KDE application. The main point of kchmviewer is compatibility with non-Engl…