Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.
-
Updated
Sep 30, 2024 - C#
Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.
DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-w…
A rule based HTML sanitizer built on top of the HTML Agility pack
Stateful programmatic web browsing, based on Python-Mechanize, which is based on Andy Lester’s Perl module WWW::Mechanize.
An agile HTML parser that builds a read/write DOM and supports plain XPATH or XSLT. Deprecated as there's new maintainer for original HAP project.
Because Chegg only gave few days to access their solutions for free when I rent their books so I decided to create a web crawler to save all the needed solutions offline.
Emulated Bloomberg API with real time stock changes
Aggregator of news from 55+ sources. Clean Architecture + Microservices + .NET Core 8.0 + ASP.NET Core 8.0 + ASP.NET Core 8.0 MVC + MediatR + Selenium + AngleSharp + HtmlAgilityPack + PostgreSQL + EntityFramework + SignalR + Serilog + Seq + Redis + RabbitMQ + Docker
This POC demonstrate simple usage of Azure functions and Screen scrapping with Html Agility Pack.
This sample demonstrate simple usage of Azure functions V3, DI with Autofac and Screen scrapping with Html Agility Pack.
This is an example of how to crawl a website using the (NuGet) HtmlAgilityPack and saving the results to a txt file.
(Crediz) Credit Simulation 4nd Register Support Application for Duy Tan University
Projeto ASP.NET Core .NET 5 para Extração e Parseamento de Dados do governo de São Paulo com integração com Buckets S3, Filas SQS AWS e Persistência realizada via EF Core no Mysql.
Experimental project to scrape a web page of comics and convert a comic into pdf
It is a program that prints the titles and prices of the advertisements on the main page of the "Sahibinden.com" site on the console screen and saves them in the text document.
Asp.net 6 core mvc realizes crawling Taobao product information and CoinGecko,Investing Related Information
It enables you to parse web sites or any other XML-based content with a predefined template.
This library allows you to retrieve several things from GitHub, things like trending repositories, profiles of users, the repositories of users and related information.
Add a description, image, and links to the htmlagilitypack topic page so that developers can more easily learn about it.
To associate your repository with the htmlagilitypack topic, visit your repo's landing page and select "manage topics."