Lists (2)
Sort Name ascending (A-Z)
Stars
The deslanting algorithm sets text upright in images. Python, C++ and OpenCL implementations provided.
Collection of different NLP recipes :)
A awesome list of (large-scale) public datasets on the Internet. (On-going collection)
This is a temporary repository for working on improvements for my book 'Text Analytics with Python'
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Te…
Contains relevant notebooks for the hands-on NLP workshop for the GIDS AIML Conference -2020 Edition
A TensorFlow project to detect sarcasm in tweets using the power of GloVe embeddings and Convolution layers.
Google Colab notebooks - tutorials, guides on ML topics
🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based
📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF
GROBID extension for identifying and normalizing physical quantities.
a Deep Learning Framework for Text https://delft.readthedocs.io/
A high performance bibliographic information service: https://biblio-glutton.readthedocs.io
A machine learning software for extracting information from scholarly documents
An opinionated guide to common Jekyll design patterns and anti-patterns.
Scrapy, a fast high-level web crawling & scraping framework for Python.
A Jekyll plugin that provides users with a traditional CMS-style graphical interface to author content and administer Jekyll sites.
A site to provide non-judgmental guidance on choosing a license for your open source project