Starred repositories
Phishing Domains, urls websites and threats database. We use the PyFunceble testing tool to validate the status of all known Phishing domains and provide stats to reveal how many unique domains use…
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports comp…
A little word cloud generator in Python
Streaming WARC/ARC library for fast web archive IO
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Automatically generate and overlay subtitles for any video.
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Python tool for grabbing text via screenshot
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
A browser extension to separate lords from peasants
A simple and efficient tool to parallelize Pandas operations on all available CPUs
Segment a HTML document into structural data
Robust Speech Recognition via Large-Scale Weak Supervision
Hindi POS Tags and keywords using TNT model. Created Date: 28 Sept 2018
🤗 Evaluate: A library for easily evaluating machine learning models and datasets.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Isometric Spoken Language Translation - Isometric SLT.
Google Drive Public File Downloader when Curl/Wget Fails
Code and data for the IWSLT 2022 shared task on Formality Control for SLT
This is where I put things I find useful that speed up my work with Machine Learning. Ever looked in your old projects to reuse those cool functions you created before? Well, this repo is designed …
Efficient, check-pointed data loading for deep learning with massive data sets.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Facebook Low Resource (FLoRes) MT Benchmark
A collaborative catalog of NLP resources for Indic languages
A tool that locates, downloads, and extracts machine translation corpora
Interactive Neural Machine Translation tool