Starred repositories
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!
🚀🀄️ A fast and strong AI for riichi mahjong, powered by Rust and deep reinforcement learning.
🔍🀄️ Review mahjong game log with mjai-compatible mahjong AI.
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
TuShare is a utility for crawling historical data of China stocks
搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
An efficient PyTorch library for deep generative modeling.
Syntactic parsing automaton visualization, including LR(0), SLR, LR(1), LALR | 在线可视化地设计、运行文法分析器与自动机,支持 LR(0), SLR, LR(1), LALR 分析
pytorch handbook是一本开源的书籍,目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门,其中包含的Pytorch教程全部通过测试保证可以成功运行
PyTorch Tutorial for Deep Learning Researchers
Tensors and Dynamic neural networks in Python with strong GPU acceleration
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
An open source library for face detection in images. The face detection speed can reach 1000FPS.
Easier Automatic Sentence Simplification Evaluation
Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models for Local Sequence Transduction": www.aclweb.org/anthology/D…
MaxMatch (M^2) Scorer - Evaluation program for grammatical error correction systems.
ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.
Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagging" (BEA-21)
System Combination for Grammatical Error Correction Based on Integer Programming
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。
(yet another not really) awesome topic/text segmentation list
Official code for ICLR 2022 paper: "PoNet: Pooling Network for Efficient Token Mixing in Long Sequences".
The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recogniti…
Implementation of the paper: Text Segmentation as a Supervised Learning Task