Stars
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
Robust Speech Recognition via Large-Scale Weak Supervision
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
🦙 Integrating LLMs into structured NLP pipelines
Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagging" (BEA-21)
Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)
MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction"
ModelScope: bring the notion of Model-as-a-Service to life.
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
a state-of-the-art-level open visual language model | 多模态预训练模型
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
Let ChatGPT teach your own chatbot in hours with a single GPU!
This chatbot app is built using the Llama 2 open source LLM from Meta.
HyDE: Precise Zero-Shot Dense Retrieval without Relevance Labels
Embeddable property graph database management system built for query speed and scalability. Implements Cypher.
LlamaIndex is the leading framework for building LLM-powered agents over your data.
A wrapper for pip download in offline scenario.
Trained models with fast variant of the "best" LSTM models + legacy models
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
Decoupling Reasoning from Observations for Efficient Augmented Language Models
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"