Stars
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Faster Whisper transcription with CTranslate2
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key 管理与二次分发。单可执行文件,提供 Docker 镜像,一键部署,开箱即用。LLM API management & k…
Collections of resources from Joint Laboratory of HIT and iFLYTEK Research (HFL)
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
【LLMs九层妖塔】分享 LLMs在自然语言处理(ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等)、信息检索(langchain)、语言合成、语言识别、多模态等领域(Stable Diffusion、MiniGPT-4、VisualGLM-6B、Ziya-Visual等)等 实战与经验。
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目,利用开源开放来促进「AI+金融」。
Robust Speech Recognition via Large-Scale Weak Supervision
手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube
The code for the paper C3: Zero-shot Text-to-SQL with ChatGPT
A library for efficient similarity search and clustering of dense vectors.
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型