Starred repositories
veRL: Volcano Engine Reinforcement Learning for LLM
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Efficient LLM Inference over Long Sequences
Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.
Access your Minikube for Mac's internal networks from your macOS host machine
Best Practices on Recommendation Systems
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
LLaMA: Open and Efficient Foundation Language Models
4 bits quantization of LLaMA using GPTQ
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
整理自然语言处理、推荐系统、搜索引擎等AI领域的入门笔记,论文学习笔记和面试资料(关于NLP那些你不知道的事、关于推荐系统那些你不知道的事、NLP百面百搭、推荐系统百面百搭、搜索引擎百面百搭)
Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
An enterprise-class low-code technology stack with scale-out design / 一套面向扩展设计的企业级低代码技术体系
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
A collection of pre-trained, state-of-the-art models in the ONNX format
Application Management Platform on Multi-Cloud Environment
Pyodide is a Python distribution for the browser and Node.js based on WebAssembly
中文医疗信息处理基准CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
Benchmarks of approximate nearest neighbor libraries in Python
Dapr is a portable, event-driven, runtime for building distributed applications across cloud and edge.
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
A resource tracking a number of Operators out in the wild.