Starred repositories
Production-ready C++ Asynchronous Framework with rich functionality
The fantastic ORM library for Golang, aims to be developer friendly
Fluss is a streaming storage built for real-time analytics.
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
Paper List for In-context Learning 🌷
"rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Azure Blob, Azure Files, Yandex Files
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image …
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)
Streamlit — A faster way to build and share data apps.
A framework for few-shot evaluation of language models.
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
New file format for storage of large columnar datasets.
Making large AI models cheaper, faster and more accessible
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥
中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微调, 推理, 测评, 接口)等.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search…
Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.