Lists (6)
Sort Name ascending (A-Z)
Stars
💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023
[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
The evaluation code for MultiIF multi-turn and multi-lingual instruction following
A course on aligning smol models.
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
Faster Whisper transcription with CTranslate2
Accessible large language models via k-bit quantization for PyTorch.
Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation preconditioner and more)
Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793
The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models
When it comes to optimizers, it's always better to be safe than sorry
A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.
Flax is a neural network library for JAX that is designed for flexibility.
Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"
A scikit-learn-compatible library for estimating prediction intervals and controlling risks, based on conformal predictions.
(ICLR 2025) TabM: Advancing Tabular Deep Learning With Parameter-Efficient Ensembling
Official implementation of "GPT or BERT: why not both?"
A comprehensive repository of reasoning tasks for LLMs (and beyond)
For optimization algorithm research and development.
The Open Cookbook for Top-Tier Code Large Language Model
🏃 Python3 Solutions of All 27 Problems in MHC 2024