Highlights
- Pro
Starred repositories
Superfast AI decision making and intelligent processing of multi-modal data.
Official Repo of "MMBench: Is Your Multi-modal Model an All-around Player?"
Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…
Implementation of Nougat Neural Optical Understanding for Academic Documents
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
A framework for few-shot evaluation of language models.
An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…
RESTful JSON API for django-oscar
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Tools for merging pretrained large language models.
verl: Volcano Engine Reinforcement Learning for LLMs
Janus-Series: Unified Multimodal Understanding and Generation Models
Fully open reproduction of DeepSeek-R1
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
Everything about the SmolLM2 and SmolVLM family of models
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Awesome LLM Books: Curated list of books on Large Language Models
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking
A Python package that makes it easy for developers to create AI apps powered by various AI providers.
Adding guardrails to large language models.
This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov
🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.