Stars
Awesome RL Reasoning Recipes ("Triple R")
Scalable toolkit for efficient model alignment
😎 curated list of awesome LMM hallucinations papers, methods & resources.
Paper collections of multi-modal LLM for Math/STEM/Code.
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
A high-throughput and memory-efficient inference and serving engine for LLMs
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key 管理与二次分发。单可执行文件,提供 Docker 镜像,一键部署,开箱即用。LLM API management & k…
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
GPT4V-level open-source multi-modal model based on Llama3-8B
awesome grounding: A curated list of research papers in visual grounding
Open-Sora: Democratizing Efficient Video Production for All
Large Language Model Text Generation Inference
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
Google Research
北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研究与应用总结。包括基于知识图谱的问答(KBQA),基于文本的问答系统(TextQA),基于表格的问答系统(TableQA)、基于视觉的问答系统(VisualQA)和机器阅读理解(MRC)等,每类任务分别对学术界和工业界进行了相关总结。
Robust recipes to align language models with human and AI preferences
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities (ICML 2024)
An Autonomous LLM Agent for Complex Task Solving
A quick guide (especially) for trending instruction finetuning datasets
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
a state-of-the-art-level open visual language model | 多模态预训练模型