Stars
Self-hosted game stream host for Moonlight.
Paper collections of multi-modal LLM for Math/STEM/Code.
veRL: Volcano Engine Reinforcement Learning for LLM
Let your Claude able to think
Align Anything: Training All-modality Model with Feedback
deepspeedai / Megatron-DeepSpeed
Forked from NVIDIA/Megatron-LMOngoing research training transformer language models at scale, including: BERT & GPT-2
Speech, Language, Audio, Music Processing with Large Language Model
✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
GraphRAG using Local LLMs - Features robust API and multiple apps for Indexing/Prompt Tuning/Query/Chat/Visualizing/Etc. This is meant to be the ultimate GraphRAG/KG local LLM app.
The Open-Source Data Annotation Platform
Data annotation toolbox supports image, audio and video data.
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
A modular graph-based Retrieval-Augmented Generation (RAG) system
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
A generative speech model for daily dialogue.
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
sharkwyf / RepoAgent
Forked from OpenBMB/RepoAgentAn LLM-powered repository agent designed to assist developers and teams in generating documentation and understanding repositories quickly.
Aria is Your AI Research Assistant Powered by GPT Large Language Models
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
[ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
[ACL'24, Outstanding Paper] Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!
A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)