Starred repositories
Real-time, fine-grained reading list on LLM-synthetic-data.🔥
A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.
[AAAI 2025] Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
🌋👵🏻 Yo'LLaVA: Your Personalized Language and Vision Assistant
Finetune Llama 3.3, Mistral, Phi-4, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
xhc19930714 / vit-pytorch
Forked from lucidrains/vit-pytorchImplementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
📖 Paper reading list in conversational AI (constantly updating 🤗).
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
The code and resource of "Towards Comprehensive Detection of Chinese Harmful Memes" (NeurIPS2024 D&B).
An Open Large Reasoning Model for Real-World Solutions
User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)
Virtual whiteboard for sketching hand-drawn like diagrams
Official repository for EMNLP'24 paper "ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations"
Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents
Abstraction your words——never mind the scandal and liber
Can MLLMs Understand the Deep Implication Behind Chinese Images?
Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek3, ...) and 150+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, Inter…
(撰写ing..)本仓库偏教程性质,以「模型中文化」为一个典型的模型训练问题切入场景,指导读者上手学习LLM二次微调训练。
Llama3、Llama3.1 中文仓库(随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)
llama3 implementation one matrix multiplication at a time
This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achieving exceptional performance on the edge.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Writing AI Conference Papers: A Handbook for Beginners