shipengai

Follow

shipeng shipengai

Follow

10 followers · 23 following

UESTC
ChengDu

Achievements

Achievements

Stars

argilla-io / argilla

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

Python 4,200 396 Updated Jan 20, 2025

huggingface / smol-course

A course on aligning smol models.

Jupyter Notebook 5,022 1,622 Updated Jan 20, 2025

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Python 604 105 Updated Jan 20, 2025

SimpleBerry / LLaMA-O1

Large Reasoning Models

Python 799 44 Updated Dec 3, 2024

HarderThenHarder / transformers_tasks

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

Jupyter Notebook 2,233 394 Updated Sep 29, 2023

NVlabs / EAGLE

Eagle Family: Exploring Model Designs, Data Recipes and Training Strategies for Frontier-Class Multimodal LLMs

Python 533 36 Updated Jan 20, 2025

QwenLM / Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 4,253 261 Updated Jan 11, 2025

TideDra / VL-RLHF

A RLHF Infrastructure for Vision-Language Models

Python 146 7 Updated Nov 15, 2024

vlf-silkie / VLFeedback

Python 94 2 Updated Dec 22, 2023

LLaVA-VL / LLaVA-NeXT

Python 3,291 294 Updated Oct 16, 2024

GAIR-NLP / anole

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 711 38 Updated Aug 5, 2024

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 3,819 367 Updated Jan 20, 2025

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,920 2,635 Updated Jan 21, 2025

NVIDIA / NeMo-Aligner

Scalable toolkit for efficient model alignment

Python 674 85 Updated Jan 18, 2025

showlab / Awesome-MLLM-Hallucination

📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

558 20 Updated Dec 23, 2024

EdinburghNLP / awesome-hallucination-detection

List of papers on hallucination detection in LLMs.

743 60 Updated Dec 19, 2024

junyangwang0410 / AMBER

An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation

Python 106 2 Updated Jan 15, 2024

tianyi-lab / HallusionBench

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Python 267 7 Updated Nov 13, 2024

huggingface / trl

Train transformer language models with reinforcement learning.

Python 10,646 1,378 Updated Jan 20, 2025

takomc / amp

【NeurIPS 2024】The official code of paper "Automated Multi-level Preference for MLLMs"

Python 17 1 Updated Sep 26, 2024

RifleZhang / LLaVA-Hound-DPO

Python 135 21 Updated Oct 31, 2024

hendryx-scale / mhal-detect

M-HalDetect Dataset Release

20 2 Updated Nov 4, 2023

opendatalab / HA-DPO

Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization

Python 77 6 Updated Jan 30, 2024

YiyangZhou / POVID

[Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning

Python 78 3 Updated Apr 30, 2024

yihedeng9 / STIC

Enhancing Large Vision Language Models with Self-Training on Image Comprehension.

Python 62 4 Updated May 31, 2024

MFaceTech / InstantID

Python 141 26 Updated May 24, 2024

AILab-CVC / SEED-X

Multimodal Models in Real World

Jupyter Notebook 429 19 Updated Oct 28, 2024

huggingface / evaluate

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.

Python 2,086 264 Updated Jan 10, 2025

EvolvingLMMs-Lab / lmms-eval

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,012 190 Updated Jan 17, 2025

OpenBMB / MiniCPM-o

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 17,058 1,219 Updated Jan 20, 2025