Skip to content
View shipengai's full-sized avatar

Block or report shipengai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

Python 4,200 396 Updated Jan 20, 2025

A course on aligning smol models.

Jupyter Notebook 5,022 1,622 Updated Jan 20, 2025

Align Anything: Training All-modality Model with Feedback

Python 604 105 Updated Jan 20, 2025

Large Reasoning Models

Python 799 44 Updated Dec 3, 2024

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

Jupyter Notebook 2,233 394 Updated Sep 29, 2023

Eagle Family: Exploring Model Designs, Data Recipes and Training Strategies for Frontier-Class Multimodal LLMs

Python 533 36 Updated Jan 20, 2025

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 4,253 261 Updated Jan 11, 2025

A RLHF Infrastructure for Vision-Language Models

Python 146 7 Updated Nov 15, 2024
Python 94 2 Updated Dec 22, 2023
Python 3,291 294 Updated Oct 16, 2024

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 711 38 Updated Aug 5, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 3,819 367 Updated Jan 20, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,920 2,635 Updated Jan 21, 2025

Scalable toolkit for efficient model alignment

Python 674 85 Updated Jan 18, 2025

📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

558 20 Updated Dec 23, 2024

List of papers on hallucination detection in LLMs.

743 60 Updated Dec 19, 2024

An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation

Python 106 2 Updated Jan 15, 2024

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Python 267 7 Updated Nov 13, 2024

Train transformer language models with reinforcement learning.

Python 10,646 1,378 Updated Jan 20, 2025

【NeurIPS 2024】The official code of paper "Automated Multi-level Preference for MLLMs"

Python 17 1 Updated Sep 26, 2024
Python 135 21 Updated Oct 31, 2024

M-HalDetect Dataset Release

20 2 Updated Nov 4, 2023

Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization

Python 77 6 Updated Jan 30, 2024

[Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning

Python 78 3 Updated Apr 30, 2024

Enhancing Large Vision Language Models with Self-Training on Image Comprehension.

Python 62 4 Updated May 31, 2024
Python 141 26 Updated May 24, 2024

Multimodal Models in Real World

Jupyter Notebook 429 19 Updated Oct 28, 2024

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.

Python 2,086 264 Updated Jan 10, 2025

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,012 190 Updated Jan 17, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 17,058 1,219 Updated Jan 20, 2025
Next