Skip to content
View X-Lai's full-sized avatar

Block or report X-Lai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

R1-like Computer-use Agent

Python 65 4 Updated Mar 21, 2025

Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"

Python 301 7 Updated Apr 11, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,282 151 Updated Mar 20, 2025
Python 921 104 Updated Jan 23, 2025

Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers

113 3 Updated Jan 13, 2025

Official Implementation for "Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition"

Python 283 29 Updated Jan 9, 2025

Official repository for VisionZip (CVPR 2025)

Python 266 12 Updated Feb 27, 2025

Unified Language-driven Zero-shot Domain Adaptation (CVPR 2024)

Python 17 1 Updated Nov 28, 2024

Official Codebase of "DiffComplete: Diffusion-based Generative 3D Shape Completion"

Python 98 9 Updated Aug 14, 2024

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,201 1,386 Updated Mar 3, 2025

[CVPR'25 highlight] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness

Python 346 14 Updated Mar 4, 2025

Reference implementation for DPO (Direct Preference Optimization)

Python 2,517 204 Updated Aug 11, 2024

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,807 172 Updated Jan 22, 2025

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Python 428 39 Updated Feb 1, 2024

Robust recipes to align language models with human and AI preferences

Python 5,122 440 Updated Nov 21, 2024

Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA

Python 1,556 79 Updated Sep 25, 2024

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 358 14 Updated Jan 19, 2025

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 17,843 1,468 Updated Mar 28, 2025

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

5,601 847 Updated Sep 24, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 16,619 1,170 Updated Mar 14, 2025

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 6,086 528 Updated Mar 7, 2025

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,862 511 Updated Sep 25, 2024

The official source code for "X-Ray: A Sequential 3D Representation for Generation".

Python 108 4 Updated Mar 7, 2025

A PyTorch native library for large model training

Python 3,584 332 Updated Apr 13, 2025

The official Meta Llama 3 GitHub site

Python 28,607 3,355 Updated Jan 26, 2025

Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.

405 16 Updated Apr 18, 2024

Code examples and resources for DBRX, a large language model developed by Databricks

Python 2,548 242 Updated May 1, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,266 283 Updated May 4, 2024

[CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding

44 1 Updated Mar 15, 2024
Jupyter Notebook 490 40 Updated Nov 2, 2024
Next