Skip to content
View Yysrc's full-sized avatar
  • Fudan University
  • 中国
  • 11:35 (UTC -12:00)

Block or report Yysrc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A free, fast, and reliable Open Source CDN for npm, GitHub, Javascript, and ESM

JavaScript 5,802 2,073 Updated Mar 15, 2025
1 Updated Mar 20, 2025

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 17,521 1,448 Updated Feb 25, 2025

finetune qwen vl, both local and distributed model.

Python 1 Updated Feb 15, 2025

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…

Python 7,575 1,330 Updated Mar 20, 2025

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 141,580 28,349 Updated Mar 20, 2025
Python 223 2 Updated Mar 17, 2025

[Findings of EMNLP 2024] AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models

Python 6 Updated Oct 2, 2024

In-context KV-Cache Eviction for LLMs via Attention-Gate

Python 5 Updated Oct 22, 2024

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 8,910 625 Updated Mar 7, 2025

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 1,596 272 Updated Jan 16, 2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,837 503 Updated Sep 25, 2024

✨✨Latest Advances on Multimodal Large Language Models

14,363 924 Updated Mar 19, 2025

Codes for paper "Three Steps to Multimodal Trajectory Prediction: Modality Clustering, Classification and Synthesis", "Human Trajectory Prediction with Momentary Observation" and "Stimulus Verifica…

Python 32 3 Updated Jun 4, 2023

[RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre-trained weights

C++ 113 10 Updated Oct 16, 2024

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,796 2,202 Updated Feb 1, 2025
Jupyter Notebook 1,692 162 Updated Sep 27, 2024

[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving

2,881 270 Updated Dec 17, 2024

State-of-the-art bilingual open-sourced Math reasoning LLMs.

Python 497 29 Updated Oct 22, 2024

Official implementation of GR-MG

Python 76 6 Updated Jan 12, 2025

Reimplementation of GR-1, a generalized policy for robotics manipulation.

Python 124 5 Updated Sep 4, 2024

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Python 2,548 489 Updated Apr 15, 2024

Official PyTorch Implementation for the "Recovering the Pre-Fine-Tuning Weights of Generative Models" paper (ICML 2024).

Python 78 2 Updated Dec 17, 2024

SIFT: Grounding LLM Reasoning in Contexts via Stickers

Python 51 3 Updated Mar 6, 2025

Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"

Python 832 66 Updated Aug 27, 2024

Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.

Python 87 4 Updated Mar 20, 2025
Python 56 5 Updated Aug 8, 2024
Python 33 1 Updated Feb 14, 2025
Next