Skip to content
View vvukimy's full-sized avatar

Block or report vvukimy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

Python 589 39 Updated Jan 7, 2024

MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips

3,921 485 Updated May 29, 2022

🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability

Python 139 10 Updated Apr 9, 2025

Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities

731 34 Updated Apr 15, 2025

[ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs

Python 427 30 Updated Jan 23, 2025

[CVPR 2025] EgoLife: Towards Egocentric Life Assistant

Python 262 17 Updated Mar 19, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 47,054 5,747 Updated Apr 16, 2025

ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning

Python 667 41 Updated Apr 15, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 6,769 731 Updated Apr 17, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 1,909 138 Updated Apr 11, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,049 145 Updated Apr 15, 2025

✨First Open-Source R1-like Video-LLM [2025/02/18]

Python 319 11 Updated Feb 23, 2025
Python 40 1 Updated Apr 8, 2025

Simple RL training for reasoning

Python 3,463 257 Updated Apr 10, 2025

Fully open reproduction of DeepSeek-R1

Python 23,997 2,193 Updated Apr 17, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,588 1,465 Updated Apr 2, 2025

Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓

2,980 169 Updated Mar 19, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,677 369 Updated Apr 16, 2025

A suite of image and video neural tokenizers

Jupyter Notebook 1,613 75 Updated Feb 11, 2025

Official repository of Uni-AdaFocus (TPAMI 2024).

Python 41 1 Updated Dec 17, 2024

[ICLR 2025] Autoregressive Video Generation without Vector Quantization

Python 472 11 Updated Apr 17, 2025

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Python 39 2 Updated Dec 9, 2024

PyTorch implementation of "ChatTime: A Unified Multimodal Time Series Foundation Model Bridging Numerical and Textual Data" (AAAI 2025 [oral])

Jupyter Notebook 69 11 Updated Dec 17, 2024

Official repository for VisionZip (CVPR 2025)

Python 268 12 Updated Feb 27, 2025

A curated list of paper, code, data, and other resources focus on multimodal time series analysis.

54 3 Updated Apr 16, 2025

[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Python 281 8 Updated Jan 22, 2025
Python 368 27 Updated Feb 28, 2025

ElasticTok: Adaptive Tokenization for Image and Video

Python 66 Updated Nov 4, 2024
Next