- Beijing
-
06:51
(UTC -12:00) - https://scholar.google.com/citations?user=eh-XJIoAAAAJ&hl=zh-CN
- @_wujinming
Lists (10)
Sort Name ascending (A-Z)
Stars
PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"
MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips
🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability
Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities
[ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs
[CVPR 2025] EgoLife: Towards Egocentric Life Assistant
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
verl: Volcano Engine Reinforcement Learning for LLMs
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
✨First Open-Source R1-like Video-LLM [2025/02/18]
Fully open reproduction of DeepSeek-R1
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
A suite of image and video neural tokenizers
Official repository of Uni-AdaFocus (TPAMI 2024).
[ICLR 2025] Autoregressive Video Generation without Vector Quantization
MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale
PyTorch implementation of "ChatTime: A Unified Multimodal Time Series Foundation Model Bridging Numerical and Textual Data" (AAAI 2025 [oral])
Official repository for VisionZip (CVPR 2025)
A curated list of paper, code, data, and other resources focus on multimodal time series analysis.
[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
ElasticTok: Adaptive Tokenization for Image and Video