Skip to content
View Mikan5916's full-sized avatar

Block or report Mikan5916

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,407 1,101 Updated Feb 14, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 5,977 346 Updated Feb 14, 2025

Ultralytics YOLO11 🚀

Python 36,498 7,038 Updated Feb 14, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 4,673 461 Updated Feb 14, 2025

A jounery to real multimodel R1 ! We are doing on large-scale experiment

Python 141 1 Updated Feb 12, 2025

Witness the aha moment of VLM with less than $3.

Python 2,416 178 Updated Feb 14, 2025

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 460 29 Updated Feb 10, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 15,795 2,081 Updated Feb 1, 2025

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 3,589 1,485 Updated Feb 9, 2025

s1: Simple test-time scaling

Python 5,261 593 Updated Feb 13, 2025

Token level visualization tools for large language models

Python 72 7 Updated Jan 8, 2025

veRL: Volcano Engine Reinforcement Learning for LLM

Python 3,129 264 Updated Feb 14, 2025
Python 2,127 146 Updated Feb 10, 2025

UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model

20 1 Updated Aug 5, 2024

DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.

C 229 24 Updated Feb 13, 2025

Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment

Python 40 2 Updated Jan 2, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 1,819 264 Updated Feb 13, 2025

📚 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计

178,851 51,184 Updated Aug 21, 2024

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,560 431 Updated Jan 12, 2025

Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding

Python 148 7 Updated Jan 24, 2025

The implementation of the paper 'Advancing Fine-Grained Visual Understanding with Multi-Granularity Alignment in Multi-Modal Models'

7 Updated Nov 14, 2024

[CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"

Python 266 3 Updated Dec 23, 2024

A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.

413 15 Updated Feb 13, 2025

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Python 128 4 Updated Dec 17, 2024

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

247 7 Updated Jan 7, 2025
Jupyter Notebook 147 9 Updated Dec 2, 2024

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,095 204 Updated Feb 14, 2025

MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities (ICML 2024)

Python 284 11 Updated Jan 20, 2025

Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓

2,514 141 Updated Jan 31, 2025
Next