Mikan5916

Mikan5916

Stars

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,407 1,101 Updated Feb 14, 2025

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 5,977 346 Updated Feb 14, 2025

ultralytics / ultralytics

Ultralytics YOLO11 🚀

Python 36,498 7,038 Updated Feb 14, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 4,673 461 Updated Feb 14, 2025

FanqingM / R1-Multimodal-Journey

A jounery to real multimodel R1 ! We are doing on large-scale experiment

Python 141 1 Updated Feb 12, 2025

Deep-Agent / R1-V

Witness the aha moment of VLM with less than $3.

Python 2,416 178 Updated Feb 14, 2025

DAMO-NLP-SG / VideoLLaMA3

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 460 29 Updated Feb 10, 2025

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 15,795 2,081 Updated Feb 1, 2025

deepseek-ai / DeepSeek-VL2

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 3,589 1,485 Updated Feb 9, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 5,261 593 Updated Feb 13, 2025

deepseek-ai / DeepSeek-R1

75,002 9,690 Updated Feb 14, 2025

FateScript / token_visualizer

Token level visualization tools for large language models

Python 72 7 Updated Jan 8, 2025

volcengine / verl

veRL: Volcano Engine Reinforcement Learning for LLM

Python 3,129 264 Updated Feb 14, 2025

MiniMax-AI / MiniMax-01

Python 2,127 146 Updated Feb 10, 2025

lzw-lzw / UnifiedMLLM

UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model

20 1 Updated Aug 5, 2024

modelscope / dash-infer

DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.

C 229 24 Updated Feb 13, 2025

OpenGVLab / TPO

Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment

Python 40 2 Updated Jan 2, 2025

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 1,819 264 Updated Feb 13, 2025

CyC2018 / CS-Notes

📚 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计

178,851 51,184 Updated Aug 21, 2024

FoundationVision / VAR

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,560 431 Updated Jan 12, 2025