-
IIT Jodhpur
- Jodhpur, India
-
08:22
(UTC +05:30) - in/souvik-maji-a12543251
Highlights
- Pro
Stars
β¨β¨Latest Advances on Multimodal Large Language Models
Install and run your own AI agent service
DeepSeek Coder: Let the Code Write Itself
Fully open reproduction of DeepSeek-R1
2025 AI/ML internship & new graduate job list updated daily
[CVPR 2023] Vote2Cap-DETR and [T-PAMI 2024] Vote2Cap-DETR++; A set-to-set perspective towards 3D Dense Captioning; State-of-the-Art 3D Dense Captioning methods
π Collection of Kaggle Solutions and Ideas π
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
Official repository for our work on micro-budget training of large-scale diffusion models.
A practical RAG where you can download and chat with github repo
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cβ¦
Collection of papers and repos for multimodal chain-of-thought
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
Awesome Few-Shot Class-Incremental Learning
Learning to Prompt (L2P) for Continual Learning @ CVPR22 and DualPrompt: Complementary Prompting for Rehearsal-free Continual Learning @ ECCV22
[CVPR-2024] Official implementations of CLIP-KD: An Empirical Study of CLIP Model Distillation
Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
veRL: Volcano Engine Reinforcement Learning for LLM
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
AdalFlow: The library to build & auto-optimize LLM applications.
Run your own AI cluster at home with everyday devices π±π» π₯οΈβ
π§βπ« 60+ Implementations/tutorials of deep learning papers with side-by-side notes π; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gaβ¦
γICLR 2025γ A Sanity Check for AI-generated Image Detection