-
IIT Jodhpur
- Jodhpur, India
-
02:33
(UTC +05:30) - in/souvik-maji-a12543251
Highlights
- Pro
Stars
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
A high-throughput and memory-efficient inference and serving engine for LLMs
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
A modular graph-based Retrieval-Augmented Generation (RAG) system
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Official inference repo for FLUX.1 models
DeepSeek Coder: Let the Code Write Itself
Fully open reproduction of DeepSeek-R1
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Janus-Series: Unified Multimodal Understanding and Generation Models
Letta (formerly MemGPT) is a framework for creating LLM services with memory.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
The open source platform for AI-native application development.
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
Seamlessly integrate LLMs into scikit-learn.
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editin…
veRL: Volcano Engine Reinforcement Learning for LLM
AdalFlow: The library to build & auto-optimize LLM applications.
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.