Skip to content
Change the repository type filter

All

    Repositories list

    • Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.
      Python
      Apache License 2.0
      1k000Updated Dec 23, 2024Dec 23, 2024
    • SPaR

      Public
      Python
      Apache License 2.0
      2000Updated Dec 17, 2024Dec 17, 2024
    • kvpress

      Public
      LLM KV cache compression made easy
      Python
      Apache License 2.0
      14000Updated Dec 9, 2024Dec 9, 2024
    • mlc-llm

      Public
      Universal LLM Deployment Engine with ML Compilation
      Python
      Apache License 2.0
      1.6k000Updated Nov 25, 2024Nov 25, 2024
    • The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.
      Jupyter Notebook
      4000Updated Nov 19, 2024Nov 19, 2024
    • Code for studying the super weight in LLM
      Jupyter Notebook
      6000Updated Nov 11, 2024Nov 11, 2024
    • Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
      Jupyter Notebook
      Other
      4.6k000Updated Nov 9, 2024Nov 9, 2024
    • Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
      Python
      563000Updated Oct 28, 2024Oct 28, 2024
    • spiritlm

      Public
      Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
      Python
      Other
      56000Updated Oct 24, 2024Oct 24, 2024
    • MMLU-Pro

      Public
      The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]
      Python
      Apache License 2.0
      25000Updated Oct 22, 2024Oct 22, 2024
    • Aria

      Public
      Codebase for Aria - an Open Multimodal Native MoE
      Jupyter Notebook
      Apache License 2.0
      79000Updated Oct 16, 2024Oct 16, 2024
    • The Triton TensorRT-LLM Backend
      Python
      Apache License 2.0
      111000Updated Oct 15, 2024Oct 15, 2024
    • TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
      C++
      Apache License 2.0
      1k100Updated Oct 15, 2024Oct 15, 2024
    • Awesome-LLM: a curated list of Large Language Model
      Creative Commons Zero v1.0 Universal
      1.6k000Updated Oct 14, 2024Oct 14, 2024
    • llm.c

      Public
      LLM training in simple, raw C/CUDA
      Cuda
      MIT License
      2.8k000Updated Oct 11, 2024Oct 11, 2024
    • Open-O1

      Public
      Apache License 2.0
      36000Updated Oct 6, 2024Oct 6, 2024
    • An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
      Python
      MIT License
      6000Updated Sep 30, 2024Sep 30, 2024
    • Awesome LLM compression research papers and tools.
      MIT License
      83000Updated Sep 30, 2024Sep 30, 2024
    • MRAG

      Public
      Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"
      Python
      Other
      19000Updated Sep 30, 2024Sep 30, 2024
    • Official Implementation of "CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks"
      Python
      Other
      2000Updated Sep 27, 2024Sep 27, 2024
    • LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
      Python
      Apache License 2.0
      149000Updated Sep 27, 2024Sep 27, 2024
    • Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"
      Python
      Other
      165000Updated Sep 25, 2024Sep 25, 2024
    • A curated list for Efficient Large Language Models
      Python
      99000Updated Sep 21, 2024Sep 21, 2024
    • quivr

      Public
      Open-source RAG Framework for building GenAI Second Brains 🧠 Build productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Efficient retrieval augmented generation framework
      Python
      Other
      3.6k000Updated Sep 13, 2024Sep 13, 2024
    • OpenLLM

      Public
      Run any open-source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.
      Python
      Apache License 2.0
      653000Updated Sep 9, 2024Sep 9, 2024
    • [TMLR] A curated list of language modeling researches for code and related datasets.
      118000Updated Sep 9, 2024Sep 9, 2024
    • Unofficial implementation of https://arxiv.org/pdf/2407.14679
      Python
      8100Updated Sep 7, 2024Sep 7, 2024
    • llmperf

      Public
      LLMPerf is a library for validating and benchmarking LLMs
      Python
      Apache License 2.0
      118100Updated Aug 21, 2024Aug 21, 2024
    • nanoGPT style version of Llama 3.1
      Python
      66000Updated Aug 8, 2024Aug 8, 2024
    • RouteLLM

      Public
      A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!
      Python
      Apache License 2.0
      253000Updated Aug 4, 2024Aug 4, 2024