Skip to content
@LLMNexus

LLMNexus

Popular repositories Loading

  1. Compact-Language-Models-via-Pruning-and-Knowledge-Distillation Compact-Language-Models-via-Pruning-and-Knowledge-Distillation Public

    Forked from alperiox/Compact-Language-Models-via-Pruning-and-Knowledge-Distillation

    Unofficial implementation of https://arxiv.org/pdf/2407.14679

    Python 1

  2. llmperf llmperf Public

    Forked from ray-project/llmperf

    LLMPerf is a library for validating and benchmarking LLMs

    Python 1

  3. TensorRT-LLM TensorRT-LLM Public

    Forked from NVIDIA/TensorRT-LLM

    TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

    C++ 1

  4. LLM101n LLM101n Public

    Forked from karpathy/LLM101n

    LLM101n: Let's build a Storyteller

  5. llm.c llm.c Public

    Forked from karpathy/llm.c

    LLM training in simple, raw C/CUDA

    Cuda

  6. build-nanogpt build-nanogpt Public

    Forked from karpathy/build-nanogpt

    Video+code lecture on building nanoGPT from scratch

    Python

Repositories

Showing 10 of 39 repositories
  • awesome-llm-apps Public Forked from Shubhamsaboo/awesome-llm-apps

    Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.

    LLMNexus/awesome-llm-apps’s past year of commit activity
    Python 0 Apache-2.0 989 0 0 Updated Dec 23, 2024
  • SPaR Public Forked from thu-coai/SPaR
    LLMNexus/SPaR’s past year of commit activity
    Python 0 Apache-2.0 2 0 0 Updated Dec 17, 2024
  • kvpress Public Forked from NVIDIA/kvpress

    LLM KV cache compression made easy

    LLMNexus/kvpress’s past year of commit activity
    Python 0 Apache-2.0 14 0 0 Updated Dec 9, 2024
  • mlc-llm Public Forked from mlc-ai/mlc-llm

    Universal LLM Deployment Engine with ML Compilation

    LLMNexus/mlc-llm’s past year of commit activity
    Python 0 Apache-2.0 1,646 0 0 Updated Nov 25, 2024
  • nano-sparse-attention Public Forked from PiotrNawrot/nano-sparse-attention

    The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.

    LLMNexus/nano-sparse-attention’s past year of commit activity
    Jupyter Notebook 0 4 0 0 Updated Nov 19, 2024
  • LLMSuperWeight Public Forked from mengxiayu/LLMSuperWeight

    Code for studying the super weight in LLM

    LLMNexus/LLMSuperWeight’s past year of commit activity
    Jupyter Notebook 0 6 0 0 Updated Nov 11, 2024
  • LLMs-from-scratch Public Forked from rasbt/LLMs-from-scratch

    Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

    LLMNexus/LLMs-from-scratch’s past year of commit activity
    Jupyter Notebook 0 4,622 0 0 Updated Nov 9, 2024
  • GOT-OCR2.0 Public Forked from Ucas-HaoranWei/GOT-OCR2.0

    Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

    LLMNexus/GOT-OCR2.0’s past year of commit activity
    Python 0 566 0 0 Updated Oct 28, 2024
  • spiritlm Public Forked from facebookresearch/spiritlm

    Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".

    LLMNexus/spiritlm’s past year of commit activity
    Python 0 56 0 0 Updated Oct 24, 2024
  • MMLU-Pro Public Forked from TIGER-AI-Lab/MMLU-Pro

    The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]

    LLMNexus/MMLU-Pro’s past year of commit activity
    Python 0 Apache-2.0 25 0 0 Updated Oct 22, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…