Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
Composable building blocks to build Llama Apps
ALIEN is a CUDA-powered artificial life simulation program.
Fast bare-bones BPE for modern tokenizer training
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.
Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""
Building a quick conversation-based search demo with Lepton AI.
SGLang is a fast serving framework for large language models and vision language models.
Landmark Attention: Random-Access Infinite Context Length for Transformers
Doing simple retrieval from LLM models at various context lengths to measure accuracy
A high-throughput and memory-efficient inference and serving engine for LLMs
Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
Repository for StripedHyena, a state-of-the-art beyond Transformer architecture
Doing simple retrieval from LLM models at various context lengths to measure accuracy
A series of large language models trained from scratch by developers @01-ai
A guidance language for controlling large language models.
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
Official inference library for Mistral models
A natural language interface for computers
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…
Large Language Model Text Generation Inference