Skip to content
View vpj's full-sized avatar
😜
😜

Organizations

@labmlai

Block or report vpj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper

Python 587 29 Updated Mar 26, 2025

Fully open reproduction of DeepSeek-R1

Python 23,891 2,178 Updated Apr 12, 2025

πŸ™ Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 54,937 5,404 Updated Apr 5, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,504 245 Updated Apr 7, 2025

Entropy Based Sampling and Parallel CoT Decoding

Python 3,348 319 Updated Nov 13, 2024

Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044

Python 32 5 Updated Oct 3, 2024

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python 784 40 Updated Mar 3, 2025

A comprehensive repository of reasoning tasks for LLMs (and beyond)

JavaScript 428 48 Updated Sep 27, 2024

LLM101n: Let's build a Storyteller

33,154 1,812 Updated Aug 1, 2024
Jupyter Notebook 1,607 345 Updated Apr 11, 2025

LLM Analytics

TypeScript 654 28 Updated Oct 19, 2024

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 17,105 1,116 Updated Apr 11, 2025

Convert PDF to markdown + JSON quickly with high accuracy

Python 24,064 1,507 Updated Apr 9, 2025

πŸ”Ž Monitor deep learning model training and hardware usage from your mobile phone πŸ“±

Python 2,140 140 Updated Apr 10, 2025

Curate better data for LLMs

Python 1,020 98 Updated Mar 19, 2024

Code for Quiet-STaR

Python 729 89 Updated Aug 21, 2024

Grok open release

Python 50,243 8,353 Updated Aug 30, 2024

A multi-programming language benchmark for LLMs

Python 240 43 Updated Jan 23, 2025

MLX: An array framework for Apple silicon

C++ 20,166 1,170 Updated Apr 13, 2025

DeepSeek LLM: Let there be answers

Makefile 6,289 973 Updated Feb 4, 2024

A quick guide (especially) for trending instruction finetuning datasets

3,001 194 Updated Nov 28, 2023

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 174,376 45,552 Updated Apr 12, 2025

πŸ“ CodeEdit App for macOS – Elevate your code editing experience. Open source, free forever.

Swift 21,593 1,059 Updated Apr 12, 2025

A terminal for a more modern age

TypeScript 63,003 3,559 Updated Mar 28, 2025

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Jupyter Notebook 2,710 140 Updated Aug 4, 2024

Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)

Python 362 67 Updated Apr 11, 2024

The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER). We have shared a pre-trained 9B parameter model.

Jupyter Notebook 122 4 Updated Apr 29, 2023

High-Resolution Image Synthesis with Latent Diffusion Models

Python 40,742 5,206 Updated Oct 10, 2024

πŸ§‘β€πŸ« 60+ Implementations/tutorials of deep learning papers with side-by-side notes πŸ“; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 59,922 6,061 Updated Aug 24, 2024

Fast and memory-efficient exact attention

Python 16,847 1,600 Updated Apr 12, 2025
Next