Skip to content
View juand-r's full-sized avatar

Highlights

  • Pro

Block or report juand-r

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A bibliography and survey of the papers surrounding o1

TeX 985 39 Updated Nov 16, 2024

Topological Data Analysis (TDA) for Natural Language Processing (NLP) Applications

2 Updated Dec 28, 2024

Lisp code for the textbook "Paradigms of Artificial Intelligence Programming"

Common Lisp 7,218 704 Updated Oct 15, 2024

A list of tech-related Bluesky starter packs

352 23 Updated Dec 9, 2024

Machine Learning Engineering Open Book

Python 12,093 735 Updated Dec 28, 2024

Supercharge Your Model Training

Python 5,207 428 Updated Dec 23, 2024

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 2,431 261 Updated Aug 13, 2024

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 36,381 4,600 Updated Nov 18, 2024

Lexical relations data extracted from AO-CHILDES

Python 2 Updated Apr 10, 2022

Pipeline to generate the Standardized Project Gutenberg Corpus

Python 165 40 Updated Jan 5, 2024

Lecture materials for Cornell CS5785 Applied Machine Learning (Fall 2024)

Jupyter Notebook 463 152 Updated Dec 27, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,935 1,136 Updated May 23, 2024

Count and truncate text based on tokens

Python 277 9 Updated May 2, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,287 874 Updated Jul 1, 2024

Materials for a language modeling class, broadly construed

NewLisp 22 2 Updated Dec 23, 2024

Pure-python library for adding annotations to PDFs

Python 199 46 Updated Mar 29, 2021

this repository accompanies the book "Grokking Deep Learning"

Jupyter Notebook 7,480 1,578 Updated Jun 1, 2024

CiteME is a benchmark designed to test the abilities of language models in finding papers that are cited in scientific texts.

Python 39 4 Updated Nov 1, 2024
Python 17 8 Updated Feb 1, 2023

An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).

Python 46 5 Updated Aug 13, 2024

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 11,006 1,096 Updated Dec 26, 2024

A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research group: https://samwald.info/

Jupyter Notebook 911 72 Updated Dec 16, 2024

Enforce the output format (JSON Schema, Regex etc) of a language model

Python 1,641 71 Updated Oct 16, 2024

Natural Language Inference is fundamental to many Natural Language Processing applications such as semantic search and question answering. The task of NLI has gained significant attention in the re…

Jupyter Notebook 42 16 Updated Dec 8, 2022

[ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future

377 13 Updated Jul 4, 2024

Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.

Jupyter Notebook 22 1 Updated Dec 24, 2024

A collection of word lists in machine readable, web-native (.yml and .json) format

21 5 Updated Jul 20, 2023
Next