Skip to content
View TuanNguyen27's full-sized avatar
☀️
☀️

Block or report TuanNguyen27

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Class notes for the course "Long Term Memory in AI - Vector Search and Databases" COS 597A @ Princeton Fall 2023

TeX 313 33 Updated Nov 18, 2023

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 37,152 3,253 Updated Aug 17, 2024

A hyper-fast local vector database for use with LLM Agents. Now accepting SAFEs at $135M cap.

Python 1,381 85 Updated Aug 29, 2024

Fast SHAP value computation for interpreting tree-based models

Python 527 33 Updated Jun 26, 2023

The release of the Twitter algorithm, annotated for recsys

487 27 Updated Apr 15, 2023

Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask without any rewrites.

Jupyter Notebook 112 19 Updated Mar 29, 2024

Statistical Rethinking Course for Jan-Mar 2023

R 2,234 251 Updated Nov 28, 2023

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 9,852 753 Updated Dec 30, 2024

Flyte Documentation 📖

Python 77 121 Updated Dec 21, 2024

The Patterns of Scalable, Reliable, and Performant Large-Scale Systems

59,711 6,099 Updated Dec 14, 2024

Neural Networks: Zero to Hero

Jupyter Notebook 12,543 1,692 Updated Aug 18, 2024

by ex-googlers, for ex-googlers - a lookup table of similar tech & services

14,684 1,045 Updated Jul 26, 2024

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

Python 7,251 798 Updated Dec 27, 2024

Approximate Nearest Neighbor Search for Sparse Data in Python!

Python 919 145 Updated Oct 2, 2020

It is my belief that you, the postgraduate students and job-seekers for whom the book is primarily meant will benefit from reading it; however, it is my hope that even the most experienced research…

4,541 297 Updated Jan 21, 2022

Coarse-grained lineage and tracing for machine learning pipelines.

Python 468 29 Updated Nov 11, 2022

A collection of (mostly) technical things every software developer should know about

85,703 7,901 Updated Aug 6, 2024

A light-weight, flexible, and expressive statistical data testing library

Python 3,502 316 Updated Dec 31, 2024

A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.

Python 2,028 94 Updated Sep 21, 2024

Source code accompanying O'Reilly book: Machine Learning Design Patterns

Jupyter Notebook 1,910 534 Updated Apr 28, 2021

Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/

Python 2,171 146 Updated Dec 24, 2024

📌 Papers, guides, and mentor interviews on applying machine learning for ApplyingML.com—the ghost knowledge of machine learning.

MDX 195 32 Updated Jun 5, 2024

Tuplex is a parallel big data processing framework that runs data science pipelines written in Python at the speed of compiled code. Tuplex has similar Python APIs to Apache Spark or Dask, but rath…

C++ 809 45 Updated Mar 28, 2024

Automatically visualize your pandas dataframe via a single print! 📊 💡

Python 5,234 370 Updated Mar 20, 2024

Preparation links and resources for system design questions

8,885 2,476 Updated May 10, 2024

📝 Design doc template & examples for machine learning systems (requirements, methodology, implementation, etc.)

567 98 Updated Mar 16, 2023

State of the Art Natural Language Processing

Scala 3,891 716 Updated Dec 31, 2024

MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvements in both training algorithms and models.

Python 340 71 Updated Dec 21, 2024

A C++ standalone library for machine learning

C++ 5,309 498 Updated Dec 20, 2024

PyTorch implementation of TabNet paper : https://arxiv.org/pdf/1908.07442.pdf

Python 2,676 492 Updated Oct 23, 2024
Next