Stars
Code repository for the NeurIPS 2024 paper "Navigating the Effect of Parametrization for Dimensionality Reduction".
Simplified implementation of UMAP like dimensionality reduction algorithm
Opinionated provides simple, clean stylesheets for plotting with matplotlib and seaborn.
R package implementing edge bundling algorithms
Creating beautiful plots of data maps
A 1D analogue of the MNIST dataset for measuring spatial biases and answering Science of Deep Learning questions.
Some hidden knowledge found in the 20 Newsgroups dataset
Approximate nearest neighbor search in Python.
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
🛰️ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.
Official repo for the SOC-Embedding blogpost: https://www.rpisoni.dev/posts/self-organizing-class-embeddings/
utilities for decoding deep representations (like sentence embeddings) back to text
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, …
A specification for OpenInference, a semantic mapping of ML inferences
Companion repository to our Lause et al. (2023) preprint "Compound models and Pearson residuals for normalization of single-cell RNA-seq data without UMIs" (bioRxiv))
Testing if UMAP/tSNE overfit their intrinsic dimensionality
Reimplentation of paper using gzip + knn for text classification
C++/Python implementation of Nearest Neighbor Descent for efficient approximate nearest neighbor search
IsaGrue / nb_black
Forked from dnanhkhoa/nb_blackA simple extension for Jupyter Notebook and Jupyter Lab to beautify Python code automatically using black.
A fast multi-core implementation of HDBSCAN for low dimensional Euclidean spaces
The landscape of biomedical research