-
MosaicML
- San Francisco, CA
- https://dblalock.substack.com
Highlights
- Pro
Stars
Code examples and resources for DBRX, a large language model developed by Databricks
LLM training code for Databricks foundation models
Machine Learning Engineering Open Book
Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)
A Data Streaming Library for Efficient Neural Network Training
Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022
Readyset is a MySQL and Postgres wire-compatible caching layer that sits in front of existing databases to speed up queries and horizontally scale read throughput. Under the hood, ReadySet caches t…
Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
Examples of how to create colorful, annotated equations in Latex using Tikz.
Customized matrix multiplication kernels
Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper
Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusion
A browser extension that links video explanations to research papers on arxiv.org
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
A smaller subset of 10 easily classified classes from Imagenet, and a little more French
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
PyTorch library to facilitate development and standardized evaluation of neural network pruning methods.
Staggeringly powerful macOS desktop automation with Lua
A benchmark for low-level CPU micro-architectural features
Google's differential privacy libraries.
A cheatsheet of modern C++ language and library features.