Skip to content
View dblalock's full-sized avatar

Highlights

  • Pro

Block or report dblalock

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code examples and resources for DBRX, a large language model developed by Databricks

Python 2,545 244 Updated May 1, 2024

Fast CUDA matrix multiplication from scratch

Cuda 664 93 Updated Dec 28, 2023

LLM training code for Databricks foundation models

Python 4,181 553 Updated Mar 21, 2025

UniverSeg: Universal Medical Image Segmentation

Python 535 53 Updated Jul 15, 2023

Machine Learning Engineering Open Book

Python 13,196 803 Updated Mar 9, 2025

Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)

Python 1,252 76 Updated Dec 18, 2024

Fast and flexible reference benchmarks

Shell 452 126 Updated Aug 14, 2024

A Data Streaming Library for Efficient Neural Network Training

Python 1,258 157 Updated Mar 5, 2025

Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022

Python 1,100 87 Updated May 15, 2024

Readyset is a MySQL and Postgres wire-compatible caching layer that sits in front of existing databases to speed up queries and horizontally scale read throughput. Under the hood, ReadySet caches t…

Rust 4,896 138 Updated Mar 21, 2025

Supercharge Your Model Training

Python 5,310 435 Updated Mar 21, 2025

Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator

Python 210 12 Updated Dec 10, 2023

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

Python 12,536 406 Updated Mar 8, 2025

Bagua Speeds up PyTorch

Python 879 81 Updated Aug 1, 2024

Examples of how to create colorful, annotated equations in Latex using Tikz.

TeX 3,800 217 Updated Jul 12, 2022

Customized matrix multiplication kernels

Jupyter Notebook 53 6 Updated Mar 5, 2022

Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper

Python 755 71 Updated Jan 11, 2023

Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusion

Python 32 11 Updated May 15, 2024

A browser extension that links video explanations to research papers on arxiv.org

JavaScript 412 29 Updated Nov 5, 2023

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 37,566 4,311 Updated Mar 21, 2025

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Python 8,805 362 Updated Feb 9, 2025

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

27,831 3,736 Updated Jul 18, 2024

A smaller subset of 10 easily classified classes from Imagenet, and a little more French

Jupyter Notebook 1,017 76 Updated Sep 26, 2022

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

C++ 5,327 629 Updated Mar 21, 2025

PyTorch library to facilitate development and standardized evaluation of neural network pruning methods.

Python 428 74 Updated Jul 7, 2023

Staggeringly powerful macOS desktop automation with Lua

Objective-C 12,678 595 Updated Feb 27, 2025

A benchmark for low-level CPU micro-architectural features

C++ 715 64 Updated Feb 8, 2022

Google's differential privacy libraries.

Go 3,128 369 Updated Mar 12, 2025

A cheatsheet of modern C++ language and library features.

20,222 2,143 Updated Oct 15, 2024
Next