Skip to content
View distributedstatemachine's full-sized avatar

Block or report distributedstatemachine

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Large Concept Models: Language modeling in a sentence representation space

Python 1,180 93 Updated Dec 16, 2024

noise_step: Training in 1.58b With No Gradient Memory

TeX 198 9 Updated Dec 25, 2024

⚙️🦀 Build portable, modular & lightweight Fullstack Agents

Rust 2,029 166 Updated Dec 19, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 20,957 1,622 Updated Dec 31, 2024

A Lossless Compression Library for AI pipelines

Python 205 26 Updated Jan 1, 2025

Fastest kernels written from scratch

Cuda 85 12 Updated Nov 30, 2024

Optimize GEMM with tensorcore step by step

17 5 Updated Dec 17, 2023

🦀 Rust runtime for ▲ Vercel Serverless Functions

Rust 880 51 Updated Dec 18, 2024

PyTorch per step fault tolerance (actively under development)

Python 41 8 Updated Dec 24, 2024
2 Updated Dec 10, 2024

🙌 OpenHands: Code Less, Make More

Python 39,629 4,459 Updated Jan 1, 2025

A collection of tools for maximizing the power of your exobrain

Rust 10 Updated Dec 11, 2024

An Open-Source Machine Learning Framework in Rust Δ

Rust 255 12 Updated Dec 31, 2024

DeMo: Decoupled Momentum Optimization

Python 159 6 Updated Dec 2, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,068 4,179 Updated Dec 30, 2024

A utility to inspect, validate, sign and verify machine learning model files.

Rust 52 1 Updated Nov 8, 2024

some mixture of experts architecture implementations

Python 12 2 Updated Mar 22, 2024

nanoGPT-like codebase for LLM training

Python 81 24 Updated Dec 26, 2024

The Grammar Checker for Developers

Rust 2,672 57 Updated Dec 31, 2024

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 79 27 Updated Jul 18, 2024

shader-like effects library for ratatui applications

Rust 812 8 Updated Dec 30, 2024

Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"

Jupyter Notebook 404 19 Updated Dec 12, 2024

Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"

Jupyter Notebook 2 Updated Nov 22, 2024

Can AdamW written in Triton be as performant as fused CUDA impl?

Python 8 1 Updated Nov 22, 2024

Convert markdown to pdf (a md to pdf transpiler)

Rust 125 5 Updated Dec 27, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 137,004 27,423 Updated Dec 31, 2024

Setup a specific Rust toolchain with extra features like problem matchers

Rust 209 32 Updated Oct 21, 2024
Python 2 4 Updated Dec 28, 2024

About the book "Writing for Developers: Blogs That Get Read," which is all about writing more compelling engineering blog posts. Available on Amazon as well as Manning. By Piotr Sarna & Cynthia Dun…

152 4 Updated Dec 30, 2024
Next