Skip to content
View dotnethero's full-sized avatar

Block or report dotnethero

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 4,800 464 Updated Mar 5, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 10,092 1,378 Updated Feb 24, 2025

JAX port of FLUX.1 models using flax.nnx

Python 23 Updated Sep 28, 2024

.NET news, announcements, release notes, and more!

PowerShell 21,199 4,912 Updated Mar 6, 2025

Pinecone.NET is a fully-fledged C# library for the Pinecone vector database.

C# 58 6 Updated Sep 27, 2024

LLM training in simple, raw C/CUDA

Cuda 25,938 2,970 Updated Oct 2, 2024

We invite you to visit and follow our new repository at https://github.com/microsoft/TileFusion. TiledCUDA is a highly efficient kernel template library designed to elevate CUDA C’s level of abstra…

C++ 176 11 Updated Jan 28, 2025

Transformer related optimization, including BERT, GPT

C++ 6,068 899 Updated Mar 27, 2024

pytorch from scratch in pure C/CUDA and python

C 40 1 Updated Oct 10, 2024

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 15,871 3,081 Updated Mar 7, 2025

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,569 1,433 Updated Feb 26, 2025

CUDA Templates for Linear Algebra Subroutines

C++ 6,982 1,139 Updated Feb 28, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 2,243 373 Updated Mar 6, 2025

cuDF - GPU DataFrame Library

C++ 8,728 933 Updated Mar 7, 2025

Demo of the potential of C# for systems programming with the .NET native ahead-of-time compilation technology.

C# 2,045 109 Updated Jul 6, 2024

Development repository for the Triton language and compiler

MLIR 14,747 1,842 Updated Mar 7, 2025

Efficient Triton Kernels for LLM Training

Python 4,564 276 Updated Mar 7, 2025

CUDA Library Samples

Cuda 1,795 366 Updated Feb 27, 2025

ML.NET is an open source and cross-platform machine learning framework for .NET.

C# 9,117 1,900 Updated Mar 6, 2025

Relax! Flux is the ML library that doesn't make you tensor

Julia 4,583 613 Updated Mar 4, 2025

CUDA programming in Julia.

Julia 1,254 238 Updated Mar 6, 2025

Official inference repo for FLUX.1 models

Python 20,619 1,453 Updated Feb 6, 2025

C# as you know it but with Go-inspired tooling (small, selfcontained, and native executables)

C# 3,715 107 Updated Feb 24, 2025

Open-source Deep Learning library in C# with CUDA and BLAS support

C# 12 1 Updated Feb 23, 2025

PALLAIDIUM - a generative AI movie studio integrated in the Blender Video Editor.

Python 1,093 87 Updated Mar 4, 2025

WPF UI provides the Fluent experience in your known and loved WPF framework. Intuitive design, themes, navigation and new immersive controls. All natively and effortlessly.

C# 8,094 800 Updated Feb 28, 2025

Generate C# FFI from Rust for automatically brings native code and C native library to .NET and Unity.

Rust 731 58 Updated Aug 15, 2024
Next