Skip to content
View khanhi2r's full-sized avatar

Block or report khanhi2r

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Triton implementation of FlashAttention2 that adds Custom Masks.

Python 106 11 Updated Aug 14, 2024

Conan - The open-source C and C++ package manager

Python 8,603 1,013 Updated Mar 27, 2025

JSON for Modern C++

C++ 44,927 6,918 Updated Mar 25, 2025

Development repository for the Triton language and compiler

MLIR 15,013 1,891 Updated Mar 28, 2025

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 9,239 652 Updated Mar 25, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,392 1,443 Updated Mar 10, 2025

LLM inference in C/C++

C++ 77,302 11,230 Updated Mar 28, 2025

Fully open reproduction of DeepSeek-R1

Python 23,435 2,131 Updated Mar 27, 2025

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

Python 11,046 703 Updated Mar 24, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,946 517 Updated Mar 27, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 42,941 6,519 Updated Mar 28, 2025

Evaluate your speech-to-text system with similarity measures such as word error rate (WER)

Python 704 101 Updated Feb 15, 2025

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

Python 646 30 Updated Dec 19, 2024

A parallel implementation of gzip for modern multi-processor, multi-core machines.

C 2,726 178 Updated Feb 19, 2025

Class to simplify communication with WeedFS

Python 27 13 Updated Jul 29, 2024

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 7,940 651 Updated Mar 27, 2025

PyTorch-MPI-DDP-example

Python 17 3 Updated Mar 21, 2018

A Vectorized Python Dict/Set

C++ 117 14 Updated May 10, 2023

Large, modern dataset for speech recognition

Shell 670 62 Updated Feb 26, 2024

This package aims at simplifying the download of the AudioSet dataset.

Python 48 13 Updated Sep 28, 2023

Inference code for PaSST, using the HEAR API.

Python 31 15 Updated Jan 2, 2024

Efficient Training of Audio Transformers with Patchout

Python 328 51 Updated Jan 12, 2024

Scenic: A Jax Library for Computer Vision Research and Beyond

Python 3,480 453 Updated Mar 27, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,478 2,758 Updated Mar 28, 2025

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1 Updated May 21, 2023

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1,252 229 Updated May 21, 2023

A free audio dataset of spoken digits. An audio version of MNIST.

Python 638 250 Updated May 2, 2024

Torch implementation of ViT based classifier for Audio classification

Python 9 3 Updated May 22, 2022
Next