khanhi2r

Follow

khanhi2r

Follow

5 followers · 4 following

Stars

alexzhang13 / flashattention2-custom-mask

Triton implementation of FlashAttention2 that adds Custom Masks.

Python 106 11 Updated Aug 14, 2024

conan-io / conan

Conan - The open-source C and C++ package manager

Python 8,603 1,013 Updated Mar 27, 2025

nlohmann / json

JSON for Modern C++

C++ 44,927 6,918 Updated Mar 25, 2025

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 15,013 1,891 Updated Mar 28, 2025

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 9,239 652 Updated Mar 25, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,392 1,443 Updated Mar 10, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 77,302 11,230 Updated Mar 28, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 23,435 2,131 Updated Mar 27, 2025

bentoml / OpenLLM

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

Python 11,046 703 Updated Mar 24, 2025

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,946 517 Updated Mar 27, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 42,941 6,519 Updated Mar 28, 2025

deepseek-ai / DeepSeek-V3

Python 94,459 15,271 Updated Mar 16, 2025

deepseek-ai / DeepSeek-R1

87,684 11,320 Updated Feb 24, 2025

jitsi / jiwer

Evaluate your speech-to-text system with similarity measures such as word error rate (WER)

Python 704 101 Updated Feb 15, 2025

nyrahealth / CrisperWhisper

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

Python 646 30 Updated Dec 19, 2024

madler / pigz

A parallel implementation of gzip for modern multi-processor, multi-core machines.

C 2,726 178 Updated Feb 19, 2025

utek / pyseaweed

Class to simplify communication with WeedFS

Python 27 13 Updated Jul 29, 2024

kyutai-labs / moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 7,940 651 Updated Mar 27, 2025

xhzhao / PyTorch-MPI-DDP-example

PyTorch-MPI-DDP-example

Python 17 3 Updated Mar 21, 2018

atom-moyer / getpy

A Vectorized Python Dict/Set

C++ 117 14 Updated May 10, 2023

SpeechColab / GigaSpeech

Large, modern dataset for speech recognition

Shell 670 62 Updated Feb 26, 2024

MorenoLaQuatra / audioset-download

This package aims at simplifying the download of the AudioSet dataset.

Python 48 13 Updated Sep 28, 2023

kkoutini / passt_hear21

Inference code for PaSST, using the HEAR API.

Python 31 15 Updated Jan 2, 2024

kkoutini / PaSST

Efficient Training of Audio Transformers with Patchout

Python 328 51 Updated Jan 12, 2024

google-research / scenic

Scenic: A Jax Library for Computer Vision Research and Beyond

Python 3,480 453 Updated Mar 27, 2025

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,478 2,758 Updated Mar 28, 2025

khanhi2r / ast

Forked from YuanGongND/ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1 Updated May 21, 2023

YuanGongND / ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1,252 229 Updated May 21, 2023

Jakobovski / free-spoken-digit-dataset

A free audio dataset of spoken digits. An audio version of MNIST.

Python 638 250 Updated May 2, 2024

syedecryptr / audio-spectogram-transformer

Torch implementation of ViT based classifier for Audio classification

Python 9 3 Updated May 22, 2022