pylSER

Follow

pylSER

Follow

8 followers · 11 following

TikTok
HKUST

Achievements

Achievements

Organizations

Stars

virattt / ai-hedge-fund

An AI Hedge Fund Team

Python 19,036 3,443 Updated Mar 23, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,921 224 Updated Mar 4, 2025

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 40,265 6,622 Updated Dec 9, 2024

karpathy / minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 21,619 2,796 Updated Aug 15, 2024

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 5,530 539 Updated Mar 23, 2025

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 14,956 1,881 Updated Mar 23, 2025

deepseek-ai / DeepSeek-V3

Python 92,905 15,109 Updated Mar 16, 2025

mit-han-lab / streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 6,831 378 Updated Jul 11, 2024

s3prl / s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,356 496 Updated Mar 11, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 42,442 6,428 Updated Mar 24, 2025

TowerYsable / ASR_awesome

语音识别论文前沿

44 6 Updated Jan 8, 2022

linan2 / Voice-activity-detection-VAD-paper-and-code

Voice activity detection (VAD) paper and code（From 198*~ ）and its classification.

95 13 Updated Feb 6, 2024

liusongxiang / Large-Audio-Models

Keep track of big models in audio domain, including speech, singing, music etc.

474 28 Updated Sep 26, 2024

ddlBoJack / Awesome-Speech-Pretraining

Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.

205 14 Updated Jan 18, 2024

AIGC-Audio / AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,114 863 Updated Jul 6, 2024

OptimalScale / LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,384 834 Updated Mar 22, 2025

deepspeedai / Megatron-DeepSpeed

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,025 353 Updated Feb 25, 2025

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 78,776 9,441 Updated Jan 4, 2025

k2-fsa / icefall

Python 1,054 326 Updated Feb 27, 2025

riffusion / riffusion-hobby

Stable diffusion for real-time music generation

Python 3,601 422 Updated Jul 22, 2024

riffusion / riffusion-app-hobby

Stable diffusion for real-time music generation (web app)

TypeScript 2,638 203 Updated Jul 22, 2024

Harmonai-org / sample-generator

Tools to train a generative model on arbitrary audio samples

Jupyter Notebook 1,094 172 Updated Apr 29, 2024

ilaria-manco / multimodal-ml-music

List of academic resources on Multimodal ML for Music

TeX 292 11 Updated Mar 25, 2023

kaldi-asr / kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 14,684 5,347 Updated Jan 28, 2025

MIW-91 / klDD

the clustering model and the method of disease early-warning detection based on differential distribution

Python 2 Updated Sep 12, 2021

wenet-e2e / wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,407 1,122 Updated Mar 20, 2025

hraban / opus

Go wrapper for libopus (golang)

Go 293 62 Updated Jun 21, 2024

qiuqiangkong / audioset_tagging_cnn

Python 1,432 262 Updated Jul 25, 2024

RasaHQ / rasa

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Python 19,706 4,727 Updated Mar 17, 2025

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 36,164 6,139 Updated Mar 24, 2025