-
Mahidol University
Lists (1)
Sort Name ascending (A-Z)
Stars
Universal Romanizer that can convert any unicode script to roman (latin) script
Official PyTorch Implementation of CleanUNet (ICASSP 2022)
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.
A repository that showcases how you can use ZenML with Git
SALMONN: Speech Audio Language Music Open Neural Network
Fast and accurate automatic speech recognition (ASR) for edge devices
AirLLM 70B inference with single 4GB GPU
A python package to build AI-powered real-time audio applications
This repository contains PyTorch implementation for the baseline models from the paper Utterance-level Dialogue Understanding: An Empirical Study
Pytorch implementation of LSTM/BERT-CRF for named entity recognition
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
A high-throughput and memory-efficient inference and serving engine for LLMs
Speaker change detection using SincNet and an LSTM/Transformer
Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…
Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
Thonburian Whisper: Open models for fine-tuned Whisper in Thai. Try our demo on Huggingface space:
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…
The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.
SincNet is a neural architecture for efficiently processing raw audio samples.
PyTorch implementation of the paper "Dialogue Act Classification with Context-Aware Self-Attention" for dialogue act classification with a generic dataset class and PyTorch-Lightning trainer
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.