- London, United Kingdom
- yasserqureshi.com
- @yasserqureshi0
- in/yasser-qureshi
Highlights
- Pro
Stars
Arabic speech recognition, classification and text-to-speech.
Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! ๐ฆฅ
๐ธ๐ฌ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
๐ Text-Prompted Generative Audio Model
Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), support for SafeTensors/BF16, voice cloning, dialogue generation, โฆ
A powerful framework for building realtime voice AI agents ๐ค๐๏ธ๐น
End-to-end stack for WebRTC. SFU media server and SDKs.
Open Source framework for voice and multimodal conversational AI
Fast voice assistant powered by Groq, Cartesia, and Vercel.
Python package to handle and analyze GC-IMS data.
Productive, portable, and performant GPU programming in Python.
Run your own AI cluster at home with everyday devices ๐ฑ๐ป ๐ฅ๏ธโ
Trajectory Clustering: A Partition-and-Group Framework(2007)็ฎๆณๅฎ็ฐ
Unofficial PyTorch Implementation of "Augmenting Convolutional networks with attention-based aggregation"
Implement Trajectory Clustering: A Partition-and-Group Framework
LaNMT: Latent-variable Non-autoregressive Neural Machine Translation with Deterministic Inference
Official Pytorch implementation of the paper "Action-Conditioned 3D Human Motion Synthesis with Transformer VAE", ICCV 2021
Transformer-based Conditional Variational Autoencoder for Controllable Story Generation
Variational Animal Motion Embedding - A tool for time series embedding and clustering
Official implementation of DeepLabCut: Markerless pose estimation of user-defined features with deep learning for all animals incl. humans
Papers from the computer science community to read and discuss.