-
University of Moratuwa
- Sri Lanka
- https://ahmdnish.netlify.app/
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
SIGKDD'2019: DeepGBM: A Deep Learning Framework Distilled by GBDT for Online Prediction Tasks
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Overview and tutorial of the LangChain Library
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
This is a demonstration on how to produce speech in a particular emotion from text, this is achieved by fine tuning a TTS model on emotion labelled speech data, formulating it as a multi-modal prob…
A multimodal approach on emotion recognition using audio and text.
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
This repository holds the code for working with data from counselchat.com
A Gradio web UI for Large Language Models. Supports LoRA/QLoRA finetuning,RAG(Retrieval-augmented generation) and Chat
FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
Reference implementation of real-time autoregressive wavenet inference
MaSS - Multilingual corpus of Sentence-aligned Spoken utterances
This is the TEPROLIN Romanian text processing platform, developed in the ReTeRom project.
Text-to-Speech for languages of India
The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems
Open source audio annotation tool for humans
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
A complete guide to start and improve in machine learning (ML), artificial intelligence (AI) in 2024 without ANY background in the field and stay up-to-date with the latest news and state-of-the-ar…