Skip to content
View rohitkrishna094's full-sized avatar

Organizations

@socialat @techpanda123 @tech2pandas

Block or report rohitkrishna094

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 9,079 1,204 Updated Jan 15, 2025

LLM training in simple, raw C/CUDA

Cuda 25,091 2,865 Updated Oct 2, 2024

Desktop environment in the browser

JavaScript 10,018 845 Updated Jan 21, 2025

A testing repo to share code and thoughts on diarisation

Python 53 3 Updated Mar 26, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 864 101 Updated Jan 19, 2025

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 5,307 462 Updated Aug 10, 2024

A fast, local neural text to speech system

C++ 7,481 549 Updated Oct 21, 2024

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 13,534 1,873 Updated Nov 19, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 37,013 4,587 Updated Aug 16, 2024

SOTA Open Source TTS

Python 18,518 1,398 Updated Jan 18, 2025

Autogenerate subtitles using OpenAI Whisper Model via Jellyfin, Plex, Emby, Tautulli, or Bazarr

Python 700 58 Updated Jan 9, 2025

Transcription, forced alignment, and audio indexing with OpenAI's Whisper

Python 1,711 184 Updated Jan 16, 2025

Synchronize Whisper's timestamps over an existing accurate transcription

Java 138 22 Updated May 28, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 21,343 2,214 Updated Jan 15, 2025

Draw a mockup and generate html for it

TypeScript 13,372 1,610 Updated Jul 18, 2024
TypeScript 410 27 Updated Aug 2, 2024

Medical Card is a centralized digital platform designed to consolidate medical data from various sources, enabling patients to easily track and understand their health metrics over time, identify t…

TypeScript 4 1 Updated Aug 24, 2024

Shared data types for building collaborative software

JavaScript 17,869 636 Updated Jan 17, 2025

Sync Schemas between PocketHost & Local PocketBase

HTML 8 3 Updated Oct 22, 2023

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 57,474 7,119 Updated Jan 19, 2025

NocoBase is an extensibility-first, open-source no-code/low-code platform for building business applications and enterprise solutions.

TypeScript 13,236 1,477 Updated Jan 21, 2025

OpenUI let's you describe UI using your imagination, then see it rendered live.

TypeScript 19,719 1,831 Updated Oct 21, 2024

Data structures & algorithms cheat sheet

Python 700 107 Updated Aug 12, 2024

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Jupyter Notebook 4,509 389 Updated Apr 3, 2024

WhisperPlus: Faster, Smarter, and More Capable 🚀

Python 1,762 137 Updated Jan 6, 2025
Jupyter Notebook 7,957 562 Updated Jun 16, 2024

🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI

1,349 64 Updated May 19, 2024

Port of OpenAI's Whisper model in C/C++

C++ 37,015 3,813 Updated Jan 18, 2025

This repo just to collect user feedback.

129 19 Updated Dec 14, 2023

Popwola: The ultimate no-code popup builder for powerful, customizable popups and stress-free engagement. ✨💪🚀

TypeScript 77 15 Updated Nov 12, 2024
Next