Stars
A python package to analyze and compare voices with deep learning
🔊 Text-Prompted Generative Audio Model
🔉 👦 👧Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)
A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.
Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Faster Whisper transcription with CTranslate2
A collection of everyday web-components and libraries.
notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references und…
Open-source vector similarity search for Postgres
utils to use word embedding models like word2vec vectors in a PostgreSQL database
🦜🔗 Build context-aware reasoning applications
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Simple lib for letting JS talk to Flipper.rb
HTML5 canvas based smooth signature drawing
SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
A latent text-to-image diffusion model
Tabler is free and open-source HTML Dashboard UI Kit built on Bootstrap
Support for doing time math in business hours and days
Wrapper for calling OpenAI and GPT-3's HTTP APIs
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Easy receipts and invoices for your Ruby on Rails applications
Ruby wrapper around pHash, the perceptual hash library for detecting duplicate multimedia files
Ideas for creating and sustaining high performance organizations