Skip to content
View mattetti's full-sized avatar

Organizations

@postrank-labs @golangchallenge @go-audio

Block or report mattetti

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Converts multisamples from a source format (WAV, multisample, KMP, wavestate, NKI, SFZ, SoundFont 2) to a different destination format.

Java 212 16 Updated Dec 9, 2024

An open-source orchestral library

565 75 Updated Apr 6, 2021

Official inference repo for FLUX.1 models

Python 18,662 1,318 Updated Nov 21, 2024

A library for audio and music analysis, feature extraction.

C 2,950 122 Updated May 24, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 13,329 1,275 Updated Dec 25, 2024

an editor for spoken-word audio with automatic transcription

TypeScript 1,706 42 Updated Oct 11, 2023

Portable Executable reversing tool with a friendly GUI

C++ 2,822 174 Updated Dec 5, 2024
C 2 Updated Jul 22, 2024

[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation

Python 5,469 456 Updated Sep 9, 2024

Open source browser extension for finding, editing, exporting, optimizing, and managing SVG content.

TypeScript 872 31 Updated Sep 11, 2024

A GUI based powerful automatic datamoshing application for free! Easily apply trippy glitch effects in your videos. Contains 30+ cool glitch effects!

Python 318 14 Updated Oct 28, 2024

A basic setup for using Epic's Gameplay Ability System.

C++ 262 41 Updated Dec 19, 2024

A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM

TypeScript 2,805 355 Updated Aug 21, 2024

Code for Toon3D https://toon3d.studio/

Python 206 9 Updated Jun 13, 2024

This is the official implementation of HomoGAN, CVPR2022

Python 46 3 Updated Sep 16, 2022

Cross-platform client for PostgreSQL databases

Go 8,695 745 Updated Dec 21, 2024

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 26,378 2,540 Updated Dec 27, 2024

LLM training in simple, raw C/CUDA

Cuda 24,834 2,816 Updated Oct 2, 2024

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 22,690 5,548 Updated Aug 14, 2024

AI driven development in your terminal. Designed for large, real-world tasks.

Go 10,917 755 Updated Dec 15, 2024

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,498 226 Updated Dec 9, 2024

tiny vision language model

Jupyter Notebook 6,150 507 Updated Dec 10, 2024

The code for some apps built with Sieve.

Python 72 15 Updated Nov 22, 2024

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Python 2,136 164 Updated Dec 6, 2024

Transcription, forced alignment, and audio indexing with OpenAI's Whisper

Python 1,654 182 Updated Dec 17, 2024

ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'

Python 333 78 Updated Oct 23, 2023

A powerful tool that translates ComfyUI workflows into executable Python code.

Python 1,344 133 Updated Sep 16, 2024

[ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.

Python 337 22 Updated Mar 21, 2024

A powerful set of mask-related nodes for ComfyUI

Python 378 38 Updated Jun 19, 2024
Next