Lists (1)
Sort Name ascending (A-Z)
Stars
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Multi functional app to find duplicates, empty folders, similar images etc.
An open-source RAG-based tool for chatting with your documents.
Prompt, run, edit, and deploy full-stack web applications
fri12013 / fish-speech1.5
Forked from fishaudio/fish-speechSOTA Open Source TTS
Robust Speech Recognition via Large-Scale Weak Supervision
Super simple and easy to use tool to visualize class & method structure of a Python project
petermost / Sourcetrail
Forked from CoatiSoftware/SourcetrailSourcetrail - free and open-source interactive source explorer
Let's all work together on a 3d scanner benchmark for desktop 3d scanners
Jump to definition for 50+ languages without configuration
an Emacs "jump to definition" package for 50+ languages
Fullstack app framework for web, desktop, mobile, and more.
Secret Legacy of the Ancient Caves - back from the dead!
Can you design a controller to steer a simulated car?
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
AI Crash Course to help busy builders catch up to the public frontier of AI research in 2 weeks
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
Drivers and libraries for the Xbox Kinect device on Windows, Linux, and OS X
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
Distributed LLM and StableDiffusion inference for mobile, desktop and server.