Starred repositories
GPU Accelerated MediaPipe Plugin for TouchDesigner
Code execution utilities for Open WebUI & Ollama
Model2Vec: Distill a Small Fast Model from any Sentence Transformer
A mod for TI-84 calculators to turn them into cheating devices.
Official inference library for Mistral models
Clapper.app, a video synthesizer and sequencer designed for the age of AI cinema
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
StockBot powered by Groq: Lightning Fast AI Chatbot that Responds With Live Interactive Stock Charts, Financials, News, Screeners, and More. Powered by Llama3-70b on Groq, Vercel AI SDK, and Tradin…
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
MindSQL: A Python Text-to-SQL RAG Library simplifying database interactions. Seamlessly integrates with PostgreSQL, MySQL, SQLite, Snowflake, and BigQuery. Powered by GPT-4 and Llama 2, it enables …
Notebooks for fine tuning pali gemma
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
Enjoy the magic of Diffusion models!
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
ML-powered speech recognition directly in your browser
A generative speech model for daily dialogue.
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
Letta (fka MemGPT) is a framework for creating stateful LLM services.
Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.