Stars
DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding
An open-source computer vision framework to build and deploy apps in minutes
Large Concept Models: Language modeling in a sentence representation space
IPED Digital Forensic Tool. It is an open source software that can be used to process and analyze digital evidence, often seized at crime scenes by law enforcement or in a corporate investigation bβ¦
LOTUS: A semantic query engine for fast and easy LLM-powered data processing
Open and efficient video watermarking
Official Implementations for Paper - AniDoc: Animation Creation Made Easier
Learn how to use AI models with prompt engineering
Latitude is the open-source prompt engineering platform to build, evaluate, and refine your prompts with AI
This repository provides programs to build Retrieval Augmented Generation (RAG) code for Generative AI with LlamaIndex, Deep Lake, and Pinecone leveraging the power of OpenAI and Hugging Face modelβ¦
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
A collection of guides and examples for the Gemma open models from Google.
A minimal and universal controller for FLUX.1.
AI agent for building React Native apps
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
π«πππΆππ πΉπππ πΎππππ π·πππππππ πππ π¨πππππππ π«πππΆππ π¬ππππππππ [π©πππππππ ππ π¨π ππππππ ]
LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
The Open Cookbook for Top-Tier Code Large Language Model
Train and Deploy an ML REST API to predict crypto prices, in 10 steps
Document to Markdown OCR library with Llama 3.2 vision
Books, Presentations, Workshops, Notebook Labs, and Model Zoo for Software Engineers and Data Scientists wanting to learn the TF.Keras Machine Learning framework
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos