Lists (1)
Sort Name ascending (A-Z)
Stars
Open-source live-chat, email support, omni-channel desk. An alternative to Intercom, Zendesk, Salesforce Service Cloud etc. 🔥💬
A modular graph-based Retrieval-Augmented Generation (RAG) system
LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
repository for 360 panorama image generation based on Stable Diffusion
Official Implementation of Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
CUDA accelerated rasterization of gaussian splatting
GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian Splatting
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
[ECCV 2024] Official implementation of NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
Inference and training library for high-quality TTS models.
Zero-Shot Speech Editing and Text-to-Speech in the Wild
An automation tool that enumerates subdomains then filters out xss, sqli, open redirect, lfi, ssrf and rce parameters and then scans for vulnerabilities.
Reverse Engineering: Decompiling Binary Code with Large Language Models
Garnet is a remote cache-store from Microsoft Research that offers strong performance (throughput and latency), scalability, storage, recovery, cluster sharding, key migration, and replication feat…
V3D: Video Diffusion Models are Effective 3D Generators
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Upload a photo of your room to generate your dream room with AI.
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
SpeechGPT Series: Speech Large Language Models
A fancy self-hosted monitoring tool
Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)