Stars
NativeLink is an open source high-performance build cache and remote execution server, compatible with Bazel, Buck2, Reclient, and other RBE-compatible build systems. It offers drastically faster b…
Implementation of popular deep learning networks with TensorRT network definition API
EAD(Entity Association Diagram) is a tool to make the development process of a Ruby on Rails project faster and easier. It is supporting all associations except has_and_belongs_to_many.
Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".
NVR with realtime local object detection for IP cameras
An algorithm for reconstructing the radiance field of a large-scale scene from a single casually captured video.
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…
Scenic: A Jax Library for Computer Vision Research and Beyond
New generated dataset for fight detection in surveillance cameras.
Video embeddings for retrieval with natural language queries
Large-scale video retrieval using image queries.
☁️ Build multimodal AI applications with cloud-native stack
[CVPR 2023 - Highlight] Accelerated Coordinate Encoding (ACE): Learning to Relocalize in Minutes using RGB and Poses
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.
Official Code for DragGAN (SIGGRAPH 2023)
SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181.
Official Implementation for "TEXTure: Text-Guided Texturing of 3D Shapes"
[ICCV2023] Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement
[ICCV'23 Workshop] SAM3D: Segment Anything in 3D Scenes
Interact with your documents using the power of GPT, 100% privately, no data leaks
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Sparsity-aware deep learning inference runtime for CPUs
Teaching robots to respond to open-vocab queries with CLIP and NeRF-like neural fields
Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).
Mask3D predicts accurate 3D semantic instances achieving state-of-the-art on ScanNet, ScanNet200, S3DIS and STPLS3D.