Stars
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Official Repository of ChatCaptioner
Command-line program to download videos from YouTube.com and other video sites
Code for CVPR 2022 paper "Generating Useful Accident-Prone Driving Scenarios via a Learned Traffic Prior"
High resolution image to image translation using multi-scale gradients
Official implementation of "A Shared Representation for Photorealistic Driving Simulators" in PyTorch.
Spherical mercator tile and coordinate utilities
Semantic segmentation on aerial and satellite imagery. Extracts features such as: buildings, parking lots, roads, water, clouds
A curated list of papers & resources linked to 3D reconstruction from images.
A series of convenience functions to make basic image processing operations such as translation, rotation, resizing, skeletonization, and displaying Matplotlib images easier with OpenCV and Python.
Animate elements as they scroll into view.
An implementation of a sequence to sequence neural network using an encoder-decoder
This GitHub provides different DeepFakes Detectors using facial regions and considering three different state-of-the-art fake detection systems.
Official repo for paper "AgileGAN: Stylizing Portraits by Inversion-Consistent Transfer Learning"
DECA: Detailed Expression Capture and Animation (SIGGRAPH 2021)
StyleFlow: Attribute-conditioned Exploration of StyleGAN-generated Images using Conditional Continuous Normalizing Flows (ACM TOG 2021)
Simple, robust LastFM API client (for public data)
OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.
Missing the daily emails? Now you can make as many as you want!
This program show you IMSI numbers of cellphones around you.