Skip to content
View mdvdv's full-sized avatar
  • ITMO University
  • Saint Petersburg

Highlights

  • Pro

Organizations

@expertspec

Block or report mdvdv

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."

Python 1,510 117 Updated Nov 28, 2024

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 3,910 271 Updated Oct 5, 2024
Python 363 40 Updated Dec 13, 2024

Hazy/Dusty Image Synthesis

Python 48 6 Updated Apr 1, 2024

Towards Simulating Foggy and Hazy Images and Evaluating their Authenticity

Python 73 21 Updated Dec 11, 2020

This is a simulator that generates foggy, rainy, smoky and cloudy image over a clear remote sensing image.

Python 12 2 Updated Dec 24, 2024

Image composition toolbox: everything you want to know about image composition or object insertion

Python 567 36 Updated Dec 14, 2024

Focus on prompting and generating

Python 42,193 6,081 Updated Aug 21, 2024

A unified ensemble framework for PyTorch to improve the performance and robustness of your deep learning model.

Python 1,100 96 Updated Jun 16, 2024

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 61,419 6,555 Updated Dec 27, 2024

[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model…

Python 727 45 Updated Sep 8, 2024

High-resolution models for human tasks.

Python 4,695 268 Updated Nov 18, 2024

Deep Learning for Speech

Jupyter Notebook 81 8 Updated Nov 25, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 40,380 4,284 Updated Jul 28, 2024

Foundational Model for Speech Recognition Tasks

Python 141 10 Updated Dec 13, 2024

A flexible free and unlimited python tool to translate between different languages in a simple way using multiple translators.

Python 1,646 188 Updated Jul 23, 2024

Open-set detection using Wasserstein Distance and Spectral Normalisation

Python 2 Updated Jul 4, 2024

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 1,114 86 Updated Oct 21, 2024

A neural network training framework within a task-based parallel programming paradigm

C++ 48 6 Updated Dec 11, 2024

Official PyTorch implementation of "TinySAM: Pushing the Envelope for Efficient Segment Anything Model"

Python 408 24 Updated Oct 5, 2024

Gaze estimation using MPIIGaze and MPIIFaceGaze

Python 315 69 Updated Jun 29, 2024

Repository of a data modeling and analysis tool based on Bayesian networks

Python 124 18 Updated Oct 14, 2024

INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processin…

650 42 Updated Dec 25, 2024

CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code inclu…

Python 434 29 Updated Jul 15, 2024

ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ suppo…

Python 948 41 Updated Sep 1, 2024

Faster Whisper transcription with CTranslate2

Python 13,153 1,104 Updated Dec 23, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 73,424 8,766 Updated Dec 1, 2024

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Jupyter Notebook 8,373 1,147 Updated Nov 13, 2024
Next