-
ITMO University
- Saint Petersburg
Highlights
- Pro
Lists (4)
Sort Name ascending (A-Z)
💻 Computer Vision
Libraries and tools for working with Computer Vision🛠️ Development
🚀 Machine Learning
Machine Learning Frameworks and Hyperparameter Tuning💬 Natural Language Processing
Libraries and tools for working with Natural Language ProcessingStars
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
Towards Simulating Foggy and Hazy Images and Evaluating their Authenticity
This is a simulator that generates foggy, rainy, smoky and cloudy image over a clear remote sensing image.
Image composition toolbox: everything you want to know about image composition or object insertion
A unified ensemble framework for PyTorch to improve the performance and robustness of your deep learning model.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model…
High-resolution models for human tasks.
Deep Learning for Speech
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Foundational Model for Speech Recognition Tasks
A flexible free and unlimited python tool to translate between different languages in a simple way using multiple translators.
Open-set detection using Wasserstein Distance and Spectral Normalisation
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
A neural network training framework within a task-based parallel programming paradigm
Official PyTorch implementation of "TinySAM: Pushing the Envelope for Efficient Segment Anything Model"
Gaze estimation using MPIIGaze and MPIIFaceGaze
Repository of a data modeling and analysis tool based on Bayesian networks
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processin…
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code inclu…
ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ suppo…
Faster Whisper transcription with CTranslate2
Robust Speech Recognition via Large-Scale Weak Supervision
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node