Highlights
- Pro
Stars
(Pattern Recognition) Pytorch implementation of “HTR-VT: Handwritten Text Recognition with Vision Transformer”
Handwritten Text Recognition and Character Detection
ECCV24 "ReMamber: Referring Image Segmentation with Mamba Twister" official repository.
Official repo for Images that sound: a special spectrogram that can be seen as images and played as sound generated by diffusions
BBDM: Image-to-image Translation with Brownian Bridge Diffusion Models
This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation
SEAN: Image Synthesis with Semantic Region-Adaptive Normalization (CVPR 2020, Oral)
StarGAN v2 - Official PyTorch Implementation (CVPR 2020)
[ICCV 2023] Video Background Music Generation: Dataset, Method and Evaluation
[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
Official PyTorch implementation of SynDiff described in the paper (https://arxiv.org/abs/2207.08208).
A collection of resources and papers on Diffusion Models
CoMoGAN: continuous model-guided image-to-image translation. CVPR 2021 oral.
[ECCV 2020] In-Domain GAN Inversion for Real Image Editing (PyTorch code)
Notes, exercises, old exams from the universities where I studied
This folder contains the final project of "Virtual and Augmented Reality Systems" (second year of the Master's degree in Computer Engineering). The project involves creating a demo of the game "Boc…
This repository contains the PyTorch code for our ICIAP 2021 paper “Avoiding Shortcuts in Unpaired Image-to-Image Translation”.
A Mask R-CNN Keras implementation with Modanet annotations on the Paperdoll dataset
Unsupervised Semantic Segmentation by Distilling Feature Correspondences
A Pytorch implementation of "Unsupervised Attention-Guided Image-to-Image Translation"