Lists (4)
Sort Name ascending (A-Z)
Stars
A collection of deep learning based RGB-T-Fusion methods, codes, and datasets. The main directions involved are Multispectral Pedestrian Detection, RGB-T Aerial Object Detection, RGB-T Semantic Seg…
VMamba: Visual State Space Models,code is based on mamba
This repository contains the PyTorch implementation of the CVPR'2024 paper (Highlight), IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection.
The official version of the paper "MMI-Det: Exploring Multi-Modal Integration for Visible and Infrared Object Detection"
Official implementation for “MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion.”
Pan-Mamba: Effective Pan-Sharpening with State Space Model
This is official Pytorch implementation of "SwinFusion: Cross-domain Long-range Learning for General Image Fusion via Swin Transformer"
Code for the paper: "FusionMamba: Efficient Image Fusion with State Space Model", TGRS, 2024.
Code of paper 'Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training'
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Codes for SAMF: Small-Area-Aeare Multi-Focus Image Fusion for Object Detection (ICASSP 2024 Oral)
FusionMamba: Dynamic Feature Enhancement for Multimodal Image Fusion with Mamba
the official pytorch implementation of “Mamba-YOLO:SSMs-based for Object Detection”
mujianyu / TwoStream_Yolov8
Forked from ultralytics/ultralyticsNEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
Small Object Detection Algorithm Incorporating Swin Transformer for Tea Buds
Use visible and infrared images to train the network. This method is better to face the dark environment.
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
[CVPR2022] End-to-End Reconstruction-Classification Learning for Face Forgery Detection
[ICCV 2023] Official implementation of the paper: "DIRE for Diffusion-Generated Image Detection"
Instant voice cloning by MIT and MyShell. Audio foundation model.
A curated list of articles and codes related to face forgery generation and detection.
你是否曾经幻想过与自己的虚拟人交互?现在,使用PaddleAvatar,您可以将自己的图像、音频和视频转化为一个逼真的数字人视频,与其进行人机交互。 PaddleAvatar是一种基于PaddlePaddle深度学习框架的数字人生成工具,基于Paddle的许多套件,它可以将您的数字图像、音频和视频合成为一个逼真的数字人视频。除此之外,PaddleAvatar还支持进一步的开发,例如使用自然语…
A Tensorflow implementation of AnimeGAN for fast photo animation ! This is the Open source of the paper 「AnimeGAN: a novel lightweight GAN for photo animation」, which uses the GAN framwork to trans…
PyTorch implementation of AnimeGANv2
Use AnimeGANv3 to make your own animation works, including turning photos or videos into anime.