Stars
AAAI 2025 - Yuan: Yielding Unblemished Aesthetics through A Unified Network for Visual Imperfections Removal in Generated Images
This is the official code implement for AAAI 2025 paper ``Defeasible Visual Entailment: Benchmark, Evaluator, and Reward-Driven Optimization''.
[TMLR 2024] Efficient Large Language Models: A Survey
[AAAI 2025] Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback
Source code for AAAI'25 paper "Component-Level Segmentation for Oracle Bone Inscription Decipherment"
[AAAI'2025] The official implementation code of SIGMA
[AAAI2025] FedCFA: Alleviating Simpson’s Paradox in Model Aggregation with Counterfactual Federated Learning
[AAAI 2025] PAT: Pruning-Aware Tuning for Large Language Models
AAAI 2025 (Oral), BrainGuard: Privacy-Preserving Multisubject Image Reconstructions from Brain Activities
[AAAI 2025] Official implementation of the paper "EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation"
This repo is the official implementation of "Retrieval-Augmented Dynamic Prompt Tuning for Incomplete Multimodal Learning" accepted by AAAI 2025.
Official implementation of the paper "Attentive Eraser: Unleashing Diffusion Model’s Object Removal Potential via Self-Attention Redirection Guidance" (AAAI 2025 Oral)
【AAAI2025】MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt
[AAAI 2025] Official implementation of the paper "Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segmentation"
【AAAI2025】DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-Identification
[AAAI 2025] Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video
An official implementation of "Re-Attentional Controllable Video Diffusion Editing" in PyTorch. (AAAI 2025)
[AAAI 2025] Motion Prior Knowledge Learning with Homogeneous Language Descriptions for Moving Infrared Small Target Detection
✨ [AAAI 2025] Queryable Prototype Multiple Instance Learning with Vision-Language Models for Incremental Whole Slide Image Classification
RaynorLEE / CATS
Forked from IDEA-FinAI/CATS[AAAI2025] Offical code implementation of "Context-aware Inductive Knowledge Graph Completion with Latent Type Constraints and Subgraph Reasoning"
Start building LLM-empowered multi-agent applications in an easier way.
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
Official repository accompanying a CVPR 2022 paper EMOCA: Emotion Driven Monocular Face Capture And Animation. EMOCA takes a single image of a face as input and produces a 3D reconstruction. EMOCA …