-
CogVLM2 Public
Forked from THUDM/CogVLM2GPT4V-level open-source multi-modal model based on Llama3-8B
Python Apache License 2.0 UpdatedSep 3, 2024 -
mamba-clip Public
Forked from raytrun/mamba-clipCLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation
Python UpdatedAug 15, 2024 -
MoVA Public
Forked from TempleX98/MoVAMoVA: Adapting Mixture of Vision Experts to Multimodal Context
Python Apache License 2.0 UpdatedAug 14, 2024 -
PVLR Public
Forked from sejong-rcv/PVLR[ACM MM 2024] Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization
Python UpdatedAug 13, 2024 -
VMamba Public
Forked from MzeroMiko/VMambaVMamba: Visual State Space Models,code is based on mamba
Python MIT License UpdatedAug 4, 2024 -
MambaVision Public
Forked from NVlabs/MambaVisionOfficial PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
Python Other UpdatedAug 1, 2024 -
alpaca-lora Public
Forked from tloen/alpaca-loraInstruct-tune LLaMA on consumer hardware
Jupyter Notebook Apache License 2.0 UpdatedJul 29, 2024 -
RWKV-LM Public
Forked from BlinkDL/RWKV-LMRWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…
Python Apache License 2.0 UpdatedJul 23, 2024 -
-
UniAD Public
Forked from OpenDriveLab/UniAD[CVPR'23 Best Paper Award] Planning-oriented Autonomous Driving
Python Apache License 2.0 UpdatedJul 11, 2024 -
UniMD Public
Forked from yingsen1/UniMDUniMD: Towards Unifying Moment retrieval and temporal action Detection
Python UpdatedJul 5, 2024 -
-
ShareGPT4V Public
Forked from ShareGPT4Omni/ShareGPT4V[ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions
Python UpdatedJul 1, 2024 -
Efficient-Multimodal-LLMs-Survey Public
Forked from swordlidev/Efficient-Multimodal-LLMs-SurveyEfficient Multimodal Large Language Models: A Survey
Apache License 2.0 UpdatedMay 31, 2024 -
CogVLM Public
Forked from THUDM/CogVLMa state-of-the-art-level open visual language model | 多模态预训练模型
Python Apache License 2.0 UpdatedMay 29, 2024 -
BLIP Public
Forked from salesforce/BLIPPyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedMay 20, 2024 -
MoE-LLaVA Public
Forked from PKU-YuanGroup/MoE-LLaVAMixture-of-Experts for Large Vision-Language Models
Python Apache License 2.0 UpdatedMay 15, 2024 -
CVPR2024-TSPNet Public
Forked from zyxia1009/CVPR2024-TSPNet(CVPR2024) Realigning Confidence with Temporal Saliency Information for Point-level Weakly-Supervised Temporal Action Localization
Python MIT License UpdatedMay 11, 2024 -
LoRA Public
Forked from microsoft/LoRACode for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Python MIT License UpdatedMay 2, 2024 -
-
ECCV2022-DELU Public
Forked from MengyuanChen21/ECCV2022-DELU[ECCV 2022] Dual-Evidential Learning for Weakly-supervised Temporal Action Localization
Python MIT License UpdatedApr 19, 2024 -
-
DS-GCN Public
Forked from davelailai/DS-GCNThe implement for Dynamic Semantic-based Graph Convolution Network for Skeleton-based Human Action Recognition
Python UpdatedApr 17, 2024 -
SGRE Public
Forked from jasonseu/SGREThe official code of Semantic-Guided Representation Enhancement for Multi-Label Image Classification, TCSVT 2024.
Python UpdatedApr 10, 2024 -
ViFi-CLIP Public
Forked from muzairkhattak/ViFi-CLIP[CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".
Python MIT License UpdatedApr 3, 2024 -
-
RGBX_Semantic_Segmentation Public
Forked from huaaaliu/RGBX_Semantic_SegmentationPython MIT License UpdatedApr 1, 2024 -
Vitis-AI Public
Forked from Xilinx/Vitis-AIVitis AI is Xilinx’s development stack for AI inference on Xilinx hardware platforms, including both edge devices and Alveo cards.
Python Apache License 2.0 UpdatedMar 15, 2024 -
VadCLIP Public
Forked from nwpu-zxr/VadCLIPVadCLIP official Pytorch implementation
Python Apache License 2.0 UpdatedMar 10, 2024 -
CLIP-FSAR Public
Forked from alibaba-mmai-research/CLIP-FSARCode for our IJCV 2023 paper "CLIP-guided Prototype Modulating for Few-shot Action Recognition".
Python Apache License 2.0 UpdatedMar 7, 2024