-
University of Technology Sydney
- Sydney
-
22:28
(UTC -12:00)
Lists (1)
Sort Name ascending (A-Z)
Stars
UnifiedQA: Crossing Format Boundaries With a Single QA System
Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Contextualized Perturbation for Textual Adversarial Attack, NAACL 2021
NeuralTalk is a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences.
I decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch for archive)
Show and Tell : A Neural Image Caption Generator
Show-and-Fool: Adversarial Examples for Image Captioning task
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
LAVIS - A One-stop Library for Language-Vision Intelligence
Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.
Simple image captioning model
Codes for reproducing the adversarial attacks on image captioning systems in “Attacking Visual Language Grounding with Adversarial Examples: A Case Study on Neural Image Captioning,” ACL 2018
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
Efficient Image Captioning code in Torch, runs on GPU
Summarization Attack via Paraphrasing