Skip to content
View lianqi1008's full-sized avatar
  • Beijing Jiaotong University
  • Beijing, China
  • 12:18 (UTC +08:00)

Block or report lianqi1008

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Extreme Image Compression using Fine-tuned VQGAN Models (DCC 2024)

Jupyter Notebook 7 1 Updated Sep 21, 2024

TensorFlow implementation of EGIC (EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation, ECCV 2024)

Python 11 Updated Nov 14, 2024

Official implementation of "Controlling Rate, Distortion, and Realism: Towards a Single Comprehensive Neural Image Compression Model" (WACV2024).

Python 13 Updated Jun 19, 2024

An open-source implementaion for fine-tuning Qwen2-VL series by Alibaba Cloud.

Python 162 17 Updated Dec 12, 2024
Python 16 Updated Mar 12, 2024

✨✨ MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Python 88 6 Updated Nov 22, 2024

Accepted by IJCAI-24 Survey Track

Python 173 4 Updated Aug 25, 2024

中国大模型

5,680 476 Updated Nov 30, 2024

Awesome-LLM: a curated list of Large Language Model

19,667 1,623 Updated Dec 26, 2024

Pytorch implementation of the paper "You Can Mask More For Extremely Low-Bitrate Image Compression".

Python 37 Updated Apr 27, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 3,918 236 Updated Dec 4, 2024

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,382 106 Updated Oct 8, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

17,096 1,625 Updated Sep 19, 2024

✨✨Latest Advances on Multimodal Large Language Models

13,285 840 Updated Dec 26, 2024

Vector (and Scalar) Quantization, in Pytorch

Python 2,762 224 Updated Dec 3, 2024
Python 3,165 278 Updated Oct 16, 2024

[CVPR2023] All in One: Exploring Unified Video-Language Pre-training

Python 280 17 Updated Mar 25, 2023

ArcFace unofficial Implemented in Tensorflow 2.0+ (ResNet50, MobileNetV2). "ArcFace: Additive Angular Margin Loss for Deep Face Recognition" Published in CVPR 2019. With Colab.

Python 263 60 Updated Nov 21, 2022

Research code for pixel-based encoders of language (PIXEL)

Python 332 33 Updated Mar 6, 2024

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 2,477 162 Updated Dec 20, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,578 514 Updated Dec 25, 2024

(TPAMI 2024) A Survey on Open Vocabulary Learning

871 50 Updated Dec 10, 2024

ICCV 2023 Paper Global Features are All You Need for Image Retrieval and Reranking Official Repository

Python 212 15 Updated Sep 14, 2023

MLCD & UNICOM : Large-Scale Visual Representation Model

Python 470 21 Updated Dec 27, 2024

All-In-One VLM: Image + Video + Transfer to Other Languages / Domains (TPAMI 2023)

Python 149 13 Updated Aug 22, 2024

X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)

Python 461 51 Updated Nov 25, 2022

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

Python 1,508 144 Updated Dec 23, 2024

A Pytorch Implementation of a continuously rate adjustable learned image compression framework.

Python 64 5 Updated Apr 6, 2023

Repository of the NeurIPS'22 paper "Selective compression learning of latent representations for variable-rate image compression" pytorch implementation

Python 5 Updated Sep 22, 2023

A collection of tools for neural compression enthusiasts.

Python 520 43 Updated Sep 20, 2024
Next