Three Method Fine-tune on CLIP for Vehicle Counting Task

Approach

1. General

2. Adapter

3. VPT (shallow)

Hardware

CPU: AMD EPYC 7742 64-Core Processor
RAM: 512GB
GPU: Nvidia A100 (40GB VRAM)
Disk Space Available: 1TB

Install the Required Packages

$ conda install pytorch==1.11.0 torchvision==0.12.0 torchaudio==0.11.0 cudatoolkit=11.3 -c pytorch
$ pip install ftfy regex tqdm torchinfo
$ pip install git+https://github.com/openai/CLIP.git

Prepare KITTI Dataset

Dataset:
Dataset Link
*Note: only need left color images of object data set (12 GB) and training labels of object data set (5 MB).

# And you must organize files into the following structure:

kitti_dataset
     ├── testing
     |      └── image_2 #Only including testing img files
     └── training
            ├── image_2 #Only including training img files
            └── label_2 #Only including txt files

Preprocessing Label

# You should modify the path of your training image_2 folder by yourself in the script (Line 4: kitti_label_file_path).
python text_generation.py

Fine-tune

# Replace "../KITTI_DATASET_ROOT/training/image_2/" into the path of your training image_2 folder.

# General fine tune on whole model
python train.py --kitti_image_file_path "../KITTI_DATASET_ROOT/training/image_2/"

# Using adapter to fine tune
python train.py --adapter --kitti_image_file_path "../KITTI_DATASET_ROOT/training/image_2/"


# Using vpt to fine tune
python train.py --prompt --vpt_version 1or2 --kitti_image_file_path "../KITTI_DATASET_ROOT/training/image_2/"

test

# Replace "../KITTI_DATASET_ROOT/training/image_2/" into the path of your training image_2 folder.

# General fine tune on whole model
python test.py --kitti_image_file_path "../KITTI_DATASET_ROOT/training/image_2/"

# Using adapter to fine tune
python test.py --adapter --kitti_image_file_path "../KITTI_DATASET_ROOT/training/image_2/"


# Using vpt to fine tune
python test.py --prompt --vpt_version 1or2 --kitti_image_file_path "../KITTI_DATASET_ROOT/training/image_2/"

Acknowledgement

This repo benefits from CLIP, AIM, and VPT. Thanks for their wonderful works.

Name	Name	Last commit message	Last commit date
Latest commit arthurleelee fix bug Jun 10, 2023 79cc773 · Jun 10, 2023 History 95 Commits
.github/workflows	.github/workflows	update torch versions in workflow	Jan 26, 2022
clip	clip	add prompt(not frozen yet)	May 28, 2023
data	data	Full list of Kinetics700 (Fix openai#158 )	Oct 19, 2021
image	image	Update	Jun 4, 2023
notebooks	notebooks	use pkg_resources for PyTorch version checks in notebooks (openai#191 )	Apr 10, 2022
tests	tests	test fix	Jul 19, 2021
.gitignore	.gitignore	prompt and train setting	Jun 2, 2023
CLIP.png	CLIP.png	initial commit	Jan 5, 2021
CLIP_ORIGINAL_OPEN_SOURCE_README.md	CLIP_ORIGINAL_OPEN_SOURCE_README.md	Readme	May 23, 2023
LICENSE	LICENSE	initial commit	Jan 5, 2021
MANIFEST.in	MANIFEST.in	Make the repo installable as a package (openai#26 )	Jan 29, 2021
README.md	README.md	pip torchinfo	Jun 6, 2023
adapter_log.txt	adapter_log.txt	Update	Jun 4, 2023
findbest.py.py	findbest.py.py	exp	Jun 4, 2023
kitti_text_generation.py	kitti_text_generation.py	test	Jul 18, 2022
label_2_sentence.csv	label_2_sentence.csv	Update	May 25, 2023
model-card.md	model-card.md	ViT-L/14@336px (openai#234 )	Apr 21, 2022
plot.py	plot.py	Update	Jun 4, 2023
requirements.txt	requirements.txt	torchinfo	Jun 9, 2023
setup.py	setup.py	Make the repo installable as a package (openai#26 )	Jan 29, 2021
standard_log.txt	standard_log.txt	Update	Jun 4, 2023
test.py	test.py	use torchinfo	Jun 6, 2023
text_generation.py	text_generation.py	Update	May 25, 2023
train.py	train.py	fix bug	Jun 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Three Method Fine-tune on CLIP for Vehicle Counting Task

Approach

1. General

2. Adapter

3. VPT (shallow)

Hardware

CPU: AMD EPYC 7742 64-Core Processor

RAM: 512GB

GPU: Nvidia A100 (40GB VRAM)

Disk Space Available: 1TB

Install the Required Packages

Prepare KITTI Dataset

Preprocessing Label

Fine-tune

test

Acknowledgement

About

Releases

Packages

Languages

License

arthurleelee/Three-Method-Fine-Tune-On-CLIP-For-Vehicle-Counting-Task

Folders and files

Latest commit

History

Repository files navigation

Three Method Fine-tune on CLIP for Vehicle Counting Task

Approach

1. General

2. Adapter

3. VPT (shallow)

Hardware

CPU: AMD EPYC 7742 64-Core Processor

RAM: 512GB

GPU: Nvidia A100 (40GB VRAM)

Disk Space Available: 1TB

Install the Required Packages

Prepare KITTI Dataset

Preprocessing Label

Fine-tune

test

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages