CLIP Image Vectorizer with BentoML and Milvus

This project offers an API for vectorizing images using OpenAI's CLIP model via BentoML. By sending images to the API, you receive a vector representation that encodes meaningful features of the image.

In addition, this project integrates Milvus for efficient vector storage and search, alongside MinIO and etcd to provide a more robust, complete system.

Features

REST API for image and text vectorization.
Powered by OpenAI CLIP, served through BentoML.
Docker Compose integration for a full-stack setup with Milvus, MinIO, and etcd.
Python scripts for:
- Importing image vectors into Milvus.
- Performing similarity searches on image vectors.

Getting Started

1. Set Up the Environment

Create and activate a virtual environment:

python3 -m venv .venv
source .venv/bin/activate

Install dependencies:

pip install transformers Pillow torch bentoml

Running BentoML Locally

Start BentoML Service

Launch the BentoML service:
```
bentoml serve service:vectorization
```

Verify the service:

Health Check:
```
curl -v http://127.0.0.1:3000/livez
```
Metrics:
```
curl http://127.0.0.1:3000/metrics
```

Vectorize Inputs

Vectorize a string:

curl -X POST -H "Content-Type: text/plain" -d "dog" http://127.0.0.1:3000/vectorize_text

Vectorize an image:

curl -X POST -H "Content-Type: image/jpeg" --data-binary @image.jpg http://127.0.0.1:3000/vectorize_image

The response will be a JSON object containing the image vector.

Building and Running the Container

Build and tag the BentoML container:

bentoml build
bentoml containerize clip_image_vectorizer:latest -t bentoml:latest

Run the container:

docker run --rm -p 3000:3000 bentoml:latest

Full Stack: BentoML + Milvus

Run the Stack

Use Docker Compose to start all services:
```
docker compose up -d
```

Verify service statuses:

etcd:

curl -X GET "http://127.0.0.1:2379/health"

Milvus:

curl -X GET "http://127.0.0.1:9091/api/v1/health"

List Milvus collections:

curl -X GET "http://127.0.0.1:9091/api/v1/collections"

Image Vector Management with Milvus

1. Import Vectors

Create a folder /images and add image files (.jpg, .jpeg).
Run the Python import script:
```
python3 milvus_import.py
```
This script:
- Creates a Milvus collection.
- Imports vectors of the images in /images into Milvus.

2. Search for Similar Images

Update the search term in milvus_search.py.

Run the search script:

python3 milvus_search.py

Example output:

Using L2 (Euclidean distance) for search...
Search results:
ID: image1.jpg, Score: 171.71
ID: image2.jpeg, Score: 174.32
...

Lower scores indicate closer matches.

Supported Image Formats

This service supports the following image formats via Pillow:

Format	File Extensions	Description
JPEG	`.jpg`, `.jpeg`	Common format, widely used for photographs.
PNG	`.png`	Lossless compression, supports transparency.
BMP	`.bmp`	Bitmap image format, uncompressed.
GIF	`.gif`	Supports animation; processes the first frame.
TIFF	`.tiff`, `.tif`	Flexible format with compression options.
WEBP	`.webp`	Modern image format for web usage.
HDR	`.hdr`	High Dynamic Range images.
TGA	`.tga`	Common in video games and graphics.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
bentoml		bentoml
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
milvus_import.py		milvus_import.py
milvus_list.py		milvus_list.py
milvus_search.py		milvus_search.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CLIP Image Vectorizer with BentoML and Milvus

Features

Getting Started

1. Set Up the Environment

Running BentoML Locally

Start BentoML Service

Vectorize Inputs

Building and Running the Container

Full Stack: BentoML + Milvus

Run the Stack

Image Vector Management with Milvus

1. Import Vectors

2. Search for Similar Images

Supported Image Formats

About

Languages

License

gordonmurray/bentoml-image-vectorization

Folders and files

Latest commit

History

Repository files navigation

CLIP Image Vectorizer with BentoML and Milvus

Features

Getting Started

1. Set Up the Environment

Running BentoML Locally

Start BentoML Service

Vectorize Inputs

Building and Running the Container

Full Stack: BentoML + Milvus

Run the Stack

Image Vector Management with Milvus

1. Import Vectors

2. Search for Similar Images

Supported Image Formats

About

Topics

Resources

License

Stars

Watchers

Forks

Languages