Multimodal Fashion Search with Jina

Multimodal search lets you use one type of data (in this case, text) to search another type of data (in this case, images). This example leverages core Jina technologies that make it simpler to build and run your search, including:

DocumentArray - let's us concurrently process Documents and push/pull them between machines. Useful for creating embeddings on remote machine with GPU and then indexing and querying locally
Jina Hub Executors, so we don't have to manually integrate deep learning models
Jina Client, so we don't have to worry about how best to format the REST request
PQLite allowing us to pre-filter results by season, price, rating, etc

The front-end is built in Streamlit.

Play with the search engine now

We've got a live demo for you to play with.

Run the fashion search engine yourself

There are multiple ways you can run this:

Deploy on JCloud
Run with Docker-Compose
Run on bare metal

First steps

Clone this repo: git clone https://github.com/jina-ai/example-multimodal-fashion-search.git
Download data: python ./get_data.py

Run on JCloud

JCloud lets you run the fashion backend Jina Flow on the cloud, without having to use your own compute.

pip install jcloud
cd backend
jc login
jc deploy jcloud

After that you can use Jina Client to connect and search/index your data.

Run with Docker-Compose

This will spin up:

Indexer: saves embeddings and metadata to /backend/workspace. You can tweak how many Documents to index in docker-compose.yml. You can also comment out the backend-index section in docker-compose.yml if you've already indexed and don't want to re-index.
Searcher: searches the embeddings/metadata stored on disk
Frontend: Streamlit frontend to make user experience easier

docker-compose up

Run on bare metal

pip install -r requirements.txt

Then, in backend:

Build your index: python app.py -t index -n 1000 # index 1000 images
Open up RESTful interface for searching/indexing: python app.py -t serve

To open the frontend, go to the frontend directory and run streamlit run frontend.py

Tips

Index using the small dataset, then swap out the data directory for that of the hi-res dataset for nicer-looking results.

Troubleshooting

I get the error `stacks: "sqlite3.IntegrityError: UNIQUE constraint failed: table_0._doc_id\n"`

This is because you're trying to index data that's already been indexed. The database we use has a UNIQUE constraint that means it won't index duplicate data. You can fix this by:

Deleting backend/workspace (this will delete your entire index)
Commenting out the backend-index section from docker-compose.yml

Name		Name	Last commit message	Last commit date
Latest commit History 175 Commits
.github/images		.github/images
backend		backend
data/subset		data/subset
frontend		frontend
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multimodal Fashion Search with Jina

Play with the search engine now

Run the fashion search engine yourself

First steps

Run on JCloud

Run with Docker-Compose

Run on bare metal

Tips

Troubleshooting

I get the error `stacks: "sqlite3.IntegrityError: UNIQUE constraint failed: table_0._doc_id\n"`

About

Contributors 2

Languages

License

jina-ai/example-multimodal-fashion-search

Folders and files

Latest commit

History

Repository files navigation

Multimodal Fashion Search with Jina

Play with the search engine now

Run the fashion search engine yourself

First steps

Run on JCloud

Run with Docker-Compose

Run on bare metal

Tips

Troubleshooting

I get the error stacks: "sqlite3.IntegrityError: UNIQUE constraint failed: table_0._doc_id\n"

About

Topics

Resources

License

Stars

Watchers

Forks

Contributors 2

Languages

I get the error `stacks: "sqlite3.IntegrityError: UNIQUE constraint failed: table_0._doc_id\n"`