Minivec

A minimal document store with semantic retrieval based upon embedding vectors and reranking

Getting started

podman run -it --rm -p 3000:3000 docker.io/pqnet/minivec

or using docker:

docker run -it --rm -p 3000:3000 pqnet/minivec

add new documents to the store with a HTTP POST:

curl -H "Content-Type: application/json" -d  '{ "documents": [{ "content": "hello world", "metadata":{} }]}' localhost:3000/api/documents

And search them using HTTP GET:

curl 'localhost:3000/api/documents?q=hello'

Persistence

By default models are downloaded in the /models directory and the database is saved in the /app/.data directory (inside the container). To allow re-using model cache, or to persist the saved vectors between runs, you can map host directories or mount named volumes at these paths, e.g.

podman run -it --rm -p 3000:3000 -v minivec-models-cache:/models -v minivec-data:/app/.data docker.io/pqnet/minivec

(similarly with docker)

docker run -it --rm -p 3000:3000 -v minivec-models-cache:/models -v minivec-data:/app/.data pqnet/minivec

Configuration

Use environment variables to configure which models to load. see nitro.config.ts for a full list of the usable variables

Model choice

bge-m3 (for embedding) and bge-reranker-v2-m3 (for reranking) are automatically downloaded and used by the container. It is possible to choose different models by specifying a local file name, an http/https URL or an huggingface repository to download the models automatically. See https://node-llama-cpp.withcat.ai/guide/downloading-models for the list of compatible URL schemes and parameters.

For a simpler deployment in cloud environments and similar we recommend creating a custom container and copy your models inside of it. It is also possible to copy a database file in the container image and disable write operations.

FROM docker.io/pqnet/minivec:latest

# Add models to the image
COPY my-embedding-model.gguf /models/my-embedding-model.gguf
COPY my-ranking-model.gguf /models/my-ranking-model.gguf
ENV NITRO_LOCAL_MODELS_EMBED_MODEL_FILE=my-embedding-model.gguf
ENV NITRO_LOCAL_MODELS_RANK_MODEL_FILE=my-ranking-model.gguf

# preload a database file, and disable writing to the store (for inference loads)
COPY db.sqlite3 /app/.data/db.sqlite3
ENV NITRO_DISABLE_WRITE=true

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
server		server
.gitignore		.gitignore
Containerfile		Containerfile
README.md		README.md
nitro.config.ts		nitro.config.ts
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Minivec

Getting started

Persistence

Configuration

Model choice

About

Uh oh!

Releases

Packages

Uh oh!

Languages

pqnet/minivec

Folders and files

Latest commit

History

Repository files navigation

Minivec

Getting started

Persistence

Configuration

Model choice

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages