Heisenberg

Heisenberg is a simple vector database optimised for LLMs.

Why do we need vector databases? Simple, it is the language of AI.

Vectors represent complex data such as images, sounds or text documents as a list of numbers - vectors. By comparing distances between vectors we can find instances of data which are similar.

Use cases

Image search. Say you have a database of vectorised images. You can take a text prompt, turn it in to a vector then using Heisenberg find the images which have the closest match to the prompt.
Semantic search. Conventional text search finds phrases or words which closely match the prompt. Heisenberg however is better suited to finding pieces of texts which are conceptually related i.e. "movies" -> "The Matrix", "Guardians of the Galaxy" etc.
LLM memory. LLMs themselves can be pretty stupid. Just like humans, they need memory. By combining semantic search, Heisenberg can facilitate more complex LLM usage (built in LLM modules are coming soon).

Currently Heisenberg is still under development so it is not recommended to use for production.

Example

Normally to convert words in to vectors you utilise an embedding model. You can utilise the OpenAI api for example to obtain vectors for a word. Here we will use a simpler model.

Consider a two dimensional vector [x, y] where x represents gender i.e. 0 is male and 1 is female and y represents power where 1 is the most powerful.

We can represent the following words as such:

func getWords() map[string][]float32 {
    words := make(map[string][]float32)
    m["man"] := [0, 0.5]
    m["woman"] := [1, 0.5]
    m["person"] := [0.5, 0.5]
    m["king"] := [1, 0]
    m["queen"] := [0, 1]
    m["servant"] := [0.5, 0]
    m["maid"] := [1, 0]
}

This gives us a semantic map of words which are closely related. We can then search through them as so:

import (
    "fmt"
    "heisenberg/core"
    "heisenberg/utils"
)


func main() {
    words := getWords()
    h := NewHeisenberg("./path/to/heisenberg/folder")
    
    // Collections are an isolated search group of vectors.
    // To create a new collection specify the name, the size of vectors to be stored and the similarity metric.
    if err := h.NewCollection("people", 2, utils.Cosine); err != nil {
        panic(err)
    }

    for k, v := range words {
        // Store the vectors in the collection "people"
        // Each vector corresponds to a key (the word in this case)
        // You can retrieve a vector using the key via h.Get(collection, key)
        if err := h.Put("people", k, v, {}); err != nil {
            panic(err)
        }
    }

    // Find the three closest words to man
    results, err := h.Search("people", words["man"], 3)
    if err != nil {
        panic(err)
    }

    fmt.Println("Words closest to man: ")
    for result := range results {
        fmt.Println("%s", result.key)
    }
}

Contact/Contribute

Our plan is to turn Heisenberg into an intelligent memory source for complex AI systems. If you're interested in becoming a part of our mission join our discord community here where you can connect with us.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
clients		clients
core		core
hnsw		hnsw
log		log
math		math
server		server
utils		utils
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Heisenberg

Use cases

Example

Contact/Contribute

About

Releases

Packages

Languages

aldarisbm/heisenberg

Folders and files

Latest commit

History

Repository files navigation

Heisenberg

Use cases

Example

Contact/Contribute

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages