Skip to content

Latest commit

 

History

History

langchain

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 

Langhchain Integration for Indexify

Indexify complements LangChain by providing a robust platform for indexing large volume of multi-modal content such as PDFs, raw text, audio and video. It provides a retriever API to retrieve context for LLMs.

You can use our LangChain retriever from our repo located in indexify_langchain/retriever.py to begin retrieving your data.

Below is an example

# setup your indexify client
from indexify.client import IndexifyClient
client = IndexifyClient()


# add docs
from indexify.repository import Document
repository = client.get_repository("default")

repository.bind_extractor(
    "diptanu/minilm-l6-extractor",
    "minilm",
)

repository.add_documents(
    [
        Document(
            text="Indexify is amazing!",
            metadata={"url": "https://github.com/tensorlakeai/indexify"},
        ),
        Document(
            text="Indexify is also a retrieval service for LLM agents!",
            metadata={"url": "https://github.com/tensorlakeai/indexify"},
        )
    ]
)


# implement retriever from indexify repo
from retriever import IndexifyRetriever

params = {"repository_name": "default", "name": "minilm-embedding", "top_k": 3}
retriever = IndexifyRetriever(client=client, params=params)

docs = retriever.get_relevant_documents("indexify")