Search in Docs

Search in Docs is a Web-Based interface to search local documents for answers using Llama 2 or other models supported by langchain. It can run on the CPU, GPU or mixed.

Based on Llama-2-Open-Source-LLM-CPU-Inference and this article.

The updates on this repo are:

New Web UI that allows you to search endlessly for documents, without having to re-load the entire model every time.
Ability to expand the references and go to the source file at the specific page (works with PDFs by attaching #page=<page>).

Steps

Install the required packages (pip install -r requirements.txt). If you want to use CUDA (GPUs), make sure to install Pytorch with CUDA.
The data directory needs to hold all the files used for indexing and searching. Replace the file Uruguay - Wikipedia.pdf with your .txt or .pdf files.
Download the LLM into models/. The orginal repo suggests downloading them from this huggingface to run the models on the CPU. If you don't know which one to use, start with this one that worked for me.
Tweak a few parameters on config.yml:
- MODEL_BIN_PATH: The path to the model.
- DEVICE: If you want to use the GPU, set to cuda, otherwise use cpu.
- GPU_LAYERS: Try a few different values (higher is better). Depending on the VRAM of the GPU you may be able to fit more layers in the GPU. If your GPU runs out of VRAM, the setup will fail with an error, and you can lower the number. My setup has a `3060Ti`` and runs 30 layers correctly.
- MODEL_TYPE: If running a non-llama model, change the name here.
Generate your indexed database of the documents. Run python db_build.py.
Run the command line mode with python endless.py or the web UI with flask run.
- If using the web UI, go to localhost:5000.

Note: Attempt first to run the command line app to test the initial setup. Once the command line works (the model loads properly in the GPU), you can try the Web UI.

References

Read more at the original repo that contains the single-use commandline interface.

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
assets		assets
config		config
data		data
models		models
src		src
templates		templates
vectorstore/db_faiss		vectorstore/db_faiss
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
db_build.py		db_build.py
endless.py		endless.py
main.py		main.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Search in Docs

Steps

References

About

Releases

Packages

Languages

License

mszylkowski/searchindocs

Folders and files

Latest commit

History

Repository files navigation

Search in Docs

Steps

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages