LlamaCpp Horde Bridge

This software enables you to join your llama.cpp server to the KoboldAI Horde and make it into a Scribe worker, performing distributed text generation.

It is a fork of KoboldAI-Horde-Bridge.

Why llama.cpp and not koboldcpp?

See this reddit post, using this trick older Pascal GPUs (GTX 10x0, P40, K80) are almost twice as fast, particulary at long contexts.

Compile llama.cpp with make LLAMA_CUBLAS=1 LLAMA_CUDA_FORCE_MMQ=1 to get a Pascal-optimized server binary.

Instructions

Launch llama.cpp server, something like: server -m /path/to/model.gguf -ngl 100 -c 2048
Obtain a Horde API key
Copy clientData_template.py to clientData.py and customize the configuration:
- kai_url LlamaCpp server endpoint (default OK if same machine)
- kai_name Horde worker name
- api_key Hode API key
Run bridge.py

Note that for quick Testing, you can provide these arguments via the CLI: bridge.py -k <kai_url> -a <api_key> -n <kai_name>

Name		Name	Last commit message	Last commit date
Latest commit History 317 Commits
.github		.github
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
benchmark.py		benchmark.py
bridge.py		bridge.py
bridge_setup.sh		bridge_setup.sh
bridge_start.bat		bridge_start.bat
bridge_start.sh		bridge_start.sh
clientData_template.py		clientData_template.py
logger.py		logger.py
requirements.txt		requirements.txt
speed.py		speed.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LlamaCpp Horde Bridge

Why llama.cpp and not koboldcpp?

Instructions

About

Releases

Sponsor this project

Packages

Contributors 7

Languages

License

the-crypt-keeper/LlamaCpp-Horde-Bridge

Folders and files

Latest commit

History

Repository files navigation

LlamaCpp Horde Bridge

Why llama.cpp and not koboldcpp?

Instructions

About

Resources

License

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Contributors 7

Languages

Packages