LLM-Utils

Utility library, scripts, tokenization implementation and comparisons, and a barebones gradio UI for chatting to various LLM endpoints.

Includes

llm_utils module with utilities for tokenization, API calling e.g. OpenAI, Together.AI and OpenAI python library use
Tokenization that's more correct than that provided by the majority of OSS libraries, together with notebooks exploring differences and verifying correctness. The whole issue of prompt formatting is currently a mess and can impact model inference performance.
A minimal Gradio UI in ui for OpenAI library compatible endpoints and those that provide a REST API meaning that many of the open source and community models can be used with e.g. vLLM

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
.github/workflows		.github/workflows
llm_utils		llm_utils
models		models
tests		tests
ui		ui
.gitignore		.gitignore
README.md		README.md
docker-compose-no-ncr.yml		docker-compose-no-ncr.yml
docker-compose.yml		docker-compose.yml
hf_inference_quant_flashattn.ipynb		hf_inference_quant_flashattn.ipynb
llm_api_calling.ipynb		llm_api_calling.ipynb
llm_api_endpoint_prompting.ipynb		llm_api_endpoint_prompting.ipynb
pytest.ini		pytest.ini
requirements-test.txt		requirements-test.txt
requirements.txt		requirements.txt

Provide feedback