GitHub - FateScript/token_visualizer: Token level visualization tools for large language models

Introduction

token visualizer is a token-level visualization tool to visualize LLM.

Quick start

Installation

Install from source

Run the following command to install the package.

git clone [email protected]:FateScript/token_visualizer.git
cd token_visualizer
pip install -v -e .  # or  python3 setup.py develop

Check installation

If you could see the version of token_visualizer by running

python3 -c "import token_visualizer; print(token_visualizer.__file__)"

Visualization demo

Inference

Start demo

Run the following command to start inference visualizer.

python3 visualizer.py

The command will start a OpenAIProxy model, to use it without exception, user should fill in the value of BASE_URL and OPENAI_KEY.

token_visualizer also support OpenAIModel and HuggingFace TransformerModel in models.py, feel free to modify the code.

Demo gif

After inputing your prompt, you will see the large language model's answer and the answer's visualization result.

The redder the color of the token, the lower the corresponding probability. The greener the color of the token, the higher the corresponding probability.

Perplexity

Start demo

Run the following command to start perplexity visualizer, then click the ppl tab.

python3 visualizer.py

Demo gif

After inputing your text, you will see the perplexity and visualization result of the text.

Tokenizer

Start demo

Run the following command to start interactive tokenizer encoding web demo.

python3 visual_tokenizer.py

Demo gif

User could select tokenizer to interacte with and text to encode. For speical string

TODO

Support ppl visualization.
Select transformers/openai/TGI with cli.
Support OpenAI tokenizer visualization.
Support TGI inference visualization.
Support multi-turn chat visualization.
Support dark mode.

Related projects/websites

Acknowledgement

Use front-end setting from https://perplexity.vercel.app/
Color algorithm from post by thesephist.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
assets		assets
token_visualizer		token_visualizer
.env		.env
.gitignore		.gitignore
README.md		README.md
setup.cfg		setup.cfg
setup.py		setup.py
visual_tokenizer.py		visual_tokenizer.py
visualizer.py		visualizer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Quick start

Installation

Install from source

Check installation

Visualization demo

Inference

Start demo

Demo gif

Perplexity

Start demo

Demo gif

Tokenizer

Start demo

Demo gif

TODO

Related projects/websites

Acknowledgement

About

Releases

Packages

Contributors 2

Languages

FateScript/token_visualizer

Folders and files

Latest commit

History

Repository files navigation

Introduction

Quick start

Installation

Install from source

Check installation

Visualization demo

Inference

Start demo

Demo gif

Perplexity

Start demo

Demo gif

Tokenizer

Start demo

Demo gif

TODO

Related projects/websites

Acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages