VISIT: Visualizing and Interpreting the Semantic Information Flow of Transformers

This repository contains the code for the paper VISIT: Visualizing and Interpreting the Semantic Information Flow of Transformers.

Try our demo:

By using this notebook you can create dynamic plots that reflect the forward passes of GPTs from a semantic perspective. These plots illustrate the information flow within the models and provide insights into the impact of each component on the semantic information flow.

Our implementation currently works with multi-head attention decoders like OpenAI's GPT-2 and EleutherAI's GPT-j (both from HuggingFace).

Feel free to open an issue if you find any problems or contact us to discuss any related topics.

Requirements:

Tested with Python 3.9.7 . After cloning our code, run the following command in your Python enviroment to install the required packages: pip install -r requirements.txt

Generate Flow Graphs:

To generate the flow graphs you can use the interactive notebooks we provided (including our demo on Colab , this version is with explicit code unlike the version at the top of the page with form-like UI) or to convert them into .py files using jupyter nbconvert <The relevent notebook>.ipynb --to python and run them from the command line.

To run the code from the command line, use the following template:

python generate_flow_graph.py --model_name <The model name or path> \
    --line <The input line> \
    --graph_config_path <The path to the graph config file> \
    --layers_to_check <The layers to check>

For example:

python generate_flow_graph.py --model_name "gpt2-medium" \
    --line "The capital of Japan is the city of" 
    --graph_config_path "flow_graph_configs/flow_graph_config_basic.json" \
    --layers_to_check "[10,14]"

or (uses a color blind friendliness configuration):

python generate_flow_graph.py --model_name "gpt2-xl" \
    --line "Lionel Messi plays for" \
    --graph_config_path "flow_graph_configs/flow_graph_config_basic_color_palette2.json" \
    --layers_to_check "[15,16]"

To generate GPT-j model: (use --model_revision "float16")

python generate_flow_graph.py --model_name "EleutherAI/gpt-j-6B" \
    --model_revision "float16" \
    --line "The capital of Japan is the city of" \
    --graph_config_path "flow_graph_configs/flow_graph_config_gptj.json" \
    --layers_to_check "[10,14]"

Note the graph can be plotted via the IDE or available browser, as well as saved to an HTML file.

GREAT NEWS! We support GPT-Neo and Llama2-7B !

Please make sure to use the correct graph config file for each model and to get the right access to the model if you are using the llama2 model from HuggingFace.

python generate_flow_graph.py --model_name "EleutherAI/gpt-neo-1.3B" \
    --line "The capital of Japan is the city of" \
    --graph_config_path "flow_graph_configs/flow_graph_config_gpt_neo.json" \
    --layers_to_check "[10,14,18]"

python generate_flow_graph.py --model_name "meta-llama/Llama-2-7b-chat-hf" \
    --line "The capital of Japan is the city of" \
    --graph_config_path "flow_graph_configs/flow_graph_config_llama2_7B.json" \
    --layers_to_check "[10,14,18]"

More models:

Our tool should be able to handle any GPT-like model (autoregressive decoder with multi-head self-attention). Please check out flow_graph_configs folder for examples of how to configure the tool for other models. If you have any questions, please contact us.

How to Cite

@article{katz2023visit,
      title={VISIT: Visualizing and Interpreting the Semantic Information Flow of Transformers}, 
      author={Shahar Katz and Yonatan Belinkov},
      year={2023},
      eprint={2305.13417},
      archivePrefix={arXiv},
}

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
flow_graph_configs		flow_graph_configs
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
flow_graph.py		flow_graph.py
generate_flow_graph.ipynb		generate_flow_graph.ipynb
generate_flow_graph.py		generate_flow_graph.py
hooks.py		hooks.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VISIT: Visualizing and Interpreting the Semantic Information Flow of Transformers

Requirements:

Generate Flow Graphs:

GREAT NEWS! We support GPT-Neo and Llama2-7B !

More models:

How to Cite

About

Releases

Packages

Languages

License

shacharKZ/VISIT-Visualizing-Transformers

Folders and files

Latest commit

History

Repository files navigation

VISIT: Visualizing and Interpreting the Semantic Information Flow of Transformers

Requirements:

Generate Flow Graphs:

GREAT NEWS! We support GPT-Neo and Llama2-7B !

More models:

How to Cite

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages