ChartMoE

Mixture of Diversely Aligned Expert Connector for Chart Understanding

Zhengzhuo Xu^1,2*, Bowen Qu^1,3*, Yiyan Qi^1*, Sinan Du², Chengjin Xu¹, Chun Yuan², Jian Guo^1,4

¹ International Digital Economy Academy (IDEA), ² Tsinghua University, ³ Peking University,

⁴ Hong Kong University of Science and Technology, Guangzhou

ICLR 2025 Oral

(* equal contribution)

If you have any question, feel free to contact 📧.

ChartMoE is a multimodal large language model with Mixture-of-Expert connector for advanced chart 1)understanding, 2)replot, 3)editing, 4)highlighting and 5)transformation.

News

2025.3.6: A reproduction of diversely aligned moe-connector is released at 🤗HF Link.
2025.2.16: ChartMoE-Data has been released at 🤗. Please download it according to our instruction.
2025.2.15: Training codes and recipes are released! Please refer to 📖!
2025.2.11: 🎉🎉🎉 ChartMoE is selected as ICLR2025 Oral(1.8%)!
2025.1.23: 🎉🎉🎉 ChartMoE is accepted by ICLR2025!
2024.9.10: We release ChartMoE!

Training of ChartMoE

Please refer to 📖training readme!

Download and Organize the ChartMoE-Data

🤗ChartMoE Data has been released! You can download it by running:

cd chartmoe/train
python scripts/chartmoe_data_download.py

Datasets will appear at chartmoe/train/data.

Then, please unzip these two files.

unzip ChartMoE-Align.zip
unzip SFT.zip

Additionally, I want to announce that the ChartY_replot in ChartMoE-Align contains data with higher quality and bilingual texts! It may be a good choice to sample more from ChartY_replot.

Installation

Step 1. Create a conda environment and activate it.

conda create -n chartmoe_env python=3.9
conda activate chartmoe_env

Step 2. Install PyTorch (We use PyTorch 2.1.0 / CUDA 12.1)

pip install torch==2.1.0 torchvision==0.16.0 torchaudio==2.1.0 --index-url https://download.pytorch.org/whl/cu121

Step 3. Install require packages

pip install -r requirements.txt

Step 4. Install editable ChartMoE packages

pip install -e .

Step 5. (Optional) Install Flash-Attn (cuda > 11.7)

pip install flash-attn==2.7.0.post2

Flash-Attn can bring ~30% accleration on training and ~20% on evaluation in our experiments.

p.s.: If you cannot install flash-attn, please set attn_implementation to eager in ChartMoE's config.json.

Quick Start

Huggingface Download Script of ChartMoE

Note: I've supported flash-attn for ChartMoE on Feb. 15. If you download chartmoe before this date, you can re-download it for acceleration.

Run:

cd chartmoe/train
python scripts/chartmoe_download.py

Then, ChartMoE will appear at chartmoe/train/ckpt/chartmoe.

Customize the weight path of ChartMoE

Set your own ChartMoE_HF_PATH. I suggest to use the absolute path of chartmoe/train/ckpt/chartmoe.

Code Demo

from chartmoe import ChartMoE_Robot
import torch

robot = ChartMoE_Robot()
image_path = "examples/bar2.png"
question = "Redraw the chart with python matplotlib, giving the code to highlight the column corresponding to the year in which the student got the highest score (painting it red). Please keep the same colors and legend as the input chart."

history = ""
with torch.cuda.amp.autocast():
    response, history = robot.chat(image_path=image_path, question=question, history=history)

print(response)

Evaluation

ChartQA

Customize the path of ChartQA:

Set your own ChartQA_ROOT(including test_human.json and test_augmented.json) and ChartQA_TEST_IMG_ROOT(including the test images).

w/ PoT:

CUDA_VISIBLE_DEVICES=0 python chartmoe/eval_ChartQA.py --save_path ./results/chartqa_results_pot --pot

w/o PoT:

CUDA_VISIBLE_DEVICES=0 python chartmoe/eval_ChartQA.py --save_path ./results/chartqa_results

MME

Run chartmoe/eval_MME.ipynb for MME scores.

WebUI Demo

CUDA_VISIBLE_DEVICES=0 python gradio_demo.py

FAQs

Q1: CLIP: Input image size (490x490) doesn't match model (336x336)

A1: Please degrade your transformers according to requiresments.txt.

Acknowledgement

Thanks to InternLM-XComposer2 and CuMo for their releases of model weights and source codes! And thanks to MMC and ChartGemma for their releases of the high-quality instruction-tuning data!

Citation

If you find our idea or code inspiring, please cite our paper:

@article{ChartMoE,
    title={ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding},
    author={Zhengzhuo Xu and Bowen Qu and Yiyan Qi and Sinan Du and Chengjin Xu and Chun Yuan and Jian Guo},
    journal={ArXiv},
    year={2024},
    volume={abs/2409.03277},
}

This code is partially based on ChartBench, if you use our code, please also cite：

@article{ChartBench,
    title={ChartBench: A Benchmark for Complex Visual Reasoning in Charts},
    author={Zhengzhuo Xu and Sinan Du and Yiyan Qi and Chengjin Xu and Chun Yuan and Jian Guo},
    journal={ArXiv},
    year={2023},
    volume={abs/2312.15915},
}

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
asset		asset
chartmoe		chartmoe
examples		examples
gradio_demo_pics		gradio_demo_pics
.gitignore		.gitignore
README.md		README.md
gradio_demo.py		gradio_demo.py
quickstart.py		quickstart.py
requirements.txt		requirements.txt
setup.py		setup.py
vis_tokens.ipynb		vis_tokens.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ChartMoE

Mixture of Diversely Aligned Expert Connector for Chart Understanding

News

Training of ChartMoE

Download and Organize the ChartMoE-Data

Installation

Quick Start

Huggingface Download Script of ChartMoE

Customize the weight path of ChartMoE

Code Demo

Evaluation

ChartQA

MME

WebUI Demo

FAQs

Acknowledgement

Citation

About

Releases

Packages

Contributors 2

Languages

IDEA-FinAI/ChartMoE

Folders and files

Latest commit

History

Repository files navigation

ChartMoE

Mixture of Diversely Aligned Expert Connector for Chart Understanding

News

Training of ChartMoE

Download and Organize the ChartMoE-Data

Installation

Quick Start

Huggingface Download Script of ChartMoE

Customize the weight path of ChartMoE

Code Demo

Evaluation

ChartQA

MME

WebUI Demo

FAQs

Acknowledgement

Citation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages