🌐 Homepage | 🏆 Leaderboard | 🤗 TextAtlas5M | 🤗 TextAtlasEval | 📖 TextAtlas arXiv
This repo contains the evaluation code for the paper "TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation"
- [2025-2-18]: Our evaluation code is now availble! 🌟
- [2025-2-13]: released TextAtlasEval & TextAtlas5M version 1.0 🔥
conda create -n TextAtlasEval python=3.9
conda activate TextAtlasEval
pip install -r requirements.txt
TextAtlas was meticulously designed to challenge and evaluate text-rich image generation. For more detailed information and accessing our dataset, please refer to our Huggingface page:
Please refer to our evaluation folders for detailed information on evaluating with TextAtlasEval benchmark:
The TextAtlas annotation documentation is available in huggingface:
- main version: Contains image paths and pre-integrated prompts, making it suitable for direct training or evaluation.
- Meta data: Includes all the data from main version, along with additional intermediate results such as bounding boxes (bbox), font size, and other related information, which can be used for further data analysis or processing.
{
"image_path": "0000089b-f1ce-41cf-9cd8-688856822244.png",
"annotation": "In an opulent boutique, a sleek white digital display contrasts sharply with meticulously arranged merchandise and luxurious decor, creating a striking visual focal point. digital display with the text : ''Amidst the opulent ambiance of the upscale boutique, a sleek white digital display stands out as a striking contrast to the meticulously arranged merchandise and sumptuous luxury decor''"
}
entry | description |
---|---|
image_path |
str , The image name |
annotatoin |
str , Full Description |
More Examples with images can be founded in 🤗 TextAtlas5M and 🤗 TextAtlasEval
In addition to the data from main version, meta data includes intermediate results retained during the processing of different subsets. These results provide useful metadata for further analysis, such as bounding boxes (bbox), font size, and other processing details.
Please refer to the TextAtlas Detailed Annotation for more comprehensive details on the meta annotations.
If you found our work useful, please consider citing:
@article{wang2025textatlas5m,
title={TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation},
author={Wang, Alex Jinpeng and Mao, Dongxing and Zhang, Jiawei and Han, Weiming and Dong, Zhuobai and Li, Linjie and Lin, Yiqi and Yang, Zhengyuan and Qin, Libo and Zhang, Fuwei and others},
journal={arXiv preprint arXiv:2502.07870},
year={2025}
}