Skip to content

A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation

Notifications You must be signed in to change notification settings

CSU-JPG/TextAtlas

Repository files navigation

TextAtlas5M

A Large-scale Dataset for Dense Text Image Generation

🌐 Homepage | 🏆 Leaderboard | 🤗 TextAtlas5M | 🤗 TextAtlasEval | 📖 TextAtlas arXiv

This repo contains the evaluation code for the paper "TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation"

Updates

  • [2025-2-18]: Our evaluation code is now availble! 🌟
  • [2025-2-13]: released TextAtlasEval & TextAtlas5M version 1.0 🔥

Table of Contents

Setup

conda create -n TextAtlasEval python=3.9 
conda activate TextAtlasEval

pip install -r requirements.txt

Accessing Datasets

TextAtlas was meticulously designed to challenge and evaluate text-rich image generation. For more detailed information and accessing our dataset, please refer to our Huggingface page:

Evaluation

Please refer to our evaluation folders for detailed information on evaluating with TextAtlasEval benchmark:

Data Format

The TextAtlas annotation documentation is available in huggingface:

  • main version: Contains image paths and pre-integrated prompts, making it suitable for direct training or evaluation.
  • Meta data: Includes all the data from main version, along with additional intermediate results such as bounding boxes (bbox), font size, and other related information, which can be used for further data analysis or processing.

Example

{
  "image_path": "0000089b-f1ce-41cf-9cd8-688856822244.png",
  "annotation": "In an opulent boutique, a sleek white digital display contrasts sharply with meticulously arranged merchandise and luxurious decor, creating a striking visual focal point. digital display with the text : ''Amidst the opulent ambiance of the upscale boutique, a sleek white digital display stands out as a striking contrast to the meticulously arranged merchandise and sumptuous luxury decor''"
}
entry description
image_path str, The image name
annotatoin str, Full Description

More Examples with images can be founded in 🤗 TextAtlas5M and 🤗 TextAtlasEval

For Metadata

In addition to the data from main version, meta data includes intermediate results retained during the processing of different subsets. These results provide useful metadata for further analysis, such as bounding boxes (bbox), font size, and other processing details.

Please refer to the TextAtlas Detailed Annotation for more comprehensive details on the meta annotations.

Citation

If you found our work useful, please consider citing:

@article{wang2025textatlas5m,
  title={TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation},
  author={Wang, Alex Jinpeng and Mao, Dongxing and Zhang, Jiawei and Han, Weiming and Dong, Zhuobai and Li, Linjie and Lin, Yiqi and Yang, Zhengyuan and Qin, Libo and Zhang, Fuwei and others},
  journal={arXiv preprint arXiv:2502.07870},
  year={2025}
}

Releases

No releases published

Packages

No packages published

Languages