TextAtlas5M

A Large-scale Dataset for Dense Text Image Generation

🌐 Homepage | 🏆 Leaderboard | 🤗 TextAtlas5M | 🤗 TextAtlasEval | 📖 TextAtlas arXiv

This repo contains the evaluation code for the paper "TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation"

Updates

[2025-2-18]: Our evaluation code is now availble! 🌟
[2025-2-13]: released TextAtlasEval & TextAtlas5M version 1.0 🔥

Setup

conda create -n TextAtlasEval python=3.9 
conda activate TextAtlasEval

pip install -r requirements.txt

Accessing Datasets

TextAtlas was meticulously designed to challenge and evaluate text-rich image generation. For more detailed information and accessing our dataset, please refer to our Huggingface page:

Evaluation

Please refer to our evaluation folders for detailed information on evaluating with TextAtlasEval benchmark:

TextAtlas Evaluation

Data Format

The TextAtlas annotation documentation is available in huggingface:

main version: Contains image paths and pre-integrated prompts, making it suitable for direct training or evaluation.
Meta data: Includes all the data from main version, along with additional intermediate results such as bounding boxes (bbox), font size, and other related information, which can be used for further data analysis or processing.

Example

{
  "image_path": "0000089b-f1ce-41cf-9cd8-688856822244.png",
  "annotation": "In an opulent boutique, a sleek white digital display contrasts sharply with meticulously arranged merchandise and luxurious decor, creating a striking visual focal point. digital display with the text : ''Amidst the opulent ambiance of the upscale boutique, a sleek white digital display stands out as a striking contrast to the meticulously arranged merchandise and sumptuous luxury decor''"
}

entry	description
`image_path`	`str`, The image name
`annotatoin`	`str`, Full Description

More Examples with images can be founded in 🤗 TextAtlas5M and 🤗 TextAtlasEval

For Metadata

In addition to the data from main version, meta data includes intermediate results retained during the processing of different subsets. These results provide useful metadata for further analysis, such as bounding boxes (bbox), font size, and other processing details.

Please refer to the TextAtlas Detailed Annotation for more comprehensive details on the meta annotations.

Citation

If you found our work useful, please consider citing:

@article{wang2025textatlas5m,
  title={TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation},
  author={Wang, Alex Jinpeng and Mao, Dongxing and Zhang, Jiawei and Han, Weiming and Dong, Zhuobai and Li, Linjie and Lin, Yiqi and Yang, Zhengyuan and Qin, Libo and Zhang, Fuwei and others},
  journal={arXiv preprint arXiv:2502.07870},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
detialed_annotation		detialed_annotation
evaluation		evaluation
README.md		README.md
data-display-overall-w-ann-v2.svg		data-display-overall-w-ann-v2.svg
intro-dataset.svg		intro-dataset.svg
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TextAtlas5M

A Large-scale Dataset for Dense Text Image Generation

Updates

Table of Contents

Setup

Accessing Datasets

Evaluation

Data Format

Example

For Metadata

Citation

About

Releases

Packages

Languages

CSU-JPG/TextAtlas

Folders and files

Latest commit

History

Repository files navigation

TextAtlas5M

A Large-scale Dataset for Dense Text Image Generation

Updates

Table of Contents

Setup

Accessing Datasets

Evaluation

Data Format

Example

For Metadata

Citation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages