I created a dataset consisting of 10,000 captcha images of different colors, fonts with random alignment.
The captcha contains 10-character long strings, each of size 282x90.
Here is the dataset: https://www.kaggle.com/aadhavvignesh/captcha-images
You can use this for text identification using PyTorch, or feel free to use it in any way!