Urdu Handwriting Dataset for Demographic Traits Classification

Mirza, Ghulam Ali; Mustufa, Syed Ghulam; Rehman, Huma

Urdu Handwriting Dataset for Demographic Traits Classification

Authors: Ghulam Ali Mirza
Syed Ghulam Mustufa
Huma Rehman
Publication date
Publisher
Doi

Abstract

Urdu Handwriting Dataset for Demographic Traits Classification was developed in Bahria University, Islamabad, Pakistan as a part of the bachelor's degree final year thesis/project. This is a unique dataset which is the first of its kind. The dataset is composed of 1000 unique handwriting images each taken from unique individuals. It can be seen in the title, the handwriting samples are specifically in Urdu Language. Urdu Handwriting Dataset is made for the Classification of Demographic Traits problem due to which it consists of the demographic information of each individual. Following are the demographic traits that are covered in the dataset: Gender (Male, Female) Handedness (Left, Right) Age-Group (15-20,21-30,31-40,41-50,51-up) Province (Balochistan, Sindh, Punjab, kpk, gilgit-baltistan, none) Occupation (Student, Employee, Both, None) Education (Primary(Below Matriculation), Matriculation, Intermediate, Bachelors, Masters, PHD, None

Similar works

Full text

Available Versions

ZENODO

oai:zenodo.org:2573099

Last time updated on 09/07/2019