Urdu Handwriting Dataset for Demographic Traits Classification

Abstract

Urdu Handwriting Dataset for Demographic Traits Classification was developed in Bahria University, Islamabad, Pakistan as a part of the bachelor's degree final year thesis/project. This is a unique dataset which is the first of its kind. The dataset is composed of 1000 unique handwriting images each taken from unique individuals. It can be seen in the title, the handwriting samples are specifically in Urdu Language. Urdu Handwriting Dataset is made for the Classification of Demographic Traits problem due to which it consists of the demographic information of each individual. Following are the demographic traits that are covered in the dataset: Gender (Male, Female) Handedness (Left, Right) Age-Group (15-20,21-30,31-40,41-50,51-up) Province (Balochistan, Sindh, Punjab, kpk, gilgit-baltistan, none) Occupation (Student, Employee, Both, None) Education (Primary(Below Matriculation), Matriculation, Intermediate, Bachelors, Masters, PHD, None

    Similar works

    Full text

    thumbnail-image

    Available Versions

    Last time updated on 09/07/2019