Search CORE

77 research outputs found

Probabilistic Archetypal Analysis

Author: Eugster Manuel J. A.
Seth Sohan
Publication venue
Publication date: 07/04/2014
Field of study

Archetypal analysis represents a set of observations as convex combinations of pure patterns, or archetypes. The original geometric formulation of finding archetypes by approximating the convex hull of the observations assumes them to be real valued. This, unfortunately, is not compatible with many practical situations. In this paper we revisit archetypal analysis from the basic principles, and propose a probabilistic framework that accommodates other observation types such as integers, binary, and probability vectors. We corroborate the proposed methodology with convincing real-world applications on finding archetypal winter tourists based on binary survey data, archetypal disaster-affected countries based on disaster count data, and document archetypes based on term-frequency data. We also present an appropriate visualization tool to summarize archetypal analysis solution better.Comment: 24 pages; added literature review and visualizatio

arXiv.org e-Print Archive

CiteSeerX

New generalized crystallographic descriptors for structural machine learning

Author: Cumby James
Seth Sohan
Zhang Ruizhi
Publication venue: 'International Union of Crystallography (IUCr)'
Publication date: 22/08/2021
Field of study

Edinburgh Research Explorer

Grouped representation of interatomic distances as a similarity measure for crystal structures

Author: Cumby James
Seth Sohan
Zhang Rui-zhi
Publication venue: 'Royal Society of Chemistry (RSC)'
Publication date: 24/11/2022
Field of study

Edinburgh Research Explorer

Archetypal Analysis for Nominal Observations

Author: Eugster Manuel
Seth Sohan
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 19/08/2015
Field of study

Crossref

Edinburgh Research Explorer

Census-Independent Population Estimation using Representation Learning

Author: Diallo Mamadou S.
Neal Isaac
Seth Sohan
Watmough Gary
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 25/03/2022
Field of study

Knowledge of population distribution is critical for building infrastructure, distributing resources, and monitoring the progress of sustainable development goals. Although censuses can provide this information, they are typically conducted every 10 years with some countries having forgone the process for several decades. Population can change in the intercensal period due to rapid migration, development, urbanisation, natural disasters, and conflicts. Census-independent population estimation approaches using alternative data sources, such as satellite imagery, have shown promise in providing frequent and reliable population estimates locally. Existing approaches, however, require significant human supervision, for example annotating buildings and accessing various public datasets, and therefore, are not easily reproducible. We explore recent representation learning approaches, and assess the transferability of representations to population estimation in Mozambique. Using representation learning reduces required human supervision, since features are extracted automatically, making the process of population estimation more sustainable and likely to be transferable to other regions or countries. We compare the resulting population estimates to existing population products from GRID3, Facebook (HRSL) and WorldPop. We observe that our approach matches the most accurate of these maps, and is interpretable in the sense that it recognises built-up areas to be an informative indicator of population

PubMed Central

Edinburgh Research Explorer