Search CORE

8,213 research outputs found

Analysis of large data sets using formal concept lattices

Author: Andrews Simon
Orphanides Constantinos
Publication venue: 'University of Seville'
Publication date: 01/01/2010
Field of study

Sheffield Hallam University Research Archive

A conceptual approach to gene expression analysis enhanced by visual analytics

Author: Andrews Simon
Aufaire Marie-Aude
Burger Albert
McLeod Kenneth
Melo Cassio
Orphanides Constantinos
Publication venue: ACM New York
Publication date: 01/01/2013
Field of study

The analysis of gene expression data is a complex task for biologists wishing to understand the role of genes in the formation of diseases such as cancer. Biologists need greater support when trying to discover, and comprehend, new relationships within their data. In this paper, we describe an approach to the analysis of gene expression data where overlapping groupings are generated by Formal Concept Analysis and interactively analyzed in a tool called CUBIST. The CUBIST workflow involves querying a semantic database and converting the result into a formal context, which can be simplified to make it manageable, before it is visualized as a concept lattice and associated charts

CiteSeerX

Heriot Watt Pure

Crossref

Sheffield Hallam University Research Archive

Recommended from our members

Incremental learning of independent, overlapping, and graded concept descriptions with an instance-based process framework

Author: Aha David W.
Publication venue: eScholarship, University of California
Publication date: 23/05/1989
Field of study

Supervised learning algorithms make several simplifying assumptions concerning the characteristics of the concept descriptions to be learned. For example, concepts are often assumed to be (1) defined with respect to the same set of relevant attributes, (2) disjoint in instance space, and (3) have uniform instance distributions. While these assumptions constrain the learning task, they unfortunately limit an algorithm's applicability. We believe that supervised learning algorithms should learn attribute relevancies independently for each concept, allow instances to be members of any subset of concepts, and represent graded concept descriptions. This paper introduces a process framework for instance-based learning algorithms that exploit only specific instance and performance feedback information to guide their concept learning processes. We also introduce Bloom, a specific instantiation of this framework. Bloom is a supervised, incremental, instance-based learning algorithm that learns relative attribute relevancies independently for each concept, allows instances to be members of any subset of concepts, and represents graded concept memberships. We describe empirical evidence to support our claims that Bloom can learn independent, overlapping, and graded concept descriptions

eScholarship - University of California

Recommended from our members

Combining numeric and symbolic learning techniques

Author: Granger.Jr. Richard H.
Schlimmer Jeffrey C.
Publication venue: eScholarship, University of California
Publication date: 01/01/1985
Field of study

Incremental learning from examples in a noisy domain is a difficult problem in Machine Learning. In this paper we divide the task into two subproblems and present a combination of numeric and symbolic approaches that yields robust learning of boolean characterizations. Our method has been implemented in a computer program, and we plot its empirical learning performance in the presence of varying amounts of noise

eScholarship - University of California

F-formation Detection: Individuating Free-standing Conversational Groups in Images

Author: Chiara Bassetti
Chiara Bassetti
Chris Russell
Francesco Setti
Marco Cristani
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 09/09/2014
Field of study

Detection of groups of interacting people is a very interesting and useful task in many modern technologies, with application fields spanning from video-surveillance to social robotics. In this paper we first furnish a rigorous definition of group considering the background of the social sciences: this allows us to specify many kinds of group, so far neglected in the Computer Vision literature. On top of this taxonomy, we present a detailed state of the art on the group detection algorithms. Then, as a main contribution, we present a brand new method for the automatic detection of groups in still images, which is based on a graph-cuts framework for clustering individuals; in particular we are able to codify in a computational sense the sociological definition of F-formation, that is very useful to encode a group having only proxemic information: position and orientation of people. We call the proposed method Graph-Cuts for F-formation (GCFF). We show how GCFF definitely outperforms all the state of the art methods in terms of different accuracy measures (some of them are brand new), demonstrating also a strong robustness to noise and versatility in recognizing groups of various cardinality.Comment: 32 pages, submitted to PLOS On

arXiv.org e-Print Archive

CiteSeerX

Directory of Open Access Journals

PubMed Central

Catalogo dei prodotti della ricerca

Adapted K-Nearest Neighbors for Detecting Anomalies on Spatio–Temporal Traffic Flow

Author: Belhadi Asma
Cano Alberto
Djenouri Djamel
Djenouri Youcef
Lin Chun Wei
Publication venue: VCU Scholars Compass
Publication date: 01/01/2019
Field of study

Outlier detection is an extensive research area, which has been intensively studied in several domains such as biological sciences, medical diagnosis, surveillance, and traffic anomaly detection. This paper explores advances in the outlier detection area by finding anomalies in spatio-temporal urban traffic flow. It proposes a new approach by considering the distribution of the flows in a given time interval. The flow distribution probability (FDP) databases are first constructed from the traffic flows by considering both spatial and temporal information. The outlier detection mechanism is then applied to the coming flow distribution probabilities, the inliers are stored to enrich the FDP databases, while the outliers are excluded from the FDP databases. Moreover, a k-nearest neighbor for distance-based outlier detection is investigated and adopted for FDP outlier detection. To validate the proposed framework, real data from Odense traffic flow case are evaluated at ten locations. The results reveal that the proposed framework is able to detect the real distribution of flow outliers. Another experiment has been carried out on Beijing data, the results show that our approach outperforms the baseline algorithms for high-urban traffic flow

VCU Scholars Compass

NORA - Norwegian Open Research Archives