3 research outputs found
DHLP 1&2: Giraph based distributed label propagation algorithms on heterogeneous drug-related networks
Background and Objective: Heterogeneous complex networks are large graphs
consisting of different types of nodes and edges. The knowledge extraction from
these networks is complicated. Moreover, the scale of these networks is
steadily increasing. Thus, scalable methods are required. Methods: In this
paper, two distributed label propagation algorithms for heterogeneous networks,
namely DHLP-1 and DHLP-2 have been introduced. Biological networks are one type
of the heterogeneous complex networks. As a case study, we have measured the
efficiency of our proposed DHLP-1 and DHLP-2 algorithms on a biological network
consisting of drugs, diseases, and targets. The subject we have studied in this
network is drug repositioning but our algorithms can be used as general methods
for heterogeneous networks other than the biological network. Results: We
compared the proposed algorithms with similar non-distributed versions of them
namely MINProp and Heter-LP. The experiments revealed the good performance of
the algorithms in terms of running time and accuracy.Comment: Source code available for Apache Giraph on Hadoo
A parameter-free label propagation algorithm using bipartite heterogeneous networks for text classification
A bipartite heterogeneous network is one of the simplest ways to represent a textual document collection. In such case, the network consists of two types of vertices, representing documents and terms, and links connecting terms to the documents. Transductive algorithms are usually applied to perform classi cation of networked objects. This type of classi cation is usually applied when few labeled examples are available, which may be worthwhile for practical situations. Nevertheless, for existing transductive algorithms users have to set several parameters that signi cantly affect the classi cation accuracy. In this paper, we propose a parameter-free algorithm for transductive classi cation of textual data, referred to as LPBHN (Label Propagation using Bipartite Heterogeneous Networks). LPBHN uses a bipartite heterogeneous network to perform the classi cátion task. The proposed algorithm presents accuracy equivalente or higher than state-of-the-art algorithms for transductive classi cation in heterogeneous or homogeneous networks
Using Social Media Websites to Support Scenario-Based Design of Assistive Technology
Indiana University-Purdue University Indianapolis (IUPUI)Having representative users, who have the targeted disability, in accessibility
studies is vital to the validity of research findings. Although it is a widely accepted tenet
in the HCI community, many barriers and difficulties make it very resource-demanding
for accessibility researchers to recruit representative users. As a result, researchers recruit
non-representative users, who do not have the targeted disability, instead of
representative users in accessibility studies. Although such an approach has been widely
justified, evidence showed that findings derived from non-representative users could be
biased and even misleading. To address this problem, researchers have come up with
different solutions such as building pools of users to recruit from. But still, the data is not
widely available and needs a lot of effort and resource to build and maintain.
On the other hand, online social media websites have become popular in the last
decade. Many online communities have emerged that allow online users to discuss
health-related subjects, exchange useful information, or provide emotional support. A
large amount of data accumulated in such online communities have gained attention from
researchers in the healthcare domain. And many researches have been done based on data
from social media websites to better understand health problems to improve the wellbeing
of people.
Despite the increasing popularity, the value of data from social media websites for
accessibility research remains untapped. Hence, my work aims to create methods that
could extract valuable information from data collected on social media websites for accessibility practitioners to support their design process. First, I investigate methods that
enable researchers to effectively collect representative data from social media websites.
More specifically, I look into machine learning approaches that could allow researchers
to automatically identify online users who have disabilities (representative users).
Second, I investigate methods that could extract useful information from user-generated
free-text using techniques drawn from the information extraction domain. Last, I explore
how such information should be visualized and presented for designers to support the
scenario-based design process in accessibility studies