Search CORE

17,271 research outputs found

Scatter networks: a new approach for analysing information scatter

Author: Adamic L A
Adar E
Bates M J
Bhavnani S K Adamic L A
Bradford S C
K Suresh
Kleinberg J M
Koren Y
Lada A Adamic
Maslov S
Milgram S
Over P
Page L Brin S Motwani R Winograd T
Tenopir C
Xiaolin Shi
Zipf G K
Publication venue: 'IOP Publishing'
Publication date: 01/07/2007
Field of study

Information on any given topic is often scattered across the Web. Previously this scatter has been characterized through the inequality of distribution of facts (i.e. pieces of information) across webpages. Such an approach conceals how specific facts (e.g. rare facts) occur in specific types of pages (e.g. fact-rich pages). To reveal such regularities, we construct bipartite networks, consisting of two types of vertices: the facts contained in webpages and the webpages themselves. Such a representation enables the application of a series of network analysis techniques, revealing structural features such as connectivity, robustness and clustering. Not only does network analysis yield new insights into information scatter, but we also illustrate the benefit of applying new and existing analysis techniques directly to a bipartite network as opposed to its one-mode projection. We discuss the implications of each network feature to the users’ ability to find comprehensive information online. Finally, we compare the bipartite graph structure of webpages and facts with the hyperlink structure between the webpages.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/58170/2/njp7_7_231.pd

Crossref

Deep Blue Documents at the University of Michigan

Discovering Sets of Key Players in Social Networks

Author: Ortiz-Arroyo Daniel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

VBN

Exploiting Social Annotation for Automatic Resource Discovery

Author: Lerman Kristina
Plangprasopchok Anon
Publication venue
Publication date: 01/01/2007
Field of study

Information integration applications, such as mediators or mashups, that require access to information resources currently rely on users manually discovering and integrating them in the application. Manual resource discovery is a slow process, requiring the user to sift through results obtained via keyword-based search. Although search methods have advanced to include evidence from document contents, its metadata and the contents and link structure of the referring pages, they still do not adequately cover information sources -- often called ``the hidden Web''-- that dynamically generate documents in response to a query. The recently popular social bookmarking sites, which allow users to annotate and share metadata about various information sources, provide rich evidence for resource discovery. In this paper, we describe a probabilistic model of the user annotation process in a social bookmarking system del.icio.us. We then use the model to automatically find resources relevant to a particular information domain. Our experimental results on data obtained from \emph{del.icio.us} show this approach as a promising method for helping automate the resource discovery task.Comment: 6 pages, submitted to AAAI07 workshop on Information Integration on the We

arXiv.org e-Print Archive

CiteSeerX

Proceedings of the 2nd Computer Science Student Workshop: Microsoft Istanbul, Turkey, April 9, 2011

Author
Publication venue: 'Sabanci University Information Center'
Publication date: 01/01/2011
Field of study

Sabanci University Research Database

Design issues for agent-based resource locator systems

Author: Gary Alani
Gary Wills
Harith Alani
Harith Ashri
Richard Crowder
Richard Kalfoglou
Ronald Ashri
Ronald Crowder
Sanghee Kim
Yannis Kalfoglou
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2002
Field of study

While knowledge is viewed by many as an asset, it is often difficult to locate particularitems within a large electronic corpus. This paper presents an agent based framework for the location of resources to resolve a specific query, and considers the associated design issue. Aspects of the work presented complements current research into both expertise finders and recommender systems. The essential issues for the proposed design are scalability, together ith the ability to learn and adapt to changing resources. As knowledge is often implicit within electronic resources, and therefore difficult to locate, we have proposed the use of ontologies, to extract the semantics and infer meaning to obtain the results required. We explore the use of communities of practice, applying ontology-based networks, and e-mail message exchanges to aid the resource discovery process

CiteSeerX

Crossref

Southampton (e-Prints Soton)

Open Research Online (The Open University)