Search CORE

254 research outputs found

The Extraction of Community Structures from Publication Networks to Support Ethnographic Observations of Field Differences in Scientific Communication

Author: Baus
Beaulieu
Birnholtz
Boyack
Börner
Cambrosio
Crane
Cronin
Fry
Fry
Galison
Geels
Gläser
Gläser
Gläser
Guimera
Guimera
Hellsten
Hine
Howard
Huang
Jansen
Kling
Kling
Kling
Knorr Cetina
Kretschmer
Lambiotte
Lancichinetti
Laurens
Lievrouw
Lievrouw
Melin
Mogoutov
Moran-Ellis
Morris
Mulkay
Nentwich
Rafols
Rosvall
Seglen
Shibata
Small
Strotmann
Van den Besselaar
Van House
Velden
Veugelers
Walsh
Whitley
Zitt
Zitt
Zuccala
Publication venue
Publication date: 09/01/2013
Field of study

The scientific community of researchers in a research specialty is an important unit of analysis for understanding the field specific shaping of scientific communication practices. These scientific communities are, however, a challenging unit of analysis to capture and compare because they overlap, have fuzzy boundaries, and evolve over time. We describe a network analytic approach that reveals the complexities of these communities through examination of their publication networks in combination with insights from ethnographic field studies. We suggest that the structures revealed indicate overlapping sub- communities within a research specialty and we provide evidence that they differ in disciplinary orientation and research practices. By mapping the community structures of scientific fields we aim to increase confidence about the domain of validity of ethnographic observations as well as of collaborative patterns extracted from publication networks thereby enabling the systematic study of field differences. The network analytic methods presented include methods to optimize the delineation of a bibliographic data set in order to adequately represent a research specialty, and methods to extract community structures from this data. We demonstrate the application of these methods in a case study of two research specialties in the physical and chemical sciences.Comment: Accepted for publication in JASIS

arXiv.org e-Print Archive

Crossref

Deep Blue Documents at the University of Michigan

Searching for temporal patterns in the time series of publications of authors in a research specialty

Author: Lagoze Carl
Velden Theresa
Yu Kan
Publication venue: 'Wiley'
Publication date: 01/01/2014
Field of study

In this paper we report results of our investigation of temporal patterns in the publication activity of authors in a research specialty. We base our analysis on Web of Science data for a field in the physical and chemical sciences from 1991‐2012. We determine the research groups in the field by clustering the co‐author network and generate our sample for this analysis by selecting the most productive author of each co‐author cluster to represent the activity of that group. Whereas a statistical time series analysis did not reveal any specific patterns, a time series clustering approach generated a grouping of time series that correlates with the structural network position (‘node role') of the respective authors in the clustered co‐author network. This work is part of a long‐term research project employing a mix of qualitative and network analytic methods to investigate field‐specific differences in collaborative patterns.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/111080/1/meet14505101039.pd

Deep Blue Documents at the University of Michigan

The extraction of community structures from publication networks to support ethnographic observations of field differences in scientific communication

Author: Baus
Beaulieu
Birnholtz
Boyack
Börner
Cambrosio
Crane
Cronin
Fry
Fry
Galison
Geels
Gläser
Gläser
Gläser
Guimera
Guimera
Hellsten
Hine
Howard
Huang
Jansen
Kling
Kling
Kling
Knorr Cetina
Kretschmer
Lambiotte
Lancichinetti
Laurens
Lievrouw
Lievrouw
Melin
Mogoutov
Moran-Ellis
Morris
Mulkay
Nentwich
Rafols
Rosvall
Seglen
Shibata
Small
Strotmann
Van den Besselaar
Van House
Velden
Veugelers
Walsh
Whitley
Zitt
Zitt
Zuccala
Publication venue: 'Wiley'
Publication date
Field of study

Crossref

Proceedings

Author: Bick Eckhard
Hagen Kristin
Müürisep Kaili
Trosterud Trond
Publication venue
Publication date: 17/11/2011
Field of study

Proceedings of the NODALIDA 2011 Workshop Constraint Grammar Applications. Editors: Eckhard Bick, Kristin Hagen, Kaili Müürisep, Trond Trosterud. NEALT Proceedings Series, Vol. 14 (2011), vi+69 pp. © 2011 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/19231

DSpace at Tartu University Library

Intelligent Information Access to Linked Data - Weaving the Cultural Heritage Web

Author: Kummer Robert
Publication venue
Publication date: 01/01/2013
Field of study

The subject of the dissertation is an information alignment experiment of two cultural heritage information systems (ALAP): The Perseus Digital Library and Arachne. In modern societies, information integration is gaining importance for many tasks such as business decision making or even catastrophe management. It is beyond doubt that the information available in digital form can offer users new ways of interaction. Also, in the humanities and cultural heritage communities, more and more information is being published online. But in many situations the way that information has been made publicly available is disruptive to the research process due to its heterogeneity and distribution. Therefore integrated information will be a key factor to pursue successful research, and the need for information alignment is widely recognized. ALAP is an attempt to integrate information from Perseus and Arachne, not only on a schema level, but to also perform entity resolution. To that end, technical peculiarities and philosophical implications of the concepts of identity and co-reference are discussed. Multiple approaches to information integration and entity resolution are discussed and evaluated. The methodology that is used to implement ALAP is mainly rooted in the fields of information retrieval and knowledge discovery. First, an exploratory analysis was performed on both information systems to get a first impression of the data. After that, (semi-)structured information from both systems was extracted and normalized. Then, a clustering algorithm was used to reduce the number of needed entity comparisons. Finally, a thorough matching was performed on the different clusters. ALAP helped with identifying challenges and highlighted the opportunities that arise during the attempt to align cultural heritage information systems

Kölner UniversitätsPublikationsServer

Improving Search via Named Entity Recognition in Morphologically Rich Languages – A Case Study in Urdu

Author: Riaz Kashif
Publication venue
Publication date: 01/02/2018
Field of study

University of Minnesota Ph.D. dissertation. February 2018. Major: Computer Science. Advisors: Vipin Kumar, Blake Howald. 1 computer file (PDF); xi, 236 pages.Search is not a solved problem even in the world of Google and Bing's state of the art engines. Google and similar search engines are keyword based. Keyword-based searching suffers from the vocabulary mismatch problem -- the terms in document and user's information request don't overlap. For example, cars and automobiles. This phenomenon is called synonymy. Similarly, the user's term may be polysemous -- a user is inquiring about a river's bank, but documents about financial institutions are matched. Vocabulary mismatch exacerbated when the search occurs in Morphological Rich Language (MRL). Concept search techniques like dimensionality reduction do not improve search in Morphological Rich Languages. Names frequently occur news text and determine the "what," "where," "when," and "who" in the news text. Named Entity Recognition attempts to recognize names automatically in text, but these techniques are far from mature in MRL, especially in Arabic Script languages. Urdu is one the focus MRL of this dissertation among Arabic, Farsi, Hindi, and Russian, but it does not have the enabling technologies for NER and search. A corpus, stop word generation algorithm, a light stemmer, a baseline, and NER algorithm is created so the NER-aware search can be accomplished for Urdu. This dissertation demonstrates that NER-aware search on Arabic, Russian, Urdu, and English shows significant improvement over baseline. Furthermore, this dissertation highlights the challenges for researching in low-resource MRL languages

University of Minnesota Digital Conservancy

Fusing Automatically Extracted Annotations for the Semantic Web

Author: Nikolov Andriy
Publication venue
Publication date: 01/01/2010
Field of study

This research focuses on the problem of semantic data fusion. Although various solutions have been developed in the research communities focusing on databases and formal logic, the choice of an appropriate algorithm is non-trivial because the performance of each algorithm and its optimal configuration parameters depend on the type of data, to which the algorithm is applied. In order to be reusable, the fusion system must be able to select appropriate techniques and use them in combination. Moreover, because of the varying reliability of data sources and algorithms performing fusion subtasks, uncertainty is an inherent feature of semantically annotated data and has to be taken into account by the fusion system. Finally, the issue of schema heterogeneity can have a negative impact on the fusion performance. To address these issues, we propose KnoFuss: an architecture for Semantic Web data integration based on the principles of problem-solving methods. Algorithms dealing with different fusion subtasks are represented as components of a modular architecture, and their capabilities are described formally. This allows the architecture to select appropriate methods and configure them depending on the processed data. In order to handle uncertainty, we propose a novel algorithm based on the Dempster-Shafer belief propagation. KnoFuss employs this algorithm to reason about uncertain data and method results in order to refine the fused knowledge base. Tests show that these solutions lead to improved fusion performance. Finally, we addressed the problem of data fusion in the presence of schema heterogeneity. We extended the KnoFuss framework to exploit results of automatic schema alignment tools and proposed our own schema matching algorithm aimed at facilitating data fusion in the Linked Data environment. We conducted experiments with this approach and obtained a substantial improvement in performance in comparison with public data repositories

CiteSeerX

Open Research Online (The Open University)

OpenGrey Repository

A morphosyntactic processor of modern standard Arabic

Author: Degachi Abdelmajid
Publication venue
Publication date: 01/01/1990
Field of study

OPUS