Search CORE

25,769 research outputs found

Quantitative Perspectives on Fifty Years of the Journal of the History of Biology

Author: Bottino Erin
Damerow Julia L.
Laubichler Manfred D.
Peirson B. R. Erick
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Journal of the History of Biology provides a fifty-year long record for examining the evolution of the history of biology as a scholarly discipline. In this paper, we present a new dataset and preliminary quantitative analysis of the thematic content of JHB from the perspectives of geography, organisms, and thematic fields. The geographic diversity of authors whose work appears in JHB has increased steadily since 1968, but the geographic coverage of the content of JHB articles remains strongly lopsided toward the United States, United Kingdom, and western Europe and has diversified much less dramatically over time. The taxonomic diversity of organisms discussed in JHB increased steadily between 1968 and the late 1990s but declined in later years, mirroring broader patterns of diversification previously reported in the biomedical research literature. Finally, we used a combination of topic modeling and nonlinear dimensionality reduction techniques to develop a model of multi-article fields within JHB. We found evidence for directional changes in the representation of fields on multiple scales. The diversity of JHB with regard to the representation of thematic fields has increased overall, with most of that diversification occurring in recent years. Drawing on the dataset generated in the course of this analysis, as well as web services in the emerging digital history and philosophy of science ecosystem, we have developed an interactive web platform for exploring the content of JHB, and we provide a brief overview of the platform in this article. As a whole, the data and analyses presented here provide a starting-place for further critical reflection on the evolution of the history of biology over the past half-century.Comment: 45 pages, 14 figures, 4 table

arXiv.org e-Print Archive

Woods Hole Open Access Server

MPG.PuRe

Exploratory topic modeling with distributional semantics

Author: A Treisman
DA Keim
DM Blei
J Risch
L Barth
M Bostock
S Fortunato
S Lohmann
S Palmer
Y Bengio
Publication venue
Publication date: 16/07/2015
Field of study

As we continue to collect and store textual data in a multitude of domains, we are regularly confronted with material whose largely unknown thematic structure we want to uncover. With unsupervised, exploratory analysis, no prior knowledge about the content is required and highly open-ended tasks can be supported. In the past few years, probabilistic topic modeling has emerged as a popular approach to this problem. Nevertheless, the representation of the latent topics as aggregations of semi-coherent terms limits their interpretability and level of detail. This paper presents an alternative approach to topic modeling that maps topics as a network for exploration, based on distributional semantics using learned word vectors. From the granular level of terms and their semantic similarity relations global topic structures emerge as clustered regions and gradients of concepts. Moreover, the paper discusses the visual interactive representation of the topic map, which plays an important role in supporting its exploration.Comment: Conference: The Fourteenth International Symposium on Intelligent Data Analysis (IDA 2015

arXiv.org e-Print Archive

Crossref

Recommended from our members

Geovisualization of household energy consumption characteristics

Author: Dykes J.
Goodwin S.
Publication venue
Publication date: 01/01/2012
Field of study

A vast amount of quantitative data is available within the energy sector, however, there is limited understanding of the relationships between neighbourhoods, demographic characteristics and domestic energy consumption habits. We report upon research that will combine datasets relating to energy consumption, saving and loss with geodemographics to enable better understanding of energy user types. A novel interactive interface is planned to evaluate the performance of these energy-based classifications. The research aims to help local governments and the energy industry in targeting households and populations for new energy saving schemes and in improving efforts to promote sustainable energy consumption. Energy based neighbourhood classifications will also promote consumption awareness amongst domestic users. This poster describes the research methodology, data sources and visualization requirements

City Research Online

Mapping crime: Understanding Hotspots

Author: Cameron J
Chainey S
Eck J
Wilson R
Publication venue: National Institute of Justice
Publication date: 01/08/2005
Field of study

UCL Discovery

Recommended from our members

Mediating geovisualization to potential users and prototyping a geovisualization application

Author: Dykes J.
Lloyd D.
Radburn R.
Publication venue
Publication date: 01/01/2008
Field of study

City Research Online

The structure of R&D collaboration networks in the European Framework Programmes

Author: Barber Michael J.
Roediger-Schluga Thomas
Publication venue
Publication date
Field of study

Using a large and novel data source, we study the structure of R&D collaboration net-works in the first five EU Framework Programmes (FPs). The networks display proper-ties typical for complex networks, including scale-free degree distributions and the small-world property. Structural features are common across FPs, indicating similar network formation mechanisms despite changes in governance rules. Several findings point towards the existence of a stable core of interlinked actors since the early FPs with integration increasing over time. This core consists mainly of universities and research organisations. We observe assortative mixing by degree of projects, but not by degree of organisations. Unexpectedly, we find only weak association between central projects and project size, suggesting that different types of projects attract different groups of actors. In particular, large projects appear to have included few of the pivotal actors in the networks studied. Central projects only partially mirror funding priorities, indicating field-specific differences in network structures. The paper concludes with an agenda for future research.R&D collaboration, EU Framework Programmes, Complex Networks, Small World Effect, Centrality Measures, European Research Area

Research Papers in Economics

Characterizing the Landscape of Musical Data on the Web: State of the Art and Challenges

Author: d'Aquin Mathieu
Daga Enrico
Daquino Marilena
Gangemi Aldo
Holland Simon
Laney Robin
Mulholland Paul
Penuela Albert Merono
Publication venue
Publication date: 01/01/2017
Field of study

Musical data can be analysed, combined, transformed and exploited for diverse purposes. However, despite the proliferation of digital libraries and repositories for music, infrastructures and tools, such uses of musical data remain scarce. As an initial step to help fill this gap, we present a survey of the landscape of musical data on the Web, available as a Linked Open Dataset: the musoW dataset of catalogued musical resources. We present the dataset and the methodology and criteria for its creation and assessment. We map the identified dimensions and parameters to existing Linked Data vocabularies, present insights gained from SPARQL queries, and identify significant relations between resource features. We present a thematic analysis of the original research questions associated with surveyed resources and identify the extent to which the collected resources are Linked Data-ready

VU Research Portal

Open Research Online (The Open University)

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Visual and interactive exploration of point data

Author: Tobon Carolina
Publication venue: Centre for Advanced Spatial Analysis (UCL)
Publication date: 01/01/2001
Field of study

Point data, such as Unit Postcodes (UPC), can provide very detailed information at fine scales of resolution. For instance, socio-economic attributes are commonly assigned to UPC. Hence, they can be represented as points and observable at the postcode level. Using UPC as a common field allows the concatenation of variables from disparate data sources that can potentially support sophisticated spatial analysis. However, visualising UPC in urban areas has at least three limitations. First, at small scales UPC occurrences can be very dense making their visualisation as points difficult. On the other hand, patterns in the associated attribute values are often hardly recognisable at large scales. Secondly, UPC can be used as a common field to allow the concatenation of highly multivariate data sets with an associated postcode. Finally, socio-economic variables assigned to UPC (such as the ones used here) can be non-Normal in their distributions as a result of a large presence of zero values and high variances which constrain their analysis using traditional statistics. This paper discusses a Point Visualisation Tool (PVT), a proof-of-concept system developed to visually explore point data. Various well-known visualisation techniques were implemented to enable their interactive and dynamic interrogation. PVT provides multiple representations of point data to facilitate the understanding of the relations between attributes or variables as well as their spatial characteristics. Brushing between alternative views is used to link several representations of a single attribute, as well as to simultaneously explore more than one variable. PVT’s functionality shows how the use of visual techniques embedded in an interactive environment enable the exploration of large amounts of multivariate point data

CiteSeerX

UCL Discovery