Search CORE

7,604 research outputs found

1st INCF Workshop on Sustainability of Neuroscience Databases

Author: Jaap van Pelt
Jack Van Horn
Publication venue
Publication date: 17/06/2008
Field of study

The goal of the workshop was to discuss issues related to the sustainability of neuroscience databases, identify problems and propose solutions, and formulate recommendations to the INCF. The report summarizes the discussions of invited participants from the neuroinformatics community as well as from other disciplines where sustainability issues have already been approached. The recommendations for the INCF involve rating, ranking, and supporting database sustainability

Crossref

Nature Precedings

Identification-method research for open-source software ecosystems

Author: Liao Zhifang
Liu Hui
Liu Shengzong
Wang Ningwei
Zhang Qi
Zhang Yan
Publication venue: 'MDPI AG'
Publication date: 01/02/2019
Field of study

In recent years, open-source software (OSS) development has grown, with many developers around the world working on different OSS projects. A variety of open-source software ecosystems have emerged, for instance, GitHub, StackOverflow, and SourceForge. One of the most typical social-programming and code-hosting sites, GitHub, has amassed numerous open-source-software projects and developers in the same virtual collaboration platform. Since GitHub itself is a large open-source community, it hosts a collection of software projects that are developed together and coevolve. The great challenge here is how to identify the relationship between these projects, i.e., project relevance. Software-ecosystem identification is the basis of other studies in the ecosystem. Therefore, how to extract useful information in GitHub and identify software ecosystems is particularly important, and it is also a research area in symmetry. In this paper, a Topic-based Project Knowledge Metrics Framework (TPKMF) is proposed. By collecting the multisource dataset of an open-source ecosystem, project-relevance analysis of the open-source software is carried out on the basis of software-ecosystem identification. Then, we used our Spectral Clustering algorithm based on Core Project (CP-SC) to identify software-ecosystem projects and further identify software ecosystems. We verified that most software ecosystems usually contain a core software project, and most other projects are associated with it. Furthermore, we analyzed the characteristics of the ecosystem, and we also found that interactive information has greater impact on project relevance. Finally, we summarize the Topic-based Project Knowledge Metrics Framework

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals

ResearchOnline@GCU

Knowledge-based Biomedical Data Science 2019

Author: Callahan Tiffany J.
Hunter Lawrence E.
Pielke-Lombardo Harrison
Tripodi Ignacio J.
Publication venue
Publication date: 08/10/2019
Field of study

Knowledge-based biomedical data science (KBDS) involves the design and implementation of computer systems that act as if they knew about biomedicine. Such systems depend on formally represented knowledge in computer systems, often in the form of knowledge graphs. Here we survey the progress in the last year in systems that use formally represented knowledge to address data science problems in both clinical and biological domains, as well as on approaches for creating knowledge graphs. Major themes include the relationships between knowledge graphs and machine learning, the use of natural language processing, and the expansion of knowledge-based approaches to novel domains, such as Chinese Traditional Medicine and biodiversity.Comment: Manuscript 43 pages with 3 tables; Supplemental material 43 pages with 3 table

arXiv.org e-Print Archive

Constructing a biodiversity terminological inventory.

Author: A Cockburn
A Henriksson
Axel J. Soto
B Boyle
C Carpineto
CD Manning
CS Parr
CW Dunnett
D Koning
D Patterson
E Pafilis
EV Berghe
G Miller
Georgios Kontonatsios
GF Guala
GH Golub
J Bobadilla
J Mitchell
JZ Wang
K Erk
K Frantzi
LM Akella
M Ashburner
M Batet
M Gerner
M Strube
N Gwinn
N Naderi
Nhung T. H. Nguyen
O Bodenreider
O Levy
P Thompson
P Thompson
PD Cantino
PD Turney
PR Leary
R Pivovarov
Riza Batista-Navarro
RL Pyle
Robert Guralnick
S Clark
S Harispe
Sophia Ananiadou
T Mikolov
T Pedersen
T Rees
WE Winkler
WN Lee
WW Cohen
X Wang
Y Bengio
Y Roskov
Y Sasaki
Y Sasaki
Y Sasaki
Y Tsuruoka
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/04/2017
Field of study

The increasing growth of literature in biodiversity presents challenges to users who need to discover pertinent information in an efficient and timely manner. In response, text mining techniques offer solutions by facilitating the automated discovery of knowledge from large textual data. An important step in text mining is the recognition of concepts via their linguistic realisation, i.e., terms. However, a given concept may be referred to in text using various synonyms or term variants, making search systems likely to overlook documents mentioning less known variants, which are albeit relevant to a query term. Domain-specific terminological resources, which include term variants, synonyms and related terms, are thus important in supporting semantic search over large textual archives. This article describes the use of text mining methods for the automatic construction of a large-scale biodiversity term inventory. The inventory consists of names of species, amongst which naming variations are prevalent. We apply a number of distributional semantic techniques on all of the titles in the Biodiversity Heritage Library, to compute semantic similarity between species names and support the automated construction of the resource. With the construction of our biodiversity term inventory, we demonstrate that distributional semantic models are able to identify semantically similar names that are not yet recorded in existing taxonomies. Such methods can thus be used to update existing taxonomies semi-automatically by deriving semantically related taxonomic names from a text corpus and allowing expert curators to validate them. We also evaluate our inventory as a means to improve search by facilitating automatic query expansion. Specifically, we developed a visual search interface that suggests semantically related species names, which are available in our inventory but not always in other repositories, to incorporate into the search query. An assessment of the interface by domain experts reveals that our query expansion based on related names is useful for increasing the number of relevant documents retrieved. Its exploitation can benefit both users and developers of search engines and text mining applications

Crossref

Directory of Open Access Journals

Edge Hill University Research Information Repository

The University of Manchester - Institutional Repository

FigShare

Recommended from our members

Scientific Literacy in the digital age: tools, environments and resources for co-inquiry

Author: Achmetova Almira
Baildinova Klara
Kozhanova Saule
Kuanysheva Anar
Maukayeva Saule
Nuralinova Gulnar
Orazalina Ainash
Tusupova Karlygash
Zhanaspayev Marat
Zhunusova Aigul
Publication venue
Publication date: 01/01/2013
Field of study

This paper describes some European and International projects to promote Scientific Literacy in the digital age as well as technologies, environments and resources for co-inquiry. The aim of this research is also to describe computer applications, software tools and environments that were designed to support processes of collaborative inquiry learning to promote Scientific Literacy. These tools are analyzed by describing their interfaces and functionalities. The outcomes of this descriptive research points out some effects on student learning and competences developed known from the literature. This paper argues the importance of promoting scientific citizenship not only through schools and Universities (formal learning), but also non-credit online courses and community-based learning programmes (non-formal context), as well as daily life activities, educational open digital materials through social networks (informal scenario)

Open Research Online (The Open University)

European Scientific Journal, ESJ

European Scientific Journal (European Scientific Institute)

Simple identification tools in FishBase

Author: Atanacio Rachek
Bailly Nicolas
Froese Rainer
Reyes Jr. Rodolfo
Publication venue: EUT - Edizioni Università di Trieste
Publication date: 01/01/2010
Field of study

Simple identification tools for fish species were included in the FishBase information system from its inception. Early tools made use of the relational model and characters like fin ray meristics. Soon pictures and drawings were added as a further help, similar to a field guide. Later came the computerization of existing dichotomous keys, again in combination with pictures and other information, and the ability to restrict possible species by country, area, or taxonomic group. Today, www.FishBase.org offers four different ways to identify species. This paper describes these tools with their advantages and disadvantages, and suggests various options for further development. It explores the possibility of a holistic and integrated computeraided strategy

OceanRep

OpenstarTs

Training and hackathon on building biodiversity knowledge graphs

Author: Baskauf Steven J.
Comspon Zacchaeus
Lujan-Toro Beatriz
Macklin James
Page Roderic D.M.
Pender Jocelyn
Sachs Joel
Publication venue: Pensoft Publishers
Publication date: 01/01/2019
Field of study

Knowledge graphs have the potential to unite disconnected digitized biodiversity data, and there are a number of efforts underway to build biodiversity knowledge graphs. More generally, the recent popularity of knowledge graphs, driven in part by the advent and success of the Google Knowledge Graph, has breathed life into the ongoing development of semantic web infrastructure and prototypes in the biodiversity informatics community. We describe a one week training event and hackathon that focused on applying three specific knowledge graph technologies – the Neptune graph database; Metaphactory; and Wikidata - to a diverse set of biodiversity use cases. We give an overview of the training, the projects that were advanced throughout the week, and the critical discussions that emerged. We believe that the main barriers towards adoption of biodiversity knowledge graphs are the lack of understanding of knowledge graphs and the lack of adoption of shared unique identifiers. Furthermore, we believe an important advancement in the outlook of knowledge graph development is the emergence of Wikidata as an identifier broker and as a scoping tool. To remedy the current barriers towards biodiversity knowledge graph development, we recommend continued discussions at workshops and at conferences, which we expect to increase awareness and adoption of knowledge graph technologies

ZENODO

Enlighten

ARPHA OAI-PMH Endpoint

ARPHA Preprints

Liberating host-virus knowledge from biological dark data

Author: Agosti D
Bastos-Silveira C
Bertolino S
Franz Nm
Groom Qj
Guidoti M
Paul D
Penev L
Poelen Jh
Reeder Dm
Sen A
Simmons Nb
Sterner B
Upham Ns
Vanhove Mpm
Publication venue: 'Elsevier BV'
Publication date: 01/01/2021
Field of study

Institutional Research Information System University of Turin