Search CORE

13 research outputs found

A hybrid human and machine resource curation pipeline for the Neuroscience Information Framework

Author: A. E. Bandrowski
Akil
Bug
Gardner
Gupta
H. M. Muller
J. Cachat
J. S. Grethe
L. Marenco
M. E. Martone
Marenco
Muller
P. Ciccarese
P. W. Sternberg
R. Wang
T. Clark
Tenenbaum
V. Astakhov
Y. Li
Publication venue: Oxford University Press
Publication date: 20/03/2012
Field of study

The breadth of information resources available to researchers on the Internet continues to expand, particularly in light of recently implemented data-sharing policies required by funding agencies. However, the nature of dense, multifaceted neuroscience data and the design of contemporary search engine systems makes efficient, reliable and relevant discovery of such information a significant challenge. This challenge is specifically pertinent for online databases, whose dynamic content is ‘hidden’ from search engines. The Neuroscience Information Framework (NIF; http://www.neuinfo.org) was funded by the NIH Blueprint for Neuroscience Research to address the problem of finding and utilizing neuroscience-relevant resources such as software tools, data sets, experimental animals and antibodies across the Internet. From the outset, NIF sought to provide an accounting of available resources, whereas developing technical solutions to finding, accessing and utilizing them. The curators therefore, are tasked with identifying and registering resources, examining data, writing configuration files to index and display data and keeping the contents current. In the initial phases of the project, all aspects of the registration and curation processes were manual. However, as the number of resources grew, manual curation became impractical. This report describes our experiences and successes with developing automated resource discovery and semiautomated type characterization with text-mining scripts that facilitate curation team efforts to discover, integrate and display new content. We also describe the DISCO framework, a suite of automated web services that significantly reduce manual curation efforts to periodically check for resource updates. Lastly, we discuss DOMEO, a semi-automated annotation tool that improves the discovery and curation of resources that are not necessarily website-based (i.e. reagents, software tools). Although the ultimate goal of automation was to reduce the workload of the curators, it has resulted in valuable analytic by-products that address accessibility, use and citation of resources that can now be shared with resource owners and the larger scientific community

Crossref

Harvard University - DASH

PubMed Central

Caltech Authors

Brede Tools and Federating Online Neuroinformatics Databases

Author: A Gupta
A Hammers
AF Hamilton
AR Laird
C Svarer
D Ferrucci
DC Essen Van
DW Shattuck
Finn Årup Nielsen
FÅ Nielsen
FÅ Nielsen
FÅ Nielsen
FÅ Nielsen
FÅ Nielsen
FÅ Nielsen
FÅ Nielsen
HYK Lam
J Hartung
JA Turner
JP Shaffer
JW Bohland
KH Cheung
L French
L Marenco
LN Soldatova
M Bota
M Fenner
M Krötzsch
MJ Kempton
MJ Kempton
MJ Kempton
N Ashish
N Tzourio-Mazoyer
P Miller
PE Turkeltaub
PT Fox
PT Fox
R Kötter
R Kötter
RA Poldrack
T Berners-Lee
T Yarkoni
WJ Bug
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref

Online Research Database In Technology

An ontological approach to describing neurons and their relationships

Author: Ascoli Giorgio A.
Hamilton David J.
Martone Maryann E.
Shepherd Gordon M.
Publication venue: Frontiers Media S.A.
Publication date: 01/01/2012
Field of study

The advancement of neuroscience, perhaps one of the most information rich disciplines of all the life sciences, requires basic frameworks for organizing the vast amounts of data generated by the research community to promote novel insights and integrated understanding. Since Cajal, the neuron remains a fundamental unit of the nervous system, yet even with the explosion of information technology, we still have few comprehensive or systematic strategies for aggregating cell-level knowledge. Progress toward this goal is hampered by the multiplicity of names for cells and by lack of a consensus on the criteria for defining neuron types. However, through umbrella projects like the Neuroscience Information Framework (NIF) and the International Neuroinformatics Coordinating Facility (INCF), we have the opportunity to propose and implement an informatics infrastructure for establishing common tools and approaches to describe neurons through a standard terminology for nerve cells and a database (a Neuron Registry) where these descriptions can be deposited and compared. This article provides an overview of the problem and outlines a solution approach utilizing ontological characterizations. Based on illustrative implementation examples, we also discuss the need for consensus criteria to be adopted by the research community, and considerations on future developments. A scalable repository of neuron types will provide researchers with a resource that materially contributes to the advancement of neuroscience

Crossref

Directory of Open Access Journals

PubMed Central

Frontiers - Publisher Connector

eScholarship - University of California

The Scalable Brain Atlas: instant web-based access to public brain atlases and related content

Author: Bakker Rembrandt
Kötter Rolf
Tiesinga Paul
Publication venue
Publication date: 01/01/2014
Field of study

The Scalable Brain Atlas (SBA) is a collection of web services that provide unified access to a large collection of brain atlas templates for different species. Its main component is an atlas viewer that displays brain atlas data as a stack of slices in which stereotaxic coordinates and brain regions can be selected. These are subsequently used to launch web queries to resources that require coordinates or region names as input. It supports plugins which run inside the viewer and respond when a new slice, coordinate or region is selected. It contains 20 atlas templates in six species, and plugins to compute coordinate transformations, display anatomical connectivity and fiducial points, and retrieve properties, descriptions, definitions and 3d reconstructions of brain regions. The ambition of SBA is to provide a unified representation of all publicly available brain atlases directly in the web browser, while remaining a responsive and light weight resource that specializes in atlas comparisons, searches, coordinate transformations and interactive displays.Comment: Rolf K\"otter sadly passed away on June 9th, 2010. He co-initiated this project and played a crucial role in the design and quality assurance of the Scalable Brain Atla

arXiv.org e-Print Archive

Crossref

Springer - Publisher Connector

PubMed Central

Juelich Shared Electronic Resources

Radboud Repository

Neuroanatomical Domain of the Foundational Model of Anatomy Ontology

Author: Brinkley James F
Detwiler Landon T
Martone Maryann E
Mejino Jose LV
Nichols B Nolan
Nilsen Trond T
Rubin Daniel L
Turner Jessica
Publication venue: ScholarWorks @ Georgia State University
Publication date: 01/01/2014
Field of study

Background: The diverse set of human brain structure and function analysis methods represents a difficult challenge for reconciling multiple views of neuroanatomical organization. While different views of organization are expected and valid, no widely adopted approach exists to harmonize different brain labeling protocols and terminologies. Our approach uses the natural organizing framework provided by anatomical structure to correlate terminologies commonly used in neuroimaging. Description: The Foundational Model of Anatomy (FMA) Ontology provides a semantic framework for representing the anatomical entities and relationships that constitute the phenotypic organization of the human body. In this paper we describe recent enhancements to the neuroanatomical content of the FMA that models cytoarchitectural and morphological regions of the cerebral cortex, as well as white matter structure and connectivity. This modeling effort is driven by the need to correlate and reconcile the terms used in neuroanatomical labeling protocols. By providing an ontological framework that harmonizes multiple views of neuroanatomical organization, the FMA provides developers with reusable and computable knowledge for a range of biomedical applications. Conclusions: A requirement for facilitating the integration of basic and clinical neuroscience data from diverse sources is a well-structured ontology that can incorporate, organize, and associate neuroanatomical data. We applied the ontological framework of the FMA to align the vocabularies used by several human brain atlases, and to encode emerging knowledge about structural connectivity in the brain. We highlighted several use cases of these extensions, including ontology reuse, neuroimaging data annotation, and organizing 3D brain models

Crossref

ScholarWorks @ Georgia State University

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

Neuroanatomical domain of the foundational model of anatomy ontology

Author
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref

Recommended from our members

Research resources: curating the new eagle-i discovery system

Author: Brush Matthew
Corday Karen
Haendel Melissa
Johnson Tenille
Robinson David
Segerdell Erik
Shaffer Chris
Torniai Carlo
Vasilevsky Nicole
Wilson Melanie
Publication venue: Oxford University Press
Publication date: 17/05/2012
Field of study

Development of biocuration processes and guidelines for new data types or projects is a challenging task. Each project finds its way toward defining annotation standards and ensuring data consistency with varying degrees of planning and different tools to support and/or report on consistency. Further, this process may be data type specific even within the context of a single project. This article describes our experiences with eagle-i, a 2-year pilot project to develop a federated network of data repositories in which unpublished, unshared or otherwise ‘invisible’ scientific resources could be inventoried and made accessible to the scientific community. During the course of eagle-i development, the main challenges we experienced related to the difficulty of collecting and curating data while the system and the data model were simultaneously built, and a deficiency and diversity of data management strategies in the laboratories from which the source data was obtained. We discuss our approach to biocuration and the importance of improving information management strategies to the research process, specifically with regard to the inventorying and usage of research resources. Finally, we highlight the commonalities and differences between eagle-i and similar efforts with the hope that our lessons learned will assist other biocuration endeavors

Harvard University - DASH

PubMed Central

Towards structured sharing of raw and derived neuroimaging data across existing resources

Author: Ashish N.
Burns G. A.
Gadde S.
Ghosh S. S.
Helmer K.
Keator D. B.
Nichols B. N.
Steffener J.
Turner J. A.
Van Erp T. G. M.
Publication venue
Publication date: 06/03/2013
Field of study

Data sharing efforts increasingly contribute to the acceleration of scientific discovery. Neuroimaging data is accumulating in distributed domain-specific databases and there is currently no integrated access mechanism nor an accepted format for the critically important meta-data that is necessary for making use of the combined, available neuroimaging data. In this manuscript, we present work from the Derived Data Working Group, an open-access group sponsored by the Biomedical Informatics Research Network (BIRN) and the International Neuroimaging Coordinating Facility (INCF) focused on practical tools for distributed access to neuroimaging data. The working group develops models and tools facilitating the structured interchange of neuroimaging meta-data and is making progress towards a unified set of tools for such data and meta-data exchange. We report on the key components required for integrated access to raw and derived neuroimaging data as well as associated meta-data and provenance across neuroimaging resources. The components include (1) a structured terminology that provides semantic context to data, (2) a formal data model for neuroimaging with robust tracking of data provenance, (3) a web service-based application programming interface (API) that provides a consistent mechanism to access and query the data model, and (4) a provenance library that can be used for the extraction of provenance data by image analysts and imaging software developers. We believe that the framework and set of tools outlined in this manuscript have great potential for solving many of the issues the neuroimaging community faces when sharing raw and derived neuroimaging data across the various existing database systems for the purpose of accelerating scientific discovery

arXiv.org e-Print Archive

Crossref

PubMed Central

eScholarship - University of California

PyXNAT: XNAT in Python

Author: Aditya eSiram
Alexis eBarbot
Benjamin eThyreau
Daniel eMarcus
Gael eVaroquaux
Gael eVaroquaux
Jean-Baptiste ePoline
Vincent eFrouin
Yannick eSchwartz
Publication venue: Frontiers Research Foundation
Publication date: 01/01/2012
Field of study

As neuroimaging databases grow in size and complexity, the time researchers spend investigating and managing the data increases to the expense of data analysis. As a result, investigators rely more and more heavily on scripting using high-level languages to automate data management and processing tasks. For this, a structured and programmatic access to the data store is necessary. Web services are a first step toward this goal. They however lack in functionality and ease of use because they provide only low-level interfaces to databases. We introduce here PyXNAT, a Python module that interacts with The Extensible Neuroimaging Archive Toolkit (XNAT) through native Python calls across multiple operating systems. The choice of Python enables PyXNAT to expose the XNAT Web Services and unify their features with a higher level and more expressive language. PyXNAT provides XNAT users direct access to all the scientific packages in Python. Finally PyXNAT aims to be efficient and easy to use, both as a back-end library to build XNAT clients and as an alternative front-end from the command line

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

INRIA a CCSD electronic archive server

PubMed Central

Frontiers - Publisher Connector

Digital Commons@Becker

HAL-CEA

Agile in-litero experiments:how can semi-automated information extraction from neuroscientific literature help neuroscience model building?

Author: Richardet Renaud Luc
Publication venue: Lausanne, EPFL
Publication date: 08/02/2016
Field of study

In neuroscience, as in many other scientific domains, the primary form of knowledge dissemination is through published articles in peer-reviewed journals. One challenge for modern neuroinformatics is to design methods to make the knowledge from the tremendous backlog of publications accessible for search, analysis and its integration into computational models. In this thesis, we introduce novel natural language processing (NLP) models and systems to mine the neuroscientific literature. In addition to in vivo, in vitro or in silico experiments, we coin the NLP methods developed in this thesis as in litero experiments, aiming at analyzing and making accessible the extended body of neuroscientific literature. In particular, we focus on two important neuroscientific entities: brain regions and neural cells. An integrated NLP model is designed to automatically extract brain region connectivity statements from very large corpora. This system is applied to a large corpus of 25M PubMed abstracts and 600K full-text articles. Central to this system is the creation of a searchable database of brain region connectivity statements, allowing neuroscientists to gain an overview of all brain regions connected to a given region of interest. More importantly, the database enables researcher to provide feedback on connectivity results and links back to the original article sentence to provide the relevant context. The database is evaluated by neuroanatomists on real connectomics tasks (targets of Nucleus Accumbens) and results in significant effort reduction in comparison to previous manual methods (from 1 week to 2h). Subsequently, we introduce neuroNER to identify, normalize and compare instances of identify neuronsneurons in the scientific literature. Our method relies on identifying and analyzing each of the domain features used to annotate a specific neuron mention, like the morphological term 'basket' or brain region 'hippocampus'. We apply our method to the same corpus of 25M PubMed abstracts and 600K full-text articles and find over 500K unique neuron type mentions. To demonstrate the utility of our approach, we also apply our method towards cross-comparing the NeuroLex and Human Brain Project (HBP) cell type ontologies. By decoupling a neuron mention's identity into its specific compositional features, our method can successfully identify specific neuron types even if they are not explicitly listed within a predefined neuron type lexicon, thus greatly facilitating cross-laboratory studies. In order to build such large databases, several tools and infrastructureslarge-scale NLP were developed: a robust pipeline to preprocess full-text PDF articles, as well as bluima, an NLP processing pipeline specialized on neuroscience to perform text-mining at PubMed scale. During the development of those two NLP systems, we acknowledged the need for novel NLP approaches to rapidly develop custom text mining solutions. This led to the formalization of the agile text miningagile text-mining methodology to improve the communication and collaboration between subject matter experts and text miners. Agile text mining is characterized by short development cycles, frequent tasks redefinition and continuous performance monitoring through integration tests. To support our approach, we developed Sherlok, an NLP framework designed for the development of agile text mining applications

Infoscience - École polytechnique fédérale de Lausanne