2,226 research outputs found

    Enabling Factor Analysis on Thousand-Subject Neuroimaging Datasets

    Full text link
    The scale of functional magnetic resonance image data is rapidly increasing as large multi-subject datasets are becoming widely available and high-resolution scanners are adopted. The inherent low-dimensionality of the information in this data has led neuroscientists to consider factor analysis methods to extract and analyze the underlying brain activity. In this work, we consider two recent multi-subject factor analysis methods: the Shared Response Model and Hierarchical Topographic Factor Analysis. We perform analytical, algorithmic, and code optimization to enable multi-node parallel implementations to scale. Single-node improvements result in 99x and 1812x speedups on these two methods, and enables the processing of larger datasets. Our distributed implementations show strong scaling of 3.3x and 5.5x respectively with 20 nodes on real datasets. We also demonstrate weak scaling on a synthetic dataset with 1024 subjects, on up to 1024 nodes and 32,768 cores

    Methodological challenges and analytic opportunities for modeling and interpreting Big Healthcare Data

    Full text link
    Abstract Managing, processing and understanding big healthcare data is challenging, costly and demanding. Without a robust fundamental theory for representation, analysis and inference, a roadmap for uniform handling and analyzing of such complex data remains elusive. In this article, we outline various big data challenges, opportunities, modeling methods and software techniques for blending complex healthcare data, advanced analytic tools, and distributed scientific computing. Using imaging, genetic and healthcare data we provide examples of processing heterogeneous datasets using distributed cloud services, automated and semi-automated classification techniques, and open-science protocols. Despite substantial advances, new innovative technologies need to be developed that enhance, scale and optimize the management and processing of large, complex and heterogeneous data. Stakeholder investments in data acquisition, research and development, computational infrastructure and education will be critical to realize the huge potential of big data, to reap the expected information benefits and to build lasting knowledge assets. Multi-faceted proprietary, open-source, and community developments will be essential to enable broad, reliable, sustainable and efficient data-driven discovery and analytics. Big data will affect every sector of the economy and their hallmark will be ‘team science’.http://deepblue.lib.umich.edu/bitstream/2027.42/134522/1/13742_2016_Article_117.pd

    A Study of Data Sharing Practices within Scholarly Research Communities

    Get PDF
    Recent literature acknowledges the importance of data and effective data management strategies to facilitate collaboration between disciplines of research. Likewise, understanding the policies and practices that support data sharing is a growing area of research in the fields of information and social studies of science. Shared data allow researchers to build on fellow researchers’ work to enrich and facilitate advancements in science. While much has been written to identify the elements that adversely affect data sharing in scholarly research, a definitive framework remains unclear. Several theories have been presented to explain this shortfall; however, the reasons are highly diverse. Some suggest the factors that impact data sharing practices include delays in the peer review process, ineffective data management practices, mistrust, financial considerations, and vague data sharing policies and procedures. Those who support data sharing have acknowledged the important role of funding agencies to leverage the sharing of data in scholarly research in return for researcher support. Likewise, advocates suggest that scientific societies should establish data sharing as standard procedure. Respected organizations such as the National Academy of Sciences (NAS), National Science Foundation (NSF), and National Institutes of Health (NIH) are tasked with developing modern strategies to ensure that policies and procedures regarding data management and dissemination meet the evolving needs and computational capabilities of the 21st century. While the NSF has proposed recent, updated regulations to guide the scientific community to adopt a culture that promotes the sharing of research data, literature suggests that regulations have been ineffective in advancing data sharing practices. The purpose of this research study was to review the NSF federal grant application process and its influence on timely data sharing practices. Most importantly, the goal of this study was to identify definitive ways in which the NSF grant application process may be improved to expedite the sharing of research data in the future

    Classification of dementias based on brain radiomics features

    Get PDF
    Dissertação de mestrado integrado em Engenharia InformáticaNeurodegenerative diseases impair the functioning of the brain and are characterized by alterations in the morphology of specific brain regions. Some of the main disorders include Alzheimer's, Parkinson's, and Huntington's diseases, and the number of cases increases exponentially since ageing is one of the main risk factors. Trying to identify the areas in which this type of disease appears is something that can have a very positive impact in this area of Medicine and can guarantee a more appropriate treatment or allow the improvement of the quality of life of patients. With the current technological advances, computer tools are capable of performing a structural or functional analysis of neuroimaging data from Magnetic Resonance Images(MRI). Therefore, Medical Informatics uses these techniques to create and manage medical neuroimaging data to improve the diagnosis and management of these patients. MRI is the image type used in the analysis of the brain area and points to a promising and reliable diagnostic tool since it allows high-quality images in various planes or strategies and MRI methods are fundamental diagnostic tools in clinical practice, allowing the diagnosis of pathologic processes such as stroke or brain tumours. However, structural MRI has limitations for the diagnosis of neurodegenerative disorders since it mainly identifies atrophy of brain regions. Currently, there is increased interest in informatics applications capable of monitoring and quantifying human brain imaging alterations, with potential for neurodegenerative disorders diagnosis and monitoring. One of these applications is Radiomics, which corresponds to a methodolog ythat allows the extraction of features from images of a given region of the brain. Specific quantitative metrics from MRI are acquired by this tool, and they correspond to a set of features, including texture, shape, among others. To standardize Radiomics application, specific libraries have been proposed to be used by the bioinformatics and biomedical communities, such as PyRadiomics, which corresponds to an open source Python package for extracting Radiomics of MRIs. Therefore, this dissertation was developed based on magnetic resonance images and the study of Deep Learning (DL) techniques to assist researchers and neuroradiologists in the diagnosis and prediction of neurodegenerative disease development. Two different main tasks were made: first, a segmentation, using FreeSurfer, of different regions of the brain and then, a model was build from radiomic features extracted from each part of the brain and interpreted for knowledge extraction.As doenças neurodegenerativas estão associadas ao funcionamento do cérebro e caracterizam-se pelo facto de serem altamente incapacitantes. São exemplos destas, as doenças de Alzheimer, Parkinson e Huntington, e o seu número de casos tem vindo a aumentar exponencialmente, uma vez que o envelhecimento é um dos principais factores de risco. Tentar identificar quais são as regiões cerebrais que permitem predizer o seu aparecimento e desenvolvimento é algo que, sendo possível, terá um impacto muito positivo nesta área da Medicina e poderá garantir um tratamento mais adequado, ou simplesmente melhorar a qualidade de vida dos pacientes. Com os avanços tecnológicos atuais, foram desenvolvidas ferramentas informáticas que são capazes de efetuar uma análise estrutural ou funcional de Ressonâncias Magnéticas (MRI), sendo essas ferramentas usadas para promover a melhoria e o conhecimento clínico. Deste modo, as constantes evoluções científicas têm realçado o papel da Informática Médica na neuroimagem para criar e gerenciar dados médicos, melhorando o diagnóstico destes pacientes. A MRI é o tipo de imagem utilizada na análise de regiões cerebrais e aponta para uma ferramenta de diagnóstico promissora e fiável, uma vez que permite obter imagens de alta qualidade em vários planos, permitindo assim, o diagnóstico de processos patológicos, tais como acidentes vasculares ou tumores cerebrais. Atualmente, existem inúmeras aplicações informáticas capazes de efetuar análises estruturais e funcionais do cérebro humano, pois é este o principal órgão afetado pelas doenças neurodegenerativas. Uma dessas aplicações é o Radiomics, que permite fazer a extração de features de imagens do cérebro. A biblioteca a utilizar será PyRadiomics, que corresponde a um package open source em Python para a extração de features Radiomics de imagens médicas. As features correspondem a características da imagem. Assim sendo, a presente dissertação foi desenvolvida com base em imagens de ressonância magnética e no estudo das técnicas de Deep Learning para investigar e auxiliar os médicos neurorradiologistas a diagnosticar e a prever o desenvolvimento de doenças neurodegenerativas. Foram feitas duas principais tarefas: primeiro, uma segmentação, utilizando o software FreeSurfer, de diferentes regiões do cérebro e, de seguida, foi construído um modelo a partir das features radiómicas extraídas de cada parte do cérebro que foi interpretado

    Emerging methods for conceptual modelling in neuroimaging

    Get PDF
    Some open theoretical questions are addressed on how the mind and brain represent and process concepts, particularly as they are instantiated in particular human languages. Recordings of neuroimaging data should provide a suitable empirical basis for investigating this topic, but the complexity and variety of language demands appropriate data-driven approaches. In this review we argue for a particular suite of methodologies, based on multivariate classification techniques which have proven to be powerful tools for distinguishing neural and cognitive states in fMRI. A combination of larger scale neuroimaging studies are introduced with different monolingual and bilingual populations, and hybrid computational analyses that use encoded implementations of existing theories of conceptual organisation to probe those data. We develop a suite of methodologies that holds the promise of being able to holistically elicit, record and model neural processing during language comprehension and production

    Multimodal population brain imaging in the UK Biobank prospective epidemiological study

    Get PDF
    Medical imaging has enormous potential for early disease prediction, but is impeded by the difficulty and expense of acquiring data sets before symptom onset. UK Biobank aims to address this problem directly by acquiring high-quality, consistently acquired imaging data from 100,000 predominantly healthy participants, with health outcomes being tracked over the coming decades. The brain imaging includes structural, diffusion and functional modalities. Along with body and cardiac imaging, genetics, lifestyle measures, biological phenotyping and health records, this imaging is expected to enable discovery of imaging markers of a broad range of diseases at their earliest stages, as well as provide unique insight into disease mechanisms. We describe UK Biobank brain imaging and present results derived from the first 5,000 participants' data release. Although this covers just 5% of the ultimate cohort, it has already yielded a rich range of associations between brain imaging and other measures collected by UK Biobank
    • …
    corecore