545 research outputs found
Adaptive smoothing as inference strategy: More specificity for unequally sized or neighboring regions
Although spatial smoothing of fMRI data can serve multiple purposes, increasing
the sensitivity of activation detection is probably its greatest benefit.
However, this increased detection power comes with a loss of specificity when
non-adaptive smoothing (i.e.\ the standard in most software packages) is used.
Simulation studies and analysis of experimental data was performed using the
R packages neuRosim and fmri. In these studies, we
systematically investigated the effect of spatial smoothing on the power and
number of false positives in two particular cases that are often encountered in
fMRI research: (1) Single condition activation detection for regions that differ
in size, and (2) multiple condition activation detection for neighbouring
regions. Our results demonstrate that adaptive smoothing is superior in both
cases because less false positives are introduced by the spatial smoothing
process compared to standard Gaussian smoothing or FDR inference of unsmoothed
data
Darth Fader: Using wavelets to obtain accurate redshifts of spectra at very low signal-to-noise
We present the DARTH FADER algorithm, a new wavelet-based method for
estimating redshifts of galaxy spectra in spectral surveys that is particularly
adept in the very low SNR regime. We use a standard cross-correlation method to
estimate the redshifts of galaxies, using a template set built using a PCA
analysis on a set of simulated, noise-free spectra. Darth Fader employs wavelet
filtering to both estimate the continuum & to extract prominent line features
in each galaxy spectrum. A simple selection criterion based on the number of
features present in the spectrum is then used to clean the catalogue: galaxies
with fewer than six total features are removed as we are unlikely to obtain a
reliable redshift estimate. Applying our wavelet-based cleaning algorithm to a
simulated testing set, we successfully build a clean catalogue including
extremely low signal-to-noise data (SNR=2.0), for which we are able to obtain a
5.1% catastrophic failure rate in the redshift estimates (compared with 34.5%
prior to cleaning). We also show that for a catalogue with uniformly mixed SNRs
between 1.0 & 20.0, with realistic pixel-dependent noise, it is possible to
obtain redshifts with a catastrophic failure rate of 3.3% after cleaning (as
compared to 22.7% before cleaning). Whilst we do not test this algorithm
exhaustively on real data, we present a proof of concept of the applicability
of this method to real data, showing that the wavelet filtering techniques
perform well when applied to some typical spectra from the SDSS archive. The
Darth Fader algorithm provides a robust method for extracting spectral features
from very noisy spectra. The resulting clean catalogue gives an extremely low
rate of catastrophic failures, even when the spectra have a very low SNR. For
very large sky surveys, this technique may offer a significant boost in the
number of faint galaxies with accurately determined redshifts.Comment: 22 pages, 15 figures. Accepted for publication in Astronomy &
Astrophysic
Recommended from our members
An atlas of cortical circular RNA expression in Alzheimer disease brains demonstrates clinical and pathological associations.
Parietal cortex RNA-sequencing (RNA-seq) data were generated from individuals with and without Alzheimer disease (AD; ncontrol = 13; nAD = 83) from the Knight Alzheimer Disease Research Center (Knight ADRC). Using this and an independent (Mount Sinai Brain Bank (MSBB)) AD RNA-seq dataset, cortical circular RNA (circRNA) expression was quantified in the context of AD. Significant associations were identified between circRNA expression and AD diagnosis, clinical dementia severity and neuropathological severity. It was demonstrated that most circRNA-AD associations are independent of changes in cognate linear messenger RNA expression or estimated brain cell-type proportions. Evidence was provided for circRNA expression changes occurring early in presymptomatic AD and in autosomal dominant AD. It was also observed that AD-associated circRNAs co-expressed with known AD genes. Finally, potential microRNA-binding sites were identified in AD-associated circRNAs for miRNAs predicted to target AD genes. Together, these results highlight the importance of analyzing non-linear RNAs and support future studies exploring the potential roles of circRNAs in AD pathogenesis
Methods for Joint Normalization and Comparison of Hi-C data
The development of chromatin conformation capture technology has opened new avenues of study into the 3D structure and function of the genome. Chromatin structure is known to influence gene regulation, and differences in structure are now emerging as a mechanism of regulation between, e.g., cell differentiation and disease vs. normal states. Hi-C sequencing technology now provides a way to study the 3D interactions of the chromatin over the whole genome. However, like all sequencing technologies, Hi-C suffers from several forms of bias stemming from both the technology and the DNA sequence itself. Several normalization methods have been developed for normalizing individual Hi-C datasets, but little work has been done on developing joint normalization methods for comparing two or more Hi-C datasets. To make full use of Hi-C data, joint normalization and statistical comparison techniques are needed to carry out experiments to identify regions where chromatin structure differs between conditions.
We develop methods for the joint normalization and comparison of two Hi-C datasets, which we then extended to more complex experimental designs. Our normalization method is novel in that it makes use of the distance-dependent nature of chromatin interactions. Our modification of the Minus vs. Average (MA) plot to the Minus vs. Distance (MD) plot allows for a nonparametric data-driven normalization technique using loess smoothing. Additionally, we present a simple statistical method using Z-scores for detecting differentially interacting regions between two datasets. Our initial method was published as the Bioconductor R package HiCcompare [http://bioconductor.org/packages/HiCcompare/](http://bioconductor.org/packages/HiCcompare/).
We then further extended our normalization and comparison method for use in complex Hi-C experiments with more than two datasets and optional covariates. We extended the normalization method to jointly normalize any number of Hi-C datasets by using a cyclic loess procedure on the MD plot. The cyclic loess normalization technique can remove between dataset biases efficiently and effectively even when several datasets are analyzed at one time. Our comparison method implements a generalized linear model-based approach for comparing complex Hi-C experiments, which may have more than two groups and additional covariates. The extended methods are also available as a Bioconductor R package [http://bioconductor.org/packages/multiHiCcompare/](http://bioconductor.org/packages/multiHiCcompare/). Finally, we demonstrate the use of HiCcompare and multiHiCcompare in several test cases on real data in addition to comparing them to other similar methods (https://doi.org/10.1002/cpbi.76)
Semiparametric Estimation of Task-Based Dynamic Functional Connectivity on the Population Level
Dynamic functional connectivity (dFC) estimates time-dependent associations between pairs of brain region time series as typically acquired during functional MRI. dFC changes are most commonly quantified by pairwise correlation coefficients between the time series within a sliding window. Here, we applied a recently developed bootstrap-based technique (Kudela et al., 2017) to robustly estimate subject-level dFC and its confidence intervals in a task-based fMRI study (24 subjects who tasted their most frequently consumed beer and Gatorade as an appetitive control). We then combined information across subjects and scans utilizing semiparametric mixed models to obtain a group-level dFC estimate for each pair of brain regions, flavor, and the difference between flavors. The proposed approach relies on the estimated group-level dFC accounting for complex correlation structures of the fMRI data, multiple repeated observations per subject, experimental design, and subject-specific variability. It also provides condition-specific dFC and confidence intervals for the whole brain at the group level. As a summary dFC metric, we used the proportion of time when the estimated associations were either significantly positive or negative. For both flavors, our fully-data driven approach yielded regional associations that reflected known, biologically meaningful brain organization as shown in prior work, as well as closely resembled resting state networks (RSNs). Specifically, beer flavor-potentiated associations were detected between several reward-related regions, including the right ventral striatum (VST), lateral orbitofrontal cortex, and ventral anterior insular cortex (vAIC). The enhancement of right VST-vAIC association by a taste of beer independently validated the main activation-based finding (Oberlin et al., 2016). Most notably, our novel dFC methodology uncovered numerous associations undetected by the traditional static FC analysis. The data-driven, novel dFC methodology presented here can be used for a wide range of task-based fMRI designs to estimate the dFC at multiple levels-group-, individual-, and task-specific, utilizing a combination of well-established statistical methods
Epigenetics of complex traits and diseases
Thousands of genetic and epigenetic variants have been identified for many common diseases including cancer through genome-wide association studies (GWAS) and epigenome-wide association studies (EWAS). To advance the complex interpretation of both GWAS and EWAS results, I developed new software tools (FORGE2 and eFORGE) for the analysis and interpretation of GWAS and EWAS data, respectively. Both tools determine the cell type-specific regulatory component of a set of target regions (either GWAS-identified genetic variants or EWAS-identified differentially methylated positions). This is achieved by detecting enrichment of overlap with histone mark peaks or DNase I hypersensitive sites across hundreds of tissues, primary cell types, and cell lines from the ENCODE, Roadmap Epigenomics, and BLUEPRINT projects. Application of both tools to publicly available datasets identified novel disease-relevant cell types for many common diseases, a stem cell-like signature in cancer EWAS, and also demonstrated the ability to detect cell-composition effects for EWAS performed on heterogeneous tissues. To complement these bioinformatics efforts and validate selected variants predicted by FORGE2, eFORGE and additional analyses, I performed conformation capture using 4C-seq to fine-map the 3D context of the genomic regions involved, uncovering novel interactions for autoimmunity-associated variants and IKZF3
Comparison of Gene Expression and Genome-Wide DNA Methylation Profiling between Phenotypically Normal Cloned Pigs and Conventionally Bred Controls
Animal breeding via Somatic Cell Nuclear Transfer (SCNT) has enormous potential in agriculture and biomedicine. However, concerns about whether SCNT animals are as healthy or epigenetically normal as conventionally bred ones are raised as the efficiency of cloning by SCNT is much lower than natural breeding or In-vitro fertilization (IVF). Thus, we have conducted a genome-wide gene expression and DNA methylation profiling between phenotypically normal cloned pigs and control pigs in two tissues (muscle and liver), using Affymetrix Porcine expression array as well as modified methylation-specific digital karyotyping (MMSDK) and Solexa sequencing technology. Typical tissue-specific differences with respect to both gene expression and DNA methylation were observed in muscle and liver from cloned as well as control pigs. Gene expression profiles were highly similar between cloned pigs and controls, though a small set of genes showed altered expression. Cloned pigs presented a more different pattern of DNA methylation in unique sequences in both tissues. Especially a small set of genomic sites had different DNA methylation status with a trend towards slightly increased methylation levels in cloned pigs. Molecular network analysis of the genes that contained such differential methylation loci revealed a significant network related to tissue development. In conclusion, our study showed that phenotypically normal cloned pigs were highly similar with normal breeding pigs in their gene expression, but moderate alteration in DNA methylation aspects still exists, especially in certain unique genomic regions
Multi-omics of the gut microbial ecosystem in inflammatory bowel diseases.
Inflammatory bowel diseases, which include Crohn's disease and ulcerative colitis, affect several million individuals worldwide. Crohn's disease and ulcerative colitis are complex diseases that are heterogeneous at the clinical, immunological, molecular, genetic, and microbial levels. Individual contributing factors have been the focus of extensive research. As part of the Integrative Human Microbiome Project (HMP2 or iHMP), we followed 132 subjects for one year each to generate integrated longitudinal molecular profiles of host and microbial activity during disease (up to 24 time points each; in total 2,965 stool, biopsy, and blood specimens). Here we present the results, which provide a comprehensive view of functional dysbiosis in the gut microbiome during inflammatory bowel disease activity. We demonstrate a characteristic increase in facultative anaerobes at the expense of obligate anaerobes, as well as molecular disruptions in microbial transcription (for example, among clostridia), metabolite pools (acylcarnitines, bile acids, and short-chain fatty acids), and levels of antibodies in host serum. Periods of disease activity were also marked by increases in temporal variability, with characteristic taxonomic, functional, and biochemical shifts. Finally, integrative analysis identified microbial, biochemical, and host factors central to this dysregulation. The study's infrastructure resources, results, and data, which are available through the Inflammatory Bowel Disease Multi'omics Database ( http://ibdmdb.org ), provide the most comprehensive description to date of host and microbial activities in inflammatory bowel diseases
Reconfiguration of dominant coupling modes in mild traumatic brain injury mediated by δ-band activity: a resting state MEG study
During the last few years, rich-club (RC) organization has been studied as a possible brain-connectivity organization model for large-scale brain networks. At the same time, empirical and simulated data of neurophysiological models have demonstrated the significant role of intra-frequency and inter-frequency coupling among distinct brain areas. The current study investigates further the importance of these couplings using recordings of resting-state magnetoencephalographic activity obtained from 30 mild traumatic brain injury (mTBI) subjects and 50 healthy controls. Intra-frequency and inter-frequency coupling modes are incorporated in a single graph to detect group differences within individual rich-club subnetworks (type I networks) and networks connecting RC nodes with the rest of the nodes (type II networks). Our results show a higher probability of inter-frequency coupling for (δ–γ1), (δ–γ2), (θ–β), (θ–γ2), (α–γ2), (γ1–γ2) and intra-frequency coupling for (γ1–γ1) and (δ–δ) for both type I and type II networks in the mTBI group. Additionally, mTBI and control subjects can be correctly classified with high accuracy (98.6%), whereas a general linear regression model can effectively predict the subject group using the ratio of type I and type II coupling in the (δ, θ), (δ, β), (δ, γ1), and (δ, γ2) frequency pairs. These findings support the presence of an RC organization simultaneously with dominant frequency interactions within a single functional graph. Our results demonstrate a hyperactivation of intrinsic RC networks in mTBI subjects compared to controls, which can be seen as a plausible compensatory mechanism for alternative frequency-dependent routes of information flow in mTBI subjects
- …