61 research outputs found
Where Have All the Interactions Gone? Estimating the Coverage of Two-Hybrid Protein Interaction Maps
Yeast two-hybrid screens are an important method for mapping pairwise physical interactions between proteins. The fraction of interactions detected in independent screens can be very small, and an outstanding challenge is to determine the reason for the low overlap. Low overlap can arise from either a high false-discovery rate (interaction sets have low overlap because each set is contaminated by a large number of stochastic false-positive interactions) or a high false-negative rate (interaction sets have low overlap because each misses many true interactions). We extend capture–recapture theory to provide the first unified model for false-positive and false-negative rates for two-hybrid screens. Analysis of yeast, worm, and fly data indicates that 25% to 45% of the reported interactions are likely false positives. Membrane proteins have higher false-discovery rates on average, and signal transduction proteins have lower rates. The overall false-negative rate ranges from 75% for worm to 90% for fly, which arises from a roughly 50% false-negative rate due to statistical undersampling and a 55% to 85% false-negative rate due to proteins that appear to be systematically lost from the assays. Finally, statistical model selection conclusively rejects the Erdös-Rényi network model in favor of the power law model for yeast and the truncated power law for worm and fly degree distributions. Much as genome sequencing coverage estimates were essential for planning the human genome sequencing project, the coverage estimates developed here will be valuable for guiding future proteomic screens. All software and datasets are available in Datasets S1 and S2, Figures S1–S5, and Tables S1−S6, and are also available from our Web site, http://www.baderzone.org
Learning Nonparametric Ordinary Differential Equations from Noisy Data
Learning nonparametric systems of Ordinary Differential Equations (ODEs) from noisy data is an emerging machine learning topic. We use the well-developed theory of Reproducing Kernel Hilbert Spaces (RKHS) to define candidates for for which the solution of the ODE exists and is unique. Learning consists of solving a constrained optimization problem in an RKHS. We propose a penalty method that iteratively uses the Representer theorem and Euler approximations to provide a numerical solution. We prove a generalization bound for the distance between and its estimator. Experiments are provided for the FitzHugh-Nagumo oscillator, the Lorenz system, and for predicting the Amyloid level in the cortex of aging subjects. In all cases, we show competitive results compared with the state-of-the-art
Learning nonparametric ordinary differential equations from noisy data
Learning nonparametric systems of Ordinary Differential Equations (ODEs) dot
x = f(t,x) from noisy data is an emerging machine learning topic. We use the
well-developed theory of Reproducing Kernel Hilbert Spaces (RKHS) to define
candidates for f for which the solution of the ODE exists and is unique.
Learning f consists of solving a constrained optimization problem in an RKHS.
We propose a penalty method that iteratively uses the Representer theorem and
Euler approximations to provide a numerical solution. We prove a generalization
bound for the L2 distance between x and its estimator and provide experimental
comparisons with the state-of-the-art.Comment: 25 pages, 6 figure
Intensity Inhomogeneity Correction of SD-OCT Data Using Macular Flatspace
Images of the retina acquired using optical coherence tomography (OCT) often suffer from intensity inhomogeneity problems that degrade both the quality of the images and the performance of automated algorithms utilized to measure structural changes. This intensity variation has many causes, including off-axis acquisition, signal attenuation, multi-frame averaging, and vignetting, making it difficult to correct the data in a fundamental way. This paper presents a method for inhomogeneity correction by acting to reduce the variability of intensities within each layer. In particular, the N3 algorithm, which is popular in neuroimage analysis, is adapted to work for OCT data. N3 works by sharpening the intensity histogram, which reduces the variation of intensities within different classes. To apply it here, the data are first converted to a standardized space called macular flat space (MFS). MFS allows the intensities within each layer to be more easily normalized by removing the natural curvature of the retina. N3 is then run on the MFS data using a modified smoothing model, which improves the efficiency of the original algorithm. We show that our method more accurately corrects gain fields on synthetic OCT data when compared to running N3 on non-flattened data. It also reduces the overall variability of the intensities within each layer, without sacrificing contrast between layers, and improves the performance of registration between OCT images
Associations Between Performance on an Abbreviated CogState Battery, Other Measures of Cognitive Function, and Biomarkers in People at Risk for Alzheimer\u27s Disease
It is not known whether computerized cognitive assessments, like the CogState battery, are sensitive to preclinical cognitive changes or pathology in people at risk for Alzheimer\u27s disease(AD). In 469 late middle-aged participants from the Wisconsin Registry for Alzheimer\u27s Prevention(mean age 63.8±7 years at testing; 67% female; 39% APOE4+), we examined relationships between a CogState abbreviated battery(CAB) of seven tests and demographic characteristics, traditional paper-based neuropsychological tests as well as a composite cognitive impairment index, cognitive impairment status(determined by consensus review), and biomarkers for amyloid and tau(CSF phosphorylated-tau/Aβ42 and global PET-PiB burden) and neural injury(CSF neurofilament light protein). CSF and PET-PiB were collected in n = 71 and n = 91 participants, respectively, approximately four years prior to CAB testing. For comparison, we examined three traditional tests of delayed memory in parallel. Similar to studies in older samples, the CAB was less influenced by demographic factors than traditional tests. CAB tests were generally correlated with most paper-based cognitive tests examined and mapped onto the same cognitive domains. Greater composite cognitive impairment index was associated with worse performance on all CAB tests. Cognitively impaired participants performed significantly worse compared to normal controls on all but one CAB test. Poorer One Card Learning test performance was associated with higher levels of CSF phosphorylated-tau/Aβ42. These results support the use of the CogState battery as measures of early cognitive impairment in studies of people at risk for AD
Self-Organization in High-Density Bacterial Colonies: Efficient Crowd Control
Colonies of bacterial cells can display complex collective dynamics, frequently culminating in the formation of biofilms and other ordered super-structures. Recent studies suggest that to cope with local environmental challenges, bacterial cells can actively seek out small chambers or cavities and assemble there, engaging in quorum sensing behavior. By using a novel microfluidic device, we showed that within chambers of distinct shapes and sizes allowing continuous cell escape, bacterial colonies can gradually self-organize. The directions of orientation of cells, their growth, and collective motion are mutually correlated and dictated by the chamber walls and locations of chamber exits. The ultimate highly organized steady state is conducive to a more-organized escape of cells from the chambers and increased access of nutrients into and evacuation of waste out of the colonies. Using a computational model, we suggest that the lengths of the cells might be optimized to maximize self-organization while minimizing the potential for stampede-like exit blockage. The self-organization described here may be crucial for the early stage of the organization of high-density bacterial colonies populating small, physically confined growth niches. It suggests that this phenomenon can play a critical role in bacterial biofilm initiation and development of other complex multicellular bacterial super-structures, including those implicated in infectious diseases
The V471A polymorphism in autophagy-related gene ATG7 modifies age at onset specifically in Italian Huntington disease patients
The cause of Huntington disease (HD) is a polyglutamine repeat expansion of more than 36 units in the huntingtin protein, which is inversely correlated with the age at onset of the disease. However, additional genetic factors are believed to modify the course and the age at onset of HD. Recently, we identified the V471A polymorphism in the autophagy-related gene ATG7, a key component of the autophagy pathway that plays an important role in HD pathogenesis, to be associated with the age at onset in a large group of European Huntington disease patients. To confirm this association in a second independent patient cohort, we analysed the ATG7 V471A polymorphism in additional 1,464 European HD patients of the “REGISTRY” cohort from the European Huntington Disease Network (EHDN). In the entire REGISTRY cohort we could not confirm a modifying effect of the ATG7 V471A polymorphism. However, analysing a modifying effect of ATG7 in these REGISTRY patients and in patients of our previous HD cohort according to their ethnic origin, we identified a significant effect of the ATG7 V471A polymorphism on the HD age at onset only in the Italian population (327 patients). In these Italian patients, the polymorphism is associated with a 6-years earlier disease onset and thus seems to have an aggravating effect. We could specify the role of ATG7 as a genetic modifier for HD particularly in the Italian population. This result affirms the modifying influence of the autophagic pathway on the course of HD, but also suggests population-specific modifying mechanisms in HD pathogenesis
Identification of genetic variants associated with Huntington's disease progression: a genome-wide association study
Background Huntington's disease is caused by a CAG repeat expansion in the huntingtin gene, HTT. Age at onset has been used as a quantitative phenotype in genetic analysis looking for Huntington's disease modifiers, but is hard to define and not always available. Therefore, we aimed to generate a novel measure of disease progression and to identify genetic markers associated with this progression measure. Methods We generated a progression score on the basis of principal component analysis of prospectively acquired longitudinal changes in motor, cognitive, and imaging measures in the 218 indivduals in the TRACK-HD cohort of Huntington's disease gene mutation carriers (data collected 2008–11). We generated a parallel progression score using data from 1773 previously genotyped participants from the European Huntington's Disease Network REGISTRY study of Huntington's disease mutation carriers (data collected 2003–13). We did a genome-wide association analyses in terms of progression for 216 TRACK-HD participants and 1773 REGISTRY participants, then a meta-analysis of these results was undertaken. Findings Longitudinal motor, cognitive, and imaging scores were correlated with each other in TRACK-HD participants, justifying use of a single, cross-domain measure of disease progression in both studies. The TRACK-HD and REGISTRY progression measures were correlated with each other (r=0·674), and with age at onset (TRACK-HD, r=0·315; REGISTRY, r=0·234). The meta-analysis of progression in TRACK-HD and REGISTRY gave a genome-wide significant signal (p=1·12 × 10−10) on chromosome 5 spanning three genes: MSH3, DHFR, and MTRNR2L2. The genes in this locus were associated with progression in TRACK-HD (MSH3 p=2·94 × 10−8 DHFR p=8·37 × 10−7 MTRNR2L2 p=2·15 × 10−9) and to a lesser extent in REGISTRY (MSH3 p=9·36 × 10−4 DHFR p=8·45 × 10−4 MTRNR2L2 p=1·20 × 10−3). The lead single nucleotide polymorphism (SNP) in TRACK-HD (rs557874766) was genome-wide significant in the meta-analysis (p=1·58 × 10−8), and encodes an aminoacid change (Pro67Ala) in MSH3. In TRACK-HD, each copy of the minor allele at this SNP was associated with a 0·4 units per year (95% CI 0·16–0·66) reduction in the rate of change of the Unified Huntington's Disease Rating Scale (UHDRS) Total Motor Score, and a reduction of 0·12 units per year (95% CI 0·06–0·18) in the rate of change of UHDRS Total Functional Capacity score. These associations remained significant after adjusting for age of onset. Interpretation The multidomain progression measure in TRACK-HD was associated with a functional variant that was genome-wide significant in our meta-analysis. The association in only 216 participants implies that the progression measure is a sensitive reflection of disease burden, that the effect size at this locus is large, or both. Knockout of Msh3 reduces somatic expansion in Huntington's disease mouse models, suggesting this mechanism as an area for future therapeutic investigation
- …