1,344 research outputs found
A Nuclear Localization Signal in Herpesvirus Protein VP1-2 Is Essential for Infection via Capsid Routing to the Nuclear Pore
To initiate infection, herpesviruses must navigate to and transport their genomes across the nuclear pore. VP1-2 is a large structural protein of the virion that is conserved in all herpesviruses and plays multiple essential roles in virus replication, including roles in early entry. VP1-2 contains an N-terminal basic motif which functions as an efficient nuclear localization signal (NLS). In this study, we constructed a mutant HSV strain, K.VP1-2ΔNLS, which contains a 7-residue deletion of the core NLS at position 475. This mutant fails to spread in normal cells but can be propagated in complementing cell lines. Electron microscopy (EM) analysis of infection in noncomplementing cells demonstrated capsid assembly, cytoplasmic envelopment, and the formation of extracellular enveloped virions. Furthermore, extracellular virions isolated from noncomplementing cells had similar profiles and abundances of structural proteins. Virions containing VP1-2ΔNLS were able to enter and be transported within cells. However, further progress of infection was prevented, with at least a 500- to 1,000-fold reduction in the efficiency of initiating gene expression compared to that in the revertant. Ultrastructural and immunofluorescence analyses revealed that the K.VP1-2ΔNLS mutant was blocked at the microtubule organizing center or immediately upstream of nuclear pore docking and prior to gene expression. These results indicate that the VP1-2 NLS is not required for the known assembly functions of the protein but is a key requirement for the early routing to the nuclear pore that is necessary for successful infection. Given its conservation, we propose that this motif may also be critical for entry of other classes of herpesviruses
Proglacial erosion rates and processes in a glacierized catchment in the Swiss Alps
In the Swiss Alps, climatic changes have not only caused glacier retreat, but also likely increased sedimentation downstream of glaciers. This material either originates from below the glacier or from periglacial environments, which are exposed as glaciers retreat, and often consist of easily erodible sediment. Griesgletscher's catchment in the Swiss Alps was examined to quantify erosion in the proglacial area, possible hydrological drivers and contributions of the sub- and periglacial sources. Digital elevation models, created from annual aerial photographs, were subtracted to determine annual volume changes in the proglacial area from 1986 to 2014. These data show a strong increase in proglacial erosion in the decade prior to 2012, coincident with increasing proglacial area size. However, examination of the gradient between discharge and sediment evacuation, and modeled sediment transport, could suggest that the proglacial area began to stabilize and sediment supply is limited. The large influx of sediment into the proglacial reservoir, which is roughly 2.5 times greater than the amount of sediment eroded from the proglacial area, demonstrates the importance of subglacial erosion to the catchment's sediment budget. Although far more sediment originates subglacially, erosion rates in the proglacial area are over 50 times greater than the rest of the catchment. In turn, both sub- and periglacial processes, in addition to constraining sediment supply, must be considered for assessing future sediment dynamics as glacier area shrinks and proglacial areas grow
Pre-processing and differential expression analysis of Agilent microRNA arrays using the AgiMicroRna Bioconductor library
<p>Abstract</p> <p>Background</p> <p>The main research tool for identifying microRNAs involved in specific cellular processes is gene expression profiling using microarray technology. Agilent is one of the major producers of microRNA arrays, and microarray data are commonly analyzed by using R and the functions and packages collected in the Bioconductor project. However, an analytical package that integrates the specific characteristics of microRNA Agilent arrays has been lacking.</p> <p>Results</p> <p>This report presents the new bioinformatic tool <it>AgiMicroRNA </it>for the pre-processing and differential expression analysis of Agilent microRNA array data. The software is implemented in the open-source statistical scripting language R and is integrated in the Bioconductor project (<url>http://www.bioconductor.org</url>) under the GPL license. For the pre-processing of the microRNA signal, <it>AgiMicroRNA </it>incorporates the <it>robust multiarray average algorithm</it>, a method that produces a summary measure of the microRNA expression using a linear model that takes into account the probe affinity effect. To obtain a normalized microRNA signal useful for the statistical analysis, <it>AgiMicroRna </it>offers the possibility of employing either the processed signal estimated by the <it>robust multiarray average algorithm </it>or the processed signal produced by the Agilent image analysis software. The <it>AgiMicroRNA </it>package also incorporates different graphical utilities to assess the quality of the data. <it>AgiMicroRna </it>uses the linear model features implemented in the <it>limma </it>package to assess the differential expression between different experimental conditions and provides links to the <it>miRBase </it>for those microRNAs that have been declared as significant in the statistical analysis.</p> <p>Conclusions</p> <p><it>AgiMicroRna </it>is a rational collection of Bioconductor functions that have been wrapped into specific functions in order to ease and systematize the pre-processing and statistical analysis of Agilent microRNA data. The development of this package contributes to the Bioconductor project filling the gap in microRNA array data analysis.</p
Normalized Affymetrix expression data are biased by G-quadruplex formation
Probes with runs of four or more guanines (G-stacks) in their sequences can exhibit a level of hybridization that is unrelated to the expression levels of the mRNA that they are intended to measure. This is most likely caused by the formation of G-quadruplexes, where inter-probe guanines form Hoogsteen hydrogen bonds, which probes with G-stacks are capable of forming. We demonstrate that for a specific microarray data set using the Human HG-U133A Affymetrix GeneChip and RMA normalization there is significant bias in the expression levels, the fold change and the correlations between expression levels. These effects grow more pronounced as the number of G-stack probes in a probe set increases. Approximately 14 of the probe sets are directly affected. The analysis was repeated for a number of other normalization pipelines and two, FARMS and PLIER, minimized the bias to some extent. We estimate that ∼15 of the data sets deposited in the GEO database are susceptible to the effect. The inclusion of G-stack probes in the affected data sets can bias key parameters used in the selection and clustering of genes. The elimination of these probes from any analysis in such affected data sets outweighs the increase of noise in the signal. © 2011 The Author(s)
Modelling the spatial distribution of DEM Error
Assessment of a DEM’s quality is usually undertaken by deriving a measure of DEM accuracy – how close the DEM’s elevation values are to the true elevation. Measures such as Root Mean Squared Error and standard deviation of the error are frequently used. These measures summarise elevation errors in a DEM as a single value. A more detailed description of DEM accuracy would allow better understanding of DEM quality and the consequent uncertainty associated with using DEMs in analytical applications. The research presented addresses the limitations of using a single root mean squared error (RMSE) value to represent the uncertainty associated with a DEM by developing a new technique for creating a spatially distributed model of DEM quality – an accuracy surface. The technique is based on the hypothesis that the distribution and scale of elevation error within a DEM are at least partly related to morphometric characteristics of the terrain. The technique involves generating a set of terrain parameters to characterise terrain morphometry and developing regression models to define the relationship between DEM error and morphometric character. The regression models form the basis for creating standard deviation surfaces to represent DEM accuracy. The hypothesis is shown to be true and reliable accuracy surfaces are successfully created. These accuracy surfaces provide more detailed information about DEM accuracy than a single global estimate of RMSE
A single-sample method for normalizing and combining full-resolution copy numbers from multiple platforms, labs and analysis methods
Motivation: The rapid expansion of whole-genome copy number (CN) studies brings a demand for increased precision and resolution of CN estimates. Recent studies have obtained CN estimates from more than one platform for the same set of samples, and it is natural to want to combine the different estimates in order to meet this demand. Estimates from different platforms show different degrees of attenuation of the true CN changes. Similar differences can be observed in CNs from the same platform run in different labs, or in the same lab, with different analytical methods. This is the reason why it is not straightforward to combine CN estimates from different sources (platforms, labs and analysis methods)
Canopy nitrogen, carbon assimilation, and albedo in temperate and boreal forests: Functional relations and potential climate feedbacks
The availability of nitrogen represents a key constraint on carbon cycling in terrestrial ecosystems, and it is largely in this capacity that the role of N in the Earth\u27s climate system has been considered. Despite this, few studies have included continuous variation in plant N status as a driver of broad-scale carbon cycle analyses. This is partly because of uncertainties in how leaf-level physiological relationships scale to whole ecosystems and because methods for regional to continental detection of plant N concentrations have yet to be developed. Here, we show that ecosystem CO2 uptake capacity in temperate and boreal forests scales directly with whole-canopy N concentrations, mirroring a leaf-level trend that has been observed for woody plants worldwide. We further show that both CO2 uptake capacity and canopy N concentration are strongly and positively correlated with shortwave surface albedo. These results suggest that N plays an additional, and overlooked, role in the climate system via its influence on vegetation reflectivity and shortwave surface energy exchange. We also demonstrate that much of the spatial variation in canopy N can be detected by using broad-band satellite sensors, offering a means through which these findings can be applied toward improved application of coupled carbon cycle–climate models
Probe set algorithms: is there a rational best bet?
Affymetrix microarrays have become a standard experimental platform for studies of mRNA expression profiling. Their success is due, in part, to the multiple oligonucleotide features (probes) against each transcript (probe set). This multiple testing allows for more robust background assessments and gene expression measures, and has permitted the development of many computational methods to translate image data into a single normalized "signal" for mRNA transcript abundance. There are now many probe set algorithms that have been developed, with a gradual movement away from chip-by-chip methods (MAS5), to project-based model-fitting methods (dCHIP, RMA, others). Data interpretation is often profoundly changed by choice of algorithm, with disoriented biologists questioning what the "accurate" interpretation of their experiment is. Here, we summarize the debate concerning probe set algorithms. We provide examples of how changes in mismatch weight, normalizations, and construction of expression ratios each dramatically change data interpretation. All interpretations can be considered as computationally appropriate, but with varying biological credibility. We also illustrate the performance of two new hybrid algorithms (PLIER, GC-RMA) relative to more traditional algorithms (dCHIP, MAS5, Probe Profiler PCA, RMA) using an interactive power analysis tool. PLIER appears superior to other algorithms in avoiding false positives with poorly performing probe sets. Based on our interpretation of the literature, and examples presented here, we suggest that the variability in performance of probe set algorithms is more dependent upon assumptions regarding "background", than on calculations of "signal". We argue that "background" is an enormously complex variable that can only be vaguely quantified, and thus the "best" probe set algorithm will vary from project to project
Knowledge-based gene expression classification via matrix factorization
Motivation: Modern machine learning methods based on matrix decomposition techniques, like independent component analysis (ICA) or non-negative matrix factorization (NMF), provide new and efficient analysis tools which are currently explored to analyze gene expression profiles. These exploratory feature extraction techniques yield expression modes (ICA) or metagenes (NMF). These extracted features are considered indicative of underlying regulatory processes. They can as well be applied to the classification of gene expression datasets by grouping samples into different categories for diagnostic purposes or group genes into functional categories for further investigation of related metabolic pathways and regulatory networks.
Results: In this study we focus on unsupervised matrix factorization techniques and apply ICA and sparse NMF to microarray datasets. The latter monitor the gene expression levels of human peripheral blood cells during differentiation from monocytes to macrophages. We show that these tools are able to identify relevant signatures in the deduced component matrices and extract informative sets of marker genes from these gene expression profiles. The methods rely on the joint discriminative power of a set of marker genes rather than on single marker genes. With these sets of marker genes, corroborated by leave-one-out or random forest cross-validation, the datasets could easily be classified into related diagnostic categories. The latter correspond to either monocytes versus macrophages or healthy vs Niemann Pick C disease patients.Siemens AG, MunichDFG (Graduate College 638)DAAD (PPP Luso - Alem˜a and PPP Hispano - Alemanas
- …