4,479 research outputs found
Understand Data Preprocessing for Effective End-to-End Training of Deep Neural Networks
In this paper, we primarily focus on understanding the data preprocessing
pipeline for DNN Training in the public cloud. First, we run experiments to
test the performance implications of the two major data preprocessing methods
using either raw data or record files. The preliminary results show that data
preprocessing is a clear bottleneck, even with the most efficient software and
hardware configuration enabled by NVIDIA DALI, a high-optimized data
preprocessing library. Second, we identify the potential causes, exercise a
variety of optimization methods, and present their pros and cons. We hope this
work will shed light on the new co-design of ``data storage, loading pipeline''
and ``training framework'' and flexible resource configurations between them so
that the resources can be fully exploited and performance can be maximized
Recommended from our members
Statistical Assessment of the Global Regulatory Role of Histone Acetylation in Saccharomyces cerevisiae
BACKGROUND: Histone acetylation plays important but incompletely understood roles in gene regulation. A comprehensive understanding of the regulatory role of histone acetylation is difficult because many different histone acetylation patterns exist and their effects are confounded by other factors, such as the transcription factor binding sequence motif information and nucleosome occupancy.
RESULTS: We analyzed recent genomewide histone acetylation data using a few complementary statistical models and tested the validity of a cumulative model in approximating the global regulatory effect of histone acetylation. Confounding effects due to transcription factor binding sequence information were estimated by using two independent motif-based algorithms followed by a variable selection method. We found that the sequence information has a significant role in regulating transcription, and we also found a clear additional histone acetylation effect. Our model fits well with observed genome-wide data. Strikingly, including more complicated combinatorial effects does not improve the model's performance. Through a statistical analysis of conditional independence, we found that H4 acetylation may not have significant direct impact on global gene expression.
CONCLUSION: Decoding the combinatorial complexity of histone modification requires not only new data but also new methods to analyze the data. Our statistical analysis confirms that histone acetylation has a significant effect on gene transcription rates in addition to that attributable to upstream sequence motifs. Our analysis also suggests that a cumulative effect model for global histone acetylation is justified, although a more complex histone code may be important at specific gene loci. We also found that the regulatory roles among different histone acetylation sites have important differences.Statistic
A New Concept to Reveal Protein Dynamics Based on Energy Dissipation
Protein dynamics is essential for its function, especially for intramolecular signal transduction. In this work we propose a new concept, energy dissipation model, to systematically reveal protein dynamics upon effector binding and energy perturbation. The concept is applied to better understand the intramolecular signal transduction during allostery of enzymes. The E. coli allosteric enzyme, aspartokinase III, is used as a model system and special molecular dynamics simulations are designed and carried out. Computational results indicate that the number of residues affected by external energy perturbation (i.e. caused by a ligand binding) during the energy dissipation process shows a sigmoid pattern. Using the two-state Boltzmann equation, we define two parameters, the half response time and the dissipation rate constant, which can be used to well characterize the energy dissipation process. For the allostery of aspartokinase III, the residue response time indicates that besides the ACT2 signal transduction pathway, there is another pathway between the regulatory site and the catalytic site, which is suggested to be the β15-αK loop of ACT1. We further introduce the term “protein dynamical modules” based on the residue response time. Different from the protein structural modules which merely provide information about the structural stability of proteins, protein dynamical modules could reveal protein characteristics from the perspective of dynamics. Finally, the energy dissipation model is applied to investigate E. coli aspartokinase III mutations to better understand the desensitization of product feedback inhibition via allostery. In conclusion, the new concept proposed in this paper gives a novel holistic view of protein dynamics, a key question in biology with high impacts for both biotechnology and biomedicine
Lattice study on and X(3872)
Properties of charmonium are investigated in quenched
lattice QCD. The mass of is determined to be 3.80(3) GeV, which is
close to the mass of -wave charmonium and in agreement with
quark model predictions. The transition width of
is also obtained with a value keV. Since the possible
assignment to X(3872) has not been ruled out by experiments, our results help
to clarify the nature of X(3872).Comment: 15 pages, 8 figures. typos, grammatical errors and some references
corrected, redundant discussions deleted, conclusion does not change.
published versio
- …