145 research outputs found

    Unifying generative and discriminative learning principles

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The recognition of functional binding sites in genomic DNA remains one of the fundamental challenges of genome research. During the last decades, a plethora of different and well-adapted models has been developed, but only little attention has been payed to the development of different and similarly well-adapted learning principles. Only recently it was noticed that discriminative learning principles can be superior over generative ones in diverse bioinformatics applications, too.</p> <p>Results</p> <p>Here, we propose a generalization of generative and discriminative learning principles containing the maximum likelihood, maximum a posteriori, maximum conditional likelihood, maximum supervised posterior, generative-discriminative trade-off, and penalized generative-discriminative trade-off learning principles as special cases, and we illustrate its efficacy for the recognition of vertebrate transcription factor binding sites.</p> <p>Conclusions</p> <p>We find that the proposed learning principle helps to improve the recognition of transcription factor binding sites, enabling better computational approaches for extracting as much information as possible from valuable wet-lab data. We make all implementations available in the open-source library Jstacs so that this learning principle can be easily applied to other classification problems in the field of genome and epigenome analysis.</p

    Jet energy measurement with the ATLAS detector in proton-proton collisions at root s=7 TeV

    Get PDF
    The jet energy scale and its systematic uncertainty are determined for jets measured with the ATLAS detector at the LHC in proton-proton collision data at a centre-of-mass energy of √s = 7TeV corresponding to an integrated luminosity of 38 pb-1. Jets are reconstructed with the anti-kt algorithm with distance parameters R=0. 4 or R=0. 6. Jet energy and angle corrections are determined from Monte Carlo simulations to calibrate jets with transverse momenta pT≥20 GeV and pseudorapidities {pipe}η{pipe}<4. 5. The jet energy systematic uncertainty is estimated using the single isolated hadron response measured in situ and in test-beams, exploiting the transverse momentum balance between central and forward jets in events with dijet topologies and studying systematic variations in Monte Carlo simulations. The jet energy uncertainty is less than 2. 5 % in the central calorimeter region ({pipe}η{pipe}<0. 8) for jets with 60≤pT<800 GeV, and is maximally 14 % for pT<30 GeV in the most forward region 3. 2≤{pipe}η{pipe}<4. 5. The jet energy is validated for jet transverse momenta up to 1 TeV to the level of a few percent using several in situ techniques by comparing a well-known reference such as the recoiling photon pT, the sum of the transverse momenta of tracks associated to the jet, or a system of low-pT jets recoiling against a high-pT jet. More sophisticated jet calibration schemes are presented based on calorimeter cell energy density weighting or hadronic properties of jets, aiming for an improved jet energy resolution and a reduced flavour dependence of the jet response. The systematic uncertainty of the jet energy determined from a combination of in situ techniques is consistent with the one derived from single hadron response measurements over a wide kinematic range. The nominal corrections and uncertainties are derived for isolated jets in an inclusive sample of high-pT jets. Special cases such as event topologies with close-by jets, or selections of samples with an enhanced content of jets originating from light quarks, heavy quarks or gluons are also discussed and the corresponding uncertainties are determined. © 2013 CERN for the benefit of the ATLAS collaboration

    Measurement of the inclusive and dijet cross-sections of b-jets in pp collisions at sqrt(s) = 7 TeV with the ATLAS detector

    Get PDF
    The inclusive and dijet production cross-sections have been measured for jets containing b-hadrons (b-jets) in proton-proton collisions at a centre-of-mass energy of sqrt(s) = 7 TeV, using the ATLAS detector at the LHC. The measurements use data corresponding to an integrated luminosity of 34 pb^-1. The b-jets are identified using either a lifetime-based method, where secondary decay vertices of b-hadrons in jets are reconstructed using information from the tracking detectors, or a muon-based method where the presence of a muon is used to identify semileptonic decays of b-hadrons inside jets. The inclusive b-jet cross-section is measured as a function of transverse momentum in the range 20 < pT < 400 GeV and rapidity in the range |y| < 2.1. The bbbar-dijet cross-section is measured as a function of the dijet invariant mass in the range 110 < m_jj < 760 GeV, the azimuthal angle difference between the two jets and the angular variable chi in two dijet mass regions. The results are compared with next-to-leading-order QCD predictions. Good agreement is observed between the measured cross-sections and the predictions obtained using POWHEG + Pythia. MC@NLO + Herwig shows good agreement with the measured bbbar-dijet cross-section. However, it does not reproduce the measured inclusive cross-section well, particularly for central b-jets with large transverse momenta.Comment: 10 pages plus author list (21 pages total), 8 figures, 1 table, final version published in European Physical Journal

    Apples and oranges: avoiding different priors in Bayesian DNA sequence analysis

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>One of the challenges of bioinformatics remains the recognition of short signal sequences in genomic DNA such as donor or acceptor splice sites, splicing enhancers or silencers, translation initiation sites, transcription start sites, transcription factor binding sites, nucleosome binding sites, miRNA binding sites, or insulator binding sites. During the last decade, a wealth of algorithms for the recognition of such DNA sequences has been developed and compared with the goal of improving their performance and to deepen our understanding of the underlying cellular processes. Most of these algorithms are based on statistical models belonging to the family of Markov random fields such as position weight matrix models, weight array matrix models, Markov models of higher order, or moral Bayesian networks. While in many comparative studies different learning principles or different statistical models have been compared, the influence of choosing different prior distributions for the model parameters when using different learning principles has been overlooked, and possibly lead to questionable conclusions.</p> <p>Results</p> <p>With the goal of allowing direct comparisons of different learning principles for models from the family of Markov random fields based on the <it>same a-priori information</it>, we derive a generalization of the commonly-used product-Dirichlet prior. We find that the derived prior behaves like a Gaussian prior close to the maximum and like a Laplace prior in the far tails. In two case studies, we illustrate the utility of the derived prior for a direct comparison of different learning principles with different models for the recognition of binding sites of the transcription factor Sp1 and human donor splice sites.</p> <p>Conclusions</p> <p>We find that comparisons of different learning principles using the same a-priori information can lead to conclusions different from those of previous studies in which the effect resulting from different priors has been neglected. We implement the derived prior is implemented in the open-source library Jstacs to enable an easy application to comparative studies of different learning principles in the field of sequence analysis.</p

    Primate TNF Promoters Reveal Markers of Phylogeny and Evolution of Innate Immunity

    Get PDF
    Background. Tumor necrosis factor (TNF) is a critical cytokine in the immune response whose transcriptional activation is controlled by a proximal promoter region that is highly conserved in mammals and, in particular, primates. Specific single nucleotide polymorphisms (SNPs) upstream of the proximal human TNF promoter have been identified, which are markers of human ancestry. Methodology/Principal findings. Using a comparative genomics approach we show that certain fixed genetic differences in the TNF promoter serve as markers of primate speciation. We also demonstrate that distinct alleles of most human TNF promoter SNPs are identical to fixed nucleotides in primate TNF promoters. Furthermore, we identify fixed genetic differences within the proximal TNF promoters of Asian apes that do not occur in African ape or human TNF promoters. Strikingly, protein-DNA binding assays and gene reporter assays comparing these Asian ape TNF promoters to African ape and human TNF promoters demonstrate that, unlike the fixed differences that we define that are associated with primate phylogeny, these Asian ape-specific fixed differences impair transcription factor binding at an Sp1 site and decrease TNF transcription induced by bacterial stimulation of macrophages. Conclusions/significance. Here, we have presented the broadest interspecies comparison of a regulatory region of an innate immune response gene to date. We have characterized nucleotide positions in Asian ape TNF promoters that underlie functional changes in cell type- and stimulus-specific activation of the TNF gene. We have also identified ancestral TNF promoter nucleotide states in the primate lineage that correspond to human SNP alleles. These findings may reflect evolution of Asian and African apes under a distinct set of infectious disease pressures involving the innate immune response and TNF

    A Kinase-Phosphatase Network that Regulates Kinetochore-Microtubule Attachments and the SAC

    Get PDF

    Heavy and light roles: myosin in the morphogenesis of the heart

    Get PDF
    Myosin is an essential component of cardiac muscle, from the onset of cardiogenesis through to the adult heart. Although traditionally known for its role in energy transduction and force development, recent studies suggest that both myosin heavy-chain and myosin lightchain proteins are required for a correctly formed heart. Myosins are structural proteins that are not only expressed from early stages of heart development, but when mutated in humans they may give rise to congenital heart defects. This review will discuss the roles of myosin, specifically with regards to the developing heart. The expression of each myosin protein will be described, and the effects that altering expression has on the heart in embryogenesis in different animal models will be discussed. The human molecular genetics of the myosins will also be reviewed

    Measurement of the cross-section for b-jets produced in association with a Z boson at root s=7 TeV with the ATLAS detector ATLAS Collaboration

    Get PDF
    A measurement is presented of the inclusive cross-section for b-jet production in association with a Z boson in pp collisions at a centre-of-mass energy of root s = 7 TeV. The analysis uses the data sample collected by the ATLAS experiment in 2010, corresponding to an integrated luminosity of approximately 36 pb(-1). The event selection requires a Z boson decaying into high P-T electrons or muons, and at least one b-jet, identified by its displaced vertex, with transverse momentum p(T) > 25 GeV and rapidity vertical bar y vertical bar < 2.1. After subtraction of background processes, the yield is extracted from the vertex mass distribution of the candidate b-jets. The ratio of this cross-section to the inclusive Z cross-section (the average number of b-jets per Z event) is also measured. Both results are found to be in good agreement with perturbative QCD predictions at next-to-leading order

    Worldwide trends in body-mass index, underweight, overweight, and obesity from 1975 to 2016: a pooled analysis of 2416 population-based measurement studies in 128·9 million children, adolescents, and adults.

    Get PDF
    BACKGROUND: Underweight, overweight, and obesity in childhood and adolescence are associated with adverse health consequences throughout the life-course. Our aim was to estimate worldwide trends in mean body-mass index (BMI) and a comprehensive set of BMI categories that cover underweight to obesity in children and adolescents, and to compare trends with those of adults. METHODS: We pooled 2416 population-based studies with measurements of height and weight on 128·9 million participants aged 5 years and older, including 31·5 million aged 5-19 years. We used a Bayesian hierarchical model to estimate trends from 1975 to 2016 in 200 countries for mean BMI and for prevalence of BMI in the following categories for children and adolescents aged 5-19 years: more than 2 SD below the median of the WHO growth reference for children and adolescents (referred to as moderate and severe underweight hereafter), 2 SD to more than 1 SD below the median (mild underweight), 1 SD below the median to 1 SD above the median (healthy weight), more than 1 SD to 2 SD above the median (overweight but not obese), and more than 2 SD above the median (obesity). FINDINGS: Regional change in age-standardised mean BMI in girls from 1975 to 2016 ranged from virtually no change (-0·01 kg/m2 per decade; 95% credible interval -0·42 to 0·39, posterior probability [PP] of the observed decrease being a true decrease=0·5098) in eastern Europe to an increase of 1·00 kg/m2 per decade (0·69-1·35, PP>0·9999) in central Latin America and an increase of 0·95 kg/m2 per decade (0·64-1·25, PP>0·9999) in Polynesia and Micronesia. The range for boys was from a non-significant increase of 0·09 kg/m2 per decade (-0·33 to 0·49, PP=0·6926) in eastern Europe to an increase of 0·77 kg/m2 per decade (0·50-1·06, PP>0·9999) in Polynesia and Micronesia. Trends in mean BMI have recently flattened in northwestern Europe and the high-income English-speaking and Asia-Pacific regions for both sexes, southwestern Europe for boys, and central and Andean Latin America for girls. By contrast, the rise in BMI has accelerated in east and south Asia for both sexes, and southeast Asia for boys. Global age-standardised prevalence of obesity increased from 0·7% (0·4-1·2) in 1975 to 5·6% (4·8-6·5) in 2016 in girls, and from 0·9% (0·5-1·3) in 1975 to 7·8% (6·7-9·1) in 2016 in boys; the prevalence of moderate and severe underweight decreased from 9·2% (6·0-12·9) in 1975 to 8·4% (6·8-10·1) in 2016 in girls and from 14·8% (10·4-19·5) in 1975 to 12·4% (10·3-14·5) in 2016 in boys. Prevalence of moderate and severe underweight was highest in India, at 22·7% (16·7-29·6) among girls and 30·7% (23·5-38·0) among boys. Prevalence of obesity was more than 30% in girls in Nauru, the Cook Islands, and Palau; and boys in the Cook Islands, Nauru, Palau, Niue, and American Samoa in 2016. Prevalence of obesity was about 20% or more in several countries in Polynesia and Micronesia, the Middle East and north Africa, the Caribbean, and the USA. In 2016, 75 (44-117) million girls and 117 (70-178) million boys worldwide were moderately or severely underweight. In the same year, 50 (24-89) million girls and 74 (39-125) million boys worldwide were obese. INTERPRETATION: The rising trends in children's and adolescents' BMI have plateaued in many high-income countries, albeit at high levels, but have accelerated in parts of Asia, with trends no longer correlated with those of adults. FUNDING: Wellcome Trust, AstraZeneca Young Health Programme
    corecore