1,651 research outputs found

    Computational discovery and analysis of metabolic pathways

    Get PDF
    Finding novel or non-standard metabolic pathways, possibly spanning multiple species, has important applications in fields such as metabolic engineering, metabolic network analysis, and metabolic network reconstruction. Traditionally, this has been a manual process, but the large volume of metabolic data now available has created a need for computational tools to automatically identify biologically relevant pathways. This thesis presents new algorithms for automatically finding biologically meaningful linear and branched metabolic pathways in multi-genome scale metabolic networks. These algorithms utilize atom mapping data, which provides the correspondence between atoms in the substrates to atoms in the products of a chemical reaction, to find pathways which conserve a given number of atoms between desired start and target compounds. The first algorithm presented identifies atom conserving linear pathways by explicitly tracking atoms during an exploration of a graph structure constructed from the atom mapping data. The explicit tracking of atoms enables finding branched pathways because it provides automatic identification of the reactions and compounds through which atoms are lost or gained. The thesis then describes two algorithmic approaches for identifying branched metabolic pathways based upon atom conserving linear pathways. One approach takes one linear pathway at a time and attempts to add branches that connect loss and gain compounds. The other approach takes a group of linear pathways and attempts to merge pathways that move mutually exclusive sets of atoms from the start to the target compounds. Comparisons to known metabolic pathways demonstrate that atom tracking causes the algorithms to avoid many unrealistic connections, often found in previous approaches, and return biologically meaningful pathways. While the theoretical complexity of finding even linear atom conserving pathways is high, by choosing the appropriate representations and heuristics, and perhaps due to the structure of the underlying data, the algorithms in this thesis have practical running times on real data. The results also demonstrate the potential of the algorithms to find novel or non-standard pathways that may span multiple organisms

    Adsorption of mono- and multivalent cat- and anions on DNA molecules

    Get PDF
    Adsorption of monovalent and multivalent cat- and anions on a deoxyribose nucleic acid (DNA) molecule from a salt solution is investigated by computer simulation. The ions are modelled as charged hard spheres, the DNA molecule as a point charge pattern following the double-helical phosphate strands. The geometrical shape of the DNA molecules is modelled on different levels ranging from a simple cylindrical shape to structured models which include the major and minor grooves between the phosphate strands. The densities of the ions adsorbed on the phosphate strands, in the major and in the minor grooves are calculated. First, we find that the adsorption pattern on the DNA surface depends strongly on its geometrical shape: counterions adsorb preferentially along the phosphate strands for a cylindrical model shape, but in the minor groove for a geometrically structured model. Second, we find that an addition of monovalent salt ions results in an increase of the charge density in the minor groove while the total charge density of ions adsorbed in the major groove stays unchanged. The adsorbed ion densities are highly structured along the minor groove while they are almost smeared along the major groove. Furthermore, for a fixed amount of added salt, the major groove cationic charge is independent on the counterion valency. For increasing salt concentration the major groove is neutralized while the total charge adsorbed in the minor groove is constant. DNA overcharging is detected for multivalent salt. Simulations for a larger ion radii, which mimic the effect of the ion hydration, indicate an increased adsorbtion of cations in the major groove.Comment: 34 pages with 14 figure

    Measurement of the t t-bar production cross section in the dilepton channel in pp collisions at sqrt(s) = 7 TeV

    Get PDF
    The t t-bar production cross section (sigma[t t-bar]) is measured in proton-proton collisions at sqrt(s) = 7 TeV in data collected by the CMS experiment, corresponding to an integrated luminosity of 2.3 inverse femtobarns. The measurement is performed in events with two leptons (electrons or muons) in the final state, at least two jets identified as jets originating from b quarks, and the presence of an imbalance in transverse momentum. The measured value of sigma[t t-bar] for a top-quark mass of 172.5 GeV is 161.9 +/- 2.5 (stat.) +5.1/-5.0 (syst.) +/- 3.6(lumi.) pb, consistent with the prediction of the standard model.Comment: Replaced with published version. Included journal reference and DO

    Search for anomalous t t-bar production in the highly-boosted all-hadronic final state

    Get PDF
    A search is presented for a massive particle, generically referred to as a Z', decaying into a t t-bar pair. The search focuses on Z' resonances that are sufficiently massive to produce highly Lorentz-boosted top quarks, which yield collimated decay products that are partially or fully merged into single jets. The analysis uses new methods to analyze jet substructure, providing suppression of the non-top multijet backgrounds. The analysis is based on a data sample of proton-proton collisions at a center-of-mass energy of 7 TeV, corresponding to an integrated luminosity of 5 inverse femtobarns. Upper limits in the range of 1 pb are set on the product of the production cross section and branching fraction for a topcolor Z' modeled for several widths, as well as for a Randall--Sundrum Kaluza--Klein gluon. In addition, the results constrain any enhancement in t t-bar production beyond expectations of the standard model for t t-bar invariant masses larger than 1 TeV.Comment: Submitted to the Journal of High Energy Physics; this version includes a minor typo correction that will be submitted as an erratu

    A new strategy for enhancing imputation quality of rare variants from next-generation sequencing data via combining SNP and exome chip data

    Get PDF
    Background: Rare variants have gathered increasing attention as a possible alternative source of missing heritability. Since next generation sequencing technology is not yet cost-effective for large-scale genomic studies, a widely used alternative approach is imputation. However, the imputation approach may be limited by the low accuracy of the imputed rare variants. To improve imputation accuracy of rare variants, various approaches have been suggested, including increasing the sample size of the reference panel, using sequencing data from study-specific samples (i.e., specific populations), and using local reference panels by genotyping or sequencing a subset of study samples. While these approaches mainly utilize reference panels, imputation accuracy of rare variants can also be increased by using exome chips containing rare variants. The exome chip contains 250 K rare variants selected from the discovered variants of about 12,000 sequenced samples. If exome chip data are available for previously genotyped samples, the combined approach using a genotype panel of merged data, including exome chips and SNP chips, should increase the imputation accuracy of rare variants. Results: In this study, we describe a combined imputation which uses both exome chip and SNP chip data simultaneously as a genotype panel. The effectiveness and performance of the combined approach was demonstrated using a reference panel of 848 samples constructed using exome sequencing data from the T2D-GENES consortium and 5,349 sample genotype panels consisting of an exome chip and SNP chip. As a result, the combined approach increased imputation quality up to 11 %, and genomic coverage for rare variants up to 117.7 % (MAF < 1 %), compared to imputation using the SNP chip alone. Also, we investigated the systematic effect of reference panels on imputation quality using five reference panels and three genotype panels. The best performing approach was the combination of the study specific reference panel and the genotype panel of combined data. Conclusions: Our study demonstrates that combined datasets, including SNP chips and exome chips, enhances both the imputation quality and genomic coverage of rare variants

    Multiple novel prostate cancer susceptibility signals identified by fine-mapping of known risk loci among Europeans

    Get PDF
    Genome-wide association studies (GWAS) have identified numerous common prostate cancer (PrCa) susceptibility loci. We have fine-mapped 64 GWAS regions known at the conclusion of the iCOGS study using large-scale genotyping and imputation in 25 723 PrCa cases and 26 274 controls of European ancestry. We detected evidence for multiple independent signals at 16 regions, 12 of which contained additional newly identified significant associations. A single signal comprising a spectrum of correlated variation was observed at 39 regions; 35 of which are now described by a novel more significantly associated lead SNP, while the originally reported variant remained as the lead SNP only in 4 regions. We also confirmed two association signals in Europeans that had been previously reported only in East-Asian GWAS. Based on statistical evidence and linkage disequilibrium (LD) structure, we have curated and narrowed down the list of the most likely candidate causal variants for each region. Functional annotation using data from ENCODE filtered for PrCa cell lines and eQTL analysis demonstrated significant enrichment for overlap with bio-features within this set. By incorporating the novel risk variants identified here alongside the refined data for existing association signals, we estimate that these loci now explain ∼38.9% of the familial relative risk of PrCa, an 8.9% improvement over the previously reported GWAS tag SNPs. This suggests that a significant fraction of the heritability of PrCa may have been hidden during the discovery phase of GWAS, in particular due to the presence of multiple independent signals within the same regio

    New genetic loci link adipose and insulin biology to body fat distribution.

    Get PDF
    Body fat distribution is a heritable trait and a well-established predictor of adverse metabolic outcomes, independent of overall adiposity. To increase our understanding of the genetic basis of body fat distribution and its molecular links to cardiometabolic traits, here we conduct genome-wide association meta-analyses of traits related to waist and hip circumferences in up to 224,459 individuals. We identify 49 loci (33 new) associated with waist-to-hip ratio adjusted for body mass index (BMI), and an additional 19 loci newly associated with related waist and hip circumference measures (P < 5 × 10(-8)). In total, 20 of the 49 waist-to-hip ratio adjusted for BMI loci show significant sexual dimorphism, 19 of which display a stronger effect in women. The identified loci were enriched for genes expressed in adipose tissue and for putative regulatory elements in adipocytes. Pathway analyses implicated adipogenesis, angiogenesis, transcriptional regulation and insulin resistance as processes affecting fat distribution, providing insight into potential pathophysiological mechanisms

    Optimasi Portofolio Resiko Menggunakan Model Markowitz MVO Dikaitkan dengan Keterbatasan Manusia dalam Memprediksi Masa Depan dalam Perspektif Al-Qur`an

    Full text link
    Risk portfolio on modern finance has become increasingly technical, requiring the use of sophisticated mathematical tools in both research and practice. Since companies cannot insure themselves completely against risk, as human incompetence in predicting the future precisely that written in Al-Quran surah Luqman verse 34, they have to manage it to yield an optimal portfolio. The objective here is to minimize the variance among all portfolios, or alternatively, to maximize expected return among all portfolios that has at least a certain expected return. Furthermore, this study focuses on optimizing risk portfolio so called Markowitz MVO (Mean-Variance Optimization). Some theoretical frameworks for analysis are arithmetic mean, geometric mean, variance, covariance, linear programming, and quadratic programming. Moreover, finding a minimum variance portfolio produces a convex quadratic programming, that is minimizing the objective function ðð¥with constraintsð ð 𥠥 ðandð´ð¥ = ð. The outcome of this research is the solution of optimal risk portofolio in some investments that could be finished smoothly using MATLAB R2007b software together with its graphic analysis
    corecore