190 research outputs found

    Efficient HTTP based I/O on very large datasets for high performance computing with the libdavix library

    Full text link
    Remote data access for data analysis in high performance computing is commonly done with specialized data access protocols and storage systems. These protocols are highly optimized for high throughput on very large datasets, multi-streams, high availability, low latency and efficient parallel I/O. The purpose of this paper is to describe how we have adapted a generic protocol, the Hyper Text Transport Protocol (HTTP) to make it a competitive alternative for high performance I/O and data analysis applications in a global computing grid: the Worldwide LHC Computing Grid. In this work, we first analyze the design differences between the HTTP protocol and the most common high performance I/O protocols, pointing out the main performance weaknesses of HTTP. Then, we describe in detail how we solved these issues. Our solutions have been implemented in a toolkit called davix, available through several recent Linux distributions. Finally, we describe the results of our benchmarks where we compare the performance of davix against a HPC specific protocol for a data analysis use case.Comment: Presented at: Very large Data Bases (VLDB) 2014, Hangzho

    The Impact of CpG Island on Defining Transcriptional Activation of the Mouse L1 Retrotransposable Elements

    Get PDF
    BACKGROUND: L1 retrotransposable elements are potent insertional mutagens responsible for the generation of genomic variation and diversification of mammalian genomes, but reliable estimates of the numbers of actively transposing L1 elements are mostly nonexistent. While the human and mouse genomes contain comparable numbers of L1 elements, several phylogenetic and L1Xplore analyses in the mouse genome suggest that 1,500-3,000 active L1 elements currently exist and that they are still expanding in the genome. Conversely, the human genome contains only 150 active L1 elements. In addition, there is a discrepancy among the nature and number of mouse L1 elements in L1Xplore and the mouse genome browser at the UCSC and in the literature. To date, the reason why a high copy number of active L1 elements exist in the mouse genome but not in the human genome is unknown, as are the potential mechanisms that are responsible for transcriptional activation of mouse L1 elements. METHODOLOGY/PRINCIPAL FINDINGS: We analyzed the promoter sequences of the 1,501 potentially active mouse L1 elements retrieved from the GenBank and L1Xplore databases and evaluated their transcription factors binding sites and CpG content. To this end, we found that a substantial number of mouse L1 elements contain altered transcription factor YY1 binding sites on their promoter sequences that are required for transcriptional initiation, suggesting that only a half of L1 elements are capable of being transcriptionally active. Furthermore, we present experimental evidence that previously unreported CpG islands exist in the promoters of the most active T(F) family of mouse L1 elements. The presence of sequence variations and polymorphisms in CpG islands of L1 promoters that arise from transition mutations indicates that CpG methylation could play a significant role in determining the activity of L1 elements in the mouse genome. CONCLUSIONS: A comprehensive analysis of mouse L1 promoters suggests that the number of transcriptionally active elements is significantly lower than the total number of full-length copies from the three active mouse L1 families. Like human L1 elements, the CpG islands and potentially the transcription factor YY1 binding sites are likely to be required for transcriptional initiation of mouse L1 elements

    Two-pion Bose-Einstein correlations in central Pb-Pb collisions at sNN\sqrt{s_{\rm NN}} = 2.76 TeV

    Get PDF
    The first measurement of two-pion Bose-Einstein correlations in central Pb-Pb collisions at sNN=2.76\sqrt{s_{\rm NN}} = 2.76 TeV at the Large Hadron Collider is presented. We observe a growing trend with energy now not only for the longitudinal and the outward but also for the sideward pion source radius. The pion homogeneity volume and the decoupling time are significantly larger than those measured at RHIC.Comment: 17 pages, 5 captioned figures, 1 table, authors from page 12, published version, figures at http://aliceinfo.cern.ch/ArtSubmission/node/388

    An Atlas of the Speed of Copy Number Changes in Animal Gene Families and Its Implications

    Get PDF
    The notion that gene duplications generating new genes and functions is commonly accepted in evolutionary biology. However, this assumption is more speculative from theory rather than well proven in genome-wide studies. Here, we generated an atlas of the rate of copy number changes (CNCs) in all the gene families of ten animal genomes. We grouped the gene families with similar CNC dynamics into rate pattern groups (RPGs) and annotated their function using a novel bottom-up approach. By comparing CNC rate patterns, we showed that most of the species-specific CNC rates groups are formed by gene duplication rather than gene loss, and most of the changes in rates of CNCs may be the result of adaptive evolution. We also found that the functions of many RPGs match their biological significance well. Our work confirmed the role of gene duplication in generating novel phenotypes, and the results can serve as a guide for researchers to connect the phenotypic features to certain gene duplications

    The role of LINEs and CpG islands in dosage compensation on the chicken Z chromosome

    Get PDF
    Most avian Z genes are expressed more highly in ZZ males than ZW females, suggesting that chromosome-wide mechanisms of dosage compensation have not evolved. Nevertheless, a small percentage of Z genes are expressed at similar levels in males and females, an indication that a yet unidentified mechanism compensates for the sex difference in copy number. Primary DNA sequences are thought to have a role in determining chromosome gene inactivation status on the mammalian X chromosome. However, it is currently unknown whether primary DNA sequences also mediate chicken Z gene compensation status. Using a combination of chicken DNA sequences and Z gene compensation profiles of 310 genes, we explored the relationship between Z gene compensation status and primary DNA sequence features. Statistical analysis of different Z chromosomal features revealed that long interspersed nuclear elements (LINEs) and CpG islands are enriched on the Z chromosome compared with 329 other DNA features. Linear support vector machine (SVM) classifiers, using primary DNA sequences, correctly predict the Z compensation status for >60% of all Z-linked genes. CpG islands appear to be the most accurate classifier and alone can correctly predict compensation of 63% of Z genes. We also show that LINE CR1 elements are enriched 2.7-fold on the chicken Z chromosome compared with autosomes and that chicken chromosomal length is highly correlated with percentage LINE content. However, the position of LINE elements is not significantly associated with dosage compensation status of Z genes. We also find a trend for a higher proportion of CpG islands in the region of the Z chromosome with the fewest dosage-compensated genes compared with the region containing the greatest concentration of compensated genes. Comparison between chicken and platypus genomes shows that LINE elements are not enriched on sex chromosomes in platypus, indicating that LINE accumulation is not a feature of all sex chromosomes. Our results suggest that CpG islands are not randomly distributed on the Z chromosome and may influence Z gene dosage compensation status

    Characterization of LINE-1 Ribonucleoprotein Particles

    Get PDF
    The average human genome contains a small cohort of active L1 retrotransposons that encode two proteins (ORF1p and ORF2p) required for their mobility (i.e., retrotransposition). Prior studies demonstrated that human ORF1p, L1 RNA, and an ORF2p-encoded reverse transcriptase activity are present in ribonucleoprotein (RNP) complexes. However, the inability to physically detect ORF2p from engineered human L1 constructs has remained a technical challenge in the field. Here, we have employed an epitope/RNA tagging strategy with engineered human L1 retrotransposons to identify ORF1p, ORF2p, and L1 RNA in a RNP complex. We next used this system to assess how mutations in ORF1p and/or ORF2p impact RNP formation. Importantly, we demonstrate that mutations in the coiled-coil domain and RNA recognition motif of ORF1p, as well as the cysteine-rich domain of ORF2p, reduce the levels of ORF1p and/or ORF2p in L1 RNPs. Finally, we used this tagging strategy to localize the L1–encoded proteins and L1 RNA to cytoplasmic foci that often were associated with stress granules. Thus, we conclude that a precise interplay among ORF1p, ORF2p, and L1 RNA is critical for L1 RNP assembly, function, and L1 retrotransposition

    Bark anatomy, chemical composition and ethanol-water extract composition of Anadenanthera peregrina and Anadenanthera colubrina

    Get PDF
    The bark of Anadenanthera peregrina (L.) Speg and Anadenanthera colubrina (Vell.) Brenan were characterized in relation to anatomical and chemical features. The barks were similar and included a thin conducting phloem, a largely dilated and sclerified non-conducting phloem, and a rhyridome with periderms with thin phellem interspersed by cortical tissues. Only small differences between species were observed that cannot be used alone for taxonomic purposes. The summative chemical composition of A. peregrina and A. colubrina was respectively: 8.2% and 7.7% ash; 28.8% and 29.3% extractives; 2.4% and 2.6% suberin; and 18.9% lignin. The monosaccharide composition showed the predominance of glucose (on average 82% of total neutral sugars) and of xylose (9%). The ethanol-water extracts of A. peregrina and A. colubrina barks included a high content of phenolics, respectively: total phenolics 583 and 682 mg GAE/g extract; 148 and 445 mg CE/g extract; tannins 587 and 98 mg CE/g extract. The antioxidant activity was 238 and 269 mg Trolox/g extract. The barks of the Anadenanthera species are a potential source of polar extractives that will represent an important valorization and therefore contribute to improve the overall economic potential and sustainability of A. peregrina and A. colubrinainfo:eu-repo/semantics/publishedVersio

    In-Orbit Performance of the Space Telescope NINA and GCR Flux Measurements

    Full text link
    The NINA apparatus, on board the Russian satellite Resurs-01 n.4, has been in polar orbit since 1998 July 10, at an altitude of 840 km. Its main scientific task is to study the galactic, solar and anomalous components of cosmic rays in the energy interval 10--200 MeV/n. In this paper we present a description of the instrument and its basic operating modes. Measurements of Galactic Cosmic Ray spectra will also be shown.Comment: 38 pages, 10 figures, accepted for publication in the ApJ

    The Physics of the B Factories

    Get PDF
    This work is on the Physics of the B Factories. Part A of this book contains a brief description of the SLAC and KEK B Factories as well as their detectors, BaBar and Belle, and data taking related issues. Part B discusses tools and methods used by the experiments in order to obtain results. The results themselves can be found in Part C

    Alignment of the ALICE Inner Tracking System with cosmic-ray tracks

    Get PDF
    37 pages, 15 figures, revised version, accepted by JINSTALICE (A Large Ion Collider Experiment) is the LHC (Large Hadron Collider) experiment devoted to investigating the strongly interacting matter created in nucleus-nucleus collisions at the LHC energies. The ALICE ITS, Inner Tracking System, consists of six cylindrical layers of silicon detectors with three different technologies; in the outward direction: two layers of pixel detectors, two layers each of drift, and strip detectors. The number of parameters to be determined in the spatial alignment of the 2198 sensor modules of the ITS is about 13,000. The target alignment precision is well below 10 micron in some cases (pixels). The sources of alignment information include survey measurements, and the reconstructed tracks from cosmic rays and from proton-proton collisions. The main track-based alignment method uses the Millepede global approach. An iterative local method was developed and used as well. We present the results obtained for the ITS alignment using about 10^5 charged tracks from cosmic rays that have been collected during summer 2008, with the ALICE solenoidal magnet switched off.Peer reviewe
    corecore