Search CORE

14,088 research outputs found

Recommended from our members

Computational Strategies for Scalable Genomics Analysis.

Author: Shi Lizhen
Wang Zhong
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

The revolution in next-generation DNA sequencing technologies is leading to explosive data growth in genomics, posing a significant challenge to the computing infrastructure and software algorithms for genomics analysis. Various big data technologies have been explored to scale up/out current bioinformatics solutions to mine the big genomics data. In this review, we survey some of these exciting developments in the applications of parallel distributed computing and special hardware to genomics. We comment on the pros and cons of each strategy in the context of ease of development, robustness, scalability, and efficiency. Although this review is written for an audience from the genomics and bioinformatics fields, it may also be informative for the audience of computer science with interests in genomics applications

eScholarship - University of California

Recommended from our members

Deconvolute individual genomes from metagenome sequences through short read clustering.

Author: Deng Li
Li Kexue
Lu Yakang
Shi Lizhen
Wang Lili
Wang Zhong
Publication venue: eScholarship, University of California
Publication date: 01/01/2020
Field of study

Metagenome assembly from short next-generation sequencing data is a challenging process due to its large scale and computational complexity. Clustering short reads by species before assembly offers a unique opportunity for parallel downstream assembly of genomes with individualized optimization. However, current read clustering methods suffer either false negative (under-clustering) or false positive (over-clustering) problems. Here we extended our previous read clustering software, SpaRC, by exploiting statistics derived from multiple samples in a dataset to reduce the under-clustering problem. Using synthetic and real-world datasets we demonstrated that this method has the potential to cluster almost all of the short reads from genomes with sufficient sequencing coverage. The improved read clustering in turn leads to improved downstream genome assembly quality

eScholarship - University of California

Bis(1,10-phenanthroline-κ2 N,N′)(sulfato-κ2 O,O′)cobalt(II) butane-2,3-diol monosolvate

Author: Batten
Kai-Long Zhong
Lu
Sheldrick
Shi-Juan Wang
Zhang
Zhong
Zhong
Zhong
Publication venue: International Union of Crystallography
Publication date: 01/04/2011
Field of study

In the title compound, [Co(SO4)(C12H8N2)2]·C4H10O2, the Co2+ ion has a distorted octahedral coordination environment composed of four N atoms from two chelating 1,10-phenanthroline ligands and two O atoms from an O,O′-bidentate sulfate anion. The dihedral angle between the two chelating N2C2 groups is 83.48 (1)°. The Co2+ ion, the S atom and the mid-point of the central C—C bond of the butane-2,3-diol solvent molecule are situated on twofold rotation axes. The molecules of the complex and the solvent molecules are held together by pairs of symmetry-related O—H⋯O hydrogen bonds with the uncoordinated O atoms of the sulfate ions as acceptors. The solvent molecule is disordered over two sets of sites with site occupancies of 0.40 and 0.60

Crossref

Directory of Open Access Journals

PubMed Central