216 research outputs found
Recommended from our members
PSSP-RFE: Accurate Prediction of Protein Structural Class by Recursive Feature Extraction from PSI-BLAST Profile, Physical-Chemical Property and Functional Annotations
Protein structure prediction is critical to functional annotation of the massively accumulated biological sequences, which prompts an imperative need for the development of high-throughput technologies. As a first and key step in protein structure prediction, protein structural class prediction becomes an increasingly challenging task. Amongst most homological-based approaches, the accuracies of protein structural class prediction are sufficiently high for high similarity datasets, but still far from being satisfactory for low similarity datasets, i.e., below 40% in pairwise sequence similarity. Therefore, we present a novel method for accurate and reliable protein structural class prediction for both high and low similarity datasets. This method is based on Support Vector Machine (SVM) in conjunction with integrated features from position-specific score matrix (PSSM), PROFEAT and Gene Ontology (GO). A feature selection approach, SVM-RFE, is also used to rank the integrated feature vectors through recursively removing the feature with the lowest ranking score. The definitive top features selected by SVM-RFE are input into the SVM engines to predict the structural class of a query protein. To validate our method, jackknife tests were applied to seven widely used benchmark datasets, reaching overall accuracies between 84.61% and 99.79%, which are significantly higher than those achieved by state-of-the-art tools. These results suggest that our method could serve as an accurate and cost-effective alternative to existing methods in protein structural classification, especially for low similarity datasets
Controlling the balance between remote, pinhole, and van der Waals epitaxy of Heusler films on graphene/sapphire
Remote epitaxy on monolayer graphene is promising for synthesis of highly
lattice mismatched materials, exfoliation of free-standing membranes, and
re-use of expensive substrates. However, clear experimental evidence of a
remote mechanism remains elusive. In many cases, due to contaminants at the
transferred graphene/substrate interface, alternative mechanisms such as
pinhole-seeded lateral epitaxy or van der Waals epitaxy can explain the
resulting exfoliatable single-crystalline films. Here, we find that growth of
the Heusler compound GdPtSb on clean graphene on sapphire substrates produces a
30 degree rotated epitaxial superstructure that cannot be explained by pinhole
or van der Waals epitaxy. With decreasing growth temperature the volume
fraction of this 30 degree domain increases compared to the direct epitaxial 0
degree domain, which we attribute to slower surface diffusion at low
temperature that favors remote epitaxy, compared to faster surface diffusion at
high temperature that favors pinhole epitaxy. We further show that careful
graphene/substrate annealing () and consideration of the
film/substrate vs film/graphene lattice mismatch are required to obtain epitaxy
to the underlying substrate for a variety of other Heusler films, including
LaPtSb and GdAuGe. The 30 degree rotated superstructure provides a possible
experimental fingerprint of remote epitaxy since it is inconsistent with the
leading alternative mechanisms
Recommended from our members
Towards pathway-centric cancer therapies via pharmacogenomic profiling analysis of ERK signalling pathway
Background: Genomic heterogeneity in human cancers complicates gene-centric personalized medicine. Malignant tumors often share a core group of pathways that are perturbed by diverse genetic mutations. Therefore, one possible solution to overcome the heterogeneity challenge is a shift from gene-centric to pathway-centric therapies. Pathway-centric perspectives, which underscore the need to understand key pathways and their critical properties, could address the complexity of cancer heterogeneity better than gene-centric approaches to aid cancer drug discovery and therapy. Methods: We used large-scale pharmacogenomic profiling data provided by the Cancer Genome Project of the Wellcome Trust Sanger Institute and the Cancer Cell Line Encyclopedia. In a systematic in silico investigation of ERK signalling pathway components and topological structures determines their influences on pathway activity and targeted therapies. Mann–Whitney U test was used to identify gene alterations associated with drug sensitivity with p values and Benjamini–Hochberg correction for multiple hypotheses testing. Results: The analysis demonstrated that genetic alterations were crucial to activation of effector pathway and subsequent tumorigenesis, however drug sensitivity suffered from both drug effector and non-effector pathways, which were determined by not only underlying genomic alterations, but also interplay and topological relationship of components in pathway, suggesting that the combinatorial targets of key nodes in perturbed pathways may yield better treatment outcome. Furthermore, we proposed a model to provide a more comprehensive insight and understanding of pathway-centric cancer therapies. Conclusions: Our study provides a holistic view of factors influencing drug sensitivity and sheds light on pathway-centric cancer therapies. Electronic supplementary material The online version of this article (doi:10.1186/s40169-015-0066-1) contains supplementary material, which is available to authorized users
Sequence-based identification of recombination spots using pseudo nucleic acid representation and recursive feature extraction by linear kernel SVM
BackgroundIdentification of the recombination hot/cold spots is critical for understanding the mechanism of recombination as well as the genome evolution process. However, experimental identification of recombination spots is both time-consuming and costly. Developing an accurate and automated method for reliably and quickly identifying recombination spots is thus urgently needed.ResultsHere we proposed a novel approach by fusing features from pseudo nucleic acid composition (PseNAC), including NAC, n-tier NAC and pseudo dinucleotide composition (PseDNC). A recursive feature extraction by linear kernel support vector machine (SVM) was then used to rank the integrated feature vectors and extract optimal features. SVM was adopted for identifying recombination spots based on these optimal features. To evaluate the performance of the proposed method, jackknife cross-validation test was employed on a benchmark dataset. The overall accuracy of this approach was 84.09%, which was higher (from 0.37% to 3.79%) than those of state-of-the-art tools.ConclusionsComparison results suggested that linear kernel SVM is a useful vehicle for identifying recombination hot/cold spots
Enhancement of Red Emission and Site Analysis in Eu2+ Doped New-Type Structure Ba3CaK(PO4)3 for Plant Growth White LEDs
MethylPurify: tumor purity deconvolution and differential methylation detection from single tumor DNA methylomes
We propose a statistical algorithm MethylPurify that uses regions with bisulfite reads showing discordant methylation levels to infer tumor purity from tumor samples alone. MethylPurify can identify differentially methylated regions (DMRs) from individual tumor methylome samples, without genomic variation information or prior knowledge from other datasets. In simulations with mixed bisulfite reads from cancer and normal cell lines, MethylPurify correctly inferred tumor purity and identified over 96% of the DMRs. From patient data, MethylPurify gave satisfactory DMR calls from tumor methylome samples alone, and revealed potential missed DMRs by tumor to normal comparison due to tumor heterogeneity. Electronic supplementary material The online version of this article (doi:10.1186/s13059-014-0419-x) contains supplementary material, which is available to authorized users
Exome Sequencing Identifies ZNF644 Mutations in High Myopia
Myopia is the most common ocular disorder worldwide, and high myopia in particular is one of the leading causes of blindness. Genetic factors play a critical role in the development of myopia, especially high myopia. Recently, the exome sequencing approach has been successfully used for the disease gene identification of Mendelian disorders. Here we show a successful application of exome sequencing to identify a gene for an autosomal dominant disorder, and we have identified a gene potentially responsible for high myopia in a monogenic form. We captured exomes of two affected individuals from a Han Chinese family with high myopia and performed sequencing analysis by a second-generation sequencer with a mean coverage of 30× and sufficient depth to call variants at ∼97% of each targeted exome. The shared genetic variants of these two affected individuals in the family being studied were filtered against the 1000 Genomes Project and the dbSNP131 database. A mutation A672G in zinc finger protein 644 isoform 1 (ZNF644) was identified as being related to the phenotype of this family. After we performed sequencing analysis of the exons in the ZNF644 gene in 300 sporadic cases of high myopia, we identified an additional five mutations (I587V, R680G, C699Y, 3′UTR+12 C>G, and 3′UTR+592 G>A) in 11 different patients. All these mutations were absent in 600 normal controls. The ZNF644 gene was expressed in human retinal and retinal pigment epithelium (RPE). Given that ZNF644 is predicted to be a transcription factor that may regulate genes involved in eye development, mutation may cause the axial elongation of eyeball found in high myopia patients. Our results suggest that ZNF644 might be a causal gene for high myopia in a monogenic form
- …