Search CORE

248 research outputs found

Coastal offshore of Novaya Zemlya Island, relief and sediments : extended abstract

Author: Nikiforov S.
Pavlidis Yu.
Rachold Volker
Publication venue: AWI, Alfred-Wegener-Institut für Polar- und Meeresforschung
Publication date: 01/01/2005
Field of study

OceanRep

TSV-Based Hairpin Bandpass Filter for 6G Mobile Communication Applications

Author: Ke Lei
Pavlidis Vasilis
Wang Fengjuan
Yang Yuan
Yin Xiangkun
Yu Ningmei
Publication venue: 'Institute of Electronics, Information and Communications Engineers (IEICE)'
Publication date: 10/08/2021
Field of study

The University of Manchester - Institutional Repository

Proteome-Wide Prediction of Novel DNA/RNA-Binding Proteins Using Amino Acid Composition and Periodicity in the Hyperthermophilic Archaeon Pyrococcus furiosus

Author: Ahmad
Akio Kanai
Altschul
Amiri
Anantharaman
Apweiler
Bairoch
Bateman
Bhardwaj
Cai
Cai
Camon
Cotton
Date
Garcia-Garcia
Gatherer
Goldbaum
Han
Han
Haruo Suzuki
Jordan
Kanai
Kanai
Kim
Kosuke Fujishima
Laskin
Manival
Masaru Tomita
Matthews
McLaughlin
Mizuki Komasa
Ofran
Pavlidis
Pavlidis
Pazos
Pruitt
Robb
Sarda
Sato
Sayaka Kitamura
Shanahan
Siew
Turnage
Xie
Yu
Yu
Publication venue: Oxford University Press
Publication date
Field of study

Proteins play a critical role in complex biological systems, yet about half of the proteins in publicly available databases are annotated as functionally unknown. Proteome-wide functional classification using bioinformatics approaches thus is becoming an important method for revealing unknown protein functions. Using the hyperthermophilic archaeon Pyrococcus furiosus as a model species, we used the support vector machine (SVM) method to discriminate DNA/RNA-binding proteins from proteins with other functions, using amino acid composition and periodicities as feature vectors. We defined this value as the composition score (CO) and periodicity score (PD). The P. furiosus proteins were classified into three classes (I–III) on the basis of the two-dimensional correlation analysis of CO score and PD score. As a result, approximately 87% of the functionally known proteins categorized as class I proteins (CO score + PD score > 0.6) were found to be DNA/RNA-binding proteins. Applying the two-dimensional correlation analysis to the 994 hypothetical proteins in P. furiosus, a total of 151 proteins were predicted to be novel DNA/RNA-binding protein candidates. DNA/RNA-binding activities of randomly chosen hypothetical proteins were experimentally verified. Six out of seven candidate proteins in class I possessed DNA/RNA-binding activities, supporting the efficacy of our method

Crossref

PubMed Central

A Case of Gastric Adenocarcinoma Presenting as Meningeal Carcinomatosis

Author: Bora Lee
Braeuninger
Byoung Jo Suh
Chang
Glantz
Hang Jong Yu
Hong Gi Lee
Kim
Kim
Kim
Kim
Larson
Lassman
Lee
Lisenko
Min
Park
Pavlidis
Sang Min Kim
van Oostenbrugge
Wagemakers
Publication venue: The Korean Association of Internal Medicine
Publication date: 01/01/2007
Field of study

Leptomeningeal carcinomatosis occurs in approximately 5% of patients with cancer. The most common cancers involving the leptomeninges are breast, lung cancer and melanoma. However, gastric adenocarcinoma has been rarely reported with leptomeningeal carcinomatosis. The presenting manifestations are usually headache, visual disturbances and seizures. We report a case of leptomeningeal metastasis that presented as a gastric cancer. A 49-year-old woman was admitted to our hospital with the symptoms of headache and melena for 10 days. The endoscopy showed a thickening of the folds of the stomach compatible with the diagnosis of a Borrman type IV gastric cancer. The biopsy revealed a signet ring cell carcinoma. The MRI of brain showed no abnormal findings; however, the patient complained of an intractable persistent headache, nausea and vomiting on admission day 6. The cytology examination of the cerebrospinal fluid supported the diagnosis of metastatic signet ring cell carcinoma

Crossref

PubMed Central

Identification of disease-causing genes using microarray data mining and gene ontology

Author: A Mohammadi
A Zhang
AA Alizadeh
Azadeh Mohammadi
B Duval
BF Souza
C Ambroise
C Ding
C Tago
D Lin
D Singh
E Martinez
FM Couto
I Guyon
I Inza
J Jaeger
JJ Jiang
L Li
L Yu
L Ziaei
Mansoor Salehi
Mohammad H Saraee
N Cristianini
P Pavlidis
P Resnik
PA Mundra
PA Mundra
PJ Park
R Genuer
RF Weaver
S Li
S Li
TM Huang
TR Golub
TS Furey
U Alon
W Xu
Y Ding
Y Saeys
Y Wang
YL Chin
Z Xie
Z Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

Background: One of the best and most accurate methods for identifying disease-causing genes is monitoring gene expression values in different samples using microarray technology. One of the shortcomings of microarray data is that they provide a small quantity of samples with respect to the number of genes. This problem reduces the classification accuracy of the methods, so gene selection is essential to improve the predictive accuracy and to identify potential marker genes for a disease. Among numerous existing methods for gene selection, support vector machine-based recursive feature elimination (SVMRFE) has become one of the leading methods, but its performance can be reduced because of the small sample size, noisy data and the fact that the method does not remove redundant genes. Methods: We propose a novel framework for gene selection which uses the advantageous features of conventional methods and addresses their weaknesses. In fact, we have combined the Fisher method and SVMRFE to utilize the advantages of a filtering method as well as an embedded method. Furthermore, we have added a redundancy reduction stage to address the weakness of the Fisher method and SVMRFE. In addition to gene expression values, the proposed method uses Gene Ontology which is a reliable source of information on genes. The use of Gene Ontology can compensate, in part, for the limitations of microarrays, such as having a small number of samples and erroneous measurement results. Results: The proposed method has been applied to colon, Diffuse Large B-Cell Lymphoma (DLBCL) and prostate cancer datasets. The empirical results show that our method has improved classification performance in terms of accuracy, sensitivity and specificity. In addition, the study of the molecular function of selected genes strengthened the hypothesis that these genes are involved in the process of cancer growth. Conclusions: The proposed method addresses the weakness of conventional methods by adding a redundancy reduction stage and utilizing Gene Ontology information. It predicts marker genes for colon, DLBCL and prostate cancer with a high accuracy. The predictions made in this study can serve as a list of candidates for subsequent wet-lab verification and might help in the search for a cure for cancers

University of Salford Institutional Repository

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Subcellular location prediction of proteins using support vector machines with alignment of block sequences utilizing amino acid composition

Author: A Ben–Hur
A Reinhardt
BW Matthews
C Guda
C Leslie
CS Yu
CS Yu
CS Yu
H Nielsen
J Cedano
J Guo
K Nakai
KC Chou
KC Chou
KC Chou
KC Chou
KJ Park
M Bhasin
M Bhasin
M Bhasin
M Kumar
M Reczko
O Emanuelsson
O Emanuelsson
P Horton
P Horton
P Horton
P Pavlidis
R Nair
S Hua
S Matsuda
Takeyuki Tamura
Tatsuya Akutsu
YD Cai
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Background: Subcellular location prediction of proteins is an important and well-studied problem in bioinformatics. This is a problem of predicting which part in a cell a given protein is transported to, where an amino acid sequence of the protein is given as an input. This problem is becoming more important since information on subcellular location is helpful for annotation of proteins and genes and the number of complete genomes is rapidly increasing. Since existing predictors are based on various heuristics, it is important to develop a simple method with high prediction accuracies. Results: In this paper, we propose a novel and general predicting method by combining techniques for sequence alignment and feature vectors based on amino acid composition. We implemented this method with support vector machines on plant data sets extracted from the TargetP database. Through fivefold cross validation tests, the obtained overall accuracies and average MCC were 0.9096 and 0.8655 respectively. We also applied our method to other datasets including that of WoLF PSORT. Conclusion: Although there is a predictor which uses the information of gene ontology and yields higher accuracy than ours, our accuracies are higher than existing predictors which use only sequence information. Since such information as gene ontology can be obtained only for known proteins, our predictor is considered to be useful for subcellular location prediction of newly-discovered proteins. Furthermore, the idea of combination of alignment and amino acid frequency is novel and general so that it may be applied to other problems in bioinformatics. Our method for plant is also implemented as a web-system and available on http://sunflower.kuicr.kyoto-u.ac.jp/~tamura/slpfa.html webcite

Crossref

Springer - Publisher Connector

PubMed Central

Kyoto University Research Information Repository

Inferring Pathway Activity toward Precise Disease Classification

Author: A Agresti
A Bhattacharjee
A Subramanian
AA Alizadeh
AH Bild
B Tian
CL Banka
DG Beer
Doheon Lee
E Segal
EJ Yeoh
Eunjung Lee
Greg Tucker-Kellogg
GV Glinsky
Han-Yu Chuang
HY Chuang
J Chen
J Lapointe
JA Swets
Jong-Won Kim
JP Svensson
JP Vert
KM Mani
L Ein-Dor
L Tian
LJ van 't Veer
MJ van de Vijver
P Pavlidis
P Pavlidis
R Sharan
RA Fisher
RA Gatenby
RA Gatenby
S Draghici
S Efroni
S Ramaswamy
SA Tomlins
SS Gambhir
SW Doniger
T Breslin
T Ideker
TR Golub
Trey Ideker
VK Mootha
WF Symmans
Y Saeys
Y Wang
Z Guo
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

The advent of microarray technology has made it possible to classify disease states based on gene expression profiles of patients. Typically, marker genes are selected by measuring the power of their expression profiles to discriminate among patients of different disease states. However, expression-based classification can be challenging in complex diseases due to factors such as cellular heterogeneity within a tissue sample and genetic heterogeneity across patients. A promising technique for coping with these challenges is to incorporate pathway information into the disease classification procedure in order to classify disease based on the activity of entire signaling pathways or protein complexes rather than on the expression levels of individual genes or proteins. We propose a new classification method based on pathway activities inferred for each patient. For each pathway, an activity level is summarized from the gene expression levels of its condition-responsive genes (CORGs), defined as the subset of genes in the pathway whose combined expression delivers optimal discriminative power for the disease phenotype. We show that classifiers using pathway activity achieve better performance than classifiers based on individual gene expression, for both simple and complex case-control studies including differentiation of perturbed from non-perturbed cells and subtyping of several different kinds of cancer. Moreover, the new method outperforms several previous approaches that use a static (i.e., non-conditional) definition of pathways. Within a pathway, the identified CORGs may facilitate the development of better diagnostic markers and the discovery of core alterations in human disease

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

LipocalinPred: a SVM-based method for prediction of lipocalins

Author: A Ben-Hur
A Garg
A Garg
A Sali
AS Martin Vogt
B Adam
C Leslie
D Holloway
D Plewczynski
Dinesh Gupta
DR Flower
DR Flower
G Wang
H Saiga
H Saigo
J Ahnstrom
J Duan
J Hull-Thompson
J Thorsten
JA Swets
Jayashree Ramana
LJ McGuffin
M Sieber
M Zervakis
NV Vapnik
P Pavlidis
R Rajakariar
S Ahmad
S Arne
SF Altschul
SR Eddy
W Deng
X Yu
YR Chan
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Functional annotation of rapidly amassing nucleotide and protein sequences presents a challenging task for modern bioinformatics. This is particularly true for protein families sharing extremely low sequence identity, as for lipocalins, a family of proteins with varied functions and great diversity at the sequence level, yet conserved structures. Results In the present study we propose a SVM based method for identification of lipocalin protein sequences. The SVM models were trained with the input features generated using amino acid, dipeptide and secondary structure compositions as well as PSSM profiles. The model derived using both PSSM and secondary structure emerged as the best model in the study. Apart from achieving a high prediction accuracy (>90% in leave-one-out), lipocalinpred correctly differentiates closely related fatty acid-binding proteins and triabins as non-lipocalins. Conclusion The method offers a promising approach as a lipocalin prediction tool, complementing PROSITE, Pfam and homology modelling methods.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Testing for Associations between Loci and Environmental Gradients Using Latent Factor Mixed Models

Author: Akey
Alexander
Barrett
Beaumont
Beaumont
Berry
Carvalho
Chen
Coop
Darwin
Durand
Eckart
Eckert
Eden
Endler
Engelhardt
Eric Frichot
Frichot
Fumagalli
Guillaume Bouchard
Haldane
Hancock
Hancock
Harmon
Hijmans
Hindorff
Hornbeck
Hudson
Jay
Jolliffe
Joost
Kelley
Koren
Lawson
Legendre
Lenormand
Li
Manel
Meirmans
Nei
Nielsen
Novembre
Olivier François
Patterson
Pavlidis
Poncet
Price
Price
Pritchard
Pritchard
Prugnolle
R Development Core Team
Saccone
Salakhutdinov
Schoville
Sean D. Schoville
Smouse
Storz
Storz
Thibert-Plante
Tipping
West
Williams
Young
Yu
Zhou
Publication venue: 'Oxford University Press (OUP)'
Publication date: 29/03/2013
Field of study

Adaptation to local environments often occurs through natural selection acting on a large number of loci, each having a weak phenotypic effect. One way to detect these loci is to identify genetic polymorphisms that exhibit high correlation with environmental variables used as proxies for ecological pressures. Here, we propose new algorithms based on population genetics, ecological modeling, and statistical learning techniques to screen genomes for signatures of local adaptation. Implemented in the computer program "latent factor mixed model" (LFMM), these algorithms employ an approach in which population structure is introduced using unobserved variables. These fast and computationally efficient algorithms detect correlations between environmental and genetic variation while simultaneously inferring background levels of population structure. Comparing these new algorithms with related methods provides evidence that LFMM can efficiently estimate random effects due to population history and isolation-by-distance patterns when computing gene-environment correlations, and decrease the number of false-positive associations in genome scans. We then apply these models to plant and human genetic data, identifying several genes with functions related to development that exhibit strong correlations with climatic gradients.Comment: 29 pages with 8 pages of Supplementary Material (V2 revised presentation and results part

arXiv.org e-Print Archive

Crossref

Hal - Université Grenoble Alpes

PubMed Central

ProDiGe: Prioritization Of Disease Genes with multitask machine learning from positive and unlabeled examples

Author: A Su
B Brancotte
B Calvo
B Linghu
B Liu
B Schölkopf
B Schölkopf
B Schölkopf
C Giallourakis
C Perez-Iratxeta
C Son
CC Chang
EA Adie
F Denis
F Mordelet
Fantine Mordelet
FS Turner
G Lanckriet
GRG Lanckriet
J Freudenberg
Jean-Philippe Vert
K Bleakley
K Lage
L Jacob
L Jacob
LC Tranchevent
M van Driel
N López-Bigas
N Tiffin
O Vanunu
P Pavlidis
RI Kondor
S Aerts
S Köhler
S Yu
T De Bie
T Evgeniou
T Hwang
U Ala
V McKusick
X Wu
Y Yamanishi
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Elucidating the genetic basis of human diseases is a central goal of genetics and molecular biology. While traditional linkage analysis and modern high-throughput techniques often provide long lists of tens or hundreds of disease gene candidates, the identification of disease genes among the candidates remains time-consuming and expensive. Efficient computational methods are therefore needed to prioritize genes within the list of candidates, by exploiting the wealth of information available about the genes in various databases. Results We propose ProDiGe, a novel algorithm for Prioritization of Disease Genes. ProDiGe implements a novel machine learning strategy based on learning from positive and unlabeled examples, which allows to integrate various sources of information about the genes, to share information about known disease genes across diseases, and to perform genome-wide searches for new disease genes. Experiments on real data show that ProDiGe outperforms state-of-the-art methods for the prioritization of genes in human diseases. Conclusions ProDiGe implements a new machine learning paradigm for gene prioritization, which could help the identification of new disease genes. It is freely available at <url>http://cbio.ensmp.fr/prodige</url>.</p

arXiv.org e-Print Archive

Crossref

Springer - Publisher Connector

Directory of Open Access Journals