Search CORE

85 research outputs found

Consumer credit in comparative perspective

Author: Aalbers MB
Akos Rona-Tas
Alya Guseva
Andreeva G
Bar-Gill O
Carruthers BG
Casolaro L
Chung RK
Deleuze G
Doneda D
Durst J
Dwyer RE
Evans DS
Fourcade M
Frank R
Guseva A
Guérin I
Harvey D
Hrustič T
Hurley M
Ibragimova D
Jappelli T
Krenn K
Krippner GR
Kuzina O
Laferté G
Lazarus J
Lazzarato M
Lissowska M
Mandell L
Matuszyk A
Miller MJ
Minty S
Niemi J
Olcoń-Kubicka M
Pitluck AZ
Rajan RG
Ramsay I
Rona-Tas A
Rona-Tas A
Rona-Tas A
Schor J
Skiba PM
Spindler G
Stoesz D
Sullivan TA
Veblen T
Wherry FF
World Bank.
Publication venue: 'Annual Reviews'
Publication date: 01/07/2018
Field of study

We review the literature in sociology and related fields on the fast global growth of consumer credit and debt and the possible explanations for this expansion. We describe the ways people interact with the strongly segmented consumer credit system around the world—more specifically, the way they access credit and the way they are held accountable for their debt. We then report on research on two areas in which consumer credit is consequential: its effects on social relations and on physical and mental health. Throughout the article, we point out national variations and discuss explanations for these differences. We conclude with a brief discussion of the future tasks and challenges of comparative research on consumer credit.Accepted manuscrip

Crossref

Boston University Institutional Repository (OpenBU)

Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index

Author: A Andreeva
Abdur R Sikder
Albert Y Zomaya
AR Sikder
FMG Pearl
G Pollastri
G Pollastri
HM Berman
J Cheng
J Liu
J Sim
JE Gewehr
L Kong
M Dumontier
M Suyama
N Nagarajan
OV Galzitskaya
RA George
RL Marsden
S Veretnik
SF Altschul
SJ Wheelan
T Joachims
TA Holland
V Vapnik
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Knowledge of protein domain boundaries is critical for the characterisation and understanding of protein function. The ability to identify domains without the knowledge of the structure – by using sequence information only – is an essential step in many types of protein analyses. In this present study, we demonstrate that the performance of DomainDiscovery is improved significantly by including the inter-domain linker index value for domain identification from sequence-based information. Improved DomainDiscovery uses a Support Vector Machine (SVM) approach and a unique training dataset built on the principle of consensus among experts in defining domains in protein structure. The SVM was trained using a PSSM (Position Specific Scoring Matrix), secondary structure, solvent accessibility information and inter-domain linker index to detect possible domain boundaries for a target sequence. RESULTS: Improved DomainDiscovery is compared with other methods by benchmarking against a structurally non-redundant dataset and also CASP5 targets. Improved DomainDiscovery achieves 70% accuracy for domain boundary identification in multi-domains proteins. CONCLUSION: Improved DomainDiscovery compares favourably to the performance of other methods and excels in the identification of domain boundaries for multi-domain proteins as a result of introducing support vector machine with benchmark_2 dataset

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

A new measure for functional similarity of gene products based on Gene Ontology

Author: A Andreeva
AK Bjorklund
Andreas Schlicker
C Ganem
CH Wu
D Devos
D Devos
D Lin
DI Liao
DL Wheeler
E Camon
E Morgunova
F Spaltmann
Francisco S Domingues
FS Domingues
HM Berman
HW Mewes
I Friedberg
I Letunic
IG Choi
IUBMB
J Hou
J Ruiz-Herrera
JD Watson
JL Sevilla
Jörg Rahnenführer
L Stein
LJ Jensen
M Ashburner
M Fischer
M Park
M Remm
N Kaplan
N Speer
NJ Mulder
P Khatri
P Resnik
P Resnik
PH Lee
PW Lord
RD Finn
S Echt
S McGinnis
SG Tringe
SL Cao
T Gabaldon
T Hubbard
TA Tatusova
TA White
Thomas Lengauer
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Gene Ontology (GO) is a standard vocabulary of functional terms and allows for coherent annotation of gene products. These annotations provide a basis for new methods that compare gene products regarding their molecular function and biological role. RESULTS: We present a new method for comparing sets of GO terms and for assessing the functional similarity of gene products. The method relies on two semantic similarity measures; sim(Rel )and funSim. One measure (sim(Rel)) is applied in the comparison of the biological processes found in different groups of organisms. The other measure (funSim) is used to find functionally related gene products within the same or between different genomes. Results indicate that the method, in addition to being in good agreement with established sequence similarity approaches, also provides a means for the identification of functionally related proteins independent of evolutionary relationships. The method is also applied to estimating functional similarity between all proteins in Saccharomyces cerevisiae and to visualizing the molecular function space of yeast in a map of the functional space. A similar approach is used to visualize the functional relationships between protein families. CONCLUSION: The approach enables the comparison of the underlying molecular biology of different taxonomic groups and provides a new comparative genomics tool identifying functionally related gene products independent of homology. The proposed map of the functional space provides a new global view on the functional relationships between gene products or protein families

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

MPG.PuRe

Prediction of catalytic residues using Support Vector Machine with selected protein sequence and structural properties

Author: A Andreeva
A Gutteridge
AH Elcock
AR Panchenko
B Lee
B Rost
BW Mathews
CA Innis
Cathy H Wu
CH Wu
DK Smith
GJ Bartlett
H Yao
HM Berman
IH Witten
JC Platt
JD Thompson
JS Milton
K Kinoshita
K Sjolander
M Ota
MA Hearst
MJ Ondrechen
Natalia V Petrova
O Lichtarge
P Aloy
PP Wangikar
R Kohavi
R Koradi
R Landgraf
RL Tatusov
S Chakravarty
S Jones
S Parthasarathy
S Zhu
SF Altschul
SJ Campbell
SJ Hubbard
TA Binkowski
W Kabsch
W Tian
WSJ Valdar
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: The number of protein sequences deriving from genome sequencing projects is outpacing our knowledge about the function of these proteins. With the gap between experimentally characterized and uncharacterized proteins continuing to widen, it is necessary to develop new computational methods and tools for functional prediction. Knowledge of catalytic sites provides a valuable insight into protein function. Although many computational methods have been developed to predict catalytic residues and active sites, their accuracy remains low, with a significant number of false positives. In this paper, we present a novel method for the prediction of catalytic sites, using a carefully selected, supervised machine learning algorithm coupled with an optimal discriminative set of protein sequence conservation and structural properties. RESULTS: To determine the best machine learning algorithm, 26 classifiers in the WEKA software package were compared using a benchmarking dataset of 79 enzymes with 254 catalytic residues in a 10-fold cross-validation analysis. Each residue of the dataset was represented by a set of 24 residue properties previously shown to be of functional relevance, as well as a label {+1/-1} to indicate catalytic/non-catalytic residue. The best-performing algorithm was the Sequential Minimal Optimization (SMO) algorithm, which is a Support Vector Machine (SVM). The Wrapper Subset Selection algorithm further selected seven of the 24 attributes as an optimal subset of residue properties, with sequence conservation, catalytic propensities of amino acids, and relative position on protein surface being the most important features. CONCLUSION: The SMO algorithm with 7 selected attributes correctly predicted 228 of the 254 catalytic residues, with an overall predictive accuracy of more than 86%. Missing only 10.2% of the catalytic residues, the method captures the fundamental features of catalytic residues and can be used as a "catalytic residue filter" to facilitate experimental identification of catalytic residues for proteins with known structure but unknown function

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Circular Permutation in Proteins

Author: A Andreeva
A Jeltsch
A Prlic
Andreas Prlić
AR Viguera
AV Cheltsov
BA Cunningham
CP Ponting
CP Ponting
DJ Bowles
DM Carrington
DP Goldenberg
DT Capraro
E Hazkani-Covo
GS Baird
H Bruhn
H Einspahr
J Jung
J Lee
J Weiner
J Weiner
JM Bujnicki
JM Thornton
K Guruprasad
K Luger
L Wang
M Shatsky
M Zuker
NJ Turner
O Bachar
P Zhang
PT Beernink
Q Kaas
S Topell
S Uliel
Shoshana Wodak
Spencer Bliven
T Schmidt-Goenner
TA Whitehead
WC Lo
WC Lo
Y Hatefi
Y Yu
YM Huang
Z Qian
Publication venue: Public Library of Science
Publication date: 29/03/2012
Field of study

This is a ‘‘Topic Page’ ’ article for PLoS Computational Biology. Circular permutation describes a type of relationship between proteins, whereby the proteins have a changed order of amino acids in their protein sequence, such that the sequence of the first portion of one protein (adjacent to the N-terminus) is related to that of the second portion of the other protein (near its C-terminus), and vice versa (see Figure 1). This is directly analogous to the mathematical notion of a cyclic permutation over the set of residues in a protein. Circular permutation can be the result of evolutionary events, post-translational modifications, or artificially engineered mutations. The result is a protein structure with different connectivity, but overall similar three-dimensional (3D) shape. The homology between portions of the proteins can be established by observing similar sequences between N- and C-terminal portions of the tw

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

BSSF: a fingerprint based ultrafast binding site similarity search and function analysis server

Author: Bing Xiong
Jie Wu
David L Burk
Mengzhu Xue
Hualiang Jiang
Jingkang Shen
WA Warr
A Kouranov
A Godzik
OC Redfern
SG Buchanan
K Lundstrom
DF Veber
D Lee
SF Altschul
A Bateman
BE Engelhardt
J Soding
C Chothia
L Holm
AG Murzin
CA Orengo
A Andreeva
TA Binkowski
GJ Kleywegt
RA Laskowski
RB Russell
S Schmitt
A Shulman-Peleg
AC Wallace
T Hamelryck
M Ashburner
P Willett
HM Berman
GP Brady
WR Pearson
A Gutteridge
T Fawcett
ND Gold
J Blaszczyk
K Yeturu
RA Laskowski
L Xie
MP Liang
M Brylinski
XY Jiang
D Pal
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Genome sequencing and post-genomics projects such as structural genomics are extending the frontier of the study of sequence-structure-function relationship of genes and their products. Although many sequence/structure-based methods have been devised with the aim of deciphering this delicate relationship, there still remain large gaps in this fundamental problem, which continuously drives researchers to develop novel methods to extract relevant information from sequences and structures and to infer the functions of newly identified genes by genomics technology. Results Here we present an ultrafast method, named BSSF(Binding Site Similarity & Function), which enables researchers to conduct similarity searches in a comprehensive three-dimensional binding site database extracted from PDB structures. This method utilizes a fingerprint representation of the binding site and a validated statistical Z-score function scheme to judge the similarity between the query and database items, even if their similarities are only constrained in a sub-pocket. This fingerprint based similarity measurement was also validated on a known binding site dataset by comparing with geometric hashing, which is a standard 3D similarity method. The comparison clearly demonstrated the utility of this ultrafast method. After conducting the database searching, the hit list is further analyzed to provide basic statistical information about the occurrences of Gene Ontology terms and Enzyme Commission numbers, which may benefit researchers by helping them to design further experiments to study the query proteins. Conclusions This ultrafast web-based system will not only help researchers interested in drug design and structural genomics to identify similar binding sites, but also assist them by providing further analysis of hit list from database searching.</p

Queen's University Belfast Research Portal

Crossref

Southampton (e-Prints Soton)

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Online Research Database In Technology

BSSF: a fingerprint based ultrafast binding site similarity search and function analysis server

Author: A Andreeva
A Bateman
A Godzik
A Gutteridge
A Kouranov
A Shulman-Peleg
AC Wallace
AG Murzin
BE Engelhardt
Bing Xiong
C Chothia
CA Orengo
D Lee
D Pal
David L Burk
DF Veber
GJ Kleywegt
GP Brady
HM Berman
Hualiang Jiang
J Blaszczyk
J Soding
Jie Wu
Jingkang Shen
K Lundstrom
K Yeturu
L Holm
L Xie
M Ashburner
M Brylinski
Mengzhu Xue
MP Liang
ND Gold
OC Redfern
P Willett
RA Laskowski
RA Laskowski
RB Russell
S Schmitt
SF Altschul
SG Buchanan
T Fawcett
T Hamelryck
TA Binkowski
WA Warr
WR Pearson
XY Jiang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Structural genomics is the largest contributor of novel structural leverage

Author: A Andreeva
A Bhattacharya
A Grant
A Harrison
Adam Godzik
AG Murzin
Andras Fiser
Andrei Kouranov
Burkhard Rost
C Chothia
C Sander
C Yeats
CA Orengo
Christine Orengo
CM Fraser-Liggett
Gaetano T. Montelione
GW Tyson
H Berman
HM Berman
IYY Koh
J Kopp
J Liu
J Liu
J Liu
J Liu
J Moult
J Moult
JC Norvell
JD Watson
Jinfeng Liu
JM Chandonia
John K. Everett
L Chen
Lukasz Jaroszewski
M Gerstein
M Levitt
MA Marti-Renom
MA Marti-Renom
N Fernandez-Fuentes
OC Redfern
PE Bourne
R Apweiler
R Nair
Rajesh Nair
RL Marsden
S Yooseph
Ta-Tsen Soong
Thomas B. Acton
U Pieper
U Pieper
Publication venue: Springer Netherlands
Publication date: 01/01/2009
Field of study

The Protein Structural Initiative (PSI) at the US National Institutes of Health (NIH) is funding four large-scale centers for structural genomics (SG). These centers systematically target many large families without structural coverage, as well as very large families with inadequate structural coverage. Here, we report a few simple metrics that demonstrate how successfully these efforts optimize structural coverage: while the PSI-2 (2005-now) contributed more than 8% of all structures deposited into the PDB, it contributed over 20% of all novel structures (i.e. structures for protein sequences with no structural representative in the PDB on the date of deposition). The structural coverage of the protein universe represented by today’s UniProt (v12.8) has increased linearly from 1992 to 2008; structural genomics has contributed significantly to the maintenance of this growth rate. Success in increasing novel leverage (defined in Liu et al. in Nat Biotechnol 25:849–851, 2007) has resulted from systematic targeting of large families. PSI’s per structure contribution to novel leverage was over 4-fold higher than that for non-PSI structural biology efforts during the past 8 years. If the success of the PSI continues, it may just take another ~15 years to cover most sequences in the current UniProt database

Crossref

Springer - Publisher Connector

PubMed Central

UCL Discovery

eScholarship - University of California

A Global Characterization and Identification of Multifunctional Enzymes

Author: A Aharoni
A Andreeva
A Bairoch
A Gomez
AE Todd
B Moore
CH Ding
CJ Jeffery
CJ Jeffery
CJ Jeffery
CJ Jeffery
CZ Cai
DH Huberts
E Zientz
H Tochio
Hai-Lei Zhang
Hao Wang
HC Lee
Hong-Huang Lin
Jing-Xian Zhang
JM Elkins
JM Peregrin-Alvarez
K Hult
KA Canada
KF Aoki-Kinoshita
M Magrane
NA Timofeyeva
Olivier Lespinet
P Carbonell
P Jiang
P Marchot
P Prasannan
P Prasannan
Quan Zou
R Breitling
RA Jensen
RD Finn
S Schmidt
SD Copley
SF Altschul
Shi-Chang Hu
TA Mohammad
TR Whitehead
U Genschel
V Cherkassky
V Sakanyan
W Zheng
Wei-Juan Huang
Xian-Ying Cheng
YH Chen
Yu-Zong Chen
Zhi-Liang Ji
Publication venue: Public Library of Science
Publication date: 18/06/2012
Field of study

Multi-functional enzymes are enzymes that perform multiple physiological functions. Characterization and identification of multi-functional enzymes are critical for communication and cooperation between different functions and pathways within a complex cellular system or between cells. In present study, we collected literature-reported 6,799 multi-functional enzymes and systematically characterized them in structural, functional, and evolutionary aspects. It was found that four physiochemical properties, that is, charge, polarizability, hydrophobicity, and solvent accessibility, are important for characterization of multi-functional enzymes. Accordingly, a combinational model of support vector machine and random forest model was constructed, based on which 6,956 potential novel multi-functional enzymes were successfully identified from the ENZYME database. Moreover, it was observed that multi-functional enzymes are non-evenly distributed in species, and that Bacteria have relatively more multi-functional enzymes than Archaebacteria and Eukaryota. Comparative analysis indicated that the multi-functional enzymes experienced a fluctuation of gene gain and loss during the evolution from S. cerevisiae to H. sapiens. Further pathway analyses indicated that a majority of multi-functional enzymes were well preserved in catalyzing several essential cellular processes, for example, metabolisms of carbohydrates, nucleotides, and amino acids. What’s more, a database of known multi-functional enzymes and a server for novel multi-functional enzyme prediction were also constructed for free access at http://bioinf.xmu.edu.cn/databases/MFEs/index.htm

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

ScholarBank@NUS

Xiamen University Institutional Repository

FigShare

The Phosphatomes of the Multicellular Myxobacteria Myxococcus xanthus and Sorangium cellulosum in Comparison with Other Prokaryotic Genomes

Author: A Clemente-Blanco
A Iwanicki
A Treuner-Lange
Anke Treuner-Lange
AV Andreeva
BS Goldman
CC Zhang
CJ Leonard
D Bordo
D Missiakas
DL Johnson
E Madec
EV Koonin
G Mittenhuber
Geraldine Butler
H Nariya
H Nariya
H Reichenbach
I Mijakovic
J Pane-Farre
J Perez
JD Mougous
JD Thompson
JM Wilkes
K Gerth
KD Singh
L Aravind
L Aravind
L Shi
L Shi
LJ Shimkets
LY Geer
M Bollen
M Ventura
MS Bennett
N Saitou
P Bork
P Cohen
PJ Kennelly
PJ Kennelly
PT Cohen
PT Cohen
R Boutros
R Garcia-Hernandez
R Li
R Li
RD Finn
RD Finn
RD Page
RL Tatusov
S Inouye
S Schneiker
SF Altschul
SK Hanks
SK Hanks
T Searls
TA Gaidenko
V Buttani
W Ludwig
W Zhang
Y Kimura
Y Kimura
Y Mechulam
Y Shi
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

BACKGROUND: Analysis of the complete genomes from the multicellular myxobacteria Myxococcus xanthus and Sorangium cellulosum identified the highest number of eukaryotic-like protein kinases (ELKs) compared to all other genomes analyzed. High numbers of protein phosphatases (PPs) could therefore be anticipated, as reversible protein phosphorylation is a major regulation mechanism of fundamental biological processes. METHODOLOGY: Here we report an intensive analysis of the phosphatomes of M. xanthus and S. cellulosum in which we constructed phylogenetic trees to position these sequences relative to PPs from other prokaryotic organisms. PRINCIPAL FINDINGS: PREDOMINANT OBSERVATIONS WERE: (i) M. xanthus and S. cellulosum possess predominantly Ser/Thr PPs; (ii) S. cellulosum encodes the highest number of PP2c-type phosphatases so far reported for a prokaryotic organism; (iii) in contrast to M. xanthus only S. cellulosum encodes high numbers of SpoIIE-like PPs; (iv) there is a significant lack of synteny among M. xanthus and S. cellulosum, and (v) the degree of co-organization between kinase and phosphatase genes is extremely low in these myxobacterial genomes. CONCLUSIONS: We conclude that there has been a greater expansion of ELKs than PPs in multicellular myxobacteria

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central