Search CORE

73 research outputs found

Establishing the precise evolutionary history of a gene improves prediction of disease-causing missense mutations

Author: Alexander O. Reznik
CA Wassif
Daniel S. Ory
DE Sleat
DM Jordan
F Chang
FD Porter
GR Oliver
H Börnig
H Jahnova
HJ Kwon
HR Davis Jr
IA Adzhubei
Igor B. Zhulin
JA Tennessen
JD Retief
JE Dickerson
JM White
K Katoh
KA King
M Lynch
M Stampfer
MC Patterson
MT Vanier
MT Vanier
MT Vanier
NO Stitziel
O Adebali
Ogun Adebali
PC Ng
PC Ng
RD Finn
S Castellana
S Guindon
S Nusca
SB Ng
SF Altschul
SH Katsanis
SR Sunyaev
U Omasits
X Jiang
X Yan
Y Choi
Z Wang
Publication venue: Digital Commons@Becker
Publication date: 01/01/2016
Field of study

PURPOSE: Predicting the phenotypic effects of mutations has become an important application in clinical genetic diagnostics. Computational tools evaluate the behavior of the variant over evolutionary time and assume that variations seen during the course of evolution are probably benign in humans. However, current tools do not take into account orthologous/paralogous relationships. Paralogs have dramatically different roles in Mendelian diseases. For example, whereas inactivating mutations in the NPC1 gene cause the neurodegenerative disorder Niemann-Pick C, inactivating mutations in its paralog NPC1L1 are not disease-causing and, moreover, are implicated in protection from coronary heart disease. METHODS: We identified major events in NPC1 evolution and revealed and compared orthologs and paralogs of the human NPC1 gene through phylogenetic and protein sequence analyses. We predicted whether an amino acid substitution affects protein function by reducing the organism’s fitness. RESULTS: Removing the paralogs and distant homologs improved the overall performance of categorizing disease-causing and benign amino acid substitutions. CONCLUSION: The results show that a thorough evolutionary analysis followed by identification of orthologs improves the accuracy in predicting disease-causing missense mutations. We anticipate that this approach will be used as a reference in the interpretation of variants in other genetic diseases as well. Genet Med 18 10, 1029–1036

Crossref

Digital Commons@Becker

PubMed Central

Three-Dimensional Phylogeny Explorer: Distinguishing paralogs, lateral transfer, and violation of "molecular clock" assumption with 3D visualization

Author: AJ Saldanha
Christopher Lee
CM Zmasek
CS Parr
DL Swofford
DL Wheeler
EV Koonin
G Trooskens
JD Retief
M Stallmann
MJ Sanderson
Namshin Kim
PL Lott
R Chenna
RD Page
RL Tatusov
RL Tatusov
RL Tatusov
RL Tatusov
S Kumar
SW Graham
Y Zhai
Z Du
Publication venue: BioMed Central
Publication date: 01/06/2007
Field of study

Abstract Background Construction and interpretation of phylogenetic trees has been a major research topic for understanding the evolution of genes. Increases in sequence data and complexity are creating a need for more powerful and insightful tree visualization tools. Results We have developed 3D Phylogeny Explorer (3DPE), a novel phylogeny tree viewer that maps trees onto three spatial axes (species on the X-axis; paralogs on Z; evolutionary distance on Y), enabling one to distinguish at a glance evolutionary features such as speciation; gene duplication and paralog evolution; lateral gene transfer; and violation of the "molecular clock" assumption. Users can input any tree on the online 3DPE, then rotate, scroll, rescale, and explore it interactively as "live" 3D views. All objects in 3DPE are clickable to display subtrees, connectivity path highlighting, sequence alignments, and gene summary views, and etc. To illustrate the value of this visualization approach for microbial genomes, we also generated 3D phylogeny analyses for all clusters from the public COG database. We constructed tree views using well-established methods and graph algorithms. We used Scientific Python to generate VRML2 3D views viewable in any web browser. Conclusion 3DPE provides a novel phylogenetic tree projection method into 3D space and its web-based implementation with live 3D features for reconstruction of phylogenetic trees of COG database.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Validation of an NSP-based (negative selection pattern) gene family identification strategy

Author: A Force
A Kalyanaraman
A Tatiana
B Korber
B Papp
C Tian
Cyriac Kandoth
EA Gaucher
Fikret Ercal
G Zhang
JA Tate
JD Retief
JS Taylor
JS Taylor
M Nei
M Suyama
MD Adams
MD Adams
Q Liu
RL Frank
RL Frank
Ronald L Frank
RS Schwarz
RT Nelson
S Lockton
SB Cannon
SH Nagaraj
SH Shiu
T Bie
T Fuchs
T Nakano
T Ota
TR Gregory
VA Albert
X Huang
YVan de Peer
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Gene family identification from ESTs can be a valuable resource for analysis of genome evolution but presents unique challenges in organisms for which the entire genome is not yet sequenced. We have developed a novel gene family identification method based on negative selection patterns (NSP) between family members to screen EST-generated contigs. This strategy was tested on five known gene families in Arabidopsis to see if individual paralogs could be identified with accuracy from EST data alone when compared to the actual gene sequences in this fully sequenced genome. Results The NSP method uniquely identified family members in all the gene families tested. Two members of the FtsH gene family, three members each of the PAL, RF1, and ribosomal L6 gene families, and four members of the CAD gene family were correctly identified. Additionally all ESTs from the representative contigs when checked against MapViewer data successfully identify the gene locus predicted. Conclusion We demonstrate the effectiveness of the NSP strategy in identifying specific gene family members in Arabidopsis using only EST data and we describe how this strategy can be used to identify many gene families in agronomically important crop species where they are as yet undiscovered.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine

Nephele: genotyping via complete composition vectors and MapReduce

Author: A Drummond
A McKenna
A Rambaut
AL Ghindilis
AS De Groot
AS Fauci
B Budowle
BJ Frey
C Macken
C Notredame
C Ranger
CA Cummings
CB Do
D Janies
D Wang
E Gabriel
EC Holmes
G Giribet
G Lin
G Lu
HL Yang
IM Wallace
J Bullard
J Dean
JC Wilgenbusch
JD Retief
JD Thompson
KH Chu
KS Li
L Campitelli
L Gao
L Stuyver
Lynette Hirschman
M Colosimo
M Li
M Lindh
Marc E Colosimo
Matthew W Peterson
MC Schatz
MW Peterson
N Saitou
RC Edgar
RC Edgar
SA McEwen
Scott Mardis
SJ Matthews
T Hughes
TB Reddy
TZ DeSantis Jr
U Rost
V Brendel
X Wu
X Wu
XF Wan
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Current sequencing technology makes it practical to sequence many samples of a given organism, raising new challenges for the processing and interpretation of large genomics data sets with associated metadata. Traditional computational phylogenetic methods are ideal for studying the evolution of gene/protein families and using those to infer the evolution of an organism, but are less than ideal for the study of the whole organism mainly due to the presence of insertions/deletions/rearrangements. These methods provide the researcher with the ability to group a set of samples into distinct genotypic groups based on sequence similarity, which can then be associated with metadata, such as host information, pathogenicity, and time or location of occurrence. Genotyping is critical to understanding, at a genomic level, the origin and spread of infectious diseases. Increasingly, genotyping is coming into use for disease surveillance activities, as well as for microbial forensics. The classic genotyping approach has been based on phylogenetic analysis, starting with a multiple sequence alignment. Genotypes are then established by expert examination of phylogenetic trees. However, these traditional single-processor methods are suboptimal for rapidly growing sequence datasets being generated by next-generation DNA sequencing machines, because they increase in computational complexity quickly with the number of sequences. Results Nephele is a suite of tools that uses the complete composition vector algorithm to represent each sequence in the dataset as a vector derived from its constituent k-mers by passing the need for multiple sequence alignment, and affinity propagation clustering to group the sequences into genotypes based on a distance measure over the vectors. Our methods produce results that correlate well with expert-defined clades or genotypes, at a fraction of the computational cost of traditional phylogenetic methods run on traditional hardware. Nephele can use the open-source Hadoop implementation of MapReduce to parallelize execution using multiple compute nodes. We were able to generate a neighbour-joined tree of over 10,000 16S samples in less than 2 hours. Conclusions We conclude that using Nephele can substantially decrease the processing time required for generating genotype trees of tens to hundreds of organisms at genome scale sequence coverage.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Cloning of a gene (SR-A1), encoding for a new member of the human Ser/Arg-rich family of pre-mRNA splicing factors: overexpression in aggressive ovarian cancer

Author: Aiyar A
Altschul SF
Amrein H
Ashworth LK
Au WC
Bairoch A
Blencowe BJ
Brendel V
Cannistra SA
Cavaloc Y
Day TG
Fu XD
Ge H
Gobert C
Goralski TJ
Hansen JE
Hoffmann K
Iida Y
Kim H
Kim YJ
Kohtz JD
Kozak M
Lennon G
Lowther WJ
McKeown M
Meier UT
Murakami K
Nielsen H
Retief JD
Ruegsegger U
Scorilas A
Screaton GR
Spritz RA
Stickeler E
Surowy CS
Takagaki Y
Takagaki Y
Takezaki N
Tanner S
Vellard M
Wang J
Webb CP
Yousef GM
Yousef GM
Yousef GM
Yousef GM
Yuryev A
Zahler AM
Publication venue: Nature Publishing Group
Publication date
Field of study

By using the positional cloning gene approach, we were able to identify a novel gene encoding for a serine/arginine-rich protein, which appears to be the human homologue of the rat A1 gene. We named this new gene SR-A1. Members of the SR family of proteins have been shown to interact with the C-terminal domain (CTD) of the large subunit of RNA polymerase II and participate in pre-mRNA splicing. We have localized the SR-A1 gene between the known genes IRF3 and RRAS on chromosome 19q13.3. The novel gene spans 16.7 kb of genomic sequence and it is formed of 11 exons and 10 intervening introns. The SR-A1 protein is composed of 1312 amino acids, with a molecular mass of 139.3 kDa and a theoretical isoelectric point of 9.31. The SR-A1 protein contains an SR-rich domain as well as a CTD-binding domain present only in a subset of SR-proteins. Through interactions with the pre-mRNA and the CTD domain of the Polymerase II, SR proteins have been shown to regulate alternative splicing. The SR-A1 gene is expressed in all tissues tested, with highest levels found in fetal brain and fetal liver. Our data suggest that this gene is overexpressed in a subset of ovarian cancers which are clinically more aggressive. Studies with the steroid hormone receptor-positive breast and prostate carcinoma cell lines ZR-75-1, BT-474 and LNCaP, respectively, suggest that SR-A1 is constitutively expressed. Furthermore, the mRNA of the SR-A1 gene in these cell lines appears to increase by estrogens, androgens and glucocorticoids, and to a lesser extend by progestins. © 2001 Cancer Research Campaign http://www.bjcancer.co

Crossref

PubMed Central

Evaluation of the bacterial diversity of Pressure ulcers using bTEFAP pyrosequencing

Abstract Background Decubitus ulcers, also known as bedsores or pressure ulcers, affect millions of hospitalized patients each year. The microflora of chronic wounds such as ulcers most commonly exist in the biofilm phenotype and have been known to significantly impair normal healing trajectories. Methods Bacterial tag-encoded FLX amplicon pyrosequencing (bTEFAP), a universal bacterial identification method, was used to identify bacterial populations in 49 decubitus ulcers. Diversity estimators were utilized and wound community compositions analyzed in relation to metadata such as Age, race, gender, and comorbidities. Results Decubitus ulcers are shown to be polymicrobial in nature with no single bacterium exclusively colonizing the wounds. The microbial community among such ulcers is highly variable. While there are between 3 and 10 primary populations in each wound there can be hundreds of different species present many of which are in trace amounts. There is no clearly significant differences in the microbial ecology of decubitus ulcer in relation to metadata except when considering diabetes. The microbial populations and composition in the decubitus ulcers of diabetics may be significantly different from the communities in non-diabetics. Conclusions Based upon the continued elucidation of chronic wound bioburdens as polymicrobial infections, it is recommended that, in addition to traditional biofilm-based wound care strategies, an antimicrobial/antibiofilm treatment program can be tailored to each patient's respective wound microflora.</p

Crossref

Directory of Open Access Journals

PubMed Central

Regulation of pH During Amelogenesis

Author: A Boyde
A Boyde
A Boyde
A Nanci
A Nanci
A Pushkin
AA Dogterom
Antonio Nanci
B Illek
BH Koller
C Supanchart
CE Kurschat
CE Smith
CE Smith
CE Smith
CE Smith
CE Smith
CE Smith
CE Smith
CE Smith
CK Arquitt
CL Kiefer
CT Supuran
D Deutsch
D Dinour
DA Parry
DF Travis
DH Retief
DM Lyaruu
ED Eanes
EJ Reith
EJ Reith
FS Collins
FT Cua
FY Demirci
GE Lyman
H Warshawsky
HM Lin
HS Koppang
ID Mandel
Ira Kurtz
J Elizabeth
J Inatomi
J Lecanda
J Sok
J. Timothy Wright
JC Hu
JD Bartlett
JD Bartlett
JF Medina
JN Snouwaert
JP Simmer
JR Riordan
JT Wright
JT Wright
JT Wright
JW Bawden
K Iwasaki
K Josephsen
K Kondo
LR Gawenis
LR Gawenis
M Goldberg
M Goldberg
M Kakei
MD McKee
Michael L. Paine
ML Drumm
ML Paine
ML Paine
MO Bevensee
MP Whyte
MU Nylen
N Abuladze
O Ryu
P Moffatt
P Moffatt
P Pan
PJ Crawford
RA Frizzell
RA Young
RE Primosch
RE Primosch
Rodrigo S. Lacruz
RS Lacruz
RS Lacruz
S Pastorekova
S Sasaki
S Toyosawa
SD McAlear
T Aoba
T Aoba
T Igarashi
T Odajima
T Sasaki
T Sugimoto
T Takagi
TD Azevedo
TJ Sferra
W Sui
WS Sly
Z Katzir
Publication venue: Springer-Verlag
Publication date: 01/01/2009
Field of study

During amelogenesis, extracellular matrix proteins interact with growing hydroxyapatite crystals to create one of the most architecturally complex biological tissues. The process of enamel formation is a unique biomineralizing system characterized first by an increase in crystallite length during the secretory phase of amelogenesis, followed by a vast increase in crystallite width and thickness in the later maturation phase when organic complexes are enzymatically removed. Crystal growth is modulated by changes in the pH of the enamel microenvironment that is critical for proper enamel biomineralization. Whereas the genetic bases for most abnormal enamel phenotypes (amelogenesis imperfecta) are generally associated with mutations to enamel matrix specific genes, mutations to genes involved in pH regulation may result in severely affected enamel structure, highlighting the importance of pH regulation for normal enamel development. This review summarizes the intra- and extracellular mechanisms employed by the enamel-forming cells, ameloblasts, to maintain pH homeostasis and, also, discusses the enamel phenotypes associated with disruptions to genes involved in pH regulation

Crossref

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

Carolina Digital Repository

Immunoglobulin Genomics in the Guinea Pig (Cavia porcellus)

Author: A Janke
A Janke
A Reyes
A Shimizu
A Tutter
AM D’Erchia
AS Greenberg
B Wagner
BA Osborne
C Chevillard
C Hamers-Casterman
CM Johnston
D Graur
D Kajiwara
DG Higgins
DJ Padilla-Carlin
DK Lanning
DS Horner
E Bengten
E Vargas-Madrazo
EM Sturm
EP Andrianova
F Gambon-Deza
G Churakov
GM Air
GW Warr
GW Warr
HW Schroeder Jr
I Achour
IM Tomlinson
J Felsenstein
J Hendricks
J Johansson
J Sambrook
J Sun
JC Almagro
JC Almagro
JD Retief
JD Thompson
JE Berman
JE Butler
JE Butler
JG Flanagan
JP Clewley
JW Ellison
K Helling
K Kullander
K Kuma
KA Charlton
KE Andersson
KL Knight
KL Knight
KW van Dijk
Liming Ren
M Bensmana
M Bruggemann
M Haino
M Nei
M Robinson-Rechavi
MC Sinclair
MG Morgado
MH Freedman
MJ Benton
ML Baker
MR Lucier
N Mizutani
Ning Li
PH Brodeur
PW Tucker
Qingwen Meng
Qingyong Meng
R Ikeda
RD Page
RK Thomas
RP Phizackerley
S Das
S Kawamura
Sebastian D. Fugmann
SJ Berens
SJ Currier
T Hall
T Honjo
T Qin
T Sitnikova
V Dufour
VK Nguyen
WH Li
X Wang
Xiaoxiang Hu
Y Bao
Y Cao
Y Guo
Y Sun
YA Zhang
Yaofeng Zhao
YH Lin
Yongchen Guo
Yonghua Bao
Publication venue: Public Library of Science
Publication date
Field of study

In science, the guinea pig is known as one of the gold standards for modeling human disease. It is especially important as a molecular and cellular biology model for studying the human immune system, as its immunological genes are more similar to human genes than are those of mice. The utility of the guinea pig as a model organism can be further enhanced by further characterization of the genes encoding components of the immune system. Here, we report the genomic organization of the guinea pig immunoglobulin (Ig) heavy and light chain genes. The guinea pig IgH locus is located in genomic scaffolds 54 and 75, and spans approximately 6,480 kb. 507 VH segments (94 potentially functional genes and 413 pseudogenes), 41 DH segments, six JH segments, four constant region genes (μ, γ, ε, and α), and one reverse δ remnant fragment were identified within the two scaffolds. Many VH pseudogenes were found within the guinea pig, and likely constituted a potential donor pool for gene conversion during evolution. The Igκ locus mapped to a 4,029 kb region of scaffold 37 and 24 is composed of 349 Vκ (111 potentially functional genes and 238 pseudogenes), three Jκ and one Cκ genes. The Igλ locus spans 1,642 kb in scaffold 4 and consists of 142 Vλ (58 potentially functional genes and 84 pseudogenes) and 11 Jλ -Cλ clusters. Phylogenetic analysis suggested the guinea pig’s large germline VH gene segments appear to form limited gene families. Therefore, this species may generate antibody diversity via a gene conversion-like mechanism associated with its pseudogene reserves

Crossref

Directory of Open Access Journals

PubMed Central