Search CORE

55 research outputs found

Recommended from our members

Are Clinical Genomes Already Becoming Semi-Routine For Patient Care?

Author: Boguski Mark S.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 15/04/2013
Field of study

Harvard University - DASH

Customized care 2020: how medical sequencing and network biology will enable personalized medicine

Author: Arnaout Ramy
Boguski Mark S
Hill Colin
Publication venue: Biology Reports Ltd
Publication date: 23/04/2011
Field of study

Applications of next-generation nucleic acid sequencing technologies will lead to the development of precision diagnostics that will, in turn, be a major technology enabler of precision medicine. Terabyte-scale, multidimensional data sets derived using these technologies will be used to reverse engineer the specific disease networks that underlie individual patients’ conditions. Modeling and simulation of these networks in the presence of virtual drugs, and combinations of drugs, will identify the most efficacious therapy for precision medicine and customized care. In coming years the practice of medicine will routinely employ network biology analytics supported by high-performance supercomputing

Crossref

Harvard University - DASH

PubMed Central

Primary structure and comparative sequence-analysis of an insect apolipoprotein: apolipophorin-Iii from Manduca-sexta

Author: Boguski Mark S.
Clark Wallace A.
Cole Kenneth D.
Fernando-Warnakulasuriya Germain J. P.
Freeman Mark
Gordon Jeffrey I.
Law John H.
Wells Michael A.
Publication venue: 'American Society for Biochemistry & Molecular Biology (ASBMB)'
Publication date: 25/08/1987
Field of study

The amino acid sequence of an insect apolipoprotein, apolipophorin-III from Manduca sexta, was determined by a combination of cDNA and protein sequencing. The mature hemolymph protein consists of 166 amino acids. The cDNA also encodes for an amino-terminal extension of 23 amino acids which is not represented in the mature hemolymph protein. The existence of a precursor protein was confirmed by in vitro translation of fat body mRNA. Computer-assisted comparative sequence analysis revealed the following points: 1) the protein is composed of tandemly repeating tetradecapeptide units with a high potential for forming amphiphilic helical structures. Compared to mammalian apolipoproteins the repeat units in the insect apolipoprotein show considerable length variability; 2) the sequence has a striking resemblance to several human apolipoproteins including apoE, AIV, AI, and CI. However, the homology seems to be entirely functional since, although the insect and mammalian apoproteins contain very similar types of amino acid residues, the actual degree of sequence identity is quite low. Whether the mammalian and insect apoproteins are derived from a common ancestral amphiphilic helix forming, lipid-binding protein, or arose by convergent evolution can not be determined at present. This represents the first complete amino acid sequence for an insect apolipoprotein

University of Queensland eSpace

annot8r: GO, EC and KEGG annotation of EST datasets

Author: A Bairoch
A Conesa
A Papanicolaou
DM Martin
E Camon
EM Zdobnov
J Bai
J Parkinson
J Parkinson
JD Wasmuth
JE Stajich
LB Koski
M Ashburner
M Kanehisa
Mark L Blaxter
MS Boguski
Ralf Schmid
SF Altschul
SR Stürzenbaum
The UniProt Consortium
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background The expressed sequence tag (EST) methodology is an attractive option for the generation of sequence data for species for which no completely sequenced genome is available. The annotation and comparative analysis of such datasets poses a formidable challenge for research groups that do not have the bioinformatics infrastructure of major genome sequencing centres. Therefore, there is a need for user-friendly tools to facilitate the annotation of non-model species EST datasets with well-defined ontologies that enable meaningful cross-species comparisons. To address this, we have developed annot8r, a platform for the rapid annotation of EST datasets with GO-terms, EC-numbers and KEGG-pathways. Results annot8r automatically downloads all files relevant for the annotation process and generates a reference database that stores UniProt entries, their associated Gene Ontology (GO), Enzyme Commission (EC) and Kyoto Encyclopaedia of Genes and Genomes (KEGG) annotation and additional relevant data. For each of GO, EC and KEGG, annot8r extracts a specific sequence subset from the UniProt dataset based on the information stored in the reference database. These three subsets are then formatted for BLAST searches. The user provides the protein or nucleotide sequences to be annotated and annot8r runs BLAST searches against these three subsets. The BLAST results are parsed and the corresponding annotations retrieved from the reference database. The annotations are saved both as flat files and also in a relational postgreSQL results database to facilitate more advanced searches within the results. annot8r is integrated with the PartiGene suite of EST analysis tools. Conclusion annot8r is a tool that assigns GO, EC and KEGG annotations for data sets resulting from EST sequencing projects both rapidly and efficiently. The benefits of an underlying relational database, flexibility and the ease of use of the program make it ideally suited for non-model species EST-sequencing projects.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Edinburgh Research Explorer

Leicester Research Archive

Analysis of multiplex gene expression maps obtained by voxelation

Author: AK Jain
B Albert
D Lin
D Liu
Desmond J Smith
G Kaiser
Hongbo Xie
JA Hartigan
JB MacQueen
Li An
Mark H Chin
MB Eisen
MH Chin
MS Boguski
PO Brown
RJ Lipshutz
RP Singh
Vasileios Megalooikonomou
VE Velculescu
VM Brown
VM Brown
Zoran Obradovic
Publication venue: BioMed Central
Publication date: 01/04/2009
Field of study

BackgroundGene expression signatures in the mammalian brain hold the key to understanding neural development and neurological disease. Researchers have previously used voxelation in combination with microarrays for acquisition of genome-wide atlases of expression patterns in the mouse brain. On the other hand, some work has been performed on studying gene functions, without taking into account the location information of a gene's expression in a mouse brain. In this paper, we present an approach for identifying the relation between gene expression maps obtained by voxelation and gene functions.ResultsTo analyze the dataset, we chose typical genes as queries and aimed at discovering similar gene groups. Gene similarity was determined by using the wavelet features extracted from the left and right hemispheres averaged gene expression maps, and by the Euclidean distance between each pair of feature vectors. We also performed a multiple clustering approach on the gene expression maps, combined with hierarchical clustering. Among each group of similar genes and clusters, the gene function similarity was measured by calculating the average gene function distances in the gene ontology structure. By applying our methodology to find similar genes to certain target genes we were able to improve our understanding of gene expression patterns and gene functions. By applying the clustering analysis method, we obtained significant clusters, which have both very similar gene expression maps and very similar gene functions respectively to their corresponding gene ontologies. The cellular component ontology resulted in prominent clusters expressed in cortex and corpus callosum. The molecular function ontology gave prominent clusters in cortex, corpus callosum and hypothalamus. The biological process ontology resulted in clusters in cortex, hypothalamus and choroid plexus. Clusters from all three ontologies combined were most prominently expressed in cortex and corpus callosum.ConclusionThe experimental results confirm the hypothesis that genes with similar gene expression maps might have similar gene functions. The voxelation data takes into account the location information of gene expression level in mouse brain, which is novel in related research. The proposed approach can potentially be used to predict gene functions and provide helpful suggestions to biologists

Crossref

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

Identification and Validation of Putative Nesprin Variants

Author: Brian A Sosa
Brian A. Sosa
Dipen Rajgor
Dipen Rajgor
Flavia Autore
Jack E Tabaska
Jennifer G. Simpson
Joshua T. Morgan
K. J. Roux
K. Natalie Randles
Kristina Djinovic-Carugo
Le Thanh Lam
Mark S. Boguski
Megan J. Puckelwartz
Melissa Crisp
Nguyen Thuy Duong
Peter Meinke
Q Zhang
Q. Zhang
Qiuping Zhang
Qiuping Zhang
Samantha J. King
Wenjia Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Crossref

King's Research Portal

Comparative Analysis of Serine/Arginine-Rich Proteins across 27 Eukaryotes: Insights into Sub-Family Classification and Extent of Alternative Splicing

Author: A Barta
A Barta
A Labadorf
A Rambaut
A Stamatakis
Adam Labadorf
AE House
AJ Roger
AM Zahler
Andrew H. Paterson
Anireddy S. N. Reddy
AR Krainer
AR Subramanian
AR Subramanian
AS Reddy
AS Reddy
Asa Ben-Hur
B-B Wang
BR Graveley
BR Graveley
C Aurrecoechea
C Haynes
C Hertz-Fowler
CG Simpson
CJ Lawrence
D Brett
D Cazalla
D Longman
D Philipps
D Portal
Dale N. Richardson
DF Stojdl
DJ Zwickl
DMG Joni Fazo
E Kim
ED Harrington
EM Zdobnov
ET Wang
F Abascal
G Ast
G Blanc
GE Crooks
H Gao
H Ge
H Nozaki
H Shen
H Tang
Hui Guo
I Letunic
J Deckert
J Ding
J Felsenstein
J Schmutz
JC Long
JD Thompson
JF Caceres
JL Manley
JW Stiller
JY Wu
K Iida
KJ Hertel
L Eichinger
LF Lareau
LT Chow
M Chen
M Isshiki
M Kalyna
M Kalyna
M Zola
MA Campbell
Mark F. Rogers
MB Roth
MS Boguski
N Behzadnia
N Mulder
O Jaillon
P Zimmermann
PJ Shepard
Q Pan
R Tacke
RK Bradley
RL Poole
RV Spriggs
S Costanzo
S Eddy
S Lin
S Ouyang
S Tweedie
SA Filichkin
SA Rensing
SF Altschul
SG Palusa
SG Palusa
Shin-Han Shiu
SM Mount
TJ Hubbard
TW Harris
TW Nilsen
V Wood
W Gilbert
W van Der Houven Van Oordt
Y Murakami
YL Xiao
Z Gu
Z Tang
ZJ Lorkovic
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Alternative splicing (AS) of pre-mRNA is a fundamental molecular process that generates diversity in the transcriptome and proteome of eukaryotic organisms. SR proteins, a family of splicing regulators with one or two RNA recognition motifs (RRMs) at the N-terminus and an arg/ser-rich domain at the C-terminus, function in both constitutive and alternative splicing. We identified SR proteins in 27 eukaryotic species, which include plants, animals, fungi and “basal” eukaryotes that lie outside of these lineages. Using RNA recognition motifs (RRMs) as a phylogenetic marker, we classified 272 SR genes into robust sub-families. The SR gene family can be split into five major groupings, which can be further separated into 11 distinct sub-families. Most flowering plants have double or nearly double the number of SR genes found in vertebrates. The majority of plant SR genes are under purifying selection. Moreover, in all paralogous SR genes in Arabidopsis, rice, soybean and maize, one of the two paralogs is preferentially expressed throughout plant development. We also assessed the extent of AS in SR genes based on a splice graph approach (http://combi.cs.colostate.edu/as/gmap_SRgenes). AS of SR genes is a widespread phenomenon throughout multiple lineages, with alternative 3′ or 5′ splicing events being the most prominent type of event. However, plant-enriched sub-families have 57%–88% of their SR genes experiencing some type of AS compared to the 40%–54% seen in other sub-families. The SR gene family is pervasive throughout multiple eukaryotic lineages, conserved in sequence and domain organization, but differs in gene number across lineages with an abundance of SR genes in flowering plants. The higher number of alternatively spliced SR genes in plants emphasizes the importance of AS in generating splice variants in these organisms

Public Library of Science (PLOS)

Crossref

Kölner UniversitätsPublikationsServer

Directory of Open Access Journals

PubMed Central

A transcriptomic analysis of Echinococcus granulosus larval stages:implications for parasite biology and host adaptation

Author: A Adoutte
A Agorio
A Aziz
A Diaz
A Diaz
A Diaz
A Garcia-Espana
A Pauli
A Siracusano
AA Aravin
AA Aravin
AC Arend
AJ Levine
AM Khalil
Arnaldo Zaha
AV Protasio
C Adamidi
C Fernandez
C Fernandez
C Lorenzo
Cecilia Fernández
Chris Sanford
CM Budke
CM Budke
CR Almeida
CR Raetz
Cristiano V. Bizarro
CV Bizarro
CV Dang
CW Dunn
D Palakodeti
D Sheehan
D Yang
DA Benson
DJ Jenkins
DP McManus
DTJ Littlewood
E Brunetti
E Fernandez-Taboada
E Ortona
F Berditchevski
F Gasparini
G Cheng
G Gonzalez
G Obal
G Salinas
G Salinas
Gustavo Salinas
GV Kryukov
H Aguilar-Diaz
H Philippe
Henrique B. Ferreira
HH Garcia
I Ulitsky
J Knapp
J Lundstrom
J Matsumoto
J Matsumoto
J Parkinson
J Parkinson
J Parkinson
J Parkinson
J Wasmuth
J Wasmuth
J Watanabe
James D. Wasmuth
JC Shepherd
JD Beggs
JD Bendtsen
JD Hayes
JD Wasmuth
JE Wilusz
JF Abril
JM Peregrin-Alvarez
JM Peregrin-Alvarez
John Parkinson
JW Nam
K Brehm
K Brehm
K Brehm
K Brehm
K Hancock
K Tamura
KM Monteiro
KM Monteiro
L Kamenetzky
M Agosin
M Cancela
M Galindo
M Ghildiyal
M Guttman
M Guttman
M Marin
M Nakao
M Reuter
M Seigneuret
M Spiliotis
M Yanez-Mo
MA Larkin
Malcolm K. Jones
Mark L. Blaxter
Matthew Berriman
ME Hemler
ME Hemler
MG Vander Heiden
MH Tran
MR Friedlander
MS Boguski
N Shibata
ND Young
ND Young
ND Young
OV Kovalenko
P Carninci
P Moro
PD Olson
PM Muzulin
PS Craig
R DeSalle
R Oriol
RD Finn
RE Davis
Rick M. Maizels
RJ DeBerardinis
RL Tatusov
RM Zayas
RM Zayas
RS Young
S Gonzalez
S Huang
S Huang
S Prast-Nielsen
S Stanojcic
S Verjovski-Almeida
SE Moss
SJ Santivanez
SM King
T Freire
T Laha
TA Holton
TR Mercer
U Koziol
V Fernandez
V Fernandez
W Hu
W Mamuti
W Wu
W Zhang
W Zhang
WG Yoo
WJ Zhao
Y Satou
Y Suzuki
YF Qin
YM Harcus
Z Dang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2012
Field of study

The cestode Echinococcus granulosus--the agent of cystic echinococcosis, a zoonosis affecting humans and domestic animals worldwide--is an excellent model for the study of host-parasite cross-talk that interfaces with two mammalian hosts. To develop the molecular analysis of these interactions, we carried out an EST survey of E. granulosus larval stages. We report the salient features of this study with a focus on genes reflecting physiological adaptations of different parasite stages.We generated ~10,000 ESTs from two sets of full-length enriched libraries (derived from oligo-capped and trans-spliced cDNAs) prepared with three parasite materials: hydatid cyst wall, larval worms (protoscoleces), and pepsin/H(+)-activated protoscoleces. The ESTs were clustered into 2700 distinct gene products. In the context of the biology of E. granulosus, our analyses reveal: (i) a diverse group of abundant long non-protein coding transcripts showing homology to a middle repetitive element (EgBRep) that could either be active molecular species or represent precursors of small RNAs (like piRNAs); (ii) an up-regulation of fermentative pathways in the tissue of the cyst wall; (iii) highly expressed thiol- and selenol-dependent antioxidant enzyme targets of thioredoxin glutathione reductase, the functional hub of redox metabolism in parasitic flatworms; (iv) candidate apomucins for the external layer of the tissue-dwelling hydatid cyst, a mucin-rich structure that is critical for survival in the intermediate host; (v) a set of tetraspanins, a protein family that appears to have expanded in the cestode lineage; and (vi) a set of platyhelminth-specific gene products that may offer targets for novel pan-platyhelminth drug development.This survey has greatly increased the quality and the quantity of the molecular information on E. granulosus and constitutes a valuable resource for gene prediction on the parasite genome and for further genomic and proteomic analyses focused on cestodes and platyhelminths

Public Library of Science (PLOS)

Crossref

Lume 5.8

Directory of Open Access Journals

PubMed Central

Edinburgh Research Explorer

FigShare