Search CORE

14 research outputs found

Assessment of methods for amino acid matrix selection and their use on empirical data shows that ad hoc assumptions for choice of matrix are not justified

Author: Creevey Christopher J
Keane Thomas M
Mclnerney James O
Naughton Thomas J
Pentony Melissa M
Publication venue
Publication date: 01/01/2006
Field of study

BACKGROUND: In recent years, model based approaches such as maximum likelihood have become the methods of choice for constructing phylogenies. A number of authors have shown the importance of using adequate substitution models in order to produce accurate phylogenies. In the past, many empirical models of amino acid substitution have been derived using a variety of different methods and protein datasets. These matrices are normally used as surrogates, rather than deriving the maximum likelihood model from the dataset being examined. With few exceptions, selection between alternative matrices has been carried out in an ad hoc manner. RESULTS: We start by highlighting the potential dangers of arbitrarily choosing protein models by demonstrating an empirical example where a single alignment can produce two topologically different and strongly supported phylogenies using two different arbitrarily-chosen amino acid substitution models. We demonstrate that in simple simulations, statistical methods of model selection are indeed robust and likely to be useful for protein model selection. We have investigated patterns of amino acid substitution among homologous sequences from the three Domains of life and our results show that no single amino acid matrix is optimal for any of the datasets. Perhaps most interestingly, we demonstrate that for two large datasets derived from the proteobacteria and archaea, one of the most favored models in both datasets is a model that was originally derived from retroviral Pol proteins. CONCLUSION: This demonstrates that choosing protein models based on their source or method of construction may not be appropriate

MURAL - Maynooth University Research Archive Library

Aberystwyth Research Portal

Springer - Publisher Connector

PubMed Central

NUI Maynooth Eprint Archive

Maynooth University ePrints and eTheses Archive

The University of Manchester - Institutional Repository

Recommended from our members

The complete costs of genome sequencing: a microcosting study in cancer and rare diseases from a single center in the United Kingdom

Author: Antoniou Pavlos
Buchanan James
Camps Carme
Dreau Helene
Fermont Jilles M.
Harris Steve
Knight Samantha J. L.
Kvikstad Erika M.
Pagnamenta Alistair T.
Pentony Melissa M.
Popitsch Niko
Schuh Anna
Schwarze Katharina
Taylor Jenny C.
Taylor John M.
Tilley Mark W.
Wordsworth Sarah
Publication venue: Genetics in Medicine
Publication date: 01/01/2020
Field of study

Abstract: Purpose: The translation of genome sequencing into routine health care has been slow, partly because of concerns about affordability. The aspirational cost of sequencing a genome is

1000, but there is little evidence to support this estimate. We estimate the cost of using genome sequencing in routine clinical care in patients with cancer or rare diseases. Methods: We performed a microcosting study of Illumina-based genome sequencing in a UK National Health Service laboratory processing 399 samples/year. Cost data were collected for all steps in the sequencing pathway, including bioinformatics analysis and reporting of results. Sensitivity analysis identified key cost drivers. Results: Genome sequencing costs £6841 per cancer case (comprising matched tumor and germline samples) and £7050 per rare disease case (three samples). The consumables used during sequencing are the most expensive component of testing (68–72% of the total cost). Equipment costs are higher for rare disease cases, whereas consumable and staff costs are slightly higher for cancer cases. Conclusion: The cost of genome sequencing is underestimated if only sequencing costs are considered, and likely surpasses

1000/genome in a single laboratory. This aspirational sequencing cost will likely only be achieved if consumable costs are considerably reduced and sequencing is performed at scale

Apollo (Cambridge)

Possible Loss of the Chloroplast Genome in the Parasitic Flowering Plant Rafflesia lagascae (Rafflesiaceae)

Author: Barcelona Julie
Concepcion Gisela P.
Flowers Jonathan M.
Geisler Matt
Hazzouri Khaled M
Inovejas Samuel Alan
Locklear Selina
Meyer Rachel S
Michel Claire-Iphanise
Molina Jeanmaire
Nickrent Daniel
Pelser Pieter
Pentony Melissa M.
Purugganan Michael D
Uy Iris
Wilkins Olivia
Yuan Wei
Publication venue: OpenSIUC
Publication date: 01/01/2014
Field of study

Rafflesia is a genus of holoparasitic plants endemic to Southeast Asia that has lost the ability to undertake photosynthesis. With short-read sequencing technology, we assembled a draft sequence of the mitochondrial genome of Rafflesia lagascae Blanco, a species endemic to the Philippine island of Luzon, with ∼350× sequencing depth coverage. Using multiple approaches, however, we were only able to identify small fragments of plastid sequences at low coverage depth

PubMed Central

OpenSIUC

Clinically actionable mutation profiles in patients with cancer identified by whole-genome sequencing

Author: Ahmed Ahmed
Antoniou Pavlos
Athanasou Nick
Church David
Colling Richard
Dreau Helene
Flanagan Adrienne M.
Hamblin Angela
Harris Adrian
Hassan Bass
Knight Samantha J.l.
Kvikstad Erika M.
Mizani Tuba
Orosz Zsolt
Parton Marina
Pentony Melissa M.
Popitsch Niko
Protheroe Andrew
Ridout Kate
Schuh Anna
Shah Ketan A.
Taylor Jenny C.
Tomlinson Ian
Vavoulis Dimitris
Winter Stuart
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 01/01/2018
Field of study

Next-generation sequencing (NGS) efforts have established catalogs of mutations relevant to cancer development. However, the clinical utility of this information remains largely unexplored. Here, we present the results of the first eight patients recruited into a clinical whole-genome sequencing (WGS) program in the United Kingdom. We performed PCR-free WGS of fresh frozen tumors and germline DNA at 75× and 30×, respectively, using the HiSeq2500 HTv4. Subtracted tumor VCFs and paired germlines were subjected to comprehensive analysis of coding and noncoding regions, integration of germline with somatically acquired variants, and global mutation signatures and pathway analyses. Results were classified into tiers and presented to a multidisciplinary tumor board. WGS results helped to clarify an uncertain histopathological diagnosis in one case, led to informed or supported prognosis in two cases, leading to de-escalation of therapy in one, and indicated potential treatments in all eight. Overall 26 different tier 1 potentially clinically actionable findings were identified using WGS compared with six SNVs/indels using routine targeted NGS. These initial results demonstrate the potential of WGS to inform future diagnosis, prognosis, and treatment choice in cancer and justify the systematic evaluation of the clinical utility of WGS in larger cohorts of patients with cancer

Crossref

UCL Discovery

Edinburgh Research Explorer

Oxford University Research Archive

Mutation burden and other molecular markers of prognosis in colorectal cancer treated with curative intent: results from the QUASAR 2 clinical trial and an Australian community-based series

Author: Askautrud Hanne
Bark Yasmine
Camps Carme
Church David N
Danielsen Havard E
Domingo Enric
Gibbs Peter
Hawkins Nicholas J
Johnstone Elaine C
Kaisaki Pamela J
Kaur Kulvinder
Kerr David
Kerr Rachel
Makino Seiko
Mouradov Dmitri
Novelli Marco
Oukrif Dahmane
Palles Claire
Palmieri Michelle
Parsons Marie J
Pentony Melissa M
Sherlock Jon
Sieber Oliver
Taylor Jenny C
Tomlinson Evie
Tomlinson Ian
Wang Haitao
Ward Robyn L
Wood Joe
Publication venue: 'Elsevier BV'
Publication date: 01/01/2018
Field of study

Background Several relatively large studies have assessed molecular indicators of colorectal cancer (CRC) prognosis, but most analyses have been restricted to a handful of markers. Methods In stage II/III CRCs from the QUASAR2 clinical trial and from an Australian community-based series, we assessed gene panels for somatic driver mutations and overall mutation burden. We determined molecular pathways of tumorigenesis, and analysed associations with treatment response and prognosis. Findings In QUASAR2 (N=511), TP53, KRAS, BRAF and GNAS mutations were independently associated with shorter relapse-free survival, whereas total somatic mutation burden was associated with longer survival, even after excluding mismatch repair-deficient (MSI+) and POLE-mutant tumours. We successfully validated these associations in the Australian sample set (N=296). In an extended analysis of 1,752 QUASAR2 and Australian CRCs for which KRAS, BRAF and MSI status was available, we found that KRAS and BRAF mutations were specifically associated with poor prognosis in MSI- cancers. This association was not present in MSI+ cancers, and MSI+ tumours with KRAS or BRAF mutation actually had better prognosis than MSI- cancers that were wildtype for KRAS or BRAF. New rare molecular pathways were also uncovered: mutations in the genes NF1 and NRAS from the MAP kinase pathway co-occurred, mutations in TP53 and ATM appeared to be alternative ways of inactivating the DNA damage response pathway. Interpretation A multi-gene panel has identified two previously unreported prognostic associations in CRC involving both TP53 mutation and total mutation burden, and confirmed associations with KRAS and BRAF. We conclude that even a modest-sized gene panel can provide important information for use in clinical practice and out-perform MSI-based models.</p

Crossref

University of Birmingham Research Portal

UCL Discovery

Edinburgh Research Explorer

Oxford University Research Archive

NORA - Norwegian Open Research Archives

University of Queensland eSpace

Structural and non-coding variants increase the diagnostic yield of clinical whole genome sequencing for rare diseases

Author: Allroggen Holger
Ansorge Olaf
Babbs Christian
Banka Siddharth
Baños-Piñero Benito
Beeson David
Ben-Ami Tal
Bennett David L.
Bento Celeste
Blair Edward
Brasch-Andersen Charlotte
Bull Katherine R.
Calpena Eduardo
Camps Carme
Cario Holger
Cilliers Deirdre
Conti Valerio
Dacal Beatriz Diez
Davies E. Graham
Dhalla Fatima
Dong Yin
Dreau Helene
Dunford James E.
Ferla Matteo
Giacopuzzi Edoardo
Guerrini Renzo
Harris Adrian L.
Hartley Jane
Hashim Mona
Hashimoto Akiko
Hollander Georg
Hughes Jim R.
Javaid Kassim
Kaisaki Pamela J.
Kane Maureen
Kelly Deirdre
Kelly Dominic
Kesim Yesim
Kini Usha
Knight Samantha J. L.
Kreins Alexandra Y.
Kvikstad Erika M.
Lange Lukas
Langman Craig B.
Lester Tracy
Lines Kate E.
Lord Simon R.
Lu Xin
Lunter Gerton
Mansour Sahar
Manzur Adnan
Maroofian Reza
Marsden Brian
Mason Joanne
McGowan Simon J.
Mei Davide
Mlcochova Hana
Murakami Yoshiko
Németh Andrea H.
Okoli Steven
Ormondroyd Elizabeth
Ousager Lilian Bomme
Pagnamenta Alistair T.
Palace Jacqueline
Patel Smita Y.
Pentony Melissa M.
Popitsch Niko
Pugh Chris
Rad Aboulfazl
Ragoussis Vassilis
Ramesh Archana
Riva Simone G.
Roberts Irene
Roy Noémi
Salminen Outi
Sanders Edward
Schilling Kyleen D.
Schuh Anna H.
Schwessinger Ron
Scott Caroline
Sen Arjune
Smith Conrad
Stevenson Mark
Taylor Jenny C.
Taylor John M.
Thakker Rajesh V.
Twigg Stephen R. F.
Uhlig Holm H.
van Wijk Richard
Vavoulis Dimitrios V.
Vona Barbara
Wall Steven
Wang Jing
Watkins Hugh
Wilkie Andrew O. M.
Yu Jing
Zak Jaroslav
Publication venue
Publication date: 09/11/2023
Field of study

BACKGROUND: Whole genome sequencing is increasingly being used for the diagnosis of patients with rare diseases. However, the diagnostic yields of many studies, particularly those conducted in a healthcare setting, are often disappointingly low, at 25-30%. This is in part because although entire genomes are sequenced, analysis is often confined to in silico gene panels or coding regions of the genome.METHODS: We undertook WGS on a cohort of 122 unrelated rare disease patients and their relatives (300 genomes) who had been pre-screened by gene panels or arrays. Patients were recruited from a broad spectrum of clinical specialties. We applied a bioinformatics pipeline that would allow comprehensive analysis of all variant types. We combined established bioinformatics tools for phenotypic and genomic analysis with our novel algorithms (SVRare, ALTSPLICE and GREEN-DB) to detect and annotate structural, splice site and non-coding variants.RESULTS: Our diagnostic yield was 43/122 cases (35%), although 47/122 cases (39%) were considered solved when considering novel candidate genes with supporting functional data into account. Structural, splice site and deep intronic variants contributed to 20/47 (43%) of our solved cases. Five genes that are novel, or were novel at the time of discovery, were identified, whilst a further three genes are putative novel disease genes with evidence of causality. We identified variants of uncertain significance in a further fourteen candidate genes. The phenotypic spectrum associated with RMND1 was expanded to include polymicrogyria. Two patients with secondary findings in FBN1 and KCNQ1 were confirmed to have previously unidentified Marfan and long QT syndromes, respectively, and were referred for further clinical interventions. Clinical diagnoses were changed in six patients and treatment adjustments made for eight individuals, which for five patients was considered life-saving.CONCLUSIONS: Genome sequencing is increasingly being considered as a first-line genetic test in routine clinical settings and can make a substantial contribution to rapidly identifying a causal aetiology for many patients, shortening their diagnostic odyssey. We have demonstrated that structural, splice site and intronic variants make a significant contribution to diagnostic yield and that comprehensive analysis of the entire genome is essential to maximise the value of clinical genome sequencing.</p

University of Birmingham Research Portal

The University of Manchester - Institutional Repository

Does a tree-like phylogeny only exist at the tips in the prokaryotes?

Author: Creevey Christopher J.
Fitzpatrick David A.
Kinsella Rhoda J.
McInerney James O.
O'Connell Mary J.
Pentony Melissa M.
Philip Gayle K.
Travers Simon A.
Wilkinson Mark
Publication venue: The Royal Society of London
Publication date: 01/01/2004
Field of study

The extent to which prokaryotic evolution has been influenced by horizontal gene transfer (HGT) and therefore might be more of a network than a tree is unclear. Here we use supertree methods to ask whether a definitive prokaryotic phylogenetic tree exists and whether it can be confidently inferred using orthologous genes. We analysed an 11-taxon dataset spanning the deepest divisions of prokaryotic relationships, a 10-taxon dataset spanning the relatively recent c-proteobacteria and a 61-taxon dataset spanning both, using species for which complete genomes are available. Congruence among gene trees spanning deep relationships is not better than random. By contrast, a strong, almost perfect phylogenetic signal exists in c-proteobacterial genes. Deep-level prokaryotic relationships are difficult to infer because of signal erosion, systematic bias, hidden paralogy and/or HGT. Our results do not preclude levels of HGT that would be inconsistent with the notion of a prokaryotic phylogeny. This approach will help decide the extent to which we can say that there is a prokaryotic phylogeny and where in the phylogeny a cohesive genomic signal exists

Maynooth University ePrints and eTheses Archive

Does a tree-like phylogeny only exist at the tips in the prokaryotes?

Author: Creevey Christopher J
Fitzpatrick David A
Kinsella Rhoda J
Mcinerney James
McInerney James O
O'Connell Mary J
Pentony Melissa M
Philip Gayle K
Travers Simon A
Wilkinson Mark
Publication venue
Publication date: 01/01/2004
Field of study

The extent to which prokaryotic evolution has been influenced by horizontal gene transfer (HGT) and therefore might be more of a network than a tree is unclear. Here we use supertree methods to ask whether a definitive prokaryotic phylogenetic tree exists and whether it can be confidently inferred using orthologous genes. We analysed an 11-taxon dataset spanning the deepest divisions of prokaryotic relationships, a 10-taxon dataset spanning the relatively recent gamma-proteobacteria and a 61-taxon dataset spanning both, using species for which complete genomes are available. Congruence among gene trees spanning deep relationships is not better than random. By contrast, a strong, almost perfect phylogenetic signal exists in gamma-proteobacterial genes. Deep-level prokaryotic relationships are difficult to infer because of signal erosion, systematic bias, hidden paralogy and/or HGT. Our results do not preclude levels of HGT that would be inconsistent with the notion of a prokaryotic phylogeny. This approach will help decide the extent to which we can say that there is a prokaryotic phylogeny and where in the phylogeny a cohesive genomic signal exists

MURAL - Maynooth University Research Archive Library

Aberystwyth Research Portal

PubMed Central

NUI Maynooth Eprint Archive

Maynooth University ePrints and eTheses Archive

The University of Manchester - Institutional Repository

Identification of Circulating Genomic and Metabolic Biomarkers in Intrahepatic Cholangiocarcinoma

Author: Chu
Edoardo Giacopuzzi
Helen Winter
James S.O. McCullagh
Jenny C. Taylor
Joe Harvey
Laurier
Matteo P. Ferla
Melissa M. Pentony
Pamela J. Kaisaki
Reyes
Ricky A. Sharma
Samantha J.L. Knight
Sidransky
Xie
Publication venue: 'MDPI AG'
Publication date
Field of study

Crossref