Search CORE

65 research outputs found

SoftSearch: Integration of Multiple Sequence Features to Identify Breakpoints of Structural Variations

Author: Fergus J. Couch (146607)
Jaysheel D. Bhavsar (497995)
Jean-Pierre A. Kocher (184636)
Raymond Moore (497994)
Saurabh Baheti (479040)
Steven N. Hart (497992)
Vivekananda Sarangi (497993)
Publication venue
Publication date: 16/12/2013
Field of study

<div>BackgroundStructural variation (SV) represents a significant, yet poorly understood contribution to an individual’s genetic makeup. Advanced next-generation sequencing technologies are widely used to discover such variations, but there is no single detection tool that is considered a community standard. In an attempt to fulfil this need, we developed an algorithm, SoftSearch, for discovering structural variant breakpoints in Illumina paired-end next-generation sequencing data. SoftSearch combines multiple strategies for detecting SV including split-read, discordant read-pair, and unmated pairs. Co-localized split-reads and discordant read pairs are used to refine the breakpoints. ResultsWe developed and validated SoftSearch using real and synthetic datasets. SoftSearch’s key features are 1) not requiring secondary (or exhaustive primary) alignment, 2) portability into established sequencing workflows, and 3) is applicable to any DNA-sequencing experiment (e.g. whole genome, exome, custom capture, etc.). SoftSearch identifies breakpoints from a small number of soft-clipped bases from split reads and a few discordant read-pairs which on their own would not be sufficient to make an SV call. ConclusionsWe show that SoftSearch can identify more true SVs by combining multiple sequence features. SoftSearch was able to call clinically relevant SVs in the BRCA2 gene not reported by other tools while offering significantly improved overall performance. </div

Directory of Open Access Journals

PubMed Central

FigShare

Example IGV screenshot of a 71bp tandem duplication in the BRCA2 gene identified by SoftSearch.

Author: Fergus J. Couch (146607)
Jaysheel D. Bhavsar (497995)
Jean-Pierre A. Kocher (184636)
Raymond Moore (497994)
Saurabh Baheti (479040)
Steven N. Hart (497992)
Vivekananda Sarangi (497993)
Publication venue
Publication date
Field of study

Discordant reads are blue (plus strand) or red (minus strand). Soft clipped bases appear as multicolour “rainbows”.</p

FigShare

Overlap of true positive calls for the NA12878 and NA18507 datasets.

Author: Fergus J. Couch (146607)
Jaysheel D. Bhavsar (497995)
Jean-Pierre A. Kocher (184636)
Raymond Moore (497994)
Saurabh Baheti (479040)
Steven N. Hart (497992)
Vivekananda Sarangi (497993)
Publication venue
Publication date
Field of study

Overlap of true positive calls for the NA12878 and NA18507 datasets.</p

FigShare

The general strategy for SoftSearch.

Author: Fergus J. Couch (146607)
Jaysheel D. Bhavsar (497995)
Jean-Pierre A. Kocher (184636)
Raymond Moore (497994)
Saurabh Baheti (479040)
Steven N. Hart (497992)
Vivekananda Sarangi (497993)
Publication venue
Publication date
Field of study

A) Left clipped reads are defined as where the clipped portion of the read is at a smaller genome coordinate than the opposite end (opposite for right clipping). For a left clipped read located on the “+” strand, SoftSearch looks upstream for a discordant read pair where the read is oriented in the “-” direction. The orientation and location of the mate is where SoftSearch links the first region to. To increase the likelihood of exactly detecting the breakpoint, it then looks upstream for a right clipped read cluster. If none is found, then the default breakpoint location is the discordant read mate location; otherwise it is the position of soft clipping at the right clipped read. B) SoftSearch determines discordant read pairs by their insert size and orientation and places them in a temporary BAM file. It also reads the input BAM file for soft clipped reads and converts them to a BED file. Overlapping soft clip locations are counted to identify putative breakpoints, and then queried against the discordant read pair bam file for properly oriented supporting reads, which are then output in VCF format.</p

FigShare

Manhattan plot of results from genome wide meta-analysis of POSH stage-1 and HEBCS hazard ratios and 95% confidence intervals.

Author: Andrew Collins (63814)
Carl Blomqvist (83761)
Diana Eccles (154169)
Fergus J. Couch (146607)
Heli Nevanlinna (83762)
Jianjun Liu (28194)
Kristiina Aittomäki (89889)
Rosanna Upstill-Goddard (435141)
Sajjad Rafiq (102223)
Sofia Khan (79890)
Susan Gerty (677141)
William Tapper (435143)
Publication venue
Publication date
Field of study

The 25 most associated SNPs are highlighted in green.</p

FigShare

Associations of SNPs with nominal replication signals with clinical characteristics associated with breast cancer in a pooled set of discovery and replication cohorts.

Author: Andrew Collins (63814)
Carl Blomqvist (83761)
Diana Eccles (154169)
Fergus J. Couch (146607)
Heli Nevanlinna (83762)
Jianjun Liu (28194)
Kristiina Aittomäki (89889)
Rosanna Upstill-Goddard (435141)
Sajjad Rafiq (102223)
Sofia Khan (79890)
Susan Gerty (677141)
William Tapper (435143)
Publication venue
Publication date
Field of study

N-stage = metastasis to lymph node, M-stage = metastasis stage and T-stage = Tumour stage.Associations of SNPs with nominal replication signals with clinical characteristics associated with breast cancer in a pooled set of discovery and replication cohorts.</p

FigShare

Replication of most significant associations from the discovery set meta-analysis in the replication samples.

Author: Andrew Collins (63814)
Carl Blomqvist (83761)
Diana Eccles (154169)
Fergus J. Couch (146607)
Heli Nevanlinna (83762)
Jianjun Liu (28194)
Kristiina Aittomäki (89889)
Rosanna Upstill-Goddard (435141)
Sajjad Rafiq (102223)
Sofia Khan (79890)
Susan Gerty (677141)
William Tapper (435143)
Publication venue
Publication date
Field of study

Results are presented for those SNPs which remained associated in the same direction in the validation set as in the discovery set (adjusted for ER-status).Replication of most significant associations from the discovery set meta-analysis in the replication samples.</p

FigShare

Kaplan-Meier plots depicting breast cancer related survival in response to rs421379 genotypes in pooled POSH stage-1, HEBCS and POSH stage-2 samples.

Author: Andrew Collins (63814)
Carl Blomqvist (83761)
Diana Eccles (154169)
Fergus J. Couch (146607)
Heli Nevanlinna (83762)
Jianjun Liu (28194)
Kristiina Aittomäki (89889)
Rosanna Upstill-Goddard (435141)
Sajjad Rafiq (102223)
Sofia Khan (79890)
Susan Gerty (677141)
William Tapper (435143)
Publication venue
Publication date
Field of study

Kaplan-Meier plots depicting breast cancer related survival in response to rs421379 genotypes in pooled POSH stage-1, HEBCS and POSH stage-2 samples.</p

FigShare

List of participating studies and number of Caucasian subjects included in at least one GxE analysis.

Author: Alexander Miron (28179)
Alice J. Sigurdson (148099)
Alina Vrieling (205159)
Alison M. Dunning (146796)
Amanda B. Spurdle (47489)
Amy Trentham-Dietz (146782)
Angela Cox (76913)
Anja Rudolph (180618)
Anna Marie Mulligan (146672)
AOCS Management Group (395478)
Argyrios Ziogas (4194)
Arif B. Ekici (146432)
Arto Mannermaa (89891)
Børge G. Nordestgaard (89871)
Celine M. Vachon (219498)
Charlotte Lanng (395474)
Christa Stegmaier (146503)
Christina A. Clarke (146496)
Dan Connley (395482)
Daniel F. Schmidt (395473)
Dieter Flesch-Janys (89907)
Diether Lambrechts (101772)
Dominiek Smeets (395479)
Doug F. Easton (395486)
Elizabeth K. Cahoon (395484)
Emilie Cordina-Duverger (146484)
Enes Makalic (395472)
Esther M. John (107266)
Fergus J. Couch (146607)
Florence Menegaux (146487)
Gianluca Severi (89918)
Gord Glendon (146666)
Graham G. Giles (89916)
Heiko Müller (7901)
Helen Cramp (395481)
Hermann Brenner (63649)
Hiltrud Brauch (89877)
Hoda Anton-Culver (3473)
Irene L. Andrulis (107262)
Isabel dos Santos Silva (146449)
Jaana M. Hartikainen (395477)
Janet E. Olson (89913)
Jean Wang (43694)
Jenny Chang-Claude (23562)
Jianjun Liu (28194)
John L. Hopper (89919)
Jolanta Lissowska (89931)
Jonine Figueroa (273680)
Julia A. Knight (146661)
Julian Peto (146454)
Katharina Buck (102605)
Kathleen Egan (395483)
kConFab (2614363)
Kenneth Offit (63812)
Kristen Stevens (121411)
Laura Baglietto (89917)
Leslie Bernstein (3467)
Linda Titus (146784)
Lorna Gibson (153960)
Lothar Häberle (395470)
Manjeet K. Humphreys (146790)
Marjanka K. Schmidt (146376)
Martina Schmidt (183309)
Matthias W. Beckmann (146438)
Melissa C. Southey (89921)
Mia Gaudet (395471)
Michele M. Doody (395485)
Mitul Shah (51821)
Montserrat Garcia-Closas (89866)
Nadia Obi (395480)
Nils Schoof (190380)
Olivia Fletcher (146442)
Pascal Guénel (146482)
Patrick Neven (146593)
Paul D. P. Pharoah (146906)
Per Hall (23544)
Peter A. Fasching (146424)
Polly Newcomb (218870)
Preetha Rajaraman (225262)
Rebecca Hein (146351)
Robert Paridaens (44564)
Roger L. Milne (89874)
Sabapathy Balasubramanian (380655)
Sabine Behrens (395469)
Shan Wang-Gohrke (89910)
Stefan Nickels (146598)
Stephen J. Chanock (14311)
Stig E. Bojesen (89870)
The GENICA Network (395476)
Thomas Brüning (135293)
Thérèse Truong (146488)
Ursula Eilber (102621)
Veli-Matti Kosma (89895)
Vesa Kataja (89896)
Volker Arndt (146500)
Volker Harth (395475)
Publication venue
Publication date
Field of study

List of participating studies and number of Caucasian subjects included in at least one GxE analysis.</p

FigShare

Main effects for the epidemiologic variables included in the analyses, derived from population-based studies only1.

Author: Alexander Miron (28179)
Alice J. Sigurdson (148099)
Alina Vrieling (205159)
Alison M. Dunning (146796)
Amanda B. Spurdle (47489)
Amy Trentham-Dietz (146782)
Angela Cox (76913)
Anja Rudolph (180618)
Anna Marie Mulligan (146672)
AOCS Management Group (395478)
Argyrios Ziogas (4194)
Arif B. Ekici (146432)
Arto Mannermaa (89891)
Børge G. Nordestgaard (89871)
Celine M. Vachon (219498)
Charlotte Lanng (395474)
Christa Stegmaier (146503)
Christina A. Clarke (146496)
Dan Connley (395482)
Daniel F. Schmidt (395473)
Dieter Flesch-Janys (89907)
Diether Lambrechts (101772)
Dominiek Smeets (395479)
Doug F. Easton (395486)
Elizabeth K. Cahoon (395484)
Emilie Cordina-Duverger (146484)
Enes Makalic (395472)
Esther M. John (107266)
Fergus J. Couch (146607)
Florence Menegaux (146487)
Gianluca Severi (89918)
Gord Glendon (146666)
Graham G. Giles (89916)
Heiko Müller (7901)
Helen Cramp (395481)
Hermann Brenner (63649)
Hiltrud Brauch (89877)
Hoda Anton-Culver (3473)
Irene L. Andrulis (107262)
Isabel dos Santos Silva (146449)
Jaana M. Hartikainen (395477)
Janet E. Olson (89913)
Jean Wang (43694)
Jenny Chang-Claude (23562)
Jianjun Liu (28194)
John L. Hopper (89919)
Jolanta Lissowska (89931)
Jonine Figueroa (273680)
Julia A. Knight (146661)
Julian Peto (146454)
Katharina Buck (102605)
Kathleen Egan (395483)
kConFab (2614363)
Kenneth Offit (63812)
Kristen Stevens (121411)
Laura Baglietto (89917)
Leslie Bernstein (3467)
Linda Titus (146784)
Lorna Gibson (153960)
Lothar Häberle (395470)
Manjeet K. Humphreys (146790)
Marjanka K. Schmidt (146376)
Martina Schmidt (183309)
Matthias W. Beckmann (146438)
Melissa C. Southey (89921)
Mia Gaudet (395471)
Michele M. Doody (395485)
Mitul Shah (51821)
Montserrat Garcia-Closas (89866)
Nadia Obi (395480)
Nils Schoof (190380)
Olivia Fletcher (146442)
Pascal Guénel (146482)
Patrick Neven (146593)
Paul D. P. Pharoah (146906)
Per Hall (23544)
Peter A. Fasching (146424)
Polly Newcomb (218870)
Preetha Rajaraman (225262)
Rebecca Hein (146351)
Robert Paridaens (44564)
Roger L. Milne (89874)
Sabapathy Balasubramanian (380655)
Sabine Behrens (395469)
Shan Wang-Gohrke (89910)
Stefan Nickels (146598)
Stephen J. Chanock (14311)
Stig E. Bojesen (89870)
The GENICA Network (395476)
Thomas Brüning (135293)
Thérèse Truong (146488)
Ursula Eilber (102621)
Veli-Matti Kosma (89895)
Vesa Kataja (89896)
Volker Arndt (146500)
Volker Harth (395475)
Publication venue
Publication date
Field of study

Main effects for the epidemiologic variables included in the analyses, derived from population-based studies only<a href="http://www.plosgenetics.org/article/info:doi/10.1371/journal.pgen.1003284#nt104" target="_blank">1</a>.</p

FigShare