Search CORE

22 research outputs found

Editorial Board

Author: Chen Wu (49684)
Richard Newcomb (3561242)
Ross Crowhurst (532959)
Thomas Buckley (344503)
Victoria Twort (4599166)
Publication venue: Published by Elsevier B.V.
Publication date: 31/05/2008
Field of study

Scaffolds match archea, bacteria and virus sequences. (XLSX 140Â kb

Elsevier - Publisher Connector

Lund University Publications

Directory of Open Access Journals

FigShare

Summary of BLASTx comparisons to the Solgenomics N. benthamiana v0.4.4 predicted proteins database.

Author: Julia Bally (204199)
Kenlee Nakasugi (109354)
Peter Waterhouse (532960)
Ross Crowhurst (532959)
Publication venue
Publication date
Field of study

The orange tracks show the percentage of database hits from each assembly that were also found in other assemblies, represented by 18 data points in each track, including self-on-self comparison (see legend). The green tracks show the percentage of the database that were found in each assembly. The grey histograms are comparisons to the top 1000 longest proteins in the database, and the blue histograms are comparisons to all sequences in the database. Each set of histograms are sub-divided into 4 bars, representing from darkest to lightest colour, the total database (always 100%), the percentage of the database that is actually expressed in our assembly, the percentage of these expressed database sequences that were found in the assembly, and of these the percentage that were aligned to more than 80% of the database sequence length. The coloured links show the proportion of TaMraw, Trraw, Sotgi and OaMraw de novo assembled transcripts that were present in the SasmMraw and SasmMevi assemblies.</p

FigShare

Number of protein sequences after clustering of top 1000 longest proteins in each assembly, using CD-HIT with an identity of 95%.

Author: Julia Bally (204199)
Kenlee Nakasugi (109354)
Peter Waterhouse (532960)
Ross Crowhurst (532959)
Publication venue
Publication date
Field of study

Number of protein sequences after clustering of top 1000 longest proteins in each assembly, using CD-HIT with an identity of 95%.</p

FigShare

Homeologous or paralogous RNA silencing gene transcript sequences, their nucleotide and protein identities, and whether they could be assembled separately in TaMraw, Trraw, Soraw and OaMraw assemblies.

Author: Julia Bally (204199)
Kenlee Nakasugi (109354)
Peter Waterhouse (532960)
Ross Crowhurst (532959)
Publication venue
Publication date
Field of study

Homeologous or paralogous RNA silencing gene transcript sequences, their nucleotide and protein identities, and whether they could be assembled separately in TaMraw, Trraw, Soraw and OaMraw assemblies.</p

FigShare

Alignment coverage of 35 RNA silencing gene transcript sequences.

Author: Julia Bally (204199)
Kenlee Nakasugi (109354)
Peter Waterhouse (532960)
Ross Crowhurst (532959)
Publication venue
Publication date
Field of study

Sequences were queried against the 18 transcriptome assemblies in this study. The CDS of these de novo assembled transcripts were screened against the assemblies using BLASTn, and the query alignment coverages were calculated from the best match to the database.</p

FigShare

Statistics of raw unprocessed k-mer assemblies (Tr and So) or merged k-mer assemblies (TaM and OaM), and their Tgi processed and Evi processed assemblies.

Author: Julia Bally (204199)
Kenlee Nakasugi (109354)
Peter Waterhouse (532960)
Ross Crowhurst (532959)
Publication venue
Publication date
Field of study

Statistics of raw unprocessed k-mer assemblies (Tr and So) or merged k-mer assemblies (TaM and OaM), and their Tgi processed and Evi processed assemblies.</p

FigShare

Feature response curves of assemblies using the ‘High_spanning_PE’ feature.

Author: Julia Bally (204199)
Kenlee Nakasugi (109354)
Peter Waterhouse (532960)
Ross Crowhurst (532959)
Publication venue
Publication date
Field of study

This feature measures the number of PE reads where the pairs are mapped onto different contigs (de novo assembled transcripts). The feature threshold is used to filter out contigs that fall above a threshold. That is, only contigs that contain less than a threshold number of features are used to calculate the coverage at that threshold. Except for the TaMevi, the Evi processed assemblies appear to perform best or at least on par on all assemblies (raw vs Tgi vs Evi) as higher coverage is achieved at a lower feature threshold.</p

FigShare

Overview of assemblies generated from two datasets, showing k-mer size ranges and respective assemblies used to generate the two combined assembly types (SasmM and SasmK).

Author: Julia Bally (204199)
Kenlee Nakasugi (109354)
Peter Waterhouse (532960)
Ross Crowhurst (532959)
Publication venue
Publication date
Field of study

* k-mer assemblies merged by: Ta – Ta merge utility; So – TGI clustering software; Oa – Oa merge utility.</p

FigShare

Dcl1 transcript assembly status by the Trinity assembler.

Author: Julia Bally (204199)
Kenlee Nakasugi (109354)
Peter Waterhouse (532960)
Ross Crowhurst (532959)
Publication venue
Publication date
Field of study

Red bars indicate the known Dcl1 CDS used as query. Black bars indicate the transcripts assembled by the assembler. A: Alignment of de novo assembled transcripts generated with ds1 and ds2 reads to the query Dcl1 sequence. A full length Dcl1 sequence could be assembled with ds1 reads, but only two partial sequences were assembled with ds2 reads. B: Read depth profile from all RNA-seq reads mapped to the query Dcl1 CDS. Changes in read depth could indicate the presence of various isoforms. C: Alignment of the full length and partial Dcl1 de novo assembled transcripts at the region where there is a sharp change in read depth, to respective genome scaffolds in our v0.3 draft assembly. This shows that the partially assembled Dcl1 sequence contains unspliced intron, and that there may be two loci for Dcl1 as implied by the different intron sequences in the scaffolds (see text).</p

FigShare

Predicted transmembrane topology of OR7.

Author: Amali H. Thrimawithana (745987)
Bernd Steinwender (158911)
Richard D. Newcomb (158917)
Ross Crowhurst (532959)
Publication venue
Publication date
Field of study

Variable sites between P. octo and P. excessana are highlighted. Red dots indicate the position of amino acid substitutions in P. octo, and black dots amino acid substitutions in P. excessana compared to a predicted common ancestor. The double line indicates the transmembrane region, with extracellular and cytoplasmic sides labelled.</p

FigShare