Search CORE

20,057 research outputs found

Cancer immunogenomics: Computational neoantigen identification and vaccine design

Author: Coffman Adam
Graubert Aaron
Griffith Malachi
Griffith Obi L
Hundal Jasreet
Kiwala Susanna
Mardis Elaine R
McMichael Joshua
Miller Christopher J
Walker Jason
Publication venue: Digital Commons@Becker
Publication date: 01/01/2016
Field of study

Structural Prediction of Protein–Protein Interactions by Docking: Application to Biomedical Problems

Author: Barradas-Bautista Didier
Fernández-Recio Juan
Pallara Chiara
Rosell Mireia
Publication venue: 'Elsevier BV'
Publication date: 01/01/2018
Field of study

A huge amount of genetic information is available thanks to the recent advances in sequencing technologies and the larger computational capabilities, but the interpretation of such genetic data at phenotypic level remains elusive. One of the reasons is that proteins are not acting alone, but are specifically interacting with other proteins and biomolecules, forming intricate interaction networks that are essential for the majority of cell processes and pathological conditions. Thus, characterizing such interaction networks is an important step in understanding how information flows from gene to phenotype. Indeed, structural characterization of protein–protein interactions at atomic resolution has many applications in biomedicine, from diagnosis and vaccine design, to drug discovery. However, despite the advances of experimental structural determination, the number of interactions for which there is available structural data is still very small. In this context, a complementary approach is computational modeling of protein interactions by docking, which is usually composed of two major phases: (i) sampling of the possible binding modes between the interacting molecules and (ii) scoring for the identification of the correct orientations. In addition, prediction of interface and hot-spot residues is very useful in order to guide and interpret mutagenesis experiments, as well as to understand functional and mechanistic aspects of the interaction. Computational docking is already being applied to specific biomedical problems within the context of personalized medicine, for instance, helping to interpret pathological mutations involved in protein–protein interactions, or providing modeled structural data for drug discovery targeting protein–protein interactions.Spanish Ministry of Economy grant number BIO2016-79960-R; D.B.B. is supported by a predoctoral fellowship from CONACyT; M.R. is supported by an FPI fellowship from the Severo Ochoa program. We are grateful to the Joint BSC-CRG-IRB Programme in Computational Biology.Peer ReviewedPostprint (author's final draft

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Rosetta Brains: A Strategy for Molecularly-Annotated Connectomics

Author: Boyden Edward S
Church George M
Daugharthy Evan R
Kalhor Reza
Kebschull Justus M
Kording Konrad P
Lee Je Hyuk
Marblestone Adam H
Mishchenko Yuriy
Peikon Ian D
Shipman Seth L
Zador Anthony M
Publication venue
Publication date: 21/04/2014
Field of study

We propose a neural connectomics strategy called Fluorescent In-Situ Sequencing of Barcoded Individual Neuronal Connections (FISSEQ-BOINC), leveraging fluorescent in situ nucleic acid sequencing in fixed tissue (FISSEQ). FISSEQ-BOINC exhibits different properties from BOINC, which relies on bulk nucleic acid sequencing. FISSEQ-BOINC could become a scalable approach for mapping whole-mammalian-brain connectomes with rich molecular annotations

arXiv.org e-Print Archive

CiteSeerX

Directed evolution of Vibrio fischeri LuxR for increased sensitivity to a broad spectrum of acyl-homoserine lactones

Author: Arnold Frances H.
Collins Cynthia H.
Leadbetter Jared R.
Publication venue: 'Foundation for Cellular and Molecular Medicine'
Publication date: 01/02/2005
Field of study

LuxR-type transcriptional regulators play key roles in quorum-sensing systems that employ acyl-homoserine lactones (acyl-HSLs) as signal molecules. These proteins mediate quorum control by changing their interactions with RNA polymerase and DNA in response to binding their cognate acyl-HSL. The evolutionarily related LuxR-type proteins exhibit considerable diversity in primary sequence and in their response to acyl-HSLs having acyl groups of differing length and composition. Little is known about which residues determine acyl-HSL specificity, and less about the evolutionary time scales required to forge new ones. To begin to examine such issues, we have focused on the LuxR protein from Vibrio fischeri, which activates gene transcription in response to binding its cognate quorum signal, 3-oxohexanoyl-homoserine lactone (3OC6HSL). Libraries of luxR mutants were screened for variants exhibiting increased gene activation in response to octanoyl-HSL (C8HSL), with which wild-type LuxR interacts only weakly. Eight LuxR variants were identified that showed a 100-fold increase in sensitivity to C8HSL; these variants also displayed increased sensitivities to pentanoyl-HSL and tetradecanoyl-HSL, while maintaining a wild-type or greater response to 3OC6HSL. The most sensitive variants activated gene transcription as strongly with C8HSL as the wild type did with 3OC6HSL. With one exception, the amino acid residues involved were restricted to the N-terminal, 'signal-binding' domain of LuxR. These residue positions differed from critical positions previously identified via 'loss-of-function' mutagenesis. We have demonstrated that acyl-HSL-dependent quorum-sensing systems can evolve rapidly to respond to new acyl-HSLs, suggesting that there may be an evolutionary advantage to maintaining such plasticity

Caltech Authors

Discovery and population genomics of structural variation in a songbird genus

Author: A Catalán
A Kapusta
A Suh
AE van’t Hof
B Charlesworth
B Weckselblatt
BS Weir
C Alkan
C Küpper
C Randler
C-C Wu
C-S Chin
CF Mugal
D Metzler
DC Jeffares
DT Parkin
E Lieberman-Aiden
EB Chuong
FA Simão
FC Grandi
FJ Sedlazeck
FJ Sedlazeck
H Ellegren
H Li
H Li
J Huddleston
J Jurka
JBW Wolf
JT Sutton
JW Poelstra
JW Poelstra
K Katoh
KA Jønsson
L Feuk
LE Flagel
M Chakraborty
M Gymrek
M Nattestad
M Wellenreuther
MH Weissensteiner
MJP Chaisson
N Vijay
NH Putnam
P Danecek
P Danecek
RB Corbett-Detig
RM Layer
S Goodwin
S Kurtz
S Tusso
SF Altschul
SS Ho
T Kawakami
T Londei
T Rausch
T Wicker
TE Kijima
U Knief
V Peona
W Meise
X Chen
X Zheng
Y Zhou
ZN Kronenberg
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Structural variation (SV) constitutes an important type of genetic mutations providing the raw material for evolution. Here, we uncover the genome-wide spectrum of intra- and interspecific SV segregating in natural populations of seven songbird species in the genus Corvus. Combining short-read (N = 127) and long-read re-sequencing (N = 31), as well as optical mapping (N = 16), we apply both assembly- and read mapping approaches to detect SV and characterize a total of 220,452 insertions, deletions and inversions. We exploit sampling across wide phylogenetic timescales to validate SV genotypes and assess the contribution of SV to evolutionary processes in an avian model of incipient speciation. We reveal an evolutionary young (~530,000 years) cis-acting 2.25-kb LTR retrotransposon insertion reducing expression of the NDP gene with consequences for premating isolation. Our results attest to the wealth and evolutionary significance of SV segregating in natural populations and highlight the need for reliable SV genotyping

Crossref

Publikationer från Uppsala Universitet

Open Access LMU

PubMed Central

Digitala Vetenskapliga Arkivet - Academic Archive On-line

University of East Anglia digital repository

MPG.PuRe

Discovery of large genomic inversions using long range information.

Author: Alkan Can
Amemiya Chris T
Antonacci Francesca
Chiatante Giorgia
Eichler Evan E
Eslami Rasekh Marzieh
Miroballo Mattia
Tang Joyce
Ventura Mario
Publication venue: eScholarship, University of California
Publication date: 01/01/2017
Field of study

BackgroundAlthough many algorithms are now available that aim to characterize different classes of structural variation, discovery of balanced rearrangements such as inversions remains an open problem. This is mainly due to the fact that breakpoints of such events typically lie within segmental duplications or common repeats, which reduces the mappability of short reads. The algorithms developed within the 1000 Genomes Project to identify inversions are limited to relatively short inversions, and there are currently no available algorithms to discover large inversions using high throughput sequencing technologies.ResultsHere we propose a novel algorithm, VALOR, to discover large inversions using new sequencing methods that provide long range information such as 10X Genomics linked-read sequencing, pooled clone sequencing, or other similar technologies that we commonly refer to as long range sequencing. We demonstrate the utility of VALOR using both pooled clone sequencing and 10X Genomics linked-read sequencing generated from the genome of an individual from the HapMap project (NA12878). We also provide a comprehensive comparison of VALOR against several state-of-the-art structural variation discovery algorithms that use whole genome shotgun sequencing data.ConclusionsIn this paper, we show that VALOR is able to accurately discover all previously identified and experimentally validated large inversions in the same genome with a low false discovery rate. Using VALOR, we also predicted a novel inversion, which we validated using fluorescent in situ hybridization. VALOR is available at https://github.com/BilkentCompGen/VALOR

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

Springer OAI

Single-molecule real-time sequencing combined with optical mapping yields completely finished fungal genome

Author: Datema Erwin
Faino Luigi
Janssen Antoine
Seidl Michael F.
Thomma Bart P. H. J.
Van Den Berg Grardy C. M.
Wittenberg Alexander H. J.
Publication venue: 'American Society for Microbiology'
Publication date: 01/01/2015
Field of study

Next-generation sequencing (NGS) technologies have increased the scalability, speed, and resolution of genomic sequencing and, thus, have revolutionized genomic studies. However, eukaryotic genome sequencing initiatives typically yield considerably fragmented genome assemblies. Here, we assessed various state-of-the-art sequencing and assembly strategies in order to produce a contiguous and complete eukaryotic genome assembly, focusing on the filamentous fungus Verticillium dahliae. Compared with Illumina-based assemblies of the V. dahliae genome, hybrid assemblies that also include PacBio- generated long reads establish superior contiguity. Intriguingly, provided that sufficient sequence depth is reached, assemblies solely based on PacBio reads outperform hybrid assemblies and even result in fully assembled chromosomes. Furthermore, the addition of optical map data allowed us to produce a gapless and complete V. dahliae genome assembly of the expected eight chromosomes from telomere to telomere. Consequently, we can now study genomic regions that were previously not assembled or poorly assembled, including regions that are populated by repetitive sequences, such as transposons, allowing us to fully appreciate an organism’s biological complexity. Our data show that a combination of PacBio-generated long reads and optical mapping can be used to generate complete and gapless assemblies of fungal genomes. IMPORTANCE Studying whole-genome sequences has become an important aspect of biological research. The advent of nextgeneration sequencing (NGS) technologies has nowadays brought genomic science within reach of most research laboratories, including those that study nonmodel organisms. However, most genome sequencing initiatives typically yield (highly) fragmented genome assemblies. Nevertheless, considerable relevant information related to genome structure and evolution is likely hidden in those nonassembled regions. Here, we investigated a diverse set of strategies to obtain gapless genome assemblies, using the genome of a typical ascomycete fungus as the template. Eventually, we were able to show that a combination of PacBiogenerated long reads and optical mapping yields a gapless telomere-to-telomere genome assembly, allowing in-depth genome sanalyses to facilitate functional studies into an organism’s biology

Directory of Open Access Journals

PubMed Central

Archivio della ricerca- Università di Roma La Sapienza

Identification of Structural Variation in Chimpanzees Using Optical Mapping and Nanopore Sequencing.

Author: Andrés Aida M
Dennis Megan Y
Kaya Gulhan
Mastoras Mira
Sahasrabudhe Ruta
Schmidt Joshua M
Shew Colin
Soto Daniela C
Publication venue: eScholarship, University of California
Publication date: 01/03/2020
Field of study

Recent efforts to comprehensively characterize great ape genetic diversity using short-read sequencing and single-nucleotide variants have led to important discoveries related to selection within species, demographic history, and lineage-specific traits. Structural variants (SVs), including deletions and inversions, comprise a larger proportion of genetic differences between and within species, making them an important yet understudied source of trait divergence. Here, we used a combination of long-read and -range sequencing approaches to characterize the structural variant landscape of two additional Pan troglodytes verus individuals, one of whom carries 13% admixture from Pan troglodytes troglodytes. We performed optical mapping of both individuals followed by nanopore sequencing of one individual. Filtering for larger variants (>10 kbp) and combined with genotyping of SVs using short-read data from the Great Ape Genome Project, we identified 425 deletions and 59 inversions, of which 88 and 36, respectively, were novel. Compared with gene expression in humans, we found a significant enrichment of chimpanzee genes with differential expression in lymphoblastoid cell lines and induced pluripotent stem cells, both within deletions and near inversion breakpoints. We examined chromatin-conformation maps from human and chimpanzee using these same cell types and observed alterations in genomic interactions at SV breakpoints. Finally, we focused on 56 genes impacted by SVs in >90% of chimpanzees and absent in humans and gorillas, which may contribute to chimpanzee-specific features. Sequencing a greater set of individuals from diverse subspecies will be critical to establish the complete landscape of genetic variation in chimpanzees

UCL Discovery

eScholarship - University of California

Genomic innovation for crop improvement

Author: Bevan Michael W.
Clark Matthew D
Krasileva Ksenia
Uauy Cristobal
Wulff Brande B H
Zhou Ji
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 14/03/2017
Field of study

Crop production needs to increase to secure future food supplies, while reducing its impact on ecosystems. Detailed characterization of plant genomes and genetic diversity is crucial for meeting these challenges. Advances in genome sequencing and assembly are being used to access the large and complex genomes of crops and their wild relatives. These have helped to identify a wide spectrum of genetic variation and permitted the association of genetic diversity with diverse agronomic phenotypes. In combination with improved and automated phenotyping assays and functional genomic studies, genomics is providing new foundations for crop-breeding systems

University of East Anglia digital repository

Assessing the Gene Content of the Megagenome: Sugar Pine (Pinus lambertiana).

Author: Delfino-Mix Annette
Famula Randi A
Gonzalez-Ibeas Daniel
Langley Charles H
Loopstra Carol A
Martinez-Garcia Pedro J
Neale David B
Stevens Kristian A
Wegrzyn Jill L
Publication venue: eScholarship, University of California
Publication date: 31/10/2016
Field of study

Sugar pine (Pinus lambertiana Douglas) is within the subgenus Strobus with an estimated genome size of 31 Gbp. Transcriptomic resources are of particular interest in conifers due to the challenges presented in their megagenomes for gene identification. In this study, we present the first comprehensive survey of the P. lambertiana transcriptome through deep sequencing of a variety of tissue types to generate more than 2.5 billion short reads. Third generation, long reads generated through PacBio Iso-Seq have been included for the first time in conifers to combat the challenges associated with de novo transcriptome assembly. A technology comparison is provided here to contribute to the otherwise scarce comparisons of second and third generation transcriptome sequencing approaches in plant species. In addition, the transcriptome reference was essential for gene model identification and quality assessment in the parallel project responsible for sequencing and assembly of the entire genome. In this study, the transcriptomic data were also used to address questions surrounding lineage-specific Dicer-like proteins in conifers. These proteins play a role in the control of transposable element proliferation and the related genome expansion in conifers

Directory of Open Access Journals

PubMed Central

eScholarship - University of California