Search CORE

34 research outputs found

The Source of the Data Flood: Sequencing Technologies

Author: Magi Alberto
PISANTI NADIA
TATTINI LORENZO
Publication venue
Publication date: 01/01/2016
Field of study

Where does this huge amount of data come from? What are the costs of producing it? The answers to these questions lie in the impressive development of sequencing technologies, which have opened up many research opportunities and challenges, some of which are described in this issue. DNA sequencing is the process of “reading” a DNA fragment (referred to as a “read”) and determining the exact order of DNA bases (the four possible nucleotides, that are Adenine, Guanine, Cytosine, and Thymine) that compose a given DNA strand. Research in biology and medicine has been revolutionised and accelerated by the advances of DNA and even RNA sequencing biotechnologies

INRIA a CCSD electronic archive server

Archivio della Ricerca - Università di Pisa

A graph theoretical analysis of the energy landscape of model polymers

Author: D. J. Wales
Lapo Casetti
Lorenzo Bongini
Lorenzo Tattini
Marco Baiesi
N. G. Van Kampen
P. J. Flory
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2009
Field of study

In systems characterized by a rough potential energy landscape, local energetic minima and saddles define a network of metastable states whose topology strongly influences the dynamics. Changes in temperature, causing the merging and splitting of metastable states, have non trivial effects on such networks and must be taken into account. We do this by means of a recently proposed renormalization procedure. This method is applied to analyze the topology of the network of metastable states for different polypeptidic sequences in a minimalistic polymer model. A smaller spectral dimension emerges as a hallmark of stability of the global energy minimum and highlights a non-obvious link between dynamic and thermodynamic properties.Comment: 15 pages, 15 figure

arXiv.org e-Print Archive

Crossref

Archivio istituzionale della ricerca - Università di Padova

Enhanced copy number variants detection from whole-exome sequencing data using EXCAVATOR2

Author: D&apos
Giusti Betti
Magi Alberto
Pellegrini Marco
Pippucci Tommaso
Tattini Lorenzo
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2016
Field of study

Copy Number Variants (CNVs) are structural rear- rangements contributing to phenotypic variation that have been proved to be associated with many dis- ease states. Over the last years, the identification of CNVs from whole-exome sequencing (WES) data has become a common practice for research and clinical purpose and, consequently, the demand for more and more efficient and accurate methods has increased. In this paper, we demonstrate that more than 30% of WES data map outside the targeted re- gions and that these reads, usually discarded, can be exploited to enhance the identification of CNVs from WES experiments. Here, we present EXCAVATOR2, the first read count based tool that exploits all the reads produced by WES experiments to detect CNVs with a genome-wide resolution. To evaluate the per- formance of our novel tool we use it for analysing two WES data sets, a population data set sequenced by the 1000 Genomes Project and a tumor data set made of bladder cancer samples. The results obtained from these analyses demonstrate that EXCAVATOR2 out- performs other four state-of-the-art methods and that our combined approach enlarge the spectrum of detectable CNVs from WES data with an unprece- dented resolution

Florence Research

PubMed Central

PUblication MAnagement

phyBWT: Alignment-Free Phylogeny via eBWT Positional Clustering

Author: Conte Alessio
Grossi Roberto
Guerrini Veronica
Liti Gianni
Rosone Giovanna
Tattini Lorenzo
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 22nd International Workshop on Algorithms in Bioinformatics (WABI 2022)
Publication date: 01/01/2022
Field of study

Molecular phylogenetics is a fundamental branch of biology. It studies the evolutionary relationships among the individuals of a population through their biological sequences, and may provide insights about the origin and the evolution of viral diseases, or highlight complex evolutionary trajectories. In this paper we develop a method called phyBWT, describing how to use the extended Burrows-Wheeler Transform (eBWT) for a collection of DNA sequences to directly reconstruct phylogeny, bypassing the alignment against a reference genome or de novo assembly. Our phyBWT hinges on the combinatorial properties of the eBWT positional clustering framework. We employ eBWT to detect relevant blocks of the longest shared substrings of varying length (unlike the k-mer-based approaches that need to fix the length k a priori), and build a suitable decomposition leading to a phylogenetic tree, step by step. As a result, phyBWT is a new alignment-, assembly-, and reference-free method that builds a partition tree without relying on the pairwise comparison of sequences, thus avoiding to use a distance matrix to infer phylogeny. The preliminary experimental results on sequencing data show that our method can handle datasets of different types (short reads, contigs, or entire genomes), producing trees of quality comparable to that found in the benchmark phylogeny

INRIA a CCSD electronic archive server

Dagstuhl Research Online Publication Server

Coherent periodic activity in excitatory neural networks : the role of network connectivity

Author: Alessandro Torcini
C Allene
C van Vreeswijk
G Buszaki
I Belykh
Lorenzo Tattini
P Bonifazi
S Olmi
Simona Olmi
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Characterization and identification of hidden rare variants in the human genome

Author: Abbate Rosanna
Cifola Ingrid
D\u27Aurizio Romina
Gensini Gian Franco
Giusti Betti
Magi Alberto
Palombo Flavia
Pippucci Tommaso
Romeo Giovanni
Semeraro Roberto
Tattini Lorenzo
Publication venue: BIOMED CENTRAL
Publication date: 01/01/2015
Field of study

BackgroundBy examining the genotype calls generated by the 1000 Genomes Project we discovered that the human reference genome GRCh37 contains almost 20,000 loci in which the reference allele has never been observed in healthy individuals and around 70,000 loci in which it has been observed only in the heterozygous state.ResultsWe show that a large fraction of this rare reference allele (RRA) loci belongs to coding, functional and regulatory elements of the genome and could be linked to rare Mendelian disorders as well as cancer. We also demonstrate that classical germline and somatic variant calling tools are not capable to recognize the rare allele when present in these loci. To overcome such limitations, we developed a novel tool, named RAREVATOR, that is able to identify and call the rare allele in these genomic positions. By using a small cancer dataset we compared our tool with two state-of-the-art callers and we found that RAREVATOR identified more than 1,500 germline and 22 somatic RRA variants missed by the two methods and which belong to significantly mutated pathways.ConclusionsThese results show that, to date, the investigation of around 100,000 loci of the human genome has been missed by re-sequencing experiments based on the GRCh37 assembly and that our tool can fill the gap left by other methods. Moreover, the investigation of the latest version of the human reference genome, GRCh38, showed that although the GRC corrected almost all insertions and a small part of SNVs and deletions, a large number of functionally relevant RRAs still remain unchanged. For this reason, also future resequencing experiments, based on GRCh38, will benefit from RAREVATOR analysis results. RAREVATOR is freely available at http://sourceforge.net/projects/rarevator

Springer - Publisher Connector

Florence Research

PubMed Central

PUblication MAnagement