Search CORE

4 research outputs found

Repeat Detector: versatile sizing of expanded tandem repeats and identification of interrupted alleles from targeted DNA sequencing

Author: Aeschbach Lorene
Barros Dinis
Barszcz Paula
Ciosi Marc
Davidson Alice E.
Dion Vincent
Gobet Nastassia
Hafford-Tear Nathaniel J.
Heuchan Eleanor R.
Jones Lesley
Massey Thomas H.
McAllister Branduff
Monckton Darren G.
Morgan Joanne
network REGISTRY Investigators of the European Huntington’s disease
Randall Emma L.
Schuepbach Thierry
Taylor Alysha S.
Trofimenko Evgeniya
Xenarios Ioannis
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2022
Field of study

Targeted DNA sequencing approaches will improve how the size of short tandem repeats is measured for diagnostic tests and preclinical studies. The expansion of these sequences causes dozens of disorders, with longer tracts generally leading to a more severe disease. Interrupted alleles are sometimes present within repeats and can alter disease manifestation. Determining repeat size mosaicism and identifying interruptions in targeted sequencing datasets remains a major challenge. This is in part because standard alignment tools are ill-suited for repetitive and unstable sequences. To address this, we have developed Repeat Detector (RD), a deterministic profile weighting algorithm for counting repeats in targeted sequencing data. We tested RD using blood-derived DNA samples from Huntington’s disease and Fuchs endothelial corneal dystrophy patients sequenced using either Illumina MiSeq or Pacific Biosciences single-molecule, real-time sequencing platforms. RD was highly accurate in determining repeat sizes of 609 blood-derived samples from Huntington’s disease individuals and did not require prior knowledge of the flanking sequences. Furthermore, RD can be used to identify alleles with interruptions and provide a measure of repeat instability within an individual. RD is therefore highly versatile and may find applications in the diagnosis of expanded repeat disorders and in the development of novel therapies

Online Research @ Cardiff

Serveur académique lausannois

PubMed Central

UCL Discovery

Enlighten

A systems genetics approach for sleep regulation

Author: Gobet Nastassia
Publication venue: Université de Lausanne, Faculté de biologie et médecine
Publication date: 01/01/2023
Field of study

Sleep is a daily behavior important for health. Many people studied sleep with more or less sophisticated technologies over time, and yet it has not revealed all its mysteries. To help uncover the molecular consequences of sleep deprivation, the Franken group have assembled a systems genetics resource interrogating the BXD mouse panel. The genotypes and sleep-wake phenome were characterized, along with intermediate phenotypes: the transcriptome in brain and in liver, and the targeted metabolome in the blood plasma. I have used this rich multi-omics BXD dataset for computational investigation and development of analytical methods for data and knowledge integration to expand the current understanding of sleep regulation. First, in collaboration with Maxime Jan we used this real-world example of data and bioinformatic analysis management to highlight multi-omics challenges and solutions used to help internal or external reusability. This includes more details on the quality check and validations of the methods, the use of Rmarkdown reports for more higher levels parts of the analyses, a metadata workflow document illustrating and referencing the different code and data files, and a web site for exploration of the results. The robustness of the results was also assessed through the change to the newest version of the mouse genome reference assembly used. Then, the classical pipeline to analyse RNA-sequencing reads uses one mouse reference for all samples, irrespective of the strain of the samples, which is potentially creates a reference bias. Therefore, to improve the genetic-specificity of the read mapping, I customized the standard assembly based on one parental strain with variants from the BXD population. An important step was adding a tailored imputation of the population genetic variants using haplotypes blocks/regions to achieve a sufficient resolution for each line-specific reference. This strategy alleviated the reference bias and allowed to detect proportionally more eQTLs with the custom BXD-specific references than with the standard reference. Lastly, I assembled a multi-layer prior knowledge network and integrated the gene expression sleep-specific on it. This integration of data-driven and knowledge driven approach sets the basis for a way to generate hypotheses based on multiple genes to explain the genetic and environmental interactions culminating in the different sleep phenotypes. -- Le sommeil est un comportement quotidien important pour la santé. De nombreuses personnes ont étudié le sommeil avec des technologies plus ou moins sophistiquées au fil du temps, et il n’a cependant pas encore révélé tous ses mystères. Pour aider a` découvrir les conséquences moléculaires de la privation de sommeil, le groupe Franken a assemblé une ressource de génétique des systèmes relative aux lignées de souris BXD. Les génotypes et le phénome de sommeil-éveil ont été charactérisés, ainsi que des phénotypes intermédiaires : d’une part le transcriptome dans le cerveau et le foie, d’autre part le métabolome ciblé dans le plasma sanguin. J’ai utilisé ce riche jeu de données multi-omics sur les BXD pour le développement de méthodes analytiques pour l’intégration de donnees et de connaissances afin d’étendre la compréhension actuelle de la regulation du sommeil. D’abord, en collaboration avec Maxime Jan, nous avons utilisé cet exemple réel de la gestion des données et de l’analyse bioinformatique pour mettre en évidence les défis multi-omics et les solutions utilisées pour que le travail puisse être réutilisé à l’interne ou à l’externe. Cela inclut plus de détails sur le contrˆole de qualité et les validations des méthodes, l’utilisation de rapports Rmarkdown pour les parties de plus haut niveau d’abstraction des analyses, un document concernant les méta-données du flux de travail pour illustrer et référencer les différents scripts et fichiers de données et un site web pour l’exploration des résultats. La stabilité des résultats a également été évaluée au travers du changement de version de l’assemblée de réference utilisée. Puis, la pipeline traditionnelle pour analyser des reads de séquen¸cage d’ARN utilise une référence murine pour tous les échantillons, quelle que soit leur souche. Afin d’améliorer la spécificité génétique du mapping des reads, j’ai utilisé et personnalisé l’assemblée standard basée sur une souche parentale avec les variants de la population BXD. L’imputation des variants génétiques en utilisant les blocs/régions haplotypes était importante pour obtenir une résolution suffisante pour chacune des lignées. Cette stratégie a diminué le biais de référence et a permis de détecter proportionnellement plus d’eQTLs avec les références spécifiques aux BXD qu’avec la référence traditionnelle. Finalement, j’ai assemblé un réseau à plusieurs couches de connaissances préalables et y ait intégré l’expression des gènes contenant la composante spécique au sommeil. L’intégration des approches basées sur les données et les connaissances préalables met en place la base pour un moyen de générer des hypothèses basées sur plusieurs gènes pour expliquer les interactions génétiques et environmentales provoquant les différents phénotypes du sommeil

Serveur académique lausannois

Evaluating mapping parameters.

Author: Ioannis Xenarios (38311)
Maxime Jan (5617592)
Nastassia Gobet (6420107)
Paul Franken (82956)
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 26/09/2022
Field of study

A. The performance on local eQTLs of selected mapping settings on cortex samples (average of the NSD and SD conditions) is measured by the percentage of expressed genes that have a significant local eQTL. The BXD-specific references were used. C. As in A but for liver samples.</p

FigShare

Structural variant calling: the long and the short of it

Author: A Dobin
A Frankish
A Gillet-Markowska
A Sanchis-Juan
AC English
AD Sanders
AD Sanders
AL Delcher
AW Pang
B Langmead
B Schule
C Alkan
C Ma
C Tian
C Trapnell
Christophe Dessimoz
CM Carvalho
CS Chin
D Kim
D Kim
DC Bragg
DC Jeffares
Diana Ivette Cruz-Dávalos
DL Cameron
E Butelli
FJ Sedlazeck
FJ Sedlazeck
Fritz J. Sedlazeck
GH Perry
H Cao
H Li
HY Lam
I Gabur
J Huddleston
J Wang
JA Wala
JL Weirather
JM Belton
JM Zook
JR Dixon
JT Simpson
K Chen
K Chen
K Trappe
K Ye
K Yi
L Tattini
L Yang
LG Maron
LS Friedman
M Benelli
M Carrara
M Cretu Stancu
M Leija-Salazar
M Lek
M Levy-Sakin
M Meyerson
M Mohiyuddin
M Nattestad
M Nattestad
M Smolka
Medhat Mahmoud
MJ Chaisson
MJ Chaisson
MJP Chaisson
N Nagarajan
N Spies
N Stransky
Nastassia Gobet
Ninon Mounier
NM Davidson
O Couronne
P McColgan
PA Audano
PH Sudmant
R Wooster
RE Mills
RM Layer
RP Abo
S Goodwin
S Kumar
S Kumar
S Liu
S Soyk
S Tian
SM Kielbasa
SM Teo
T Becker
T Rausch
T Sutton
TA Gaines
X Chen
X Fan
Z Chong
Z Iqbal
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref