Search CORE

91 research outputs found

Multiple-breed genomic evaluation by principal component analysis in small size populations

Author: Ajmone-Marsan P.
Cellesi M.
Dimauro C.
Gaspa G
Jorjani H.
Macciotta Npp
Stella A
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 01/01/2014
Field of study

In this study, the effects of breed composition and predictor dimensionality on the accuracy of direct genomic values (DGV) in a multiple breed (MB) cattle population were investigated. A total of 3559 bulls of three breeds were genotyped at 54 001 single nucleotide polymorphisms: 2093 Holstein (H), 749 Brown Swiss (B) and 717 Simmental (S). DGV were calculated using a principal component (PC) approach for either single (SB) or MB scenarios. Moreover, DGV were computed using all SNP genotypes simultaneously with SNPBLUP model as comparison. A total of seven data sets were used: three with a SB each, three with different pairs of breeds (HB, HS and BS), and one with all the three breeds together (HBS), respectively. Editing was performed separately for each scenario. Reference populations differed in breed composition, whereas the validation bulls were the same for all scenarios. The number of SNPs retained after data editing ranged from 36 521 to 41 360. PCs were extracted from actual genotypes. The total number of retained PCs ranged from 4029 to 7284 in Brown Swiss and HBS respectively, reducing the number of predictors by about 85% (from 82% to 89%). In all, three traits were considered: milk, fat and protein yield. Correlations between deregressed proofs and DGV were used to assess prediction accuracy in validation animals. In the SB scenarios, average DGV accuracy did not substantially change when either SNPBLUP or PC were used. Improvement of DGV accuracy were observed for some traits in Brown Swiss, only when MB reference populations and PC approach were used instead of SB-SNPBLUP (+10% HBS, +16%HB for milk yield and +3% HBS and +7% HB for protein yield, respectively). With the exclusion of the abovementioned cases, similar accuracies were observed using MB reference population, under the PC or SNPBLUP models. Random variation owing to sampling effect or size and composition of the reference population may explain the difficulty in finding a defined pattern in the results

PubliCatt

Institutional Research Information System University of Turin

Principal component and factor analytic models in international sire evaluation

Author: A Sigurdsson
AC Rencher
AM Tyrisevä
AM Tyrisevä
AM Tyrisevä
Anna-Maria Tyrisevä
EA Mäntysaari
Esa A Mäntysaari
H Jorjani
H Jorjani
H Leclerc
Jette Jakobsen
JW Dürr
K Meyer
K Meyer
K Meyer
K Meyer
K Meyer
K Meyer
Karin Meyer
LR Schaeffer
M Kirkpatrick
Martin H Lidauer
P Madsen
R Thompson
Vincent Ducrocq
W Freddy Fikse
Publication venue: BioMed Central
Publication date: 01/08/2010
Field of study

Abstract Background Interbull is a non-profit organization that provides internationally comparable breeding values for globalized dairy cattle breeding programmes. Due to different trait definitions and models for genetic evaluation between countries, each biological trait is treated as a different trait in each of the participating countries. This yields a genetic covariance matrix of dimension equal to the number of countries which typically involves high genetic correlations between countries. This gives rise to several problems such as over-parameterized models and increased sampling variances, if genetic (co)variance matrices are considered to be unstructured. Methods Principal component (PC) and factor analytic (FA) models allow highly parsimonious representations of the (co)variance matrix compared to the standard multi-trait model and have, therefore, attracted considerable interest for their potential to ease the burden of the estimation process for multiple-trait across country evaluation (MACE). This study evaluated the utility of PC and FA models to estimate variance components and to predict breeding values for MACE for protein yield. This was tested using a dataset comprising Holstein bull evaluations obtained in 2007 from 25 countries. Results In total, 19 principal components or nine factors were needed to explain the genetic variation in the test dataset. Estimates of the genetic parameters under the optimal fit were almost identical for the two approaches. Furthermore, the results were in a good agreement with those obtained from the full rank model and with those provided by Interbull. The estimation time was shortest for models fitting the optimal number of parameters and prolonged when under- or over-parameterized models were applied. Correlations between estimated breeding values (EBV) from the PC19 and PC25 were unity. With few exceptions, correlations between EBV obtained using FA and PC approaches under the optimal fit were ≥ 0.99. For both approaches, EBV correlations decreased when the optimal model and models fitting too few parameters were compared. Conclusions Genetic parameters from the PC and FA approaches were very similar when the optimal number of principal components or factors was fitted. Over-fitting increased estimation time and standard errors of the estimates but did not affect the estimates of genetic correlations or the predictions of breeding values, whereas fitting too few parameters affected bull rankings in different countries.</p

Research UNE

Jukuri

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

ProdInra

Principal component approach in variance component estimation for international sire evaluation

Author: AM Tyrisevä
Anna-Maria Tyrisevä
EA Mäntysaari
Esa A Mäntysaari
H Akaike
H Jorjani
H Jorjani
H Leclerc
H Leclerc
J Tarres
Jette Jakobsen
JH Jakobsen
JHJ van der Werf
K Meyer
K Meyer
K Meyer
K Meyer
Karin Meyer
L Jairath
LR Schaeffer
M Kirkpatrick
Martin H Lidauer
P Madsen
R Rekaya
S Beek van der
T Mark
Vincent Ducrocq
W Freddy Fikse
WF Fikse
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The dairy cattle breeding industry is a highly globalized business, which needs internationally comparable and reliable breeding values of sires. The international Bull Evaluation Service, Interbull, was established in 1983 to respond to this need. Currently, Interbull performs multiple-trait across country evaluations (MACE) for several traits and breeds in dairy cattle and provides international breeding values to its member countries. Estimating parameters for MACE is challenging since the structure of datasets and conventional use of multiple-trait models easily result in over-parameterized genetic covariance matrices. The number of parameters to be estimated can be reduced by taking into account only the leading principal components of the traits considered. For MACE, this is readily implemented in a random regression model. Methods This article compares two principal component approaches to estimate variance components for MACE using real datasets. The methods tested were a REML approach that directly estimates the genetic principal components (direct PC) and the so-called bottom-up REML approach (bottom-up PC), in which traits are sequentially added to the analysis and the statistically significant genetic principal components are retained. Furthermore, this article evaluates the utility of the bottom-up PC approach to determine the appropriate rank of the (co)variance matrix. Results Our study demonstrates the usefulness of both approaches and shows that they can be applied to large multi-country models considering all concerned countries simultaneously. These strategies can thus replace the current practice of estimating the covariance components required through a series of analyses involving selected subsets of traits. Our results support the importance of using the appropriate rank in the genetic (co)variance matrix. Using too low a rank resulted in biased parameter estimates, whereas too high a rank did not result in bias, but increased standard errors of the estimates and notably the computing time. Conclusions In terms of estimation's accuracy, both principal component approaches performed equally well and permitted the use of more parsimonious models through random regression MACE. The advantage of the bottom-up PC approach is that it does not need any previous knowledge on the rank. However, with a predetermined rank, the direct PC approach needs less computing time than the bottom-up PC.</p

Crossref

Jukuri

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

SNPchiMp v.3: integrating and standardizing single nucleotide polymorphism data for livestock species

Author: Brauning R.
Brew F.
Caprera A.
Cozzi P.
Evans G.
Jorjani H.
Lawley C.
Nazzicari N.
Nicolazzi E.
Pirani A.
Simpson B.
Soans C.
Stella A.
Strozzi F.
Tosser-Klopp G.
Williams J.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Published: 10 April 2015In recent years, the use of genomic information in livestock species for genetic improvement, association studies and many other fields has become routine. In order to accommodate different market requirements in terms of genotyping cost, manufacturers of single nucleotide polymorphism (SNP) arrays, private companies and international consortia have developed a large number of arrays with different content and different SNP density. The number of currently available SNP arrays differs among species: ranging from one for goats to more than ten for cattle, and the number of arrays available is increasing rapidly. However, there is limited or no effort to standardize and integrate array- specific (e.g. SNP IDs, allele coding) and species-specific (i.e. past and current assemblies) SNP information.Here we present SNPchiMp v.3, a solution to these issues for the six major livestock species (cow, pig, horse, sheep, goat and chicken). Original data was collected directly from SNP array producers and specific international genome consortia, and stored in a MySQL database. The database was then linked to an open-access web tool and to public databases. SNPchiMp v.3 ensures fast access to the database (retrieving within/across SNP array data) and the possibility of annotating SNP array data in a user-friendly fashion.This platform allows easy integration and standardization, and it is aimed at both industry and research. It also enables users to easily link the information available from the array producer with data in public databases, without the need of additional bioinformatics tools or pipelines. In recognition of the open-access use of Ensembl resources, SNPchiMp v.3 was officially credited as an Ensembl E!mpowered tool. Availability at http://bioinformatics.tecnoparco.org/SNPchimp.Ezequiel L Nicolazzi, Andrea Caprera, Nelson Nazzicari, Paolo Cozzi, Francesco Strozzi, Cindy Lawley, Ali Pirani, Chandrasen Soans, Fiona Brew, Hossein Jorjani, Gary Evans, Barry Simpson, Gwenola Tosser-Klopp, Rudiger Brauning, John L Williams and Alessandra Stell

Crossref

Adelaide Research & Scholarship

Springer - Publisher Connector

PubMed Central

ProdInra

Genomic evaluation for a three-way crossbreeding system considering breed-of-origin of alleles

Author: A Gilmour
A Roos de
A Stuart
A Wolc
B Zumbach
BJ Hayes
BS Weir
CA Sevillano
Claudia A. Sevillano
DA Lourenco
E Lutaaya
H Brandt
H Esfandyari
H Jorjani
IE Grevenhof Van
J Přibyl
J Vandenplas
J Vandenplas
JCM Dekkers
Jeremie Vandenplas
JL Jannink
JM Hickey
John W. M. Bastiaansen
M Saatchi
M Sargolzaei
M Wei
M Wei
Mario P. L. Calus
ML Makgahlela
ML Makgahlela
MP Calus
MS Lopes
N Ibánẽz-Escriche
N Moghaddar
OF Christensen
OF Christensen
P Knap
PM VanRaden
R Veroneze
Rob Bergsma
S Nakavisut
S Newman
T Xiang
T Xiang
TA Schrag
TH Meuwissen
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Cytotoxic evaluation of Melia azedarach in comparison with, Azadirachta indica and its phytochemical investigation

Author: A Ansarishirazi
A Ghosh
A Kelecom
A Zargari
AA Timofeev
AH Brantner
AP Knight
Avicenna
E Barrau
EK Akkol
H Kim
H Nagata
H-B Liu
HS Puri
I Sakane
K Gomathi
K Rishi
M Cintra-Francischinelli
M Dousti
M Tomczyk
MC Alley
MC Carpinella
MH Aghili Alavi Khorasani
MM Tonekaboni
MMO Cabral
ND Yuliana
NM El-Sawia
O Koul
P Ciuffreda
PB Oelrichs
PK Agrawal
R Jayaraj
R Takeyama
R Vijayaraghavan
RA Hiipakka
RE Trudel
RJ Verma
SE Jorjani
SPH Alexander
SV Shetab-Boushehri
TC Wikramanayake
V Lakshmi
WB Mors
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

The Non-Coding Transcriptome of Prostate Cancer: Implications for Clinical Practice

Author: A Dallaire
A Haese
A Heidenreich
A Mehdiani
A Salameh
AG Telonis
AJ Vickers
AM Khalil
BD Kelly
BL Maughan
C Chen
C-L Shih
CF Bennett
CM Hindson
Consortium EP
CP Meyer
CR Ritch
D Duijvesz
D Duijvesz
D Hanahan
D Hessels
D Hessels
D Hong
D Koppers-Lalic
D Robinson
D Taller
DA Sartori
DP Bartel
DT Miyamoto
DW Wegman
E Pol van der
E Wienholds
E Zoni
ED Crawford
EJ Whitman
ES Martens-Uzunova
ES Martens-Uzunova
ES Martens-Uzunova
F Cappello
F Degliangeli
F Dupuis-Sandoval
F Royo
F Wang
F Wang
F Yaman Agaoglu
G Gundem
G Obernosterer
G Petrovics
G Ploussard
G Poste
G Raposo
GA Calin
GH Leyten
GS Filonov
H Im
H Jorjani
H Nakanishi
H Poppel van
H Schwarzenbach
HC Nguyen
I Popa
IL Deras
J Boele
J Groskopf
J Lagarde
J Liao
J Lu
J Lu
J Ma
J Shen
J Yang
JB Kok de
JC Brase
JH Lee
JH Lee
JM Lorenzen
JR Prensner
JR Prensner
JR Prensner
JR Prensner
JS Bono de
JS Paige
JT Wei
JW Catto
K Mannoor
K Nishikura
K Stuopelytė
KA Lennox
KP Porkka
L Fabris
L Gao
L Lof
LB Ferreira
LQ Gu
LS Marks
M Hegemann
M Kosanovic
M Moyano
M Oliveira-Rodriguez
M Olvedy
M Ozen
M Pavon-Eternod
M Puhka
M Re Del
M Salagierski
MH Veldman-Jones
MJ Bussemakers
MJ Donovan
MJ Roberts
MJ Roberts
MK-LD Wachalska
MP Gils van
MY Shah
N Erho
P Krishnan
P Krishnan
P Kumar
P Landgraf
PS Mitchell
PT Nelson
R Bottcher
R Crescitelli
R Jiang
R Mahn
R Malik
R Mehra
RJ Bryant
RJ Taft
S Ambs
S Eissa
S Ren
S Shukla
S Volinia
S Wagner
SK Channavajjhala
SL Maas
SLN Maas
SM Aubin
SS Kanwar
T Derrien
T Goda
T Hung
TM Wheeler
U Erdbrugger
U Erdbrugger
V Mouraviev
V Srikantan
VA Malkov
VM Velonas
VN Kim
X Wu
Y Ceder
Y Okugawa
Y Yamamoto
Y Yoshioka
YH Park
YS Lee
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Discovery of widespread transcription initiation at microsatellites predictable by sequence-based deep neural network

Author: Abugessaisa Imad
Aitken Stuart
Aken Bronwen L.
Alam Intikhab
Alam Tanvir
Alasiri Rami
Alhendi Ahmad M. N.
Alinejad-Rokny Hamid
Alvarez Mariano J.
Andersson Robin
Arakawa Takahiro
Araki Marito
Arbel Taly
Archer John
Archibald Alan L.
Arner Erik
Arner Peter
Asai Kiyoshi
Ashoor Haitham
Astrom Gaby
Babina M.
Baillie J.K.
Bajic V.B.
Bajpai A.
Baker S.
Baldarelli R.M.
Balic A.
Bansal M.
Batagov A.O.
Batzoglou S.
Beckhouse A.G.
Beltrami A.P.
Beltrami C.A.
Bertin Nicolas
Bessière Chloé
Bhattacharya S.
Bickel P.J.
Blake J.A.
Blanchette M.
Bodega B.
Bonetti A.
Bono H.
Bornholdt J.
Bougouffa S.
Boyd M.
Breda J.
Brombacher F.
Brown J.B.
Bréhélin L.
Bttcher M.
Bult C.J.
Burroughs A.M.
Burt D.W.
Busch A.
Caglio G.
Califano A.
Cameron C.J.
Cannistraci C.V.
Carbone A.
Carlisle A.J.
Carninci Piero
Carninci Piero
Carter K.W.
Cesselli D.
Chang J.-C.
Chatelain Clement
Chen J.C.
Chen Y.
Chierici M.
Christodoulou J.
Ciani Y.
Clark E.L.
Coskun M.
Dalby M.
Dalla E.
Daub C.O.
Davis C.A.
de Hoom Michiel J. L.
de Hoom Michiel J. L.
de Rie D.
Denisenko E.
Deplancke B.
Detmar M.
Deviatiiarov R.
Di Bernardo D.
Diehl A.D.
Dieterich L.C.
Dimont E.
Djebali S.
Dohi T.
Dostie J.
Drablos F.
Edge A.S.B.
Edinger M.
Ehrlund A.
Ekwall K.
Elofsson A.
Endoh M.
Enomoto H.
Enomoto S.
Faghihi M.
Fagiolini M.
FANTOM consortium.
Farach-Carson M.C.
Faulkner G.J.
Favorov A.
Fernandes A.M.
Ferrai C.
Forrest A.R.R.
Forrester L.M.
Forsberg M.
Fort A.
Francescatto M.
Freeman T.C.
Frith Martin C.
Frith Martin C.
Fukuda S.
Funayama M.
Furlanello C.
Furuno M.
Furusawa C.
Gao H.
Gazova I.
Gebhard C.
Geier F.
Geijtenbeek T.B.H.
Ghosh S.
Ghosheh Y.
Gingeras T.R.
Gojobori T.
Goldberg T.
Goldowitz D.
Gough J.
Grapotte Mathys
Greco D.
Gruber A.J.
Guhl S.
Guigo R.
Guler R.
Gusev O.
Gustincich S.
Ha T.J.
Haberle V.
Hale P.
Hallstrom B.M.
Hamada M.
Handoko L.
Hara M.
Harbers M.
Harrow J.
Harshbarger J.
Hase T.
Hasegawa Akira
Hasegawa Akira
Hashimoto K.
Hatano T.
Hattori N.
Hayashi R.
Hayashizaki Yoshihide
Hayashizaki Yoshihide
Herlyn M.
Hettne K.
Heutink P.
Hide W.
Hitchens K.J.
Hon C.C.
Hori F.
Horie M.
Horimoto K.
Horton P.
Hou R.
Huang E.
Huang Y.
Hugues R.
Hume D.
Ienasescu H.
Iida K.
Ikawa T.
Ikemura T.
Ikeo K.
Inoue N.
Ishizu Y.
Ito Y.
Itoh Masayoshi
Itoh Masayoshi
Ivshina A.V.
Jankovic B.R.
Jenjaroenpun P.
Johnson R.
Jorgensen M.
Jorjani H.
Joshi A.
Jurman G.
Kaczkowski B.
Kai C.
Kaida K.
Kajiyama K.
Kaliyaperumal R.
Kaminuma E.
Kanaya T.
Kaneda H.
Kapranov P.
Kasianov A.S.
Kasukawa Takeya
Kasukawa Takeya
Katayama T.
Kato S.
Kawaguchi S.
Kawai J.
Kawaji H.
Kawamoto H.
Kawamura Y.I.
Kawasaki S.
Kawashima T.
Kempfle J.S.
Kenna T.J.
Kere J.
Khachigian L.
Kiryu H.
Kishima M.
Kitajima H.
Kitamura T.
Kitano H.
Klaric E.
Klepper K.
Klinken S.P.
Kloppmann E.
Knox A.J.
Kodama Y.
Kogo Y.
Kojima M.
Kojima S.
Kojima-Ishiyama Miki
Komatsu N.
Komiyama H.
Kono T.
Koseki H.
Koyasu S.
Kratz A.
Kukalev A.
Kulakovskiy I.
Kundaje A.
Kunikata H.
Kuo R.
Kuo T.
Kuraku S.
Kuznetsov V.A.
Kwon T.J.
Larouche M.
Lassmann T.
Laurent G.S.
Law A.
Le-Cao K.-A.
Lecellier C.-H.
Lecellier C.-H.
Lee W.
Lenhard B.
Lennartsson A.
Li K.
Li R.
Lilje B.
Lipovich L.
Lizio M.
Lopez G.
Magi S.
Mak G.K.
Makeev V.
Manabe R.
Mandai M.
Mar J.
Maruyama K.
Maruyama T.
Mason E.
Mathelier A.
Matsuda H.
Medvedeva Y.A.
Meehan T.F.
Mejhert N.
Menichelli Christophe
Meynert A.
Mikami N.
Minoda A.
Miura H.
Miyagi Y.
Miyawaki A.
Mizuno Y.
Morikawa H.
Morimoto M.
Morioka M.
Morishita S.
Moro K.
Motakis E.
Motohashi H.
Mukarram A.K.
Mummery C.L.
Mungall C.J.
Murakawa Y.
Muramatsu M.
Murata Mitsuyoshi
Murata Mitsuyoshi
Nagasaka K.
Nagase T.
Nakachi Y.
Nakahara F.
Nakai K.
Nakamura K.
Nakamura Y.
Nakamura Y.
Nakazawa T.
Nason G.P.
Nepal C.
Nguyen Q.H.
Nielsen L.K.
Nishida K.
Nishiguchi K.M.
Nishiyori H.
Nishiyori-Sueki Hiromi
Nitta K.
Noguchi Shuhei
Noguchi Shuhei
Noma Shohei
Noma Shohei
Notredame C.
Ogishima S.
Ohkura N.
Ohno H.
Ohshima M.
Ohtsu T.
Okada Y.
Okada-Hatakeyama M.
Okazaki Y.
Oksvold P.
Orlando V.
Ow G.S.
Ozturk M.
Pachkov M.
Paparountas T.
Parihar S.P.
Park S.-J.
Pascarella G.
Passier R.
Persson H.
Philippens I.H.
Piazza S.
Plessy C.
Pombo A.
Ponten F.
Poulain S.
Poulsen T.M.
Pradhan S.
Prezioso C.
Pridans C.
Qin X.-Y.
Quackenbush J.
Rackham O.
Ramilowski Jordan A.
Ramilowski Jordan A.
Ravasi T.
Rehli M.
Rennie S.
Rito T.
Rizzu P.
Robert C.
Roos M.
Rost B.
Roudnicky F.
Roy R.
Rye M.B.
Sachenkova O.
Saetrom P.
Sai H.
Saiki S.
Saito A.
Saito M.
Sakaguchi S.
Sakai M.
Sakaue S.
Sakaue-Sawano A.
Sandelin A.
Sano H.
Saraswat Manu
Sasamoto Y.
Sato H.
Saxena A.
Saya H.
Schafferhans A.
Schmeier S.
Schmidl C.
Schmocker D.
Schneider C.
Schueler M.
Schultes E.A.
Schulze-Tanzil G.
Semple C.A.
Seno S.
Seo W.
Sese J.
Severin Jessica
Severin Jessica
Sheng G.
Shi J.
Shimoni Y.
Shin J.W.
SimonSanchez J.
Sivertsson A.
Sjostedt E.
Soderhall C.
Stoiber M.H.
Sugiyama D.
Sui S.H.
Summers K.M.
Suzuki A.M.
Suzuki Harukazu
Suzuki Harukazu
Suzuki K.
Suzuki M.
Suzuki N.
Suzuki T.
Swanson D.J.
Swoboda R.K.
Tagami Michihira
Tagami Michihira
Taguchi A.
Takahashi H.
Takahashi M.
Takamochi K.
Takeda S.
Takenaka Y.
Tam K.T.
Tanaka H.
Tanaka R.
Tanaka Y.
Tang D.
Taniuchi I.
Tanzer A.
Tarui H.
Taylor M.S.
Terada A.
Terao Y.
Testa A.C.
Thomas M.
Thongjuea S.
Tomii K.
Toyoda H.
Triglia E.T.
Tsang H.G.
Tsujikawa M.
Uhlén M.
Valen E.
van de Wetering M.
van Nimwegen E.
Velmeshev D.
Verardo R.
Vitezic M.
Vitting-Seerup K.
von Feilitzen K.
Voolstra C.R.
Vorontsov I.E.
Wahlestedt C.
Wasserman Wyeth W.
Wasserman Wyeth W.
Watanabe K.
Watanabe S.
Wells C.A.
Winteringham L.N.
Wolvetang E.
Yabukami H.
Yagi K.
Yamada T.
Yamaguchi Y.
Yamamoto M.
Yamamoto Y.
Yamamoto Y.
Yamanaka Y.
Yano K.
Yasuzawa K.
Yatsuka Y.
Yo M.
Yokokura S.
Yoneda M.
Yoshida E.
Yoshida Y.
Yoshihara M.
Young R.
Young R.S.
Yu N.Y.
Yumoto N.
Zabierowski S.E.
Zhang P.G.
Zucchelli S.
Zwahlen M.
’t Hoen P.A.C.
Publication venue: Nature Publishing Group
Publication date: 15/12/2020
Field of study

Using the Cap Analysis of Gene Expression (CAGE) technology, the FANTOM5 consortium provided one of the most comprehensive maps of transcription start sites (TSSs) in several species. Strikingly, ~72% of them could not be assigned to a specific gene and initiate at unconventional regions, outside promoters or enhancers. Here, we probe these unassigned TSSs and show that, in all species studied, a significant fraction of CAGE peaks initiate at microsatellites, also called short tandem repeats (STRs). To confirm this transcription, we develop Cap Trap RNA-seq, a technology which combines cap trapping and long read MinION sequencing. We train sequence-based deep learning models able to predict CAGE signal at STRs with high accuracy. These models unveil the importance of STR surrounding sequences not only to distinguish STR classes, but also to predict the level of transcription initiation. Importantly, genetic variants linked to human diseases are preferentially found at STRs with high transcription initiation level, supporting the biological and clinical relevance of transcription initiation at STRs. Together, our results extend the repertoire of non-coding transcription associated with DNA tandem repeats and complexify STR polymorphism

Approach for evaluating trade-off between agricultural drainage and wetland conservation.

Author: Duinker P.
Jorjani H.
Publication venue
Publication date: 01/01/1988
Field of study

Wageningen University & Research Publications

An approach for evaluating trade-offs between agricultural drainage and wetland conservation.

Author: Duinker P.
Jorjani H.
Publication venue
Publication date: 01/01/1989
Field of study

Wageningen University & Research Publications