6 research outputs found
Glycoprotein gene truncation in avian metapneumovirus subtype C isolates from the United States
The length of the published glycoprotein (G) gene sequences of avian metapneumovirus subtype-C (aMPV-C) isolated from domestic turkeys and wild birds in the United States (1996–2003) remains controversial. To explore the G gene size variation in aMPV-C by the year of isolation and cell culture passage levels, we examined 21 turkey isolates of aMPV-C at different cell culture passages. The early domestic turkey isolates of aMPV-C (aMPV/CO/1996, aMPV/MN/1a-b, and 2a-b/97) had a G gene of 1,798 nucleotides (nt) that coded for a predicted protein of 585 amino acids (aa) and showed >97% nt similarity with that of aMPV-C isolated from Canada geese. This large G gene got truncated upon serial passages in Vero cell cultures by deletion of 1,015 nt near the end of the open reading frame. The recent domestic turkey isolates of aMPV-C lacked the large G gene but instead had a small G gene of 783 nt, irrespective of cell culture passage levels. In some cultures, both large and small genes were detected, indicating the existence of a mixed population of the virus. Apparently, serial passage of aMPV-C in cell cultures and natural passage in turkeys in the field led to truncation of the G gene, which may be a mechanism of virus evolution for survival in a new host or environment
Discovery of widespread transcription initiation at microsatellites predictable by sequence-based deep neural network
Using the Cap Analysis of Gene Expression (CAGE) technology, the FANTOM5 consortium provided one of the most comprehensive maps of transcription start sites (TSSs) in several species. Strikingly, ~72% of them could not be assigned to a specific gene and initiate at unconventional regions, outside promoters or enhancers. Here, we probe these unassigned TSSs and show that, in all species studied, a significant fraction of CAGE peaks initiate at microsatellites, also called short tandem repeats (STRs). To confirm this transcription, we develop Cap Trap RNA-seq, a technology which combines cap trapping and long read MinION sequencing. We train sequence-based deep learning models able to predict CAGE signal at STRs with high accuracy. These models unveil the importance of STR surrounding sequences not only to distinguish STR classes, but also to predict the level of transcription initiation. Importantly, genetic variants linked to human diseases are preferentially found at STRs with high transcription initiation level, supporting the biological and clinical relevance of transcription initiation at STRs. Together, our results extend the repertoire of non-coding transcription associated with DNA tandem repeats and complexify STR polymorphism
Cointegration and market integration: an application to the Indonesian rice market
This article suggests improvements in the use of regression analysis to measure spatial market integration. The procedure pioneered by Ravallion is still widespread but is valid only under certain conditions of exogeneity. The alternative offered here is an error‐correction mechanism which makes it possible to test for exogeneity as well as indicating the direction and strength of causality in price formation between markets. The method is illustrated with data on rice prices in different parts of the Indonesian market. The results confirm, among other things, that supply sources are more important than demand sources in driving prices
Discovery of widespread transcription initiation at microsatellites predictable by sequence-based deep neural network
10.1038/s41467-021-23143-7Nature Communications121329