7 research outputs found
Multiple Recurrent De Novo CNVs, Including Duplications of the 7q11.23 Williams Syndrome Region, Are Strongly Associated with Autism
SummaryWe have undertaken a genome-wide analysis of rare copy-number variation (CNV) in 1124 autism spectrum disorder (ASD) families, each comprised of a single proband, unaffected parents, and, in most kindreds, an unaffected sibling. We find significant association of ASD with de novo duplications of 7q11.23, where the reciprocal deletion causes Williams-Beuren syndrome, characterized by a highly social personality. We identify rare recurrent de novo CNVs at five additional regions, including 16p13.2 (encompassing genes USP7 and C16orf72) and Cadherin 13, and implement a rigorous approach to evaluating the statistical significance of these observations. Overall, large de novo CNVs, particularly those encompassing multiple genes, confer substantial risks (OR = 5.6; CI = 2.6–12.0, p = 2.4 × 10-7). We estimate there are 130–234 ASD-related CNV regions in the human genome and present compelling evidence, based on cumulative data, for association of rare de novo events at 7q11.23, 15q11.2-13.1, 16p11.2, and Neurexin 1
Representing Cells As Sentences Enables Natural Language Processing For Single Cell Transcriptomics
Gene expression matrices commonly used in single-cell transcriptomics cannot be directly analyzed with tools developed for natural languages. Restructuring these matrices as abundance-ordered sequences of genes allows the generation of cell sentences: rank-normalized, positionally encoded sequence-structured expression data. The rank-normalization procedure also minimizes batch effects from differential sequencing depth, and comparison of cell sentences against other tools for batch integration shows that cell sentences achieve comparable performance in batch effect removal and biological effect preservation. After transformation, cell sentences can be analyzed using any existing tools from natural language processing that take text as input, enabling a host of new ways to process and understand single-cell transcriptomics data. As an example, a machine translation approach is applied to cells from neural retina, unifying cell and gene representations across species. Finally, this approach can also be used to transform a number of other data modalities to sequential formats, including imaging data. Testing of neural network architectures pretrained on language tasks against those specialized for vision tasks shows that in low-data scenarios, language models can outperform vision models in image classification tasks. These findings suggest homology in the underlying structure of natural language and natural images which may be of particular interest in machine learning for medical imaging, where small datasets are the norm
Single-cell analysis reveals inflammatory interactions driving macular degeneration
Abstract Due to commonalities in pathophysiology, age-related macular degeneration (AMD) represents a uniquely accessible model to investigate therapies for neurodegenerative diseases, leading us to examine whether pathways of disease progression are shared across neurodegenerative conditions. Here we use single-nucleus RNA sequencing to profile lesions from 11 postmortem human retinas with age-related macular degeneration and 6 control retinas with no history of retinal disease. We create a machine-learning pipeline based on recent advances in data geometry and topology and identify activated glial populations enriched in the early phase of disease. Examining single-cell data from Alzheimer’s disease and progressive multiple sclerosis with our pipeline, we find a similar glial activation profile enriched in the early phase of these neurodegenerative diseases. In late-stage age-related macular degeneration, we identify a microglia-to-astrocyte signaling axis mediated by interleukin-1β which drives angiogenesis characteristic of disease pathogenesis. We validated this mechanism using in vitro and in vivo assays in mouse, identifying a possible new therapeutic target for AMD and possibly other neurodegenerative conditions. Thus, due to shared glial states, the retina provides a potential system for investigating therapeutic approaches in neurodegenerative diseases
Clinical Characterization and Genomic Analysis of Samples from COVID-19 Breakthrough Infections during the Second Wave among the Various States of India
From March to June 2021, India experienced a deadly second wave of COVID-19, with an increased number of post-vaccination breakthrough infections reported across the country. To understand the possible reason for these breakthroughs, we collected 677 clinical samples (throat swab/nasal swabs) of individuals from 17 states/Union Territories of the country who had received two doses (n = 592) and one dose (n = 85) of vaccines and tested positive for COVID-19. These cases were telephonically interviewed and clinical data were analyzed. A total of 511 SARS-CoV-2 genomes were recovered with genome coverage of higher than 98% from both groups. Analysis of both groups determined that 86.69% (n = 443) of them belonged to the Delta variant, along with Alpha, Kappa, Delta AY.1, and Delta AY.2. The Delta variant clustered into four distinct sub-lineages. Sub-lineage I had mutations in ORF1ab A1306S, P2046L, P2287S, V2930L, T3255I, T3446A, G5063S, P5401L, and A6319V, and in N G215C; Sub-lineage II had mutations in ORF1ab P309L, A3209V, V3718A, G5063S, P5401L, and ORF7a L116F; Sub-lineage III had mutations in ORF1ab A3209V, V3718A, T3750I, G5063S, and P5401L and in spike A222V; Sub-lineage IV had mutations in ORF1ab P309L, D2980N, and F3138S and spike K77T. This study indicates that majority of the breakthrough COVID-19 clinical cases were infected with the Delta variant, and only 9.8% cases required hospitalization, while fatality was observed in only 0.4% cases. This clearly suggests that the vaccination does provide reduction in hospital admission and mortality
Congenital rubella syndrome surveillance in India, 2016–21: Analysis of five years surveillance data
Background: In India, facility-based surveillance for congenital rubella syndrome (CRS) was initiated in 2016 to estimate the burden and monitor the progress made in rubella control. We analyzed the surveillance data for 2016–2021 from 14 sentinel sites to describe the epidemiology of CRS. Method: We analyzed the surveillance data to describe the distribution of suspected and laboratory confirmed CRS patients by time, place and person characteristics. We compared clinical signs of laboratory confirmed CRS and discarded case-patients to find independent predictors of CRS using logistic regression analysis and developed a risk prediction model. Results: During 2016–21, surveillance sites enrolled 3940 suspected CRS case-patients (Age 3.5 months, SD: 3.5). About one-fifth (n = 813, 20.6%) were enrolled during newborn examination. Of the suspected CRS patients, 493 (12.5%) had laboratory evidence of rubella infection. The proportion of laboratory confirmed CRS cases declined from 26% in 2017 to 8.7% in 2021. Laboratory confirmed patients had higher odds of having hearing impairment (Odds ratio [OR] = 9.5, 95% confidence interval [CI]: 5.6–16.2), cataract (OR = 7.8, 95% CI: 5.4–11.2), pigmentary retinopathy (OR = 6.7, 95 CI: 3.3–13.6), structural heart defect with hearing impairment (OR = 3.8, 95% CI: 1.2–12.2) and glaucoma (OR = 3.1, 95% CI: 1.2–8.1). Nomogram, along with a web version, was developed. Conclusions: Rubella continues to be a significant public health issue in India. The declining trend of test positivity among suspected CRS case-patients needs to be monitored through continued surveillance in these sentinel sites