137 research outputs found
An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics
For a decade, The Cancer Genome Atlas (TCGA) program collected clinicopathologic annotation data along with multi-platform molecular profiles of more than 11,000 human tumors across 33 different cancer types. TCGA clinical data contain key features representing the democratized nature of the data collection process. To ensure proper use of this large clinical dataset associated with genomic features, we developed a standardized dataset named the TCGA Pan-Cancer Clinical Data Resource (TCGA-CDR), which includes four major clinical outcome endpoints. In addition to detailing major challenges and statistical limitations encountered during the effort of integrating the acquired clinical data, we present a summary that includes endpoint usage recommendations for each cancer type. These TCGA-CDR findings appear to be consistent with cancer genomics studies independent of the TCGA effort and provide opportunities for investigating cancer biology using clinical correlates at an unprecedented scale. Analysis of clinicopathologic annotations for over 11,000 cancer patients in the TCGA program leads to the generation of TCGA Clinical Data Resource, which provides recommendations of clinical outcome endpoint usage for 33 cancer types
A Pre-Landing Assessment of Regolith Properties at the InSight Landing Site
This article discusses relevant physical properties of the regolith at the Mars InSight landing site as understood prior to landing of the spacecraft. InSight will land in the northern lowland plains of Mars, close to the equator, where the regolith is estimated to be ≥3--5 m thick. These investigations of physical properties have relied on data collected from Mars orbital measurements, previously collected lander and rover data, results of studies of data and samples from Apollo lunar missions, laboratory measurements on regolith simulants, and theoretical studies. The investigations include changes in properties with depth and temperature. Mechanical properties investigated include density, grain-size distribution, cohesion, and angle of internal friction. Thermophysical properties include thermal inertia, surface emissivity and albedo, thermal conductivity and diffusivity, and specific heat. Regolith elastic properties not only include parameters that control seismic wave velocities in the immediate vicinity of the Insight lander but also coupling of the lander and other potential noise sources to the InSight broadband seismometer. The related properties include Poisson’s ratio, P- and S-wave velocities, Young’s modulus, and seismic attenuation. Finally, mass diffusivity was investigated to estimate gas movements in the regolith driven by atmospheric pressure changes. Physical properties presented here are all to some degree speculative. However, they form a basis for interpretation of the early data to be returned from the InSight mission.Additional co-authors: Nick Teanby and Sharon Keda
Track D Social Science, Human Rights and Political Science
Peer Reviewedhttps://deepblue.lib.umich.edu/bitstream/2027.42/138414/1/jia218442.pd
Driver Fusions and Their Implications in the Development and Treatment of Human Cancers.
Gene fusions represent an important class of somatic alterations in cancer. We systematically investigated fusions in 9,624 tumors across 33 cancer types using multiple fusion calling tools. We identified a total of 25,664 fusions, with a 63% validation rate. Integration of gene expression, copy number, and fusion annotation data revealed that fusions involving oncogenes tend to exhibit increased expression, whereas fusions involving tumor suppressors have the opposite effect. For fusions involving kinases, we found 1,275 with an intact kinase domain, the proportion of which varied significantly across cancer types. Our study suggests that fusions drive the development of 16.5% of cancer cases and function as the sole driver in more than 1% of them. Finally, we identified druggable fusions involving genes such as TMPRSS2, RET, FGFR3, ALK, and ESR1 in 6.0% of cases, and we predicted immunogenic peptides, suggesting that fusions may provide leads for targeted drug and immune therapy
Association of variants in the SPTLC1 gene with juvenile amyotrophic lateral sclerosis
IMPORTANCE Juvenile amyotrophic lateral sclerosis (ALS) is a rare form of ALS characterized by age of symptom onset less than 25 years and a variable presentation.OBJECTIVE To identify the genetic variants associated with juvenile ALS.DESIGN, SETTING, AND PARTICIPANTS In this multicenter family-based genetic study, trio whole-exome sequencing was performed to identify the disease-associated gene in a case series of unrelated patients diagnosed with juvenile ALS and severe growth retardation. The patients and their family members were enrolled at academic hospitals and a government research facility between March 1, 2016, and March 13, 2020, and were observed until October 1, 2020. Whole-exome sequencing was also performed in a series of patients with juvenile ALS. A total of 66 patients with juvenile ALS and 6258 adult patients with ALS participated in the study. Patients were selected for the study based on their diagnosis, and all eligible participants were enrolled in the study. None of the participants had a family history of neurological disorders, suggesting de novo variants as the underlying genetic mechanism.MAIN OUTCOMES AND MEASURES De novo variants present only in the index case and not in unaffected family members.RESULTS Trio whole-exome sequencing was performed in 3 patients diagnosed with juvenile ALS and their parents. An additional 63 patients with juvenile ALS and 6258 adult patients with ALS were subsequently screened for variants in the SPTLC1 gene. De novo variants in SPTLC1 (p. Ala20Ser in 2 patients and p.Ser331Tyr in 1 patient) were identified in 3 unrelated patients diagnosed with juvenile ALS and failure to thrive. A fourth variant (p.Leu39del) was identified in a patient with juvenile ALS where parental DNA was unavailable. Variants in this gene have been previously shown to be associated with autosomal-dominant hereditary sensory autonomic neuropathy, type 1A, by disrupting an essential enzyme complex in the sphingolipid synthesis pathway.CONCLUSIONS AND RELEVANCE These data broaden the phenotype associated with SPTLC1 and suggest that patients presenting with juvenile ALS should be screened for variants in this gene.Genetics of disease, diagnosis and treatmen
Whole-exome sequencing study identifies four novel gene loci associated with diabetic kidney disease
Diabetic kidney disease (DKD) is recognized as an important public health challenge. However, its genomic mechanisms are poorly understood. To identify rare variants for DKD, we conducted a whole-exome sequencing (WES) study leveraging large cohorts well-phenotyped for chronic kidney disease and diabetes. Our two-stage WES study included 4372 European and African ancestry participants from the Chronic Renal Insufficiency Cohort and Atherosclerosis Risk in Communities studies (stage 1) and 11 487 multi-ancestry Trans-Omics for Precision Medicine participants (stage 2). Generalized linear mixed models, which accounted for genetic relatedness and adjusted for age, sex and ancestry, were used to test associations between single variants and DKD. Gene-based aggregate rare variant analyses were conducted using an optimized sequence kernel association test implemented within our mixed model framework. We identified four novel exome-wide significant DKD-related loci through initiating diabetes. In single-variant analyses, participants carrying a rare, in-frame insertion in the DIS3L2 gene (rs141560952) exhibited a 193-fold increased odds [95% confidence interval (CI): 33.6, 1105] of DKD compared with noncarriers (P = 3.59 × 10-9). Likewise, each copy of a low-frequency KRT6B splice-site variant (rs425827) conferred a 5.31-fold higher odds (95% CI: 3.06, 9.21) of DKD (P = 2.72 × 10-9). Aggregate gene-based analyses further identified ERAP2 (P = 4.03 × 10-8) and NPEPPS (P = 1.51 × 10-7), which are both expressed in the kidney and implicated in renin-angiotensin-aldosterone system modulated immune response. In the largest WES study of DKD, we identified novel rare variant loci attaining exome-wide significance. These findings provide new insights into the molecular mechanisms underlying DKD
Mapping and characterization of structural variation in 17,795 human genomes
A key goal of whole-genome sequencing for studies of human genetics is to interrogate all forms of variation, including single-nucleotide variants, small insertion or deletion (indel) variants and structural variants. However, tools and resources for the study of structural variants have lagged behind those for smaller variants. Here we used a scalable pipeline1 to map and characterize structural variants in 17,795 deeply sequenced human genomes. We publicly release site-frequency data to create the largest, to our knowledge, whole-genome-sequencing-based structural variant resource so far. On average, individuals carry 2.9 rare structural variants that alter coding regions; these variants affect the dosage or structure of 4.2 genes and account for 4.0–11.2% of rare high-impact coding alleles. Using a computational model, we estimate that structural variants account for 17.2% of rare alleles genome-wide, with predicted deleterious effects that are equivalent to loss-of-function coding alleles; approximately 90% of such structural variants are noncoding deletions (mean 19.1 per genome). We report 158,991 ultra-rare structural variants and show that 2% of individuals carry ultra-rare megabase-scale structural variants, nearly half of which are balanced or complex rearrangements. Finally, we infer the dosage sensitivity of genes and noncoding elements, and reveal trends that relate to element class and conservation. This work will help to guide the analysis and interpretation of structural variants in the era of whole-genome sequencing
Scalable Open Science Approach for Mutation Calling of Tumor Exomes Using Multiple Genomic Pipelines
The Cancer Genome Atlas (TCGA) cancer genomics dataset includes over 10,000 tumor-normal exome pairs across 33 different cancer types, in total >400 TB of raw data files requiring analysis. Here we describe the Multi-Center Mutation Calling in Multiple Cancers project, our effort to generate a comprehensive encyclopedia of somatic mutation calls for the TCGA data to enable robust cross-tumor-type analyses. Our approach accounts for variance and batch effects introduced by the rapid advancement of DNA extraction, hybridization-capture, sequencing, and analysis methods over time. We present best practices for applying an ensemble of seven mutation-calling algorithms with scoring and artifact filtering. The dataset created by this analysis includes 3.5 million somatic variants and forms the basis for PanCan Atlas papers. The results have been made available to the research community along with the methods used to generate them. This project is the result of collaboration from a number of institutes and demonstrates how team science drives extremely large genomics projects
Perspective on Oncogenic Processes at the End of the Beginning of Cancer Genomics
The Cancer Genome Atlas (TCGA) has catalyzed systematic characterization of diverse genomic alterations underlying human cancers. At this historic junction marking the completion of genomic characterization of over 11,000 tumors from 33 cancer types, we present our current understanding of the molecular processes governing oncogenesis. We illustrate our insights into cancer through synthesis of the findings of the TCGA PanCancer Atlas project on three facets of oncogenesis: (1) somatic driver mutations, germline pathogenic variants, and their interactions in the tumor; (2) the influence of the tumor genome and epigenome on transcriptome and proteome; and (3) the relationship between tumor and the microenvironment, including implications for drugs targeting driver events and immunotherapies. These results will anchor future characterization of rare and common tumor types, primary and relapsed tumors, and cancers across ancestry groups and will guide the deployment of clinical genomic sequencing
- …