118 research outputs found
The Spitzer Survey of Interstellar Clouds in the Gould Belt. VI. The Auriga-California Molecular Cloud observed with IRAC and MIPS
We present observations of the Auriga-California Molecular Cloud (AMC) at
3.6, 4.5, 5.8, 8.0, 24, 70 and 160 micron observed with the IRAC and MIPS
detectors as part of the Spitzer Gould Belt Legacy Survey. The total mapped
areas are 2.5 sq-deg with IRAC and 10.47 sq-deg with MIPS. This giant molecular
cloud is one of two in the nearby Gould Belt of star-forming regions, the other
being the Orion A Molecular Cloud (OMC). We compare source counts, colors and
magnitudes in our observed region to a subset of the SWIRE data that was
processed through our pipeline. Using color-magnitude and color-color diagrams,
we find evidence for a substantial population of 166 young stellar objects
(YSOs) in the cloud, many of which were previously unknown. Most of this
population is concentrated around the LkHalpha 101 cluster and the filament
extending from it. We present a quantitative description of the degree of
clustering and discuss the fraction of YSOs in the region with disks relative
to an estimate of the diskless YSO population. Although the AMC is similar in
mass, size and distance to the OMC, it is forming about 15 - 20 times fewer
stars.Comment: (30 pages, 17 figures (2 multipage figures), accepted for publication
in ApJ
Recommended from our members
Erratum: Author Correction: Identification of genes required for eye development by high-throughput screening of mouse knockouts.
[This corrects the article DOI: 10.1038/s42003-018-0226-0.]
Genome modeling system: A knowledge management platform for genomics
In this work, we present the Genome Modeling System (GMS), an analysis information management system capable of executing automated genome analysis pipelines at a massive scale. The GMS framework provides detailed tracking of samples and data coupled with reliable and repeatable analysis pipelines. The GMS also serves as a platform for bioinformatics development, allowing a large team to collaborate on data analysis, or an individual researcher to leverage the work of others effectively within its data management system. Rather than separating ad-hoc analysis from rigorous, reproducible pipelines, the GMS promotes systematic integration between the two. As a demonstration of the GMS, we performed an integrated analysis of whole genome, exome and transcriptome sequencing data from a breast cancer cell line (HCC1395) and matched lymphoblastoid line (HCC1395BL). These data are available for users to test the software, complete tutorials and develop novel GMS pipeline configurations. The GMS is available at https://github.com/genome/gms
Whole genome analysis for 163 gRNAs in Cas9-edited mice reveals minimal off-target activity.
Genome editing with CRISPR-associated (Cas) proteins holds exceptional promise for correcting variants causing genetic disease. To realize this promise, off-target genomic changes cannot occur during the editing process. Here, we use whole genome sequencing to compare the genomes of 50 Cas9-edited founder mice to 28 untreated control mice to assess the occurrence of S. pyogenes Cas9-induced off-target mutagenesis. Computational analysis of whole-genome sequencing data detects 26 unique sequence variants at 23 predicted off-target sites for 18/163 guides used. While computationally detected variants are identified in 30% (15/50) of Cas9 gene-edited founder animals, only 38% (10/26) of the variants in 8/15 founders validate by Sanger sequencing. In vitro assays for Cas9 off-target activity identify only two unpredicted off-target sites present in genome sequencing data. In total, only 4.9% (8/163) of guides tested have detectable off-target activity, a rate of 0.2 Cas9 off-target mutations per founder analyzed. In comparison, we observe ~1,100 unique variants in each mouse regardless of genome exposure to Cas9 indicating off-target variants comprise a small fraction of genetic heterogeneity in Cas9-edited mice. These findings will inform future design and use of Cas9-edited animal models as well as provide context for evaluating off-target potential in genetically diverse patient populations
Genome-wide screening reveals the genetic basis of mammalian embryonic eye development
BackgroundMicrophthalmia, anophthalmia, and coloboma (MAC) spectrum disease encompasses a group of eye malformations which play a role in childhood visual impairment. Although the predominant cause of eye malformations is known to be heritable in nature, with 80% of cases displaying loss-of-function mutations in the ocular developmental genes OTX2 or SOX2, the genetic abnormalities underlying the remaining cases of MAC are incompletely understood. This study intended to identify the novel genes and pathways required for early eye development. Additionally, pathways involved in eye formation during embryogenesis are also incompletely understood. This study aims to identify the novel genes and pathways required for early eye development through systematic forward screening of the mammalian genome.ResultsQuery of the International Mouse Phenotyping Consortium (IMPC) database (data release 17.0, August 01, 2022) identified 74 unique knockout lines (genes) with genetically associated eye defects in mouse embryos. The vast majority of eye abnormalities were small or absent eyes, findings most relevant to MAC spectrum disease in humans. A literature search showed that 27 of the 74 lines had previously published knockout mouse models, of which only 15 had ocular defects identified in the original publications. These 12 previously published gene knockouts with no reported ocular abnormalities and the 47 unpublished knockouts with ocular abnormalities identified by the IMPC represent 59 genes not previously associated with early eye development in mice. Of these 59, we identified 19 genes with a reported human eye phenotype. Overall, mining of the IMPC data yielded 40 previously unimplicated genes linked to mammalian eye development. Bioinformatic analysis showed that several of the IMPC genes colocalized to several protein anabolic and pluripotency pathways in early eye development. Of note, our analysis suggests that the serine-glycine pathway producing glycine, a mitochondrial one-carbon donator to folate one-carbon metabolism (FOCM), is essential for eye formation.ConclusionsUsing genome-wide phenotype screening of single-gene knockout mouse lines, STRING analysis, and bioinformatic methods, this study identified genes heretofore unassociated with MAC phenotypes providing models to research novel molecular and cellular mechanisms involved in eye development. These findings have the potential to hasten the diagnosis and treatment of this congenital blinding disease
Mendelian gene identification through mouse embryo viability screening.
BACKGROUND: The diagnostic rate of Mendelian disorders in sequencing studies continues to increase, along with the pace of novel disease gene discovery. However, variant interpretation in novel genes not currently associated with disease is particularly challenging and strategies combining gene functional evidence with approaches that evaluate the phenotypic similarities between patients and model organisms have proven successful. A full spectrum of intolerance to loss-of-function variation has been previously described, providing evidence that gene essentiality should not be considered as a simple and fixed binary property.
METHODS: Here we further dissected this spectrum by assessing the embryonic stage at which homozygous loss-of-function results in lethality in mice from the International Mouse Phenotyping Consortium, classifying the set of lethal genes into one of three windows of lethality: early, mid, or late gestation lethal. We studied the correlation between these windows of lethality and various gene features including expression across development, paralogy and constraint metrics together with human disease phenotypes. We explored a gene similarity approach for novel gene discovery and investigated unsolved cases from the 100,000 Genomes Project.
RESULTS: We found that genes in the early gestation lethal category have distinct characteristics and are enriched for genes linked with recessive forms of inherited metabolic disease. We identified several genes sharing multiple features with known biallelic forms of inborn errors of the metabolism and found signs of enrichment of biallelic predicted pathogenic variants among early gestation lethal genes in patients recruited under this disease category. We highlight two novel gene candidates with phenotypic overlap between the patients and the mouse knockouts.
CONCLUSIONS: Information on the developmental period at which embryonic lethality occurs in the knockout mouse may be used for novel disease gene discovery that helps to prioritise variants in unsolved rare disease cases
Impact of essential genes on the success of genome editing experiments generating 3313 new genetically engineered mouse lines
The International Mouse Phenotyping Consortium (IMPC) systematically produces and phenotypes mouse lines with presumptive null mutations to provide insight into gene function. The IMPC now uses the programmable RNA-guided nuclease Cas9 for its increased capacity and flexibility to efficiently generate null alleles in the C57BL/6N strain. In addition to being a valuable novel and accessible research resource, the production of 3313 knockout mouse lines using comparable protocols provides a rich dataset to analyze experimental and biological variables affecting in vivo gene engineering with Cas9. Mouse line production has two critical steps - generation of founders with the desired allele and germline transmission (GLT) of that allele from founders to offspring. A systematic evaluation of the variables impacting success rates identified gene essentiality as the primary factor influencing successful production of null alleles. Collectively, our findings provide best practice recommendations for using Cas9 to generate alleles in mouse essential genes, many of which are orthologs of genes linked to human disease
RNA-Seq Mapping and Detection of Gene Fusions with a Suffix Array Algorithm
High-throughput RNA sequencing enables quantification of transcripts (both known and novel), exon/exon junctions and fusions of exons from different genes. Discovery of gene fusions–particularly those expressed with low abundance– is a challenge with short- and medium-length sequencing reads. To address this challenge, we implemented an RNA-Seq mapping pipeline within the LifeScope software. We introduced new features including filter and junction mapping, annotation-aided pairing rescue and accurate mapping quality values. We combined this pipeline with a Suffix Array Spliced Read (SASR) aligner to detect chimeric transcripts. Performing paired-end RNA-Seq of the breast cancer cell line MCF-7 using the SOLiD system, we called 40 gene fusions among over 120,000 splicing junctions. We validated 36 of these 40 fusions with TaqMan assays, of which 25 were expressed in MCF-7 but not the Human Brain Reference. An intra-chromosomal gene fusion involving the estrogen receptor alpha gene ESR1, and another involving the RPS6KB1 (Ribosomal protein S6 kinase beta-1) were recurrently expressed in a number of breast tumor cell lines and a clinical tumor sample
- …
