107 research outputs found
Depth, Highness and DNR Degrees
A sequence is Bennett deep [5] if every recursive approximation of the
Kolmogorov complexity of its initial segments from above satisfies that the difference
between the approximation and the actual value of the Kolmogorov complexity of
the initial segments dominates every constant function. We study for different lower
bounds r of this difference between approximation and actual value of the initial segment
complexity, which properties the corresponding r(n)-deep sets have. We prove
that for r(n) = εn, depth coincides with highness on the Turing degrees. For smaller
choices of r, i.e., r is any recursive order function, we show that depth implies either
highness or diagonally-non-recursiveness (DNR). In particular, for left-r.e. sets, order
depth already implies highness. As a corollary, we obtain that weakly-useful sets are
either high or DNR. We prove that not all deep sets are high by constructing a low
order-deep set.
Bennett's depth is defined using prefix-free Kolmogorov complexity. We show that
if one replaces prefix-free by plain Kolmogorov complexity in Bennett's depth definition,
one obtains a notion which no longer satisfies the slow growth law (which
stipulates that no shallow set truth-table computes a deep set); however, under this
notion, random sets are not deep (at the unbounded recursive order magnitude). We
improve Bennett's result that recursive sets are shallow by proving all K-trivial sets
are shallow; our result is close to optimal.
For Bennett's depth, the magnitude of compression improvement has to be achieved
almost everywhere on the set. Bennett observed that relaxing to infinitely often is
meaningless because every recursive set is infinitely often deep. We propose an alternative
infinitely often depth notion that doesn't suffer this limitation (called i.o.
depth).We show that every hyperimmune degree contains a i.o. deep set of magnitude
εn, and construct a π01- class where every member is an i.o. deep set of magnitude
εn. We prove that every non-recursive, non-DNR hyperimmune-free set is i.o. deep
of constant magnitude, and that every nonrecursive many-one degree contains such
a set
Infantile Convulsions with Paroxysmal Dyskinesia (ICCA Syndrome) and Copy Number Variation at Human Chromosome 16p11
BACKGROUND: Benign infantile convulsions and paroxysmal dyskinesia are episodic cerebral disorders that can share common genetic bases. They can be co-inherited as one single autosomal dominant trait (ICCA syndrome); the disease ICCA gene maps at chromosome 16p12-q12. Despite intensive and conventional mutation screening, the ICCA gene remains unknown to date. The critical area displays highly complicated genomic architecture and is the site of deletions and duplications associated with various diseases. The possibility that the ICCA syndrome is related to the existence of large-scale genomic alterations was addressed in the present study. METHODOLOGY/PRINCIPAL FINDINGS: A combination of whole genome and dedicated oligonucleotide array comparative genomic hybridization coupled with quantitative polymerase chain reaction was used. Low copy number of a region corresponding to a genomic variant (Variation_7105) located at 16p11 nearby the centromere was detected with statistical significance at much higher frequency in patients from ICCA families than in ethnically matched controls. The genomic variant showed no apparent difference in size and copy number between patients and controls, making it very unlikely that the genomic alteration detected here is ICCA-specific. Furthermore, no other genomic alteration that would directly cause the ICCA syndrome in those nine families was detected in the ICCA critical area. CONCLUSIONS/SIGNIFICANCE: Our data excluded that inherited genomic deletion or duplication events directly cause the ICCA syndrome; rather, they help narrowing down the critical ICCA region dramatically and indicate that the disease ICCA genetic defect lies very close to or within Variation_7105 and hence should now be searched in the corresponding genomic area and its surrounding regions
A first generation BAC-based physical map of the rainbow trout genome
Background: Rainbow trout (Oncorhynchus mykiss) are the most-widely cultivated cold freshwater fish in the world and an important model species for many research areas. Coupling great interest in this species as a research model with the need for genetic improvement of aquaculture production efficiency traits justifies the continued development of genomics research resources. Many quantitative trait loci (QTL) have been identified for production and life-history traits in rainbow trout. A bacterial artificial chromosome (BAC) physical map is needed to facilitate fine mapping of QTL and the selection of positional candidate genes for incorporation in marker-assisted selection (MAS) for improving rainbow trout aquaculture production. This resource will also facilitate efforts to obtain and assemble a whole-genome reference sequence for this species.[br/] Results: The physical map was constructed from DNA fingerprinting of 192,096 BAC clones using the 4-color high-information content fingerprinting (HICF) method. The clones were assembled into physical map contigs using the finger-printing contig (FPC) program. The map is composed of 4,173 contigs and 9,379 singletons. The total number of unique fingerprinting fragments (consensus bands) in contigs is 1,185,157, which corresponds to an estimated physical length of 2.0 Gb. The map assembly was validated by 1) comparison with probe hybridization results and agarose gel fingerprinting contigs; and 2) anchoring large contigs to the microsatellite-based genetic linkage map.[br/] Conclusion: The production and validation of the first BAC physical map of the rainbow trout genome is described in this paper. We are currently integrating this map with the NCCCWA genetic map using more than 200 microsatellites isolated from BAC end sequences and by identifying BACs that harbor more than 300 previously mapped markers. The availability of an integrated physical and genetic map will enable detailed comparative genome analyses, fine mapping of QTL, positional cloning, selection of positional candidate genes for economically important traits and the incorporation of MAS into rainbow trout breeding programs
A second generation genetic map for rainbow trout (Oncorhynchus mykiss)
<p>Abstract</p> <p>Background</p> <p>Genetic maps characterizing the inheritance patterns of traits and markers have been developed for a wide range of species and used to study questions in biomedicine, agriculture, ecology and evolutionary biology. The status of rainbow trout genetic maps has progressed significantly over the last decade due to interest in this species in aquaculture and sport fisheries, and as a model research organism for studies related to carcinogenesis, toxicology, comparative immunology, disease ecology, physiology and nutrition. We constructed a second generation genetic map for rainbow trout using microsatellite markers to facilitate the identification of quantitative trait loci for traits affecting aquaculture production efficiency and the extraction of comparative information from the genome sequences of model fish species.</p> <p>Results</p> <p>A genetic map ordering 1124 microsatellite loci spanning a sex-averaged distance of 2927.10 cM (Kosambi) and having 2.6 cM resolution was constructed by genotyping 10 parents and 150 offspring from the National Center for Cool and Cold Water Aquaculture (NCCCWA) reference family mapping panel. Microsatellite markers, representing pairs of loci resulting from an evolutionarily recent whole genome duplication event, identified 180 duplicated regions within the rainbow trout genome. Microsatellites associated with genes through expressed sequence tags or bacterial artificial chromosomes produced comparative assignments with tetraodon, zebrafish, fugu, and medaka resulting in assignments of homology for 199 loci.</p> <p>Conclusion</p> <p>The second generation NCCCWA genetic map provides an increased microsatellite marker density and quantifies differences in recombination rate between the sexes in outbred populations. It has the potential to integrate with cytogenetic and other physical maps, identifying paralogous regions of the rainbow trout genome arising from the evolutionarily recent genome duplication event, and anchoring a comparative map with the zebrafish, medaka, tetraodon, and fugu genomes. This resource will facilitate the identification of genes affecting traits of interest through fine mapping and positional cloning of candidate genes.</p
Estimating Contact Process Saturation in Sylvatic Transmission of Trypanosoma cruzi in the United States
Although it has been known for nearly a century that strains of Trypanosoma cruzi, the etiological agent for Chagas' disease, are enzootic in the southern U.S., much remains unknown about the dynamics of its transmission in the sylvatic cycles that maintain it, including the relative importance of different transmission routes. Mathematical models can fill in gaps where field and lab data are difficult to collect, but they need as inputs the values of certain key demographic and epidemiological quantities which parametrize the models. In particular, they determine whether saturation occurs in the contact processes that communicate the infection between the two populations. Concentrating on raccoons, opossums, and woodrats as hosts in Texas and the southeastern U.S., and the vectors Triatoma sanguisuga and Triatoma gerstaeckeri, we use an exhaustive literature review to derive estimates for fundamental parameters, and use simple mathematical models to illustrate a method for estimating infection rates indirectly based on prevalence data. Results are used to draw conclusions about saturation and which population density drives each of the two contact-based infection processes (stercorarian/bloodborne and oral). Analysis suggests that the vector feeding process associated with stercorarian transmission to hosts and bloodborne transmission to vectors is limited by the population density of vectors when dealing with woodrats, but by that of hosts when dealing with raccoons and opossums, while the predation of hosts on vectors which drives oral transmission to hosts is limited by the population density of hosts. Confidence in these conclusions is limited by a severe paucity of data underlying associated parameter estimates, but the approaches developed here can also be applied to the study of other vector-borne infections
Pan-cancer analysis of whole genomes
Cancer is driven by genetic change, and the advent of massively parallel sequencing has enabled systematic documentation of this variation at the whole-genome scale(1-3). Here we report the integrative analysis of 2,658 whole-cancer genomes and their matching normal tissues across 38 tumour types from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA). We describe the generation of the PCAWG resource, facilitated by international data sharing using compute clouds. On average, cancer genomes contained 4-5 driver mutations when combining coding and non-coding genomic elements; however, in around 5% of cases no drivers were identified, suggesting that cancer driver discovery is not yet complete. Chromothripsis, in which many clustered structural variants arise in a single catastrophic event, is frequently an early event in tumour evolution; in acral melanoma, for example, these events precede most somatic point mutations and affect several cancer-associated genes simultaneously. Cancers with abnormal telomere maintenance often originate from tissues with low replicative activity and show several mechanisms of preventing telomere attrition to critical levels. Common and rare germline variants affect patterns of somatic mutation, including point mutations, structural variants and somatic retrotransposition. A collection of papers from the PCAWG Consortium describes non-coding mutations that drive cancer beyond those in the TERT promoter(4); identifies new signatures of mutational processes that cause base substitutions, small insertions and deletions and structural variation(5,6); analyses timings and patterns of tumour evolution(7); describes the diverse transcriptional consequences of somatic mutation on splicing, expression levels, fusion genes and promoter activity(8,9); and evaluates a range of more-specialized features of cancer genomes(8,10-18).Peer reviewe
Stroke genetics informs drug discovery and risk prediction across ancestries
Previous genome-wide association studies (GWASs) of stroke — the second leading cause of death worldwide — were conducted predominantly in populations of European ancestry1,2. Here, in cross-ancestry GWAS meta-analyses of 110,182 patients who have had a stroke (five ancestries, 33% non-European) and 1,503,898 control individuals, we identify association signals for stroke and its subtypes at 89 (61 new) independent loci: 60 in primary inverse-variance-weighted analyses and 29 in secondary meta-regression and multitrait analyses. On the basis of internal cross-ancestry validation and an independent follow-up in 89,084 additional cases of stroke (30% non-European) and 1,013,843 control individuals, 87% of the primary stroke risk loci and 60% of the secondary stroke risk loci were replicated (P < 0.05). Effect sizes were highly correlated across ancestries. Cross-ancestry fine-mapping, in silico mutagenesis analysis3, and transcriptome-wide and proteome-wide association analyses revealed putative causal genes (such as SH3PXD2A and FURIN) and variants (such as at GRK5 and NOS3). Using a three-pronged approach4, we provide genetic evidence for putative drug effects, highlighting F11, KLKB1, PROC, GP1BA, LAMC2 and VCAM1 as possible targets, with drugs already under investigation for stroke for F11 and PROC. A polygenic score integrating cross-ancestry and ancestry-specific stroke GWASs with vascular-risk factor GWASs (integrative polygenic scores) strongly predicted ischaemic stroke in populations of European, East Asian and African ancestry5. Stroke genetic risk scores were predictive of ischaemic stroke independent of clinical risk factors in 52,600 clinical-trial participants with cardiometabolic disease. Our results provide insights to inform biology, reveal potential drug targets and derive genetic risk prediction tools across ancestries
- …