412 research outputs found

    Unsupervised feature selection for noisy data

    Get PDF
    Feature selection techniques are enormously applied in a variety of data analysis tasks in order to reduce the dimensionality. According to the type of learning, feature selection algorithms are categorized to: supervised or unsupervised. In unsupervised learning scenarios, selecting features is a much harder problem, due to the lack of class labels that would facilitate the search for relevant features. The selecting feature difficulty is amplified when the data is corrupted by different noises. Almost all traditional unsupervised feature selection methods are not robust against the noise in samples. These approaches do not have any explicit mechanism for detaching and isolating the noise thus they can not produce an optimal feature subset. In this article, we propose an unsupervised approach for feature selection on noisy data, called Robust Independent Feature Selection (RIFS). Specifically, we choose feature subset that contains most of the underlying information, using the same criteria as the Independent component analysis (ICA). Simultaneously, the noise is separated as an independent component. The isolation of representative noise samples is achieved using factor oblique rotation whereas noise identification is performed using factor pattern loadings. Extensive experimental results over divers real-life data sets have showed the efficiency and advantage of the proposed algorithm.We thankfully acknowledge the support of the Comision Interministerial de Ciencia y Tecnologa (CICYT) under contract No. TIN2015-65316-P which has partially funded this work.Peer ReviewedPostprint (author's final draft

    Structure of the ATP synthase catalytic complex (F(1)) from Escherichia coli in an autoinhibited conformation.

    Get PDF
    ATP synthase is a membrane-bound rotary motor enzyme that is critical for cellular energy metabolism in all kingdoms of life. Despite conservation of its basic structure and function, autoinhibition by one of its rotary stalk subunits occurs in bacteria and chloroplasts but not in mitochondria. The crystal structure of the ATP synthase catalytic complex (F(1)) from Escherichia coli described here reveals the structural basis for this inhibition. The C-terminal domain of subunit ɛ adopts a heretofore unknown, highly extended conformation that inserts deeply into the central cavity of the enzyme and engages both rotor and stator subunits in extensive contacts that are incompatible with functional rotation. As a result, the three catalytic subunits are stabilized in a set of conformations and rotational positions distinct from previous F(1) structures

    Genome of the facultative scuticociliatosis pathogen Pseudocohnilembus persalinus provides insight into its virulence through horizontal gene transfer

    Get PDF
    This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ The attached file is the published version of the article

    Genetic Variants in Nuclear-Encoded Mitochondrial Genes Influence AIDS Progression

    Get PDF
    Background: The human mitochondrial genome includes only 13 coding genes while nuclear-encoded genes account for 99% of proteins responsible for mitochondrial morphology, redox regulation, and energetics. Mitochondrial pathogenesis occurs in HIV patients and genetically, mitochondrial DNA haplogroups with presumed functional differences have been associated with differential AIDS progression. Methodology/Principal Findings: Here we explore whether single nucleotide polymorphisms (SNPs) within 904 of the estimated 1,500 genes that specify nuclear-encoded mitochondrial proteins (NEMPs) influence AIDS progression among HIV-1 infected patients. We examined NEMPs for association with the rate of AIDS progression using genotypes generated by an Affymetrix 6.0 genotyping array of 1,455 European American patients from five US AIDS cohorts. Successfully genotyped SNPs gave 50% or better haplotype coverage for 679 of known NEMP genes. With a Bonferroni adjustment for the number of genes and tests examined, multiple SNPs within two NEMP genes showed significant association with AIDS progression: acyl-CoA synthetase medium-chain family member 4 (ACSM4) on chromosome 12 and peroxisomal D3,D2-enoyl- CoA isomerase (PECI) on chromosome 6. Conclusions: Our previous studies on mitochondrial DNA showed that European haplogroups with presumed functional differences were associated with AIDS progression and HAART mediated adverse events. The modest influences of nuclearencoded mitochondrial genes found in the current study add support to the idea that mitochondrial function plays a role in AIDS pathogenesis

    Assessing Historical Fish Community Composition Using Surveys, Historical Collection Data, and Species Distribution Models

    Get PDF
    Accurate establishment of baseline conditions is critical to successful management and habitat restoration. We demonstrate the ability to robustly estimate historical fish community composition and assess the current status of the urbanized Barton Creek watershed in central Texas, U.S.A. Fish species were surveyed in 2008 and the resulting data compared to three sources of fish occurrence information: (i) historical records from a museum specimen database and literature searches; (ii) a nearly identical survey conducted 15 years earlier; and (iii) a modeled historical community constructed with species distribution models (SDMs). This holistic approach, and especially the application of SDMs, allowed us to discover that the fish community in Barton Creek was more diverse than the historical data and survey methods alone indicated. Sixteen native species with high modeled probability of occurrence within the watershed were not found in the 2008 survey, seven of these were not found in either survey or in any of the historical collection records. Our approach allowed us to more rigorously establish the true baseline for the pre-development fish fauna and then to more accurately assess trends and develop hypotheses regarding factors driving current fish community composition to better inform management decisions and future restoration efforts. Smaller, urbanized freshwater systems, like Barton Creek, typically have a relatively poor historical biodiversity inventory coupled with long histories of alteration, and thus there is a propensity for land managers and researchers to apply inaccurate baseline standards. Our methods provide a way around that limitation by using SDMs derived from larger and richer biodiversity databases of a broader geographic scope. Broadly applied, we propose that this technique has potential to overcome limitations of popular bioassessment metrics (e.g., IBI) to become a versatile and robust management tool for determining status of freshwater biotic communities

    Ku Regulates the Non-Homologous End Joining Pathway Choice of DNA Double-Strand Break Repair in Human Somatic Cells

    Get PDF
    The repair of DNA double-strand breaks (DSBs) is critical for the maintenance of genomic integrity and viability for all organisms. Mammals have evolved at least two genetically discrete ways to mediate DNA DSB repair: homologous recombination (HR) and non-homologous end joining (NHEJ). In mammalian cells, most DSBs are preferentially repaired by NHEJ. Recent work has demonstrated that NHEJ consists of at least two sub-pathways—the main Ku heterodimer-dependent or “classic” NHEJ (C-NHEJ) pathway and an “alternative” NHEJ (A-NHEJ) pathway, which usually generates microhomology-mediated signatures at repair junctions. In our study, recombinant adeno-associated virus knockout vectors were utilized to construct a series of isogenic human somatic cell lines deficient in the core C-NHEJ factors (Ku, DNA-PKcs, XLF, and LIGIV), and the resulting cell lines were characterized for their ability to carry out DNA DSB repair. The absence of DNA-PKcs, XLF, or LIGIV resulted in cell lines that were profoundly impaired in DNA DSB repair activity. Unexpectedly, Ku86-null cells showed wild-type levels of DNA DSB repair activity that was dominated by microhomology joining events indicative of A-NHEJ. Importantly, A-NHEJ DNA DSB repair activity could also be efficiently de-repressed in LIGIV-null and DNA-PKcs-null cells by subsequently reducing the level of Ku70. These studies demonstrate that in human cells C-NHEJ is the major DNA DSB repair pathway and they show that Ku is the critical C-NHEJ factor that regulates DNA NHEJ DSB pathway choice

    Gene expression fingerprint of uterine serous papillary carcinoma: identification of novel molecular markers for uterine serous cancer diagnosis and therapy

    Get PDF
    Uterine serous papillary cancer (USPC) represents a rare but highly aggressive variant of endometrial cancer, the most common gynecologic tumour in women. We used oligonucleotide microarrays that interrogate the expression of some 10 000 known genes to profile 10 highly purified primary USPC cultures and five normal endometrial cells (NEC). We report that unsupervised analysis of mRNA fingerprints readily distinguished USPC from normal endometrial epithelial cells and identified 139 and 390 genes that exhibited >5-fold upregulation and downregulation, respectively, in primary USPC when compared to NEC. Many of the genes upregulated in USPC were found to represent adhesion molecules, secreted proteins and oncogenes, such as L1 cell adhesion molecule, claudin-3 and claudin-4, kallikrein 6 (protease M) and kallikrein 10 (NES1), interleukin-6 and c-erbB2. Downregulated genes in USPC included SEMACAP3, ras homolog gene family, member I (ARHI), and differentially downregulated in ovarian carcinoma gene 1. Quantitative RT–PCR was used to validate differences in gene expression between USPC and NEC for several of these genes. Owing to its potential as a novel therapeutic marker, expression of the high-affinity epithelial receptor for Clostridium perfringens enterotoxin (CPE) claudin-4 was further validated through immunohistochemical analysis of formalin-fixed paraffin-embedded specimens from which the primary USPC cultures were obtained, as well as an independent set of archival USPC specimens. Finally, the sensitivity of primary USPC to the administration of scalar doses of CPE in vitro was also demonstrated. Our results highlight the novel molecular features of USPC and provide a foundation for the development of new type-specific therapies against this highly aggressive variant of endometrial cancer

    Contemporary Evolutionary Divergence for a Protected Species following Assisted Colonization

    Get PDF
    Contemporary evolution following assisted colonization may increase the probability of persistence for refuge populations established as a bet-hedge for protected species. Such refuge populations are considered "genetic replicates" that might be used for future re-colonization in the event of a catastrophe in the native site. Although maladaptive evolutionary divergence of captive populations is well recognized, evolutionary divergence of wild refuge populations may also occur on contemporary time scales. Thus, refuge populations may lose their "value" as true genetic replicates of the native population. Here, we show contemporary evolutionary divergence in body shape in an approximately 30-year old refuge population of the protected White Sands pupfish (Cyprinodon tularosa) resulting in a body-shape mismatch with its native environment.Geometric morphometic data were collected from C. tularosa cultures raised in experimental mesocosms. Cultures were initiated with fish from the two native populations, plus hybrids, in high or low salinity treatments representing the salinities of the two native habitats. We found that body shape was heritable and that shape variation due to phenotypic plasticity was small compared to shape variation due to population source. C. tularosa from the high salinity population retained slender body shapes and fish from the low salinity population retained deep body shapes, irrespective of mesocosm salinity. These data suggest that the observed divergence of a recently established pupfish population was not explained by plasticity. An analysis of microsatellite variation indicated that no significant genetic drift occurred in the refuge population, further supporting the adaptive nature of changes in body shape. These lines of evidence suggest that body shape divergence of the refuge population reflects a case of contemporary evolution (over a 30-year period).These results suggest assisted colonization can introduce novel, and/or relaxed selection, and lead to unintended evolutionary divergence

    Expression of Nestin by Neural Cells in the Adult Rat and Human Brain

    Get PDF
    Neurons and glial cells in the developing brain arise from neural progenitor cells (NPCs). Nestin, an intermediate filament protein, is thought to be expressed exclusively by NPCs in the normal brain, and is replaced by the expression of proteins specific for neurons or glia in differentiated cells. Nestin expressing NPCs are found in the adult brain in the subventricular zone (SVZ) of the lateral ventricle and the subgranular zone (SGZ) of the dentate gyrus. While significant attention has been paid to studying NPCs in the SVZ and SGZ in the adult brain, relatively little attention has been paid to determining whether nestin-expressing neural cells (NECs) exist outside of the SVZ and SGZ. We therefore stained sections immunocytochemically from the adult rat and human brain for NECs, observed four distinct classes of these cells, and present here the first comprehensive report on these cells. Class I cells are among the smallest neural cells in the brain and are widely distributed. Class II cells are located in the walls of the aqueduct and third ventricle. Class IV cells are found throughout the forebrain and typically reside immediately adjacent to a neuron. Class III cells are observed only in the basal forebrain and closely related areas such as the hippocampus and corpus striatum. Class III cells resemble neurons structurally and co-express markers associated exclusively with neurons. Cell proliferation experiments demonstrate that Class III cells are not recently born. Instead, these cells appear to be mature neurons in the adult brain that express nestin. Neurons that express nestin are not supposed to exist in the brain at any stage of development. That these unique neurons are found only in brain regions involved in higher order cognitive function suggests that they may be remodeling their cytoskeleton in supporting the neural plasticity required for these functions
    corecore