390 research outputs found

    Block CUR: Decomposing Matrices using Groups of Columns

    Full text link
    A common problem in large-scale data analysis is to approximate a matrix using a combination of specifically sampled rows and columns, known as CUR decomposition. Unfortunately, in many real-world environments, the ability to sample specific individual rows or columns of the matrix is limited by either system constraints or cost. In this paper, we consider matrix approximation by sampling predefined \emph{blocks} of columns (or rows) from the matrix. We present an algorithm for sampling useful column blocks and provide novel guarantees for the quality of the approximation. This algorithm has application in problems as diverse as biometric data analysis to distributed computing. We demonstrate the effectiveness of the proposed algorithms for computing the Block CUR decomposition of large matrices in a distributed setting with multiple nodes in a compute cluster, where such blocks correspond to columns (or rows) of the matrix stored on the same node, which can be retrieved with much less overhead than retrieving individual columns stored across different nodes. In the biometric setting, the rows correspond to different users and columns correspond to users' biometric reaction to external stimuli, {\em e.g.,}~watching video content, at a particular time instant. There is significant cost in acquiring each user's reaction to lengthy content so we sample a few important scenes to approximate the biometric response. An individual time sample in this use case cannot be queried in isolation due to the lack of context that caused that biometric reaction. Instead, collections of time segments ({\em i.e.,} blocks) must be presented to the user. The practical application of these algorithms is shown via experimental results using real-world user biometric data from a content testing environment.Comment: shorter version to appear in ECML-PKDD 201

    Evolution of Mycobacterium ulcerans and other mycolactone-producing mycobacteria from a common Mycobacterium marinum progenitor

    Get PDF
    It had been assumed that production of the cytotoxic polyketide mycolactone was strictly associated with Mycobacterium ulcerans, the causative agent of Buruli ulcer. However, a recent study has uncovered a broader distribution of mycolactone-producing mycobacteria (MPM) that includes mycobacteria cultured from diseased fish and frogs in the United States and from diseased fish in the Red and Mediterranean Seas. All of these mycobacteria contain versions of the M. ulcerans pMUM plasmid, produce mycolactones, and show a high degree of genetic relatedness to both M. ulcerans and Mycobacterium marinum. Here, we show by multiple genetic methods, including multilocus sequence analysis and DNA-DNA hybridization, that all MPM have evolved from a common M. marinum progenitor to form a genetically cohesive group among a more diverse assemblage of M. marinum strains. Like M. ulcerans, the fish and frog MPM show multiple copies of the insertion sequence IS2404. Comparisons of pMUM and chromosomal gene sequences demonstrate that plasmid acquisition and the subsequent ability to produce mycolactone were probably the key drivers of speciation. Ongoing evolution among MPM has since produced at least two genetically distinct ecotypes that can be broadly divided into those typically causing disease in ectotherms (but also having a high zoonotic potential) and those causing disease in endotherms, such as humans

    Human papillomavirus status in southern Chinese women

    Get PDF
    1. The overall and type-specific human papillomavirus (HPV) prevalence differed between Hong Kong and Guangzhou healthy women. The prevalence of HPV was significantly higher in Guangzhou than Hong Kong women. Younger women had significantly higher risk of HPV infection. 2. HPV16 remained the most common type detected in both regions; the frequency increased with increasing disease severity. The prevalence of HPV58 and HPV52 was relatively high in women with normal cervix and precancerous lesions.published_or_final_versio

    Distinct DNA methylation epigenotypes in bladder cancer from different Chinese sub-populations and its implication in cancer detection using voided urine

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Bladder cancer is the sixth most common cancer in the world and the incidence is particularly high in southwestern Taiwan. Previous studies have identified several tumor-related genes that are hypermethylated in bladder cancer; however the DNA methylation profile of bladder cancer in Taiwan is not fully understood.</p> <p>Methods</p> <p>In this study, we compared the DNA methylation profile of multiple tumor suppressor genes (<it>APC</it>, <it>DAPK</it>, <it>E-cadherin</it>, <it>hMLH1</it>, <it>IRF8</it>, <it>p14</it>, <it>p15</it>, <it>RASSF1A</it>, <it>SFRP1 </it>and <it>SOCS-1</it>) in bladder cancer patients from different Chinese sub-populations including Taiwan (104 cases), Hong Kong (82 cases) and China (24 cases) by MSP. Two normal human urothelium were also included as control. To investigate the diagnostic potential of using DNA methylation in non-invasive detection of bladder cancer, degree of methylation of <it>DAPK</it>, <it>IRF8</it>, <it>p14</it>, <it>RASSF1A </it>and <it>SFRP1 </it>was also accessed by quantitative MSP in urine samples from thirty bladder cancer patients and nineteen non-cancer controls.</p> <p>Results</p> <p>There were distinct DNA methylation epigenotypes among the different sub-populations. Further, samples from Taiwan and China demonstrated a bimodal distribution suggesting that CpG island methylator phentotype (CIMP) is presented in bladder cancer. Moreover, the number of methylated genes in samples from Taiwan and Hong Kong were significantly correlated with histological grade (P < 0.01) and pathological stage (P < 0.01). Regarding the samples from Taiwan, methylation of <it>SFRP1</it>, <it>IRF8</it>, <it>APC </it>and <it>RASSF1A </it>were significantly associated with increased tumor grade, stage. Methylation of <it>RASSF1A </it>was associated with tumor recurrence. Patients with methylation of <it>APC </it>or <it>RASSF1A </it>were also significantly associated with shorter recurrence-free survival. For methylation detection in voided urine samples of cancer patients, the sensitivity and specificity of using any of the methylated genes (<it>IRF8</it>, <it>p14 </it>or <it>sFRP1</it>) by qMSP was 86.7% and 94.7%.</p> <p>Conclusions</p> <p>Our results indicate that there are distinct methylation epigenotypes among different Chinese sub-populations. These profiles demonstrate gradual increases with cancer progression. Finally, detection of gene methylation in voided urine with these distinct DNA methylation markers is more sensitive than urine cytology.</p

    Gene network interconnectedness and the generalized topological overlap measure

    Get PDF
    BACKGROUND: Network methods are increasingly used to represent the interactions of genes and/or proteins. Genes or proteins that are directly linked may have a similar biological function or may be part of the same biological pathway. Since the information on the connection (adjacency) between 2 nodes may be noisy or incomplete, it can be desirable to consider alternative measures of pairwise interconnectedness. Here we study a class of measures that are proportional to the number of neighbors that a pair of nodes share in common. For example, the topological overlap measure by Ravasz et al. [1] can be interpreted as a measure of agreement between the m = 1 step neighborhoods of 2 nodes. Several studies have shown that two proteins having a higher topological overlap are more likely to belong to the same functional class than proteins having a lower topological overlap. Here we address the question whether a measure of topological overlap based on higher-order neighborhoods could give rise to a more robust and sensitive measure of interconnectedness. RESULTS: We generalize the topological overlap measure from m = 1 step neighborhoods to m ≥ 2 step neighborhoods. This allows us to define the m-th order generalized topological overlap measure (GTOM) by (i) counting the number of m-step neighbors that a pair of nodes share and (ii) normalizing it to take a value between 0 and 1. Using theoretical arguments, a yeast co-expression network application, and a fly protein network application, we illustrate the usefulness of the proposed measure for module detection and gene neighborhood analysis. CONCLUSION: Topological overlap can serve as an important filter to counter the effects of spurious or missing connections between network nodes. The m-th order topological overlap measure allows one to trade-off sensitivity versus specificity when it comes to defining pairwise interconnectedness and network modules

    Biofortified red mottled beans (Phaseolus vulgaris L.) in a maize and bean diet provide more bioavailable iron than standard red mottled beans: Studies in poultry (Gallus gallus) and an in vitro digestion/Caco-2 model

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Our objective was to compare the capacities of biofortified and standard colored beans to deliver iron (Fe) for hemoglobin synthesis. Two isolines of large-seeded, red mottled Andean beans (<it>Phaseolus vulgaris </it>L.), one standard ("Low Fe") and the other biofortified ("High Fe") in Fe (49 and 71 μg Fe/g, respectively) were used. This commercial class of red mottled beans is the preferred varietal type for most of the Caribbean and Eastern and Southern Africa where almost three quarters of a million hectares are grown. Therefore it is important to know the affect of biofortification of these beans on diets that simulate human feeding studies.</p> <p>Methods</p> <p>Maize-based diets containing the beans were formulated to meet the nutrient requirements for broiler except for Fe (Fe concentrations in the 2 diets were 42.9 ± 1.2 and 54.6 ± 0.9 mg/kg). One day old chicks (<it>Gallus gallus</it>) were allocated to the experimental diets (n = 12). For 4 wk, hemoglobin, feed-consumption and body-weights were measured.</p> <p>Results</p> <p>Hemoglobin maintenance efficiencies (HME) (means ± SEM) were different between groups on days 14 and 21 of the experiment (P < 0.05). Final total body hemoglobin Fe contents were different between the standard (12.58 ± 1.0 mg {0.228 ± 0.01 μmol}) and high Fe (15.04 ± 0.65 mg {0.273 ± 0.01 μmol}) bean groups (P < 0.05). At the end of the experiment, tissue samples were collected from the intestinal duodenum and liver for further analyses. Divalent-metal-transporter-1, duodenal-cytochrome-B, and ferroportin expressions were higher and liver ferritin was lower (P < 0.05) in the standard group vs. the biofortified group. <it>In-vitro </it>analysis showed lower iron bioavailability in cells exposed to standard ("Low Fe") bean based diet.</p> <p>Conclusions</p> <p>We conclude that the <it>in-vivo </it>results support the <it>in-vitro </it>observations; biofortified colored beans contain more bioavailable-iron than standard colored beans. In addition, biofortified beans seems to be a promising vehicle for increasing intakes of bioavailable Fe in human populations that consume these beans as a dietary staple. This justifies further work on the large-seeded Andean beans which are the staple of a large-region of Africa where iron-deficiency anemia is a primary cause of infant death and poor health status.</p

    Measurement of the top quark mass using the matrix element technique in dilepton final states

    Get PDF
    We present a measurement of the top quark mass in pp¯ collisions at a center-of-mass energy of 1.96 TeV at the Fermilab Tevatron collider. The data were collected by the D0 experiment corresponding to an integrated luminosity of 9.7  fb−1. The matrix element technique is applied to tt¯ events in the final state containing leptons (electrons or muons) with high transverse momenta and at least two jets. The calibration of the jet energy scale determined in the lepton+jets final state of tt¯ decays is applied to jet energies. This correction provides a substantial reduction in systematic uncertainties. We obtain a top quark mass of mt=173.93±1.84  GeV

    Reduced Risk of Malaria Parasitemia Following Household Screening and Treatment: A Cross-Sectional and Longitudinal Cohort Study

    Get PDF
    BACKGROUND: In regions of declining malaria transmission, new strategies for control are needed to reduce transmission and achieve elimination. Artemisinin-combination therapy (ACT) is active against immature gametocytes and can reduce the risk of transmission. We sought to determine whether household screening and treatment of infected individuals provides protection against infection for household members. METHODOLOGY/PRINCIPAL FINDINGS: The study was conducted in two areas in Southern Province, Zambia in 2007 and 2008/2009. To determine the impact of proactive case detection, households were randomly selected either to join a longitudinal cohort, in which participants were repeatedly screened throughout the year and those infected treated with artemether-lumefantrine, or a cross-sectional survey, in which participants were visited only once. Cross-sectional surveys were conducted throughout the year. The prevalence of RDT positivity was compared between the longitudinal and cross-sectional households at baseline and during follow-up using multilevel logistic regression. In the 2007 study area, 174 and 156 participants enrolled in the cross-sectional and longitudinal groups, respectively. In the 2008/2009 study area, 917 and 234 participants enrolled in the cross-sectional and longitudinal groups, respectively. In both study areas, participants and households in the longitudinal and cross-sectional groups were similar on demographic characteristics and prevalence of RDT positivity at baseline (2007: OR = 0.97; 95% CI:0.46, 2.03 | 2008/2009: OR = 1.28; 95% CI:0.44, 3.79). After baseline, the prevalence of RDT positivity was significantly lower in longitudinal compared to cross-sectional households in both study areas (2007: OR = 0.44; 95% CI:0.20, 0.96 | 2008/2009: OR = 0.16; 95% CI:0.05, 0.55). CONCLUSIONS/SIGNIFICANCE: Proactive case detection, consisting of screening household members with an RDT and treating those positive with ACT, can reduce transmission and provide indirect protection to household members. A targeted test and treat strategy could contribute to the elimination of malaria in regions of low transmission
    corecore