659 research outputs found

    Computing Web-scale Topic Models using an Asynchronous Parameter Server

    Full text link
    Topic models such as Latent Dirichlet Allocation (LDA) have been widely used in information retrieval for tasks ranging from smoothing and feedback methods to tools for exploratory search and discovery. However, classical methods for inferring topic models do not scale up to the massive size of today's publicly available Web-scale data sets. The state-of-the-art approaches rely on custom strategies, implementations and hardware to facilitate their asynchronous, communication-intensive workloads. We present APS-LDA, which integrates state-of-the-art topic modeling with cluster computing frameworks such as Spark using a novel asynchronous parameter server. Advantages of this integration include convenient usage of existing data processing pipelines and eliminating the need for disk writes as data can be kept in memory from start to finish. Our goal is not to outperform highly customized implementations, but to propose a general high-performance topic modeling framework that can easily be used in today's data processing pipelines. We compare APS-LDA to the existing Spark LDA implementations and show that our system can, on a 480-core cluster, process up to 135 times more data and 10 times more topics without sacrificing model quality.Comment: To appear in SIGIR 201

    Tools in Spanish Universities to facilitate the transmission from High School to University and their assessment.

    Get PDF
    Incoming students in the University have education deficiencies, so universities studies require a sound basis of scientific knowledge. In this project are analysed instruments to reinforcing knowledge in those areas related to the studies that students are about to embark on public Spanish universities. There are important differences among universities and, in each university there are great differences among titles. Initial courses (cursos cero) are widespread (in 50% of universities) that selfevaluation instruments (14 % of universities). It is necessary to improve diffusion of those instruments because it is not possible to evaluate them. So are proposed the next actuations: to make regular standard surveys for professors and students; to publish results of surveys; public universities should institutionalize their basic training offer and improve the dissemination of this offer especially through the web. This paper presents a questionnaire to assess student opinion about these tools. To analyze the effectiveness, and make an initial estimate of the evaluation of these tools, we conducted a pilot test of the questionnaire with 68 students at the University of Extremadura. The results of preliminary statistical analysis conducted on the pilot test indicate that the survey results are reliable. A global evaluation of both tools, with a scale of 1 to 5, gave an average score of 3.29 for initial courses and 3.41 for selfevaluation. The 72.9% of the students consider the "self assessment" more effective than the "initial course

    Ethanol Affects the Development of Sensory Hair Cells in Larval Zebrafish (Danio rerio)

    Get PDF
    Children born to mothers with substantial alcohol consumption during pregnancy can present a number of morphological, cognitive, and sensory abnormalities, including hearing deficits, collectively known as fetal alcohol syndrome (FAS). The goal of this study was to determine if the zebrafish lateral line could be used to study sensory hair cell abnormalities caused by exposure to ethanol during embryogenesis. Some lateral line sensory hair cells are present at 2 days post-fertilization (dpf) and are functional by 5 dpf. Zebrafish embryos were raised in fish water supplemented with varying concentrations of ethanol (0.75%–1.75% by volume) from 2 dpf through 5 dpf. Ethanol treatment during development resulted in many physical abnormalities characteristic of FAS in humans. Also, the number of sensory hair cells decreased as the concentration of ethanol increased in a dose-dependent manner. The dye FM 1-43FX was used to detect the presence of functional mechanotransduction channels. The percentage of FM 1-43-labeled hair cells decreased as the concentration of ethanol increased. Methanol treatment did not affect the development of hair cells. The cell cycle markers proliferating cell nuclear antigen (PCNA) and bromodeoxyuridine (BrdU) demonstrated that ethanol reduced the number of sensory hair cells, as a consequence of decreased cellular proliferation. There was also a significant increase in the rate of apoptosis, as determined by TUNEL-labeling, in neuromasts following ethanol treatment during larval development. Therefore, zebrafish are a useful animal model to study the effects of hair cell developmental disorders associated with FAS

    Thermal Jeans fragmentation within 1000 AU in OMC-1S

    Full text link
    We present subarcsecond 1.3 mm continuum ALMA observations towards the Orion Molecular Cloud 1 South (OMC-1S) region, down to a spatial resolution of 74 AU, which reveal a total of 31 continuum sources. We also present subarcsecond 7 mm continuum VLA observations of the same region, which allow to further study fragmentation down to a spatial resolution of 40 AU. By applying a Mean Surface Density of Companions method we find a characteristic spatial scale at ~560 AU, and we use this spatial scale to define the boundary of 19 `cores' in OMC-1S as groupings of millimeter sources. We find an additional characteristic spatial scale at ~2900 AU, which is the typical scale of the filaments in OMC-1S, suggesting a two-level fragmentation process. We measured the fragmentation level within each core and find a higher fragmentation towards the southern filament. In addition, the cores of the southern filament are also the densest (within 1100 AU) cores in OMC-1S. This is fully consistent with previous studies of fragmentation at spatial scales one order of magnitude larger, and suggests that fragmentation down to 40 AU seems to be governed by thermal Jeans processes in OMC-1S.Comment: Accepted to Ap

    Contexts of diffusion: Adoption of research synthesis in Social Work and Women's Studies

    Full text link
    Texts reveal the subjects of interest in research fields, and the values, beliefs, and practices of researchers. In this study, texts are examined through bibliometric mapping and topic modeling to provide a birds eye view of the social dynamics associated with the diffusion of research synthesis methods in the contexts of Social Work and Women's Studies. Research synthesis texts are especially revealing because the methods, which include meta-analysis and systematic review, are reliant on the availability of past research and data, sometimes idealized as objective, egalitarian approaches to research evaluation, fundamentally tied to past research practices, and performed with the goal informing future research and practice. This study highlights the co-influence of past and subsequent research within research fields; illustrates dynamics of the diffusion process; and provides insight into the cultural contexts of research in Social Work and Women's Studies. This study suggests the potential to further develop bibliometric mapping and topic modeling techniques to inform research problem selection and resource allocation.Comment: To appear in proceedings of the 2014 International Conference on Social Computing, Behavioral-Cultural Modeling, and Prediction (SBP2014

    Serum Biochemical Profile of Captive-Bred Philippine Crocodiles (Crocodylus Mindorensis Schmidt, 1935) Sub-adults

    Get PDF
    The Philippine crocodiles Crocodylus mindorensis Schmidt; 1935 are a critically endangered species that necessitate minimally invasive diagnostic tools for their physiological state and health assessment. In the current study; we determined the reference ranges for the serum biochemistry of male and female captive-bred C. mindorensis sub-adults. We collected blood samples from the post-occipital venous sinus of six male and seven female captive-bred crocodile sub-adults at the Palawan Wildlife Rescue and Conservation Center and quantified the serum biochemical values for cholesterol; triglycerides; uric acid; glucose; creatinine; aspartate aminotransferase (AST); alanine aminotransferase (ALT); albumin; total protein; and globulin. We defined reference ranges through the central 95% of the concentration values obtained. The uric acid concentrations were significantly different between male and female C. mindorensis sub-adults. Uric acid levels were higher (P = 0.035) in male sub-adults because of their higher food intake resulting from dominance and aggression during feeding times. Serum biochemical values of C. mindorensis exhibited similarities with and variations from other crocodile species. We briefly discussed the differences with other species and the influence of factors such as field and laboratory methodologies; environmental conditions; nutritional status; and size class

    A Novel Clustering Algorithm Based on Quantum Games

    Full text link
    Enormous successes have been made by quantum algorithms during the last decade. In this paper, we combine the quantum game with the problem of data clustering, and then develop a quantum-game-based clustering algorithm, in which data points in a dataset are considered as players who can make decisions and implement quantum strategies in quantum games. After each round of a quantum game, each player's expected payoff is calculated. Later, he uses a link-removing-and-rewiring (LRR) function to change his neighbors and adjust the strength of links connecting to them in order to maximize his payoff. Further, algorithms are discussed and analyzed in two cases of strategies, two payoff matrixes and two LRR functions. Consequently, the simulation results have demonstrated that data points in datasets are clustered reasonably and efficiently, and the clustering algorithms have fast rates of convergence. Moreover, the comparison with other algorithms also provides an indication of the effectiveness of the proposed approach.Comment: 19 pages, 5 figures, 5 table
    corecore