659 research outputs found
Computing Web-scale Topic Models using an Asynchronous Parameter Server
Topic models such as Latent Dirichlet Allocation (LDA) have been widely used
in information retrieval for tasks ranging from smoothing and feedback methods
to tools for exploratory search and discovery. However, classical methods for
inferring topic models do not scale up to the massive size of today's publicly
available Web-scale data sets. The state-of-the-art approaches rely on custom
strategies, implementations and hardware to facilitate their asynchronous,
communication-intensive workloads.
We present APS-LDA, which integrates state-of-the-art topic modeling with
cluster computing frameworks such as Spark using a novel asynchronous parameter
server. Advantages of this integration include convenient usage of existing
data processing pipelines and eliminating the need for disk writes as data can
be kept in memory from start to finish. Our goal is not to outperform highly
customized implementations, but to propose a general high-performance topic
modeling framework that can easily be used in today's data processing
pipelines. We compare APS-LDA to the existing Spark LDA implementations and
show that our system can, on a 480-core cluster, process up to 135 times more
data and 10 times more topics without sacrificing model quality.Comment: To appear in SIGIR 201
Tools in Spanish Universities to facilitate the transmission from High School to University and their assessment.
Incoming students in the University have education deficiencies, so universities studies require a sound basis of scientific knowledge. In this project are analysed instruments to reinforcing knowledge in those areas related to the studies that students are about to embark on public Spanish universities. There are important differences among universities and, in each university there are great differences among titles. Initial courses (cursos cero) are widespread (in 50% of universities) that selfevaluation instruments (14 % of universities). It is necessary to improve diffusion of those instruments because it is not possible to evaluate them. So are proposed the next actuations: to make regular standard surveys for professors and students; to publish results of surveys; public universities should institutionalize their basic training offer and improve the dissemination of this offer especially through the web. This paper presents a questionnaire to assess student opinion about these tools. To analyze the effectiveness, and make an initial estimate of the evaluation of these tools, we conducted a pilot test of the questionnaire with 68 students at the University of Extremadura. The results of preliminary statistical analysis conducted on the pilot test indicate that the survey results are reliable. A global evaluation of both tools, with a scale of 1 to 5, gave an average score of 3.29 for initial courses and 3.41 for selfevaluation. The 72.9% of the students consider the "self assessment" more effective than the "initial course
Ethanol Affects the Development of Sensory Hair Cells in Larval Zebrafish (Danio rerio)
Children born to mothers with substantial alcohol consumption during pregnancy can present a number of morphological, cognitive, and sensory abnormalities, including hearing deficits, collectively known as fetal alcohol syndrome (FAS). The goal of this study was to determine if the zebrafish lateral line could be used to study sensory hair cell abnormalities caused by exposure to ethanol during embryogenesis. Some lateral line sensory hair cells are present at 2 days post-fertilization (dpf) and are functional by 5 dpf. Zebrafish embryos were raised in fish water supplemented with varying concentrations of ethanol (0.75%–1.75% by volume) from 2 dpf through 5 dpf. Ethanol treatment during development resulted in many physical abnormalities characteristic of FAS in humans. Also, the number of sensory hair cells decreased as the concentration of ethanol increased in a dose-dependent manner. The dye FM 1-43FX was used to detect the presence of functional mechanotransduction channels. The percentage of FM 1-43-labeled hair cells decreased as the concentration of ethanol increased. Methanol treatment did not affect the development of hair cells. The cell cycle markers proliferating cell nuclear antigen (PCNA) and bromodeoxyuridine (BrdU) demonstrated that ethanol reduced the number of sensory hair cells, as a consequence of decreased cellular proliferation. There was also a significant increase in the rate of apoptosis, as determined by TUNEL-labeling, in neuromasts following ethanol treatment during larval development. Therefore, zebrafish are a useful animal model to study the effects of hair cell developmental disorders associated with FAS
Thermal Jeans fragmentation within 1000 AU in OMC-1S
We present subarcsecond 1.3 mm continuum ALMA observations towards the Orion
Molecular Cloud 1 South (OMC-1S) region, down to a spatial resolution of 74 AU,
which reveal a total of 31 continuum sources. We also present subarcsecond 7 mm
continuum VLA observations of the same region, which allow to further study
fragmentation down to a spatial resolution of 40 AU. By applying a Mean Surface
Density of Companions method we find a characteristic spatial scale at ~560 AU,
and we use this spatial scale to define the boundary of 19 `cores' in OMC-1S as
groupings of millimeter sources. We find an additional characteristic spatial
scale at ~2900 AU, which is the typical scale of the filaments in OMC-1S,
suggesting a two-level fragmentation process. We measured the fragmentation
level within each core and find a higher fragmentation towards the southern
filament. In addition, the cores of the southern filament are also the densest
(within 1100 AU) cores in OMC-1S. This is fully consistent with previous
studies of fragmentation at spatial scales one order of magnitude larger, and
suggests that fragmentation down to 40 AU seems to be governed by thermal Jeans
processes in OMC-1S.Comment: Accepted to Ap
Contexts of diffusion: Adoption of research synthesis in Social Work and Women's Studies
Texts reveal the subjects of interest in research fields, and the values,
beliefs, and practices of researchers. In this study, texts are examined
through bibliometric mapping and topic modeling to provide a birds eye view of
the social dynamics associated with the diffusion of research synthesis methods
in the contexts of Social Work and Women's Studies. Research synthesis texts
are especially revealing because the methods, which include meta-analysis and
systematic review, are reliant on the availability of past research and data,
sometimes idealized as objective, egalitarian approaches to research
evaluation, fundamentally tied to past research practices, and performed with
the goal informing future research and practice. This study highlights the
co-influence of past and subsequent research within research fields;
illustrates dynamics of the diffusion process; and provides insight into the
cultural contexts of research in Social Work and Women's Studies. This study
suggests the potential to further develop bibliometric mapping and topic
modeling techniques to inform research problem selection and resource
allocation.Comment: To appear in proceedings of the 2014 International Conference on
Social Computing, Behavioral-Cultural Modeling, and Prediction (SBP2014
Serum Biochemical Profile of Captive-Bred Philippine Crocodiles (Crocodylus Mindorensis Schmidt, 1935) Sub-adults
The Philippine crocodiles Crocodylus mindorensis Schmidt; 1935 are a critically endangered species that necessitate minimally invasive diagnostic tools for their physiological state and health assessment. In the current study; we determined the reference ranges for the serum biochemistry of male and female captive-bred C. mindorensis sub-adults. We collected blood samples from the post-occipital venous sinus of six male and seven female captive-bred crocodile sub-adults at the Palawan Wildlife Rescue and Conservation Center and quantified the serum biochemical values for cholesterol; triglycerides; uric acid; glucose; creatinine; aspartate aminotransferase (AST); alanine aminotransferase (ALT); albumin; total protein; and globulin. We defined reference ranges through the central 95% of the concentration values obtained. The uric acid concentrations were significantly different between male and female C. mindorensis sub-adults. Uric acid levels were higher (P = 0.035) in male sub-adults because of their higher food intake resulting from dominance and aggression during feeding times. Serum biochemical values of C. mindorensis exhibited similarities with and variations from other crocodile species. We briefly discussed the differences with other species and the influence of factors such as field and laboratory methodologies; environmental conditions; nutritional status; and size class
A Novel Clustering Algorithm Based on Quantum Games
Enormous successes have been made by quantum algorithms during the last
decade. In this paper, we combine the quantum game with the problem of data
clustering, and then develop a quantum-game-based clustering algorithm, in
which data points in a dataset are considered as players who can make decisions
and implement quantum strategies in quantum games. After each round of a
quantum game, each player's expected payoff is calculated. Later, he uses a
link-removing-and-rewiring (LRR) function to change his neighbors and adjust
the strength of links connecting to them in order to maximize his payoff.
Further, algorithms are discussed and analyzed in two cases of strategies, two
payoff matrixes and two LRR functions. Consequently, the simulation results
have demonstrated that data points in datasets are clustered reasonably and
efficiently, and the clustering algorithms have fast rates of convergence.
Moreover, the comparison with other algorithms also provides an indication of
the effectiveness of the proposed approach.Comment: 19 pages, 5 figures, 5 table
- …