521 research outputs found
Formation of Compressed Flat Electron Beams with High Transverse-Emittance Ratios
Flat beams -- beams with asymmetric transverse emittances -- have important
applications in novel light-source concepts, advanced-acceleration schemes and
could possibly alleviate the need for damping rings in lepton colliders. Over
the last decade, a flat-beam-generation technique based on the conversion of an
angular-momentum-dominated beam was proposed and experimentally tested. In this
paper we explore the production of compressed flat beams. We especially
investigate and optimize the flat-beam transformation for beams with
substantial fractional energy spread. We use as a simulation example the
photoinjector of the Fermilab's Advanced Superconducting Test Accelerator
(ASTA). The optimizations of the flat beam generation and compression at ASTA
were done via start-to-end numerical simulations for bunch charges of 3.2 nC,
1.0 nC and 20 pC at ~37 MeV. The optimized emittances of flat beams with
different bunch charges were found to be 0.25 {\mu}m (emittance ratio is ~400),
0.13 {\mu}m, 15 nm before compression, and 0.41 {\mu}m, 0.20 {\mu}m, 16 nm
after full compression, respectively with peak currents as high as 5.5 kA for a
3.2-nC flat beam. These parameters are consistent with requirements needed to
excite wakefields in asymmetric dielectric-lined waveguides or produce
significant photon flux using small-gap micro-undulators.Comment: 17
Is This a Joke? Detecting Humor in Spanish Tweets
While humor has been historically studied from a psychological, cognitive and
linguistic standpoint, its study from a computational perspective is an area
yet to be explored in Computational Linguistics. There exist some previous
works, but a characterization of humor that allows its automatic recognition
and generation is far from being specified. In this work we build a
crowdsourced corpus of labeled tweets, annotated according to its humor value,
letting the annotators subjectively decide which are humorous. A humor
classifier for Spanish tweets is assembled based on supervised learning,
reaching a precision of 84% and a recall of 69%.Comment: Preprint version, without referra
Exploratory Analysis of Highly Heterogeneous Document Collections
We present an effective multifaceted system for exploratory analysis of
highly heterogeneous document collections. Our system is based on intelligently
tagging individual documents in a purely automated fashion and exploiting these
tags in a powerful faceted browsing framework. Tagging strategies employed
include both unsupervised and supervised approaches based on machine learning
and natural language processing. As one of our key tagging strategies, we
introduce the KERA algorithm (Keyword Extraction for Reports and Articles).
KERA extracts topic-representative terms from individual documents in a purely
unsupervised fashion and is revealed to be significantly more effective than
state-of-the-art methods. Finally, we evaluate our system in its ability to
help users locate documents pertaining to military critical technologies buried
deep in a large heterogeneous sea of information.Comment: 9 pages; KDD 2013: 19th ACM SIGKDD Conference on Knowledge Discovery
and Data Minin
Detecting Singleton Review Spammers Using Semantic Similarity
Online reviews have increasingly become a very important resource for
consumers when making purchases. Though it is becoming more and more difficult
for people to make well-informed buying decisions without being deceived by
fake reviews. Prior works on the opinion spam problem mostly considered
classifying fake reviews using behavioral user patterns. They focused on
prolific users who write more than a couple of reviews, discarding one-time
reviewers. The number of singleton reviewers however is expected to be high for
many review websites. While behavioral patterns are effective when dealing with
elite users, for one-time reviewers, the review text needs to be exploited. In
this paper we tackle the problem of detecting fake reviews written by the same
person using multiple names, posting each review under a different name. We
propose two methods to detect similar reviews and show the results generally
outperform the vectorial similarity measures used in prior works. The first
method extends the semantic similarity between words to the reviews level. The
second method is based on topic modeling and exploits the similarity of the
reviews topic distributions using two models: bag-of-words and
bag-of-opinion-phrases. The experiments were conducted on reviews from three
different datasets: Yelp (57K reviews), Trustpilot (9K reviews) and Ott dataset
(800 reviews).Comment: 6 pages, WWW 201
Three-Dimensional Analysis of Wakefields Generated by Flat Electron Beams in Planar Dielectric-Loaded Structures
An electron bunch passing through dielectric-lined waveguide generates
erenkov radiation that can result in high-peak axial electric field
suitable for acceleration of a subsequent bunch. Axial field beyond
Gigavolt-per-meter are attainable in structures with sub-mm sizes depending on
the achievement of suitable electron bunch parameters. A promising
configuration consists of using planar dielectric structure driven by flat
electron bunches. In this paper we present a three-dimensional analysis of
wakefields produced by flat beams in planar dielectric structures thereby
extending the work of Reference [A. Tremaine, J. Rosenzweig, and P. Schoessow,
Phys. Rev. E 56, No. 6, 7204 (1997)] on the topic. We especially provide
closed-form expressions for the normal frequencies and field amplitudes of the
excited modes and benchmark these analytical results with finite-difference
time-domain particle-in-cell numerical simulations. Finally, we implement a
semi-analytical algorithm into a popular particle tracking program thereby
enabling start-to-end high-fidelity modeling of linear accelerators based on
dielectric-lined planar waveguides.Comment: 12 pages, 2 tables, 10 figure
Recommended from our members
Networks and Natural Language Processing
Article discussing networks and natural language processing. The authors present some of the most successful graph-based representations and algorithms used in language processing and try to explain how and why they work
Visual Affect Around the World: A Large-scale Multilingual Visual Sentiment Ontology
Every culture and language is unique. Our work expressly focuses on the
uniqueness of culture and language in relation to human affect, specifically
sentiment and emotion semantics, and how they manifest in social multimedia. We
develop sets of sentiment- and emotion-polarized visual concepts by adapting
semantic structures called adjective-noun pairs, originally introduced by Borth
et al. (2013), but in a multilingual context. We propose a new
language-dependent method for automatic discovery of these adjective-noun
constructs. We show how this pipeline can be applied on a social multimedia
platform for the creation of a large-scale multilingual visual sentiment
concept ontology (MVSO). Unlike the flat structure in Borth et al. (2013), our
unified ontology is organized hierarchically by multilingual clusters of
visually detectable nouns and subclusters of emotionally biased versions of
these nouns. In addition, we present an image-based prediction task to show how
generalizable language-specific models are in a multilingual context. A new,
publicly available dataset of >15.6K sentiment-biased visual concepts across 12
languages with language-specific detector banks, >7.36M images and their
metadata is also released.Comment: 11 pages, to appear at ACM MM'1
OpenEssayist: a supply and demand learning analytics tool for drafting academic essays
This paper focuses on the use of a natural language analytics engine to provide feedback to students when preparing an essay for summative assessment. OpenEssayist is a real-time learning analytics tool, which operates through the combination of a linguistic analysis engine that processes the text in the essay, and a web application that uses the output of the linguistic analysis engine to generate the feedback. We outline the system itself and present analysis of observed patterns of activity as a cohort of students engaged with the system for their module assignments. We report a significant positive correlation between the number of drafts submitted to the system and the grades awarded for the first assignment. We can also report that this cohort of students gained significantly higher overall grades than the students in the previous cohort, who had no access to OpenEssayist. As a system that is content free, OpenEssayist can be used to support students working in any domain that requires the writing of essays
Prediction of future hydrological regimes in poorly gauged high altitude basins: the case study of the upper Indus, Pakistan
In the mountain regions of the Hindu Kush, Karakoram and Himalaya (HKH) the "third polar ice cap" of our planet, glaciers play the role of "water towers" by providing significant amount of melt water, especially in the dry season, essential for agriculture, drinking purposes, and hydropower production. Recently, most glaciers in the HKH have been retreating and losing mass, mainly due to significant regional warming, thus calling for assessment of future water resources availability for populations down slope. However, hydrology of these high altitude catchments is poorly studied and little understood. Most such catchments are poorly gauged, thus posing major issues in flow prediction therein, and representing in fact typical grounds of application of PUB concepts, where simple and portable hydrological modeling based upon scarce data amount is necessary for water budget estimation, and prediction under climate change conditions. In this preliminarily study, future (2060) hydrological flows in a particular watershed (Shigar river at Shigar, ca. 7000 km<sup>2</sup>), nested within the upper Indus basin and fed by seasonal melt from major glaciers, are investigated. <br><br> The study is carried out under the umbrella of the SHARE-Paprika project, aiming at evaluating the impact of climate change upon hydrology of the upper Indus river. We set up a minimal hydrological model, tuned against a short series of observed ground climatic data from a number of stations in the area, in situ measured ice ablation data, and remotely sensed snow cover data. The future, locally adjusted, precipitation and temperature fields for the reference decade 2050–2059 from <i>CCSM3</i> model, available within the IPCC's panel, are then fed to the hydrological model. We adopt four different glaciers' cover scenarios, to test sensitivity to decreased glacierized areas. The projected flow duration curves, and some selected flow descriptors are evaluated. The uncertainty of the results is then addressed, and use of the model for nearby catchments discussed. The proposed approach is valuable as a tool to investigate the hydrology of poorly gauged high altitude areas, and to project forward their hydrological behavior pending climate change
- …