521 research outputs found

    Formation of Compressed Flat Electron Beams with High Transverse-Emittance Ratios

    Full text link
    Flat beams -- beams with asymmetric transverse emittances -- have important applications in novel light-source concepts, advanced-acceleration schemes and could possibly alleviate the need for damping rings in lepton colliders. Over the last decade, a flat-beam-generation technique based on the conversion of an angular-momentum-dominated beam was proposed and experimentally tested. In this paper we explore the production of compressed flat beams. We especially investigate and optimize the flat-beam transformation for beams with substantial fractional energy spread. We use as a simulation example the photoinjector of the Fermilab's Advanced Superconducting Test Accelerator (ASTA). The optimizations of the flat beam generation and compression at ASTA were done via start-to-end numerical simulations for bunch charges of 3.2 nC, 1.0 nC and 20 pC at ~37 MeV. The optimized emittances of flat beams with different bunch charges were found to be 0.25 {\mu}m (emittance ratio is ~400), 0.13 {\mu}m, 15 nm before compression, and 0.41 {\mu}m, 0.20 {\mu}m, 16 nm after full compression, respectively with peak currents as high as 5.5 kA for a 3.2-nC flat beam. These parameters are consistent with requirements needed to excite wakefields in asymmetric dielectric-lined waveguides or produce significant photon flux using small-gap micro-undulators.Comment: 17

    Is This a Joke? Detecting Humor in Spanish Tweets

    Full text link
    While humor has been historically studied from a psychological, cognitive and linguistic standpoint, its study from a computational perspective is an area yet to be explored in Computational Linguistics. There exist some previous works, but a characterization of humor that allows its automatic recognition and generation is far from being specified. In this work we build a crowdsourced corpus of labeled tweets, annotated according to its humor value, letting the annotators subjectively decide which are humorous. A humor classifier for Spanish tweets is assembled based on supervised learning, reaching a precision of 84% and a recall of 69%.Comment: Preprint version, without referra

    Exploratory Analysis of Highly Heterogeneous Document Collections

    Full text link
    We present an effective multifaceted system for exploratory analysis of highly heterogeneous document collections. Our system is based on intelligently tagging individual documents in a purely automated fashion and exploiting these tags in a powerful faceted browsing framework. Tagging strategies employed include both unsupervised and supervised approaches based on machine learning and natural language processing. As one of our key tagging strategies, we introduce the KERA algorithm (Keyword Extraction for Reports and Articles). KERA extracts topic-representative terms from individual documents in a purely unsupervised fashion and is revealed to be significantly more effective than state-of-the-art methods. Finally, we evaluate our system in its ability to help users locate documents pertaining to military critical technologies buried deep in a large heterogeneous sea of information.Comment: 9 pages; KDD 2013: 19th ACM SIGKDD Conference on Knowledge Discovery and Data Minin

    Detecting Singleton Review Spammers Using Semantic Similarity

    Full text link
    Online reviews have increasingly become a very important resource for consumers when making purchases. Though it is becoming more and more difficult for people to make well-informed buying decisions without being deceived by fake reviews. Prior works on the opinion spam problem mostly considered classifying fake reviews using behavioral user patterns. They focused on prolific users who write more than a couple of reviews, discarding one-time reviewers. The number of singleton reviewers however is expected to be high for many review websites. While behavioral patterns are effective when dealing with elite users, for one-time reviewers, the review text needs to be exploited. In this paper we tackle the problem of detecting fake reviews written by the same person using multiple names, posting each review under a different name. We propose two methods to detect similar reviews and show the results generally outperform the vectorial similarity measures used in prior works. The first method extends the semantic similarity between words to the reviews level. The second method is based on topic modeling and exploits the similarity of the reviews topic distributions using two models: bag-of-words and bag-of-opinion-phrases. The experiments were conducted on reviews from three different datasets: Yelp (57K reviews), Trustpilot (9K reviews) and Ott dataset (800 reviews).Comment: 6 pages, WWW 201

    Three-Dimensional Analysis of Wakefields Generated by Flat Electron Beams in Planar Dielectric-Loaded Structures

    Full text link
    An electron bunch passing through dielectric-lined waveguide generates Cˇ\check{C}erenkov radiation that can result in high-peak axial electric field suitable for acceleration of a subsequent bunch. Axial field beyond Gigavolt-per-meter are attainable in structures with sub-mm sizes depending on the achievement of suitable electron bunch parameters. A promising configuration consists of using planar dielectric structure driven by flat electron bunches. In this paper we present a three-dimensional analysis of wakefields produced by flat beams in planar dielectric structures thereby extending the work of Reference [A. Tremaine, J. Rosenzweig, and P. Schoessow, Phys. Rev. E 56, No. 6, 7204 (1997)] on the topic. We especially provide closed-form expressions for the normal frequencies and field amplitudes of the excited modes and benchmark these analytical results with finite-difference time-domain particle-in-cell numerical simulations. Finally, we implement a semi-analytical algorithm into a popular particle tracking program thereby enabling start-to-end high-fidelity modeling of linear accelerators based on dielectric-lined planar waveguides.Comment: 12 pages, 2 tables, 10 figure

    Visual Affect Around the World: A Large-scale Multilingual Visual Sentiment Ontology

    Get PDF
    Every culture and language is unique. Our work expressly focuses on the uniqueness of culture and language in relation to human affect, specifically sentiment and emotion semantics, and how they manifest in social multimedia. We develop sets of sentiment- and emotion-polarized visual concepts by adapting semantic structures called adjective-noun pairs, originally introduced by Borth et al. (2013), but in a multilingual context. We propose a new language-dependent method for automatic discovery of these adjective-noun constructs. We show how this pipeline can be applied on a social multimedia platform for the creation of a large-scale multilingual visual sentiment concept ontology (MVSO). Unlike the flat structure in Borth et al. (2013), our unified ontology is organized hierarchically by multilingual clusters of visually detectable nouns and subclusters of emotionally biased versions of these nouns. In addition, we present an image-based prediction task to show how generalizable language-specific models are in a multilingual context. A new, publicly available dataset of >15.6K sentiment-biased visual concepts across 12 languages with language-specific detector banks, >7.36M images and their metadata is also released.Comment: 11 pages, to appear at ACM MM'1

    OpenEssayist: a supply and demand learning analytics tool for drafting academic essays

    Get PDF
    This paper focuses on the use of a natural language analytics engine to provide feedback to students when preparing an essay for summative assessment. OpenEssayist is a real-time learning analytics tool, which operates through the combination of a linguistic analysis engine that processes the text in the essay, and a web application that uses the output of the linguistic analysis engine to generate the feedback. We outline the system itself and present analysis of observed patterns of activity as a cohort of students engaged with the system for their module assignments. We report a significant positive correlation between the number of drafts submitted to the system and the grades awarded for the first assignment. We can also report that this cohort of students gained significantly higher overall grades than the students in the previous cohort, who had no access to OpenEssayist. As a system that is content free, OpenEssayist can be used to support students working in any domain that requires the writing of essays

    Prediction of future hydrological regimes in poorly gauged high altitude basins: the case study of the upper Indus, Pakistan

    Get PDF
    In the mountain regions of the Hindu Kush, Karakoram and Himalaya (HKH) the "third polar ice cap" of our planet, glaciers play the role of "water towers" by providing significant amount of melt water, especially in the dry season, essential for agriculture, drinking purposes, and hydropower production. Recently, most glaciers in the HKH have been retreating and losing mass, mainly due to significant regional warming, thus calling for assessment of future water resources availability for populations down slope. However, hydrology of these high altitude catchments is poorly studied and little understood. Most such catchments are poorly gauged, thus posing major issues in flow prediction therein, and representing in fact typical grounds of application of PUB concepts, where simple and portable hydrological modeling based upon scarce data amount is necessary for water budget estimation, and prediction under climate change conditions. In this preliminarily study, future (2060) hydrological flows in a particular watershed (Shigar river at Shigar, ca. 7000 km<sup>2</sup>), nested within the upper Indus basin and fed by seasonal melt from major glaciers, are investigated. <br><br> The study is carried out under the umbrella of the SHARE-Paprika project, aiming at evaluating the impact of climate change upon hydrology of the upper Indus river. We set up a minimal hydrological model, tuned against a short series of observed ground climatic data from a number of stations in the area, in situ measured ice ablation data, and remotely sensed snow cover data. The future, locally adjusted, precipitation and temperature fields for the reference decade 2050–2059 from <i>CCSM3</i> model, available within the IPCC's panel, are then fed to the hydrological model. We adopt four different glaciers' cover scenarios, to test sensitivity to decreased glacierized areas. The projected flow duration curves, and some selected flow descriptors are evaluated. The uncertainty of the results is then addressed, and use of the model for nearby catchments discussed. The proposed approach is valuable as a tool to investigate the hydrology of poorly gauged high altitude areas, and to project forward their hydrological behavior pending climate change
    • …
    corecore