Search CORE

180 research outputs found

MultiLexNorm: A Shared Task on Multilingual Lexical Normalization

Author: Baldwin T
Caselli T
Ljubešić N
Mahendra R
Muller B
Plank B
Ramponi A
Roncal ISV
Sidorenko W
van der Goot R
Workshop on Noisy User-Generated Text
Zubiaga A
Çetinoğlu Ö
Çolakoğlu T
Publication venue
Publication date: 01/01/2021
Field of study

Lexical normalization is the task of transforming an utterance into its standardized form. This task is beneficial for downstream analysis, as it provides a way to harmonize (often spontaneous) linguistic variation. Such variation is typical for social media on which information is shared in a multitude of ways, including diverse languages and code-switching. Since the seminal work of Han and Baldwin (2011) a decade ago, lexical normalization has attracted attention in English and multiple other languages. However, there exists a lack of a common benchmark for comparison of systems across languages with a homogeneous data and evaluation setup. The MULTILEXNORM shared task sets out to fill this gap. We provide the largest publicly available multilingual lexical normalization benchmark including 12 language variants. We propose a homogenized evaluation setup with both intrinsic and extrinsic evaluation. As extrinsic evaluation, we use dependency parsing and part-of-speech tagging with adapted evaluation metrics (a-LAS, a-UAS, and a-POS) to account for alignment discrepancies. The shared task hosted at W-NUT 2021 attracted 9 participants and 18 submissions. The results show that neural normalization systems outperform the previous state-of-the-art system by a large margin. Downstream parsing and part-of-speech tagging performance is positively affected but to varying degrees, with improvements of up to 1.72 a-LAS, 0.85 a-UAS, and 1.54 a-POS for the winning system

Queen Mary Research Online

Authenticating the Query Results of Text Search Engines

Author: Baeza-Yates R.
Cheng W.
Devanbu P. T.
Li F.
Merkle R.
Papadopoulos S.
Pfleeger C. P.
Proposed Federal Information Processing DSS.
Text TREC.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2008
Field of study

The number of successful attacks on the Internet shows that it is very difficult to guarantee the security of online search engines. A breached server that is not detected in time may return incorrect results to the users. To prevent that, we introduce a methodology for generating an integrity proof for each search result. Our solution is targeted at search engines that perform similarity-based document retrieval, and utilize an inverted list implementation (as most search engines do). We formulate the properties that define a correct result, map the task of processing a text search query to adaptations of existing threshold-based algorithms, and devise an authentication scheme for checking the validity of a result. Finally, we confirm the efficiency and practicality of our solution through an empirical evaluation with real documents and benchmark queries. 1

CiteSeerX

Crossref

Institutional Knowledge at Singapore Management University

Direct, Indirect and Collider Detection of Neutralino Dark Matter In SUSY Models with Non-universal Higgs Masses

Author: A. Bottino
A. Burkert
A. Djouadi
A. El-Zant
A. Morselli
A.S. Eddington
Alexander Belyaev
Atlas collaboration
Azar Mustafayev
B. Allanach
CMS collaboration
D. Auto
D. Denegri .
D.G. Cerdeno
D.G. Cerdeno
D.N. Spergel .
Dawson
F. Moortgat
F. Moortgat
F.E. Paige
for a review
for a summary
For a text book review of supersymmetry see M. Drees
for reviews of SUSY phenomenology
Galprop numerical package
H. Baer
H. Baer
H. Baer
H. Baer
H. Baer
H. Baer
H. Baer
H. Baer
H. Baer
H. Baer
H. Baer
H. Baer .
Higgs Working Group collaboration
Howard Baer
J. Edsjo
J. Edsjö
J. Edsjö
J.A. Bagger
J.F. Navarro .
L. Bergstrom
L. Roszkowski
M. Battaglia
M. Bisset .
M. Drees
M. Drees
N. Ohta
P. Gondolo .
P. Ullio
R. Jeannerot
S. Profumo
S. Profumo
S. Profumo
S. Profumo
see also
see also M. Misiak
See constraints in e.g. H. Baer
Stefano Profumo
Super-Kamiokande collaboration
T. Sjostrand
WMAP collaboration
X. Tata
Xerxes Tata
Y. Mambrini
Publication venue: 'IOP Publishing'
Publication date: 01/04/2005
Field of study

In supersymmetric models with gravity-mediated SUSY breaking, universality of soft SUSY breaking sfermion masses m_0 is motivated by the need to suppress unwanted flavor changing processes. The same motivation, however, does not apply to soft breaking Higgs masses, which may in general have independent masses from matter scalars at the GUT scale. We explore phenomenological implications of both the one-parameter and two-parameter non-universal Higgs mass models (NUHM1 and NUHM2), and examine the parameter ranges compatible with Omega_CDM h^2, BF(b --> s,gamma) and (g-2)_mu constraints. In contrast to the mSUGRA model, in both NUHM1 and NUHM2 models, the dark matter A-annihilation funnel can be reached at low values of tan(beta), while the higgsino dark matter annihilation regions can be reached for low values of m_0. We show that there may be observable rates for indirect and direct detection of neutralino cold dark matter in phenomenologically aceptable ranges of parameter space. We also examine implications of the NUHM models for the Fermilab Tevatron, the CERN LHC and a Sqrt(s)=0.5-1 TeV e+e- linear collider. Novel possibilities include: very light s-top_R, s-charm_R squark and slepton_L masses as well as light charginos and neutralinos and H, A and H^+/- Higgs bosons.Comment: LaTeX, 48pages, 26 Figures. The version with high resolution Figures is available at http://hep.pa.msu.edu/belyaev/public/projects/nuhm/nuhm.p

arXiv.org e-Print Archive

Crossref

CERN Document Server

Embellishing Text Search Queries to Protect User Privacy

Author: Adar E.
Baeza-Yates R.
Barbaro M.
Benaloh J. C.
Dumais S. T.
Husbands P.
Joho H.
Kushilevitz E.
Song D. X.
Text TREC.
Publication venue: 'VLDB Endowment'
Publication date: 01/01/2010
Field of study

Users of text search engines are increasingly wary that their activities may disclose confidential information about their business or personal profiles. It would be desirable for a search engine to perform document retrieval for users while protecting their intent. In this paper, we identify the privacy risks arising from semantically related search terms within a query, and from recurring highspecificity query terms in a search session. To counter the risks, we propose a solution for a similarity text retrieval system to offer anonymity and plausible deniability for the query terms, and hence the user intent, without degrading the system’s precision-recall performance. The solution comprises a mechanism that embellishes each user query with decoy terms that exhibit similar specificity spread as the genuine terms, but point to plausible alternative topics. We also provide an accompanying retrieval scheme that enables the search engine to compute the encrypted document relevance scores from only the genuine search terms, yet remain oblivious to their distinction from the decoys. Empirical evaluation results are presented to substantiate the effectiveness of our solution. 1

CiteSeerX

Crossref

Institutional Knowledge at Singapore Management University

Comment to the SEC in Support of the Enhanced Disclosure of Patent and Technology License Information

Author: === Rfc Text
Aneta Ferguson
Arti K. Rai
Arti Rai
Bharat N Anand
Bronwyn H Hall
Bronwyn H Hall
Carol Corrado
Colleen Chien
Colleen Chien
Colleen V. Chien
Deepak Hegde
Dotan Oliar
Jorge L Contreras
Jorge L. Contreras
P G Sandner
Saurabh Vishnubhakat
Stuart Graham
Stuart J.H. Graham
Thomas R Varner
Zvi Griliches
Publication venue: 'Elsevier BV'
Publication date: 01/01/2016
Field of study

Crossref

Managing Spoilers in a Hybrid War: The Democratic Republic of Congo (1996-2010)

Author: All-Inclusive Agreement (Full Text in French Version)
Alusala N.
Annette Seegers
Boshoff Henri
Carayannis T.
Cilliers J.
Dagne T.
David Fuamba
Douma P.
Doyle M. W.
Hampson F. O.
Hibou B.
Hoebeke H.
Hugo J.-F.
Jackson S.
Khadiagala G.
Kisangani E.
Lemarchand R.
Mamdani M.
Masako Yonekawa
Nest M. W.
Nzongola-Ntalaja G.
Nzongola-Ntalaja G.
Prunier G.
Rafti M.
Reed W. C.
Renton D.
Rogier E.
Specker L.
Stedman S.
Swart G.
Swart G.
Thom W. G.
Turner T.
UNHCR
Vlassenroot K.
Vlassenroot Koen
Wadada-Nabudere D.
Weiss H.
Wolters S.
Young C.
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2013
Field of study

Scholarship on the management of spoilers in a hybrid type of conflict is almost non-existent. Through an examination of the recent Congolese wars and peace efforts (1996–2010), we develop an understanding of how spoilers are managed in a conflict characterised by both interstate and intrastate dynamics. Certainly, more strategies of dealing with spoiler behaviours in this type of conflict are likely to emerge as similar cases are investigated, but our discussion recommends these non-related, but strongly interacting principles: the practice of inclusivity, usually preferred in the management of spoilers, is more complex, and in fact ineffective, particularly when concerned groups’ internal politics and supportive alliances are unconventional. Because holding elections is often deemed indispensable in peacemaking efforts, it is vital that total spoilers be prevented from winning or disrupting them. The toughest challenge is the protection of civilians, especially when the state lacks a monopoly on the use of violence and governance remains partitioned across the country

Cape Town University OpenUCT

Crossref

A robustness testing approach for SOAP Web services

Author: A Avizienis
A Mukherjee
Bartolini C Bertolino A, Marchetti E, Polini A (2009) WS-TAXI: A WSDL-based testing tool for Web services. In: International conference on software testing verification and validation, ICST
D Stuttard
DA Chappel
E Weyuker
F Curbera
F Sebastiani
I Lee
Laranjeiro N Oliveira R, Vieira M (2010) Applying text classification algorithms in Web services robustness testing. In: 29th IEEE international symposium on reliable distributed systems (SRDS
MG Fugini
T Erl
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Lisbon in the sixteenth century: decoding the Chafariz d’el Rei

Author: Ariès and Duby
Being a galley slave was no monopoly of ‘Moorish’ or ‘white slaves’
Bodian Miriam
Brandão João
Brearley Mary
Caetano Joaquim Oliveira
Cf. in the Seville census of 1565 slaves accounted for 7.4 per cent of the population, Antonio Domínguez Ortiz
Correa Manuel
Dacos Nicole
de Oliveira Marques H.
Durão Vitor C. M.
Dutra Francis A.
For a critical discussion of this text
Francisco de Quevedo
Gibson Walter S.
Jorge Ferreira de Vasconcellos
Laura R. Bass
Lavanha J. B.
Lavanha João Baptista
Noted in Júlio de Castilho
Repr
Repr. in Isabel Castro
Saunders
Serrão ‘A Imagem do Mar e da capital do império no século XVI’
Silva Rodrigo Banha da
Stefan Halikowski Smith
Sullivan M. A.
Vecellio
‘[T]here are to be seen in that city beautiful jennets which the Portuguese buy for any price’ cited in Herculano
‘Annotationes in abusus sacramentorum’
‘Letter to Jacques Latomus’
‘Majestade e Grandezas’
Publication venue: 'SAGE Publications'
Publication date: 01/01/2018
Field of study

Crossref

Cronfa at Swansea University