Search CORE

1,138 research outputs found

SECaps: A Sequence Enhanced Capsule Model for Charge Prediction

Author: Chao-Lin Liu
CL Liu
DM Katz
F Kort
G Salton
JA Suykens
R Keown
S Hochreiter
SS Nagel
YH Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 25/05/2019
Field of study

Automatic charge prediction aims to predict appropriate final charges according to the fact descriptions for a given criminal case. Automatic charge prediction plays a critical role in assisting judges and lawyers to improve the efficiency of legal decisions, and thus has received much attention. Nevertheless, most existing works on automatic charge prediction perform adequately on high-frequency charges but are not yet capable of predicting few-shot charges with limited cases. In this paper, we propose a Sequence Enhanced Capsule model, dubbed as SECaps model, to relieve this problem. Specifically, following the work of capsule networks, we propose the seq-caps layer, which considers sequence information and spatial information of legal texts simultaneously. Then we design a attention residual unit, which provides auxiliary information for charge prediction. In addition, our SECaps model introduces focal loss, which relieves the problem of imbalanced charges. Comparing the state-of-the-art methods, our SECaps model obtains 4.5% and 6.4% absolutely considerable improvements under Macro F1 in Criminal-S and Criminal-L respectively. The experimental results consistently demonstrate the superiorities and competitiveness of our proposed model.Comment: 13 pages, 3figures, 5 table

arXiv.org e-Print Archive

Crossref

Non-Compositional Term Dependence for Information Retrieval

Author: Fujita S.
Jeffreys H.
Jurafsky D.
Katz G.
Kiela D.
Krcmár L.
Metzler D. P.
Michelbacher L.
Pederson J.
Reddy S.
Reddy S.
Salehi B.
Salton G.
Salton G.
Singhal A.
Sparck-Jones K.
Strzalkowski T.
Thomason R. H.
Walde S. Schulte
Yu C. T.
Zhai C.
Publication venue
Publication date: 01/01/2015
Field of study

Modelling term dependence in IR aims to identify co-occurring terms that are too heavily dependent on each other to be treated as a bag of words, and to adapt the indexing and ranking accordingly. Dependent terms are predominantly identified using lexical frequency statistics, assuming that (a) if terms co-occur often enough in some corpus, they are semantically dependent; (b) the more often they co-occur, the more semantically dependent they are. This assumption is not always correct: the frequency of co-occurring terms can be separate from the strength of their semantic dependence. E.g. "red tape" might be overall less frequent than "tape measure" in some corpus, but this does not mean that "red"+"tape" are less dependent than "tape"+"measure". This is especially the case for non-compositional phrases, i.e. phrases whose meaning cannot be composed from the individual meanings of their terms (such as the phrase "red tape" meaning bureaucracy). Motivated by this lack of distinction between the frequency and strength of term dependence in IR, we present a principled approach for handling term dependence in queries, using both lexical frequency and semantic evidence. We focus on non-compositional phrases, extending a recent unsupervised model for their detection [21] to IR. Our approach, integrated into ranking using Markov Random Fields [31], yields effectiveness gains over competitive TREC baselines, showing that there is still room for improvement in the very well-studied area of term dependence in IR

arXiv.org e-Print Archive

CiteSeerX

Crossref

Copenhagen University Research Information System

VBN

Automatic Discovery of Complementary Learning Resources

Author: B. Ashraf
C. Romero
G. Adomavicius
G. Salton
G. Salton
H. Cunningham
H. Drachsler
H.-C. Schmitz
H.C. Schmitz
M. Wolpers
P. Lops
P. Resnick
R. Burke
R. Mazza
R. Mazza
S. Valsamidis
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

Proceedings of: 6th European Conference of Technology Enhanced Learning, EC-TEL 2011, Palermo, Italy, September 20-23, 2011.Students in a learning experience can be seen as a community working simultaneously (and in some cases collaboratively) in a set of activities. During these working sessions, students carry out numerous actions that affect their learning. But those actions happening outside a class or the Learning Management System cannot be easily observed. This paper presents a technique to widen the observability of these actions. The set of documents browsed by the students in a course was recorded during a period of eight weeks. These documents are then processed and the set with highest similarity with the course notes are selected and recommended back to all the students. The main problem is that this user community visits thousands of documents and only a small percent of them are suitable for recommendation. Using a combination of lexican analysis and information retrieval techniques, a fully automatic procedure to analyze these documents, classify them and select the most relevant ones is presented. The approach has been validated with an empirical study in an undergraduate engineering course with more than one hundred students. The recommended resources were rated as "relevant to the course" by the seven instructors with teaching duties in the course.Work partially funded by the Learn3 project, “Plan Nacional de I+D+I TIN2008-05163/TSI”, the Acción Integrada Ref. DE2009-0051, the “Emadrid: Investigación y desarrollo de tecnologías para el e-learning en la Comunidad de Madrid” project (S2009/TIC-1650) and TELMA Project (Plan Avanza TSI-020110-2009-85)

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Universidad Carlos III de Madrid e-Archivo

Off the Beaten Path: Let's Replace Term-Based Retrieval with k-NN Search

Author: Andoni A.
Beyer K.
Broder A. Z.
Brown P. F.
Fried D.
Le Q.
Mikolov T.
Mu Y.
Muja M.
Petrović S.
Riezler S.
Salton G.
Wang J.
Weber R.
Yang L.
Yao X.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 30/10/2016
Field of study

Retrieval pipelines commonly rely on a term-based search to obtain candidate records, which are subsequently re-ranked. Some candidates are missed by this approach, e.g., due to a vocabulary mismatch. We address this issue by replacing the term-based search with a generic k-NN retrieval algorithm, where a similarity function can take into account subtle term associations. While an exact brute-force k-NN search using this similarity function is slow, we demonstrate that an approximate algorithm can be nearly two orders of magnitude faster at the expense of only a small loss in accuracy. A retrieval pipeline using an approximate k-NN search can be more effective and efficient than the term-based pipeline. This opens up new possibilities for designing effective retrieval pipelines. Our software (including data-generating code) and derivative data based on the Stack Overflow collection is available online

arXiv.org e-Print Archive

Crossref

Scipedia

Using Synchronic and Diachronic Relations for Summarizing Multiple Documents Describing Evolving Events

Author: B. Endres-Niggemeyer
Constantin Halatsis
D. Marcu
D. Marcu
D. R. Radev
E. Reiter
E. Reiter
G. Salton
H. P. Edmundson
H. P. Luhn
H. S. Pinto
I. H. Witten
I. Mani
I. Mani
M. Taboada
Panagiotis Stamatopoulos
R. Grishman
S. D. Afantenos
S. D. Afantenos
S. D. Afantenos
S. D. Afantenos
S. Pinker
Stergos D. Afantenos
Vangelis Karkaletsis
W. C. Mann
W. G. Lehnert
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 18/10/2007
Field of study

In this paper we present a fresh look at the problem of summarizing evolving events from multiple sources. After a discussion concerning the nature of evolving events we introduce a distinction between linearly and non-linearly evolving events. We present then a general methodology for the automatic creation of summaries from evolving events. At its heart lie the notions of Synchronic and Diachronic cross-document Relations (SDRs), whose aim is the identification of similarities and differences between sources, from a synchronical and diachronical perspective. SDRs do not connect documents or textual elements found therein, but structures one might call messages. Applying this methodology will yield a set of messages and relations, SDRs, connecting them, that is a graph which we call grid. We will show how such a grid can be considered as the starting point of a Natural Language Generation System. The methodology is evaluated in two case-studies, one for linearly evolving events (descriptions of football matches) and another one for non-linearly evolving events (terrorist incidents involving hostages). In both cases we evaluate the results produced by our computational systems.Comment: 45 pages, 6 figures. To appear in the Journal of Intelligent Information System

arXiv.org e-Print Archive

A non-intrusive movie recommendation system

Author: A. Kontostathis
G. Adomavicius
G. Linden
G. Salton
G. Yunhua
L. Po
M. Balabanovic
R. Burke
R. Gemulla
R. Trillo
R.J. Mooney
S. Debnath
S. Deerwester
S. Sorrentino
S.T. Dumais
Y. Koren
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Several recommendation systems have been developed to support the user in choosing an interesting movie from multimedia repositories. The widely utilized collaborative-filtering systems focus on the analysis of user profiles or user ratings of the items. However, these systems decrease their performance at the start-up phase and due to privacy issues, when a user hides most of his personal data. On the other hand, content-based recommendation systems compare movie features to suggest similar multimedia contents; these systems are based on less invasive observations, however they find some difficulties to supply tailored suggestions. In this paper, we propose a plot-based recommendation system, which is based upon an evaluation of similarity among the plot of a video that was watched by the user and a large amount of plots that is stored in a movie database. Since it is independent from the number of user ratings, it is able to propose famous and beloved movies as well as old or unheard movies/programs that are still strongly related to the content of the video the user has watched. We experimented different methodologies to compare natural language descriptions of movies (plots) and evaluated the Latent Semantic Analysis (LSA) to be the superior one in supporting the selection of similar plots. In order to increase the efficiency of LSA, different models have been experimented and in the end, a recommendation system that is able to compare about two hundred thousands movie plots in less than a minute has been developed

Crossref

Archivio istituzionale della ricerca - Università di Modena e Reggio Emilia

Multi-score Learning for Affect Recognition: the Case of Body Postures

Author: A. Argyriou
A. Camurri
A. Kleinsmith
A. Kleinsmith
D.F. Specht
G. Salton
H.M. Paterson
J. Fürnkranz
J. Moody
J. Wagner
K.V. Mardia
M. Pantic
M. Zhang
P.R. Silva De
R. Rosipal
R.L. Mandryk
R.W. Picard
S. Jong
Z. Zeng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

An important challenge in building automatic affective state recognition systems is establishing the ground truth. When the groundtruth is not available, observers are often used to label training and testing sets. Unfortunately, inter-rater reliability between observers tends to vary from fair to moderate when dealing with naturalistic expressions. Nevertheless, the most common approach used is to label each expression with the most frequent label assigned by the observers to that expression. In this paper, we propose a general pattern recognition framework that takes into account the variability between observers for automatic affect recognition. This leads to what we term a multi-score learning problem in which a single expression is associated with multiple values representing the scores of each available emotion label. We also propose several performance measurements and pattern recognition methods for this framework, and report the experimental results obtained when testing and comparing these methods on two affective posture datasets

Crossref

Brunel University Research Archive

The molecular identity of the TLQP-21 peptide receptor

Author: Bartolomucci A.
Ghosh V.
Nguyen M. E.
Pallais J. P.
Razzoli M.
Rodriguez P.
Sahu B. S.
Salton S. R.
Sham Y. Y.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

The TLQP-21 neuropeptide has been implicated in functions as diverse as lipolysis, neurodegeneration and metabolism, thus suggesting an important role in several human diseases. Three binding targets have been proposed for TLQP-21: C3aR1, gC1qR and HSPA8. The aim of this review is to critically evaluate the molecular identity of the TLQP-21 receptor and the proposed multi-receptor mechanism of action. Several studies confirm a critical role for C3aR1 in TLQP-21 biological activity and a largely conserved mode of binding, receptor activation and signaling with C3a, its first-identified endogenous ligand. Conversely, data supporting a role of gC1qR and HSPA8 in TLQP-21 activity remain limited, with no signal transduction pathways being described. Overall, C3aR1 is the only receptor for which a necessary and sufficient role in TLQP-21 activity has been confirmed thus far. This conclusion calls into question the validity of a multi-receptor mechanism of action for TLQP-21 and should inform future studies

Archivio istituzionale della Ricerca - Università degli Studi di Parma

Host-Based Treatments for Severe COVID-19

Author: Barbieri M.
Bozzi C.
Confalonieri M.
Confalonieri P.
Hughes M.
Lerda S.
Mondini L.
Pozzan R.
Ruaro B.
Salton F.
Tavano S.
Trotta L.
Publication venue
Publication date: 01/01/2023
Field of study

COVID-19 has been a global health problem since 2020. There are different spectrums of manifestation of this disease, ranging from asymptomatic to extremely severe forms requiring admission to intensive care units and life-support therapies, mainly due to severe pneumonia. The progressive understanding of this disease has allowed researchers and clinicians to implement different therapeutic alternatives, depending on both the severity of clinical involvement and the causative molecular mechanism that has been progressively explored. In this review, we analysed the main therapeutic options available to date based on modulating the host inflammatory response to SARS-CoV-2 infection in patients with severe and critical illness. Although current guidelines are moving toward a personalised treatment approach titrated on the timing of presentation, disease severity, and laboratory parameters, future research is needed to identify additional biomarkers that can anticipate the disease course and guide targeted interventions on an individual basis

Archivio istituzionale della ricerca - Università di Trieste