Search CORE

5 research outputs found

Distributional models in the task of hypernym discovery

Author: Ryzhova A.
Sochenkov I.
Yadrintsev V.
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

An approach to the solution of the first task of automatically taxonomy construction for the Russian language is described. This task consists in matching unknown input-words with hypernyms from the existing taxonomy. We show that useful results can be attained using pre-trained distribution models without additional training. © Springer Nature Switzerland AG 2020

RUDN Repository

Anomaly detection for short texts: Identifying whether your chatbot should switch from goal-oriented conversation to chit-chatting

Author: Bakarov A.
Sochenkov I.
Yadrintsev V.
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Goal-oriented conversational agents are systems able converse with humans using natural language to help them reach a certain goal. The number of goals (or domains) about which an agent could converse is limited, and one of the issues is to identify whether a user talks about the unknown domain (in order to report a misunderstanding or switch to chit-chatting mode). We argue that this issue could be resolved if we consider it as an anomaly detection task which is in a field of machine learning. The scientific community developed a broad range of methods for resolving this task, and their applicability to the short text data was never investigated before. The aim of this work is to compare performance of 6 different anomaly detection methods on Russian and English short texts modeling conversational utterances, proposing the first evaluation framework for this task. As a result of the study, we find out that a simple threshold for cosine similarity works better than other methods for both of the considered languages. © Springer Nature Switzerland AG 2018

RUDN Repository

Fast and Accurate Patent Classification in Search Engines

Author: Bakarov A.
Sochenkov I.
Suvorov R.
Yadrintsev V.
Publication venue: 'IOP Publishing'
Publication date
Field of study

This article presents a new approach to large scale patent classification. The need to classify documents often takes place in professional information retrieval systems. In this paper we describe our approach, based on linguistically-supported k-nearest neighbors. We experimentally evaluate it on the Russian and English datasets and compare modern classification technique fastText. We show that KNN is a viable alternative to traditional text classifiers, achieving comparable accuracy while using less additional hardware resources. © Published under licence by IOP Publishing Ltd

RUDN Repository

The Hybrid Method for Accurate Patent Classification

Author: A J Trappey
A Shvets
C D Manning
C J Fall
D Eisinger
E D’hondt
F Piroi
H Schutze
I Moloshnikov
I V Sochenkov
I. V. Sochenkov
K V Vorontsov
M Krier
M Nokel
M S Ageev
P Bojanowski
P Glauner
S Arts
S Ilyinsky
S Verberne
T Grainger
V Yadrintsev
V. V. Yadrintsev
X Zhang
Y-L Chen
Publication venue: 'Pleiades Publishing Ltd'
Publication date
Field of study

Crossref

THE SETTLEMENT OF SIBERIA AND THE FAR EAST FROM THE LATE 18TH TO THE EARLY 20TH CENTURY (1795-1917)

Author: Kabuzan V..M.
Kabuzan V..M.
Kaufman A..A.
Kolesnikov A..D.
Lenin V..I.
Menshikov A..A.
Sklyarov L..F.
Stavrovskiy Ya..F.
Sychevskiy Ye..P.
Tikhonov B..V.
Turchaninov I.
Turchaninov I.
Velichko A..P.
Yadrintsev N..M.
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref