Search CORE

11 research outputs found

A Spinning Wheel for YARN: User Interface for a Crowdsourced Thesaurus

Author: Braslavski P.
Mukhin M.
Ustalov D.
Браславский П. И.
Мухин М. Ю.
Усталов Д. А.
Publication venue
Publication date: 01/01/2014
Field of study

YARN (Yet Another RussNet) project started in 2013 aims at creating a large open thesaurus for Russian using crowdsourcing. This paper describes synset assembly interface developed within the project — motivation behind it, design, usage scenarios, implementation details, and first experimental results

Institutional repository of Ural Federal University named after the first President of Russia B.N.Yeltsin

TOWARDS WORD SENSES AND LINKS BETWEEN THEM

Author: Ustalov D. A.
Усталов Д. A.
Publication venue: УрФУ
Publication date: 01/01/2017
Field of study

In this study, we demonstrate an unsupervised approach for constructing a semantic network uniting word senses (or word concepts) rather than the coarse-grained con-cepts. The reported study was funded by RFBR (project no. 16-37-00354 мол_a) and by RFH (project no. 16-04-12019).Исследование выполнено при финансовой поддержке РФФИ в рамках науч-ного проекта № 16-37-00354 мол_а и при финансовой поддержке РГНФ в рамках научного проекта № 16-04-12019 «Интеграция тезаурусов RussNet и YARN»

Institutional repository of Ural Federal University named after the first President of Russia B.N.Yeltsin

CROWDSOURCING AS A HUMAN-COMPUTER SYSTEM WITH FEEDBACK

Author: Ustalov D. A.
Усталов Д. А.
Publication venue: Уральский федеральный университет
Publication date: 01/01/2015
Field of study

Crowdsourcing is an established approach for such problems as data gathering, annotation, cleaning, etc. Given a set of simple and verifiable tasks, many participants execute them voluntarily or on a paid basis. Since the resources are constrained, it is crucial to evaluate the effort of each participant and to focus the crowdsourcing process. We discuss the representation of crowdsourcing as a human-computer system with feedback and propose a reference model of such a system.Реализация предложенного подхода выполняется в рамках открытого проекта Yet Another RussNet [1]. Работа поддержана грантом РГНФ № 13-04-12020 «Новый открытый электронный тезаурус русского языка»

Institutional repository of Ural Federal University named after the first President of Russia B.N.Yeltsin

Коллективные потоковые вычисления: реляционные модели и алгоритмы

Author: D. Ustalov A.
Д. Усталов А.
Publication venue: 'P.G. Demidov Yaroslavl State University'
Publication date: 20/04/2016
Field of study

Recently, microtask crowdsourcing has become a popular approach for addressing various data mining problems. Crowdsourcing workflows for approaching such problems are composed of several data processing stages which require consistent representation for making the work reproducible. This paper is devoted to the problem of reproducibility and formalization of the microtask crowdsourcing process. A computational model for microtask crowdsourcing based on an extended relational model and a dataflow computational model has been proposed. The proposed collaborative dataflow computational model is designed for processing the input data sources by executing annotation stages and automatic synchronization stages simultaneously. Data processing stages and connections between them are expressed by using collaborative computation workflows represented as loosely connected directed acyclic graphs. A synchronous algorithm for executing such workflows has been described. The computational model has been evaluated by applying it to two tasks from the computational linguistics field: concept lexicalization refining in electronic thesauri and establishing hierarchical relations between such concepts. The “Add–Remove–Confirm” procedure is designed for adding the missing lexemes to the concepts while removing the odd ones. The “Genus–Species–Match” procedure is designed for establishing “is-a” relations between the concepts provided with the corresponding word pairs. The experiments involving both volunteers from popular online social networks and paid workers from crowdsourcing marketplaces confirm applicability of these procedures for enhancing lexical resources. В последнее время краудсорсинг на основе выполения микрозадач получил широкое применение в области анализа неструктурированных данных. Разрабатываются специализированные методики, состоящие из множества этапов обработки исходных данных, требующих согласованности их представления для обеспечения воспроизводимости работы. Данная статья посвящена решению проблемы воспроизводимости и формализации процесса краудсорсинга микрозадачами. Предложена модель коллективных потоковых вычислений на основе расширенной реляционной модели и потоковой модели вычислений. Модель предназначена для обработки исходных данных в виде реляционных отношений путем параллельного выполнения этапов разметки микрозадачами и этапов автоматической синхронизации. Этапы обработки данных и связи между ними записываются с использованием схемы коллективных вычислений, представляющей собой слабо связный ориентированный ациклический граф. Описан синхронный алгоритм выполнения схем коллективных вычислений. Продемонстрированы приложения модели в области компьютерной лингвистики для уточнения лексикализации понятий в электронных тезаурусах и построения родо-видовых отношений между понятиями при помощи краудсорсинга. Процедура «добавить–удалить–подтвердить» позволяет внести в лексикализацию понятий недостающие лексемы и исключить посторонние. Процедура «род–вид–сопоставить» позволяет сформировать гипо-гиперонимические отношения между понятиями на основе соответствующих родо-видовых пар слов. Результаты экспериментов на материалах открытого электронного тезауруса русского языка подтверждают применимость разработанных процедур для развития лексических ресурсов. В экспериментах приняли участие как волонтеры из популярных социальных сетей, так и пользователи бирж краудсорсинга (за вознаграждение в форме микроплатежей).

Modeling and Analysis of Information Systems / Моделирование и анализ информационных систем (МАИС)

Summary of Tutorials at The Web Conference 2021

Author: Albert J.
Altunina O.
Aref S.
Aspert N.
Avram T.M.
Baidakova D.
Benhalloum A.
Bhagat S.
Bian Y.
Chen J.
Cheng H.
Courdier E.
Couto F.M.
Cvetinovic D.
Defferrard M.
Diesner J.
Dinh L.
Drutsa A.
Dy J.
Fakhrei S.
Faloutsos C.
Fan W.
Fan Y.
Feng F.
Ferng C.-S.
Geng X.
Gessert F.
Goldenberg D.
Gong M.
Gopalan A.
Groth P.
He X.
Heydon A.
Howell R.
Huang J.
Huang W.
Ilharco Magalhaes C.
Ioannidis S.
Jeunen O.
Jiang D.
Jose J.
Juan D.-C.
Kenthapadi K.
Koenig M.
Laurent F.
Lisena P.
Lu C.-T.
Meroño-Peñuela A.
Mishra S.
Miz V.
Mohanty S.
Müller M.
Packer B.
Pei J.
Pham P.
Popov N.
Rezapour R.
Ricaud B.
Ritter N.
Rohde D.
Rong Y.
Sakhi O.
Sameki M.
Scheller C.
Schneider M.
Schraner Y.
Sephus N.
Shou L.
Succo S.
Sun F.
Tang J.
Teinemaa I.
Tsinadze L.
Ustalov D.
Vasile F.
Wang X.
Wang Y.
West R.
Wingerath W.
Wollmer B.
Xu T.
Yıldız İ.
Yin D.
Yu G.
Zhao X.
Zhou X.
Zitnik M.
Çelebi O.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2021
Field of study

International Migration, Integration and Social Cohesion online publications

Summary of Tutorials at The Web Conference 2021

Author: Albert J.
Altunina O.
Aref S.
Aspert N.
Avram T.M.
Baidakova D.
Benhalloum A.
Bhagat S.
Bian Y.
Chen J.
Cheng H.
Courdier E.
Couto F.M.
Cvetinovic D.
Defferrard M.
Diesner J.
Dinh L.
Drutsa A.
Dy J.
Fakhrei S.
Faloutsos C.
Fan W.
Fan Y.
Feng F.
Ferng C.-S.
Geng X.
Gessert F.
Goldenberg D.
Gong M.
Gopalan A.
Groth P.
He X.
Heydon A.
Howell R.
Huang J.
Huang W.
Ilharco Magalhaes C.
Ioannidis S.
Jeunen O.
Jiang D.
Jose J.
Juan D.-C.
Kenthapadi K.
Koenig M.
Laurent F.
Lisena P.
Lu C.-T.
Meroño-Peñuela A.
Mishra S.
Miz V.
Mohanty S.
Müller M.
Packer B.
Pei J.
Pham P.
Popov N.
Rezapour R.
Ricaud B.
Ritter N.
Rohde D.
Rong Y.
Sakhi O.
Sameki M.
Scheller C.
Schneider M.
Schraner Y.
Sephus N.
Shou L.
Succo S.
Sun F.
Tang J.
Teinemaa I.
Tsinadze L.
Ustalov D.
Vasile F.
Wang X.
Wang Y.
West R.
Wingerath W.
Wollmer B.
Xu T.
Yıldız İ.
Yin D.
Yu G.
Zhao X.
Zhou X.
Zitnik M.
Çelebi O.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2021
Field of study

International Migration, Integration and Social Cohesion online publications

UvA-DARE

What can crowd computing do for the next generation of AI systems?

Author: Baidakova D.
Casati F.
Drutsa A.
Gadiraju Ujwal
Ustalov D.
Yang J.
Publication venue: CEUR
Publication date: 01/01/2020
Field of study

The unprecedented rise in the adoption of artificial intelligence techniques and automation in many contexts is concomitant with shortcomings of such technology with respect to robustness, interpretability, usability, and trustworthiness. Crowd computing offers a viable means to leverage human intelligence at scale for data creation, enrichment, and interpretation, demonstrating a great potential to improve the performance of AI systems and increase the adoption of AI in general. Existing research and practice has mainly focused on leveraging crowd computing for training data creation. However, this perspective is rather limiting in terms of how AI can fully benefit from crowd computing. In this vision paper, we identify opportunities in crowd computing to propel better AI technology, and argue that to make such progress, fundamental problems need to be tackled from both computation and interaction standpoints. We discuss important research questions in both these themes, with an aim to shed light on the research needed to pave a future where humans and AI can work together seamlessly, while benefiting from each other.</p

Improving hypernymy extraction with distributional semantic classes

Author: Biemann C.
Faralli S.
Panchenko A.
Ponzetto S. P.
Ustalov D.
Publication venue: European Language Resources Association (ELRA)
Publication date: 01/01/2019
Field of study

In this paper, we show how distributionally-induced semantic classes can be helpful for extracting hypernyms. We present methods for inducing sense-aware semantic classes using distributional semantics and using these induced semantic classes for filtering noisy hypernymy relations. Denoising of hypernyms is performed by labeling each semantic class with its hypernyms. On the one hand, this allows us to filter out wrong extractions using the global structure of distributionally similar senses. On the other hand, we infer missing hypernyms via label propagation to cluster terms. We conduct a large-scale crowdsourcing study showing that processing of automatically extracted hypernyms using our approach improves the quality of the hypernymy extraction in terms of both precision and recall. Furthermore, we show the utility of our method in the domain taxonomy induction task, achieving the state-of-the-art results on a SemEval'16 task on taxonomy induction

Archivio della ricerca- Università di Roma La Sapienza

Unsupervised, knowledge-free, and interpretable word sense disambiguation

Author: Biemann C.
Faralli S.
Marten F.
Panchenko A.
Ponzetto S. P.
Ruppert E.
Ustalov D.
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2017
Field of study

Interpretability of a predictive model is a powerful feature that gains the trust of users in the correctness of the predictions. In word sense disambiguation (WSD), knowledge-based systems tend to be much more interpretable than knowledge-free counterparts as they rely on the wealth of manually-encoded elements representing word senses, such as hypernyms, usage examples, and images. We present a WSD system that bridges the gap between these two so far disconnected groups of methods. Namely, our system, providing access to several state-of-the-art WSD models, aims to be interpretable as a knowledge-based system while it remains completely unsupervised and knowledge-free. The presented tool features a Web interface for all-word disambiguation of texts that makes the sense predictions human readable by providing interpretable word sense inventories, sense representations, and disambiguation results. We provide a public API, enabling seamless integration

Archivio della ricerca- Università di Roma La Sapienza

Towards Automatic Text Adaptation in Russian

Author: A Panchenko
A Sokirco
D M Blei
D Ustalov
G Vera
I Oborneva
J Nivre
M Baroni
M Nevdah
M S Mackovskiy
N Andriushina
N Karpov
N Karpov
N Verhelst
Nikolai Karpov
P D Turney
R Chandrasekar
T L Franccois
Tuomo Korenius
V Bocharov
V Trishin
Vera Sibirtseva
Publication venue: 'Elsevier BV'
Publication date: 01/01/2014
Field of study

Crossref