Search CORE

556 research outputs found

A systematic literature review on Wikidata

Author: García Barriocanal María Elena
Mora Cantallops Marçal
Sánchez Alonso Salvador
Publication venue: Emerald Publishing Limited
Publication date: 01/07/2019
Field of study

To review the current status of research on Wikidata and, in particular, of articles that either describe applications of Wikidata or provide empirical evidence, in order to uncover the topics of interest, the fields that are benefiting from its applications and which researchers and institutions are leading the work

e_Buah - Biblioteca Digital de la Universidad de Alcalá

Representing the Under-Represented: a Dataset of Post-Colonial, and Migrant Writers

Author: Damiano Rossana
Patti Viviana
Stranisci Marco Antonio
Publication venue: OASIcs - OpenAccess Series in Informatics. 3rd Conference on Language, Data and Knowledge (LDK 2021)
Publication date: 01/01/2021
Field of study

Dagstuhl Research Online Publication Server

Institutional Research Information System University of Turin

Multiple Texts as a Limiting Factor in Online Learning: Quantifying (Dis-)similarities of Knowledge Networks across Languages

Author: Hemati Wahed
Konca Maxim
Mehler Alexander
Uslu Tolga
Welke Pascal
Publication venue: 'Frontiers Media SA'
Publication date: 05/08/2020
Field of study

We test the hypothesis that the extent to which one obtains information on a given topic through Wikipedia depends on the language in which it is consulted. Controlling the size factor, we investigate this hypothesis for a number of 25 subject areas. Since Wikipedia is a central part of the web-based information landscape, this indicates a language-related, linguistic bias. The article therefore deals with the question of whether Wikipedia exhibits this kind of linguistic relativity or not. From the perspective of educational science, the article develops a computational model of the information landscape from which multiple texts are drawn as typical input of web-based reading. For this purpose, it develops a hybrid model of intra- and intertextual similarity of different parts of the information landscape and tests this model on the example of 35 languages and corresponding Wikipedias. In this way the article builds a bridge between reading research, educational science, Wikipedia research and computational linguistics.Comment: 40 pages, 13 figures, 5 table

arXiv.org e-Print Archive

Hochschulschriftenserver - Universität Frankfurt am Main

Linked Data Quality of DBpedia, Freebase, OpenCyc, Wikidata, and YAGO

Author: Bartscherer Frederic
Färber Michael
Menne Carsten
Rettinger Achim
Publication venue: IOS Press
Publication date: 19/02/2019
Field of study

KITopen

Analogy Training Multilingual Encoders

Author: Garneau N
Hartmann M
Ruder S
Sandholm A
Søgaard A
Vulić I
Publication venue: 35th AAAI Conference on Artificial Intelligence, AAAI 2021
Publication date: 01/01/2021
Field of study

Language encoders encode words and phrases in ways that capture their local semantic relatedness, but are known to be globally inconsistent. Global inconsistency can seemingly be corrected for, in part, by leveraging signals from knowledge bases, but previous results are partial and limited to monolingual English encoders. We extract a large-scale multilingual, multi-word analogy dataset from Wikidata for diagnosing and correcting for global inconsistencies and implement a four-way Siamese BERT architecture for grounding multilingual BERT (mBERT) in Wikidata through analogy training. We show that analogy training not only improves the global consistency of mBERT, as well as the isomorphism of language-specific subspaces, but also leads to significant gains on downstream tasks such as bilingual dictionary induction and sentence retrieval

Copenhagen University Research Information System

Apollo (Cambridge)

Association for the Advancement of Artificial Intelligence: AAAI Publications

Identifying and Consolidating Knowledge Engineering Requirements

Author: Allen Bradley P.
Ilievski Filip
Joshi Saurav
Publication venue
Publication date: 26/06/2023
Field of study

Knowledge engineering is the process of creating and maintaining knowledge-producing systems. Throughout the history of computer science and AI, knowledge engineering workflows have been widely used because high-quality knowledge is assumed to be crucial for reliable intelligent agents. However, the landscape of knowledge engineering has changed, presenting four challenges: unaddressed stakeholder requirements, mismatched technologies, adoption barriers for new organizations, and misalignment with software engineering practices. In this paper, we propose to address these challenges by developing a reference architecture using a mainstream software methodology. By studying the requirements of different stakeholders and eras, we identify 23 essential quality attributes for evaluating reference architectures. We assess three candidate architectures from recent literature based on these attributes. Finally, we discuss the next steps towards a comprehensive reference architecture, including prioritizing quality attributes, integrating components with complementary strengths, and supporting missing socio-technical requirements. As this endeavor requires a collaborative effort, we invite all knowledge engineering researchers and practitioners to join us

arXiv.org e-Print Archive

Semalytics: a semantic analytics platform for the exploration of distributed and heterogeneous cancer data in translational research

Author: Bertotti A.
Fiori A.
Grand A.
Medico E.
Mignone A.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2019
Field of study

Institutional Research Information System University of Turin

Wikipedia Citations: A comprehensive dataset of citations with identifiers extracted from English Wikipedia

Author: Colavizza Giovanni
Singh Harshdeep
West Robert
Publication venue
Publication date: 14/07/2020
Field of study

Wikipedia's contents are based on reliable and published sources. To this date, relatively little is known about what sources Wikipedia relies on, in part because extracting citations and identifying cited sources is challenging. To close this gap, we release Wikipedia Citations, a comprehensive dataset of citations extracted from Wikipedia. A total of 29.3M citations were extracted from 6.1M English Wikipedia articles as of May 2020, and classified as being to books, journal articles or Web contents. We were thus able to extract 4.0M citations to scholarly publications with known identifiers -- including DOI, PMC, PMID, and ISBN -- and further equip an extra 261K citations with DOIs from Crossref. As a result, we find that 6.7% of Wikipedia articles cite at least one journal article with an associated DOI, and that Wikipedia cites just 2% of all articles with a DOI currently indexed in the Web of Science. We release our code to allow the community to extend upon our work and update the dataset in the future

arXiv.org e-Print Archive

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Defining a Knowledge Graph Development Process Through a Systematic Review

Author: Groth P.
Tamašauskaitė G.
Publication venue
Publication date: 01/01/2023
Field of study

International Migration, Integration and Social Cohesion online publications

UvA-DARE