Search CORE

7 research outputs found

Towards Name Disambiguation: Relational, Streaming, and Privacy-Preserving Text Data

Author: Zhang Baichuan
Publication venue: 'Purdue University (bepress)'
Publication date: 01/12/2017
Field of study

In the real world, our DNA is unique but many people share names. This phenomenon often causes erroneous aggregation of documents of multiple persons who are namesakes of one another. Such mistakes deteriorate the performance of document retrieval, web search, and more seriously, cause improper attribution of credit or blame in digital forensics. To resolve this issue, the name disambiguation task 1 is designed to partition the documents associated with a name reference such that each partition contains documents pertaining to a unique real-life person. Existing algorithms for this task mainly suffer from the following drawbacks. First, the majority of existing solutions substantially rely on feature engineering, such as biographical feature extraction, or construction of auxiliary features from Wikipedia. However, for many scenarios, such features may be costly to obtain or unavailable in privacy sensitive domains. Instead we solve the name disambiguation task in restricted setting by leveraging only the relational data in the form of anonymized graphs. Second, most of the existing works for this task operate in a batch mode, where all records to be disambiguated are initially available to the algorithm. However, more realistic settings require that the name disambiguation task should be performed in an online streaming fashion in order to identify records of new ambiguous entities having no preexisting records. Finally, we investigate the potential disclosure risk of textual features used in name disambiguation and propose several algorithms to tackle the task in a privacy-aware scenario. In summary, in this dissertation, we present a number of novel approaches to address name disambiguation tasks from the above three aspects independently, namely relational, streaming, and privacy preserving textual data

Purdue E-Pubs

B!SON: A Tool for Open Access Journal Recommendation

Author: Entrup Elias
Eppelin Anita
Ewerth Ralph
Hartwig Josephine
Hoppe Anett
Tullney Marco
Wohlgemuth Michael
Publication venue: Heidelberg : Springer
Publication date: 01/01/2022
Field of study

Finding a suitable open access journal to publish scientific work is a complex task: Researchers have to navigate a constantly growing number of journals, institutional agreements with publishers, funders’ conditions and the risk of Predatory Publishers. To help with these challenges, we introduce a web-based journal recommendation system called B!SON. It is developed based on a systematic requirements analysis, built on open data, gives publisher-independent recommendations and works across domains. It suggests open access journals based on title, abstract and references provided by the user. The recommendation quality has been evaluated using a large test set of 10,000 articles. Development by two German scientific libraries ensures the longevity of the project

Repositorium für Naturwissenschaften und Technik

Leveraging co-authorship and biographical information for author ambiguity resolution in DBLP

Author: Hazimeh Hussein
Khaled Omar Abou
Makki Jawad
Noureddine Hassan
Tscherrig Julien
Youness Iman
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 21/10/2022
Field of study

Many authors can share the same name and this constitutes a serious problem that affects the relevancy of retrieval results and constitutes our motivation of finding such approach to cover this issue at the author names entity level. Solving such a problem may return with positive gain at the level of document retrieval, web search and the quality of data. This entity resolution task can be tackled as an unsupervised problem, where there are set of features that can be employed for the resolution job, or as supervised problem to compute the similarities among two citations and then classify if they are the same or not. Recent approaches usually utilize features such as: co-author, venue, topic similarity, affiliations and title of publications to deal with author ambiguity. In this paper, three attributes are used to treat this problem sequentially. The co-authorship firstly which is a well-known attribute, and then the topic and affiliation extracted from biographies, which can be found inside the publication, and this is our novelty frame in this paper

Hes-so: ArODES Open Archive (University of Applied Sciences and Arts Western Switzerland / Haute école spécialisée de Suisse occidentale / FH Westschweiz)

Study on open science: The general state of the play in Open Science principles and practices at European life sciences institutes

Author: Foltynová Pavla
Ornerová Kateřina
Publication venue: International Society for Scientometrics and Informetrics
Publication date: 01/01/2019
Field of study

Nowadays, open science is a hot topic on all levels and also is one of the priorities of the European Research Area. Components that are commonly associated with open science are open access, open data, open methodology, open source, open peer review, open science policies and citizen science. Open science may a great potential to connect and influence the practices of researchers, funding institutions and the public. In this paper, we evaluate the level of openness based on public surveys at four European life sciences institute

Univerzitní repozitář Masarykovy univerzity

Proceedings of the Eighth Italian Conference on Computational Linguistics CliC-it 2021

Author
Publication venue: 'OpenEdition'
Publication date: 15/12/2022
Field of study

The eighth edition of the Italian Conference on Computational Linguistics (CLiC-it 2021) was held at Università degli Studi di Milano-Bicocca from 26th to 28th January 2022. After the edition of 2020, which was held in fully virtual mode due to the health emergency related to Covid-19, CLiC-it 2021 represented the first moment for the Italian research community of Computational Linguistics to meet in person after more than one year of full/partial lockdown

Directory of Open Access Books (DOAB)

Social informatics

Author: Bing Tian DAI
DIAS Gael
DING Ying
Ee-peng LIM
FLANAGIN Andrew J.
JATOWT Adam
MIURA Asako
TANAKA Katsumi
TEZUKA Taro
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2013
Field of study

5th International Conference, SocInfo 2013, Kyoto, Japan, November 25-27, 2013, Proceedings</p

Institutional Knowledge at Singapore Management University

Proceedings of DRS Learn X Design 2019: Insider Knowledge

Author: Börekçi Naz Ayşe Güzide Z.
Jones Derek
Korkut Fatma
Özgen Koçyıldırım Dalsu
Publication venue: METU Department of Industrial Design / Design Research Society, UK
Publication date: 01/11/2019
Field of study

OpenMETU (Middle East Technical University)