Search CORE

159 research outputs found

Metadata Extraction from References of Different Styles

Author: I Ayansola Promise
Ibikunle Olatunde,
Madamidola Olugbenga A
T Adeboje Olawale
Publication venue: 'International Journal of Computer Engineering and Applications'
Publication date: 04/05/2021
Field of study

Metadata extraction is the process of describing extrinsic and intrinsic qualities of the resource such as document, image, video, including getting data from references. References form an essential part of electronic scholarly publications. A reference is the way of giving acknowledgment to individuals for their creative and intellectual works that one utilized in his or her research work. It can also be used to locate particular sources and combat plagiarism. A reference style dictates the information necessary for a reference and how the information is ordered. Accurate and automatic reference metadata generation provides scalability, interoperability and usability for digital libraries of both public and private institution and their collections. Accurate reference metadata extraction becomes an intriguing task to researchers who want to collect data of scientific publications; therefore, this research work proposes a metadata extraction from references of different styles with the use of regular expression. This work accurately extract metadata such as author, title of article, volume, year of publication and institution from references of different styles limiting it to six referencing style

International Journal of Computer (IJC - Global Society of Scientific Research and Researchers, GSSRR)

Searching and Visualization of References in Research Documents

Author: Annisa Annisa
Nadirman Firnas
Ridha Ahmad
Publication venue: 'Universitas Ahmad Dahlan'
Publication date: 01/06/2014
Field of study

This research aims to develop a module for information retrieval that can trace references from bibliography entries of research documents, specifically those based on Bogor Agricultural University (IPB)’s writing guidelines. A total of 242 research documents in PDF from the Department of Computer Science IPB were used to generate parsing patterns to extract the bibliography entries. With modified ParaTools, automatic extraction of bibliography entries was performed on text files generated from the PDF files. The entries are stored in a database that is used to visualize author relationship as graphs. This module is supplemented by an information retrieval system based on Sphinx search system and also provides information of authors’ publications and citations. Evaluation showed that (1) bibliography entry extraction missed only 5.37% bibliography entries caused by incorrect bibliography formatting, (2) 91.54% bibliography entry attributes could be identified correctly, and (3) 90.31% entries were successfully connected to other documents

TELKOMNIKA (Telecommunication Computing Electronics and Control)

AI EDAM special issue: advances in implemented shape grammars: solutions and applications

Author: Economou A.
Eloy S.
Pauwels P.
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 01/01/2018
Field of study

This paper introduces the special issue “Advances in Implemented Shape Grammars: Solutions and Applications” and frames the topic of computer implementations of shape grammars, both with a theoretical and an applied focus. This special issue focuses on the current state of the art regarding computer implementations of shape grammars and brings a discussion about how those systems can evolve in the coming years so that they can be used in real life design scenarios. This paper presents a brief state of the art of shape grammars implementation and an overview of the papers included in the current special issue categorized under technical design, interpreters and interface design, and uses cases. The paper ends with a comprehensive outlook into the future of shape grammars implementations.info:eu-repo/semantics/acceptedVersio

Repositório Institucional do ISCTE-IUL

Embedding a Creativity Support Tool within Computer Graphics Research

Author: Abgaz Yalemisew
Hurley Donny
O'Donoghue Diarmuid
Ronzano Francesco
Saggion Horacio
Smorodinnikov Dmitry
Publication venue
Publication date: 01/01/2016
Field of study

We describe the Dr Inventor creativity support tool that aims to support and even enhance the creativity of active research scientists, by discovering un-noticed analogical similarities between publications. The tool combines text processing, lexical analysis and computational cognitive modelling to find comparisons with the greatest potential for a creative impact on the system users. A multi-year corpus of publications is used to drive the creativity of the system, with a central graph matching algorithm being adapted to identify the best analogy between any pair of papers. Dr Inventor has been developed for use by computer graphics researchers, with a particular focus on publications from the SIGGRAPH conference series and it uses this context in three main ways. Firstly, the pragmatic context of creativity support requires the identification of comparisons that are unlike pre-existing information. Secondly, the suggested inferences are assessed for quality within the context of a corpus of graphics publications. Finally, expert users from this discipline were asked to identify the qualities of greatest concern to them, which then guided the subsequent evaluation task

MURAL - Maynooth University Research Archive Library

Recommended from our members

B!SON: A Tool for Open Access Journal Recommendation

Author: Entrup Elias
Eppelin Anita
Ewerth Ralph
Hartwig Josephine
Hoppe Anett
Tullney Marco
Wohlgemuth Michael
Publication venue: Heidelberg : Springer
Publication date: 01/01/2022
Field of study

Finding a suitable open access journal to publish scientific work is a complex task: Researchers have to navigate a constantly growing number of journals, institutional agreements with publishers, funders’ conditions and the risk of Predatory Publishers. To help with these challenges, we introduce a web-based journal recommendation system called B!SON. It is developed based on a systematic requirements analysis, built on open data, gives publisher-independent recommendations and works across domains. It suggests open access journals based on title, abstract and references provided by the user. The recommendation quality has been evaluated using a large test set of 10,000 articles. Development by two German scientific libraries ensures the longevity of the project

Repositorium für Naturwissenschaften und Technik

Recommended from our members

Language Models for Citation Classification

Author: Nambanoor Kunnath Suchetha
Publication venue
Publication date: 31/01/2024
Field of study

Authors reference academic works for a variety of reasons. As a result, not all citations in a research article have the same purpose. The need to understand and distinguish these citation purposes led to the development of automated approaches that consider semantic cues in the form of the context surrounding the citations. Identifying the semantic aspects of citations has proven valuable in various applications including research assessment, information retrieval, document summarisation, and more. While automated citation classification has been in progress since the early 2000s, current efforts to determine citation types based on their contexts remain largely domain-specific. Besides, there is a lack of standard benchmarks for evaluating models for citation classification. Extracting valuable metadata related to the reason behind citation in scientific articles, particularly across multiple domains, is laborious and researchers still lack consensus on what should be the optimal context size for effective detection of citation function. The current methods heavily rely on the amount of annotated data used for training, making them data-centric. The emergence of self-supervised language models, which efficiently learn contextual relationships from vast unannotated datasets, has brought about substantial changes in the realm of Natural Language Processing in recent years. Despite these advancements, the few-shot predictive capability of the language models remains under-utilised in this field. This thesis addresses the above shortcomings of citation classification. We systematically and comprehensively review the existing methodologies used by the previous works and identify the research gap and the potential future works. This meta-analysis forms the foundation for the research problems addressed in Chapters 3, 4, 5 and 6. Initially, we introduce a novel benchmark in the form of an open shared task competition for multi-disciplinary citation classification in Chapter 3. The methods submitted to this shared task highlight the superiority of deep learning-based approaches and hinted at the importance of incorporating additional context to enhance the performance of citation classification models. Secondly, we create a new open access feature-enriched multi-disciplinary citation classification dataset to overcome the challenges associated with extracting meta-data from both citing and cited articles in Chapter 4. The feature extraction process, utilising multiple sources and the missing meta-data values, indicates the complexities involved in extracting features for a heterogeneous dataset. In Chapter 5, we assess domain-specific and multi-disciplinary datasets by fine-tuning them on pre-trained scientific language models, specifically exploring various fixed citation context windows. We introduce a new method for automatically extracting dynamic context windows in an unsupervised manner. Both sets of experiments emphasise the significance of additional context in citation context classification. Moreover, the experimental results also show the domain dependence of the citation context window, providing evidence for the benefit of extracting context dynamically. Lastly, Chapter 6 presents novel prompting strategies for scientific and general-purpose language models to reduce the dependence on labelled citation classification datasets. The analysis of model performances under zero and few-shot settings reveals the effectiveness of large language models with minimal supervision, particularly when employing the newly proposed dynamic citation context-based prompting strategy

Open Research Online (The Open University)

Information retrieval and text mining technologies for chemistry

Author: Abacha A. B.
Alberts D.
Alfonso Valencia
American Chemical Society
Anália Lourenço
Aphinyanaphongs Y.
Appelt D. E.
Aramaki E.
Aronson A. R.
Asahara M.
Babych B.
Baeza-Yates R.
Bambenek J.
Barnard J. M.
Bast H.
Batista-Navarro R.
Batista-Navarro R. T.
Bian J.
Bies A.
Bikel D. M.
Blaschke C.
Brecher J. S.
Brill E.
Bunescu R.
Bunescu R. C.
Califf M. E.
Carpenter B.
Caruana R.
Chee B. W.
Chhieng D.
Chinchor N.
Chiticariu L.
Chowdhury M. F. M.
Chowdhury M. F. M.
Ciravegna F.
Cleverdon C. W.
Coden A.
Cohen R.
Collier N.
Corbett P.
Corbett P.
Cover T. M.
Craven M.
Cummings M. D.
Currano J. N.
Currano J. N.
Currano J. N.
Currano J. N.
Cutting D. R.
Davis C. H.
Dieb T. M.
Dieb T. M.
Dogan R. I.
Downs G. M.
Dunikowski L. G.
Embarek M.
Eom J.-H.
Faber J.
Fall C. J.
Fattore M.
Fennell R. W.
Freund Y.
Fujiyoshi A.
Fukuda K.
Gale W. A.
Garcelon N.
Garnier J.-P.
Garten Y.
Ginn R.
Giuliano C.
Gold S.
Grefenstette G.
Grishman R.
Gurulingappa H.
Gurulingappa H.
Gusfield D.
He Y.
Hearst M. A.
Hersh W.
Hersh W.
Hirschman L.
Hobbs J. R.
Hodge G. M.
Holzinger A.
Hsueh P.-Y.
Huber T.
Iyer S. V
Jackson P.
Joachims T.
Johnson D.
Jonnalagadda S.
Jonnalagadda S.
Julen Oyarzabal
Jurafsky D.
Kaewphan S.
Kaewphan S.
Karkaletsis V.
Katragadda S.
Kazama J.
Kazawa H.
Kelly L.
Kenny P. W.
Kim J.-D.
Kim Y.
Kleene S. C.
Kolárik C.
Kongburan W.
Kornai A.
Kraaij W.
Krallinger M.
Krallinger M.
Krallinger M.
Kremer G.
Kreuzthaler M.
Kucera H.
Lai H.
Lawson A. J.
Leaman R.
Leaman R.
Lee C.-H.
Levenshtein V. I.
Levin M. A.
Li J.
Li N.
Li Y.
Liu X.
Locke W. N.
Lovins J. B.
Lowe D. M.
Lupu M.
Lupu M.
Mackenzie C. E.
Manning C. D.
Mansouri A.
Martin E.
Martin Krallinger
Mattmann C.
Maynard D.
McCallum A.
McEwen L.
McKnight L.
McNaught A.
Meystre S. M.
Michalski S. R.
Michie D.
Mihalcea R.
Mitton R.
Miwa M.
Mollá D.
Murray-Rust P.
Müller B.
Nebel A.
Nikfarjam A.
Névéol A.
Névéol A.
Obdulia Rabal
Pang B.
Panico R.
Perez-Iratxeta C.
Ponomareva N.
Ratinov L.
Ratnaparkhi A.
Read J.
Rebholz-Schuhmann D.
Reeker L. H.
Rocchio J. J.
Rohbeck H.-G.
Rosario B.
Roth D. L.
Rupp C. J.
Rupp C. J.
Sagae K.
Salim N.
Salton G.
Sanchez-Cisneros D.
Saracevic T.
Sasaki Y.
Schapire R. E.
Schenck R.
Schenck R. J.
Schlaf A.
Schuemie M. J.
Segura Bedmar I.
Segura-Bedmar I.
Sekine S.
Sequeira E.
Settles B.
Settles B.
Sewell W.
Shen D.
Shidha M. V
Singhal A.
Smith E. G.
Stamatatos E.
Sutton C.
Sætre R.
Taylor K. T.
Tharatipyakul A.
Tomanek K.
Tomanek K.
Tsuruoka Y.
Tsuruoka Y.
Täger W.
Urbain J.
van Rijsbergen C. J.
Vapnik V. N.
Vasserman A.
Visweswaran S.
Voorhees E. M.
Wang W.
Wang Y.
Wei C.-H.
Wei C.-H.
Wermter J.
Wilbur W. J.
Willett P.
Willett P.
Williams A. J.
Witten I. H.
Workman M. L.
Wrublewski D. T.
Xu R.
Xue N.
Yan S.
Yang C.
Yang C. C.
Yang Y.
Zass E.
Zipf G. K.
Zipf G. K.
Zitnik S.
Publication venue: 'American Chemical Society (ACS)'
Publication date: 01/01/2017
Field of study

Efficient access to chemical information contained in scientific literature, patents, technical reports, or the web is a pressing need shared by researchers and patent attorneys from different chemical disciplines. Retrieval of important chemical information in most cases starts with finding relevant documents for a particular chemical compound or family. Targeted retrieval of chemical documents is closely connected to the automatic recognition of chemical entities in the text, which commonly involves the extraction of the entire list of chemicals mentioned in a document, including any associated information. In this Review, we provide a comprehensive and in-depth description of fundamental concepts, technical implementations, and current technologies for meeting these information demands. A strong focus is placed on community challenges addressing systems performance, more particularly CHEMDNER and CHEMDNER patents tasks of BioCreative IV and V, respectively. Considering the growing interest in the construction of automatically annotated chemical knowledge bases that integrate chemical information and biological data, cheminformatics approaches for mapping the extracted chemical names into chemical structures and their subsequent annotation together with text mining applications for linking chemistry with biological information are also presented. Finally, future trends and current challenges are highlighted as a roadmap proposal for research in this emerging field.A.V. and M.K. acknowledge funding from the European Community’s Horizon 2020 Program (project reference: 654021 - OpenMinted). M.K. additionally acknowledges the Encomienda MINETAD-CNIO as part of the Plan for the Advancement of Language Technology. O.R. and J.O. thank the Foundation for Applied Medical Research (FIMA), University of Navarra (Pamplona, Spain). This work was partially funded by Consellería de Cultura, Educación e Ordenación Universitaria (Xunta de Galicia), and FEDER (European Union), and the Portuguese Foundation for Science and Technology (FCT) under the scope of the strategic funding of UID/BIO/04469/2013 unit and COMPETE 2020 (POCI-01-0145-FEDER-006684). We thank Iñigo Garciá -Yoldi for useful feedback and discussions during the preparation of the manuscript.info:eu-repo/semantics/publishedVersio

Universidade do Minho: RepositoriUM

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC