Search CORE

31 research outputs found

Learning Residual Finite-State Automata Using Observation Tables

Author: Anna Kasprzik
Giovanni Pighizzini
Ian McQuillan
Publication venue: 'Open Publishing Association'
Publication date: 01/08/2010
Field of study

We define a two-step learner for RFSAs based on an observation table by using an algorithm for minimal DFAs to build a table for the reversal of the language in question and showing that we can derive the minimal RFSA from it after some simple modifications. We compare the algorithm to two other table-based ones of which one (by Bollig et al. 2009) infers a RFSA directly, and the other is another two-step learner proposed by the author. We focus on the criterion of query complexity.Comment: In Proceedings DCFS 2010, arXiv:1008.127

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

Recommended from our members

Knowledge organization systems in mathematics and in libraries

Author: Kasprzik Anna
Publication venue: Zenodo
Publication date: 01/01/2017
Field of study

Based on the project activities planned in the context of the Specialized Information Service for Mathematics (TIB Hannover, FAU Erlangen, L3S, SUB Göttingen) we give an overview over the history and interplay of subject cataloguing in libraries, the development of computerized methods for metadata processing and the rise of the Semantic Web. We survey various knowledge organization systems such as the Mathematics Subject Classification, the German Authority File, the clustering International Authority File VIAF, and lexical databases such as WordNet and their potential use for mathematics in education and research. We briefly address the difference between thesauri and ontologies and the relations they typically contain from a linguistic perspective. We will then discuss with the audience how the current efforts to represent and handle mathematical theories as semantic objects can help deflect the decline of semantic resource annotation in libraries that has been predicted by some due to the existence of highly performant retrieval algorithms (based on statistical, neuronal, or other big data methods). We will also explore the potential characteristics of a fruitful symbiosis between carefully cultivated kernels of semantic structure and automated methods in order to scale those structures up to the level that is necessary in order to cope with the amounts of digital data found in libraries and in (mathematical) research (e.g., in simulations) today

Repositorium für Naturwissenschaften und Technik

Recommended from our members

Methoden und Denkweisen aus der Theoretischen Informatik in Bibliotheken

Author: Kasprzik Anna
Publication venue: Zenodo
Publication date: 01/01/2017
Field of study

Praxisbericht einer Formalsprachlerin aus dem Arbeitsalltag in der Forschungs- und Entwicklungsabteilung einer Bibliothek

Repositorium für Naturwissenschaften und Technik

Knowledge organization systems in mathematics and in libraries

Author: Kasprzik Anna
Publication venue: Zenodo
Publication date: 01/01/2017
Field of study

ZENODO

Repositorium für Naturwissenschaften und Technik

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Community-getriebene Entwicklung von Sacherschließungswerkzeugen rund um die GND : Von der Normdatei zum Thesaurus, und vom Thesaurus zur Ontologie – strukturelle Stolpersteine

Author: Kasprzik Anna
Publication venue
Publication date: 18/06/2018
Field of study

KITopen

Die Bereinigung der Systematik an der Bibliothek der Universität Konstanz

Author: Kasprzik Anna
Publication venue: Bibliothek aktuell
Publication date: 01/12/2013
Field of study

Universität Konstanz: Bibliothek aktuell (E-Journal)

Recommended from our members

Projektvorhaben TEASER - Entwicklung einer Plattform für die kollaborative Thesauruspflege und -verknüpfung am Beispiel „Industrie 4.0“

Author: Kasprzik Anna
Publication venue: Hannover : Technische Informationsbibliothek
Publication date: 01/01/2017
Field of study

[no abstract available

Repositorium für Naturwissenschaften und Technik

ÖBs und WBs in Turku und Helsinki

Author: Kasprzik Anna
Publication venue: Bibliothek aktuell
Publication date: 01/12/2013
Field of study

Universität Konstanz: Bibliothek aktuell (E-Journal)

Recommended from our members

The Research Core Dataset (KDSF) in the Linked Data context

Author: Hauschke Christian
Kasprzik Anna
Walther Tatiana
Publication venue: Amsterdam [u.a.] : Elsevier
Publication date: 01/01/2019
Field of study

This paper describes our efforts to implement the Research Core Dataset (“Kerndatensatz Forschung”; KDSF) as an ontology in VIVO. KDSF is used in VIVO to record the required metadata on incoming data and to produce reports as an output. While both processes need an elaborate adaptation of the KDSF specification, this paper focusses on the adaptation of the KDSF basic data model for recording data in VIVO. In this context, the VIVO and KDSF ontologies were compared with respect to domain, syntax, structure, and granularity in order to identify correspondences and mismatches. To produce an alignment, different matching approaches have been applied. Furthermore, we made necessary modifications and extensions on KDSF classes and properties

Repositorium für Naturwissenschaften und Technik

Aufbau eines produktiven Dienstes für die automatisierte Inhaltserschließung an der ZBW: Ein Status- und Erfahrungsbericht

Author: Kasprzik Anna
Publication venue: o-bib. Das offene Bibliotheksjournal / Herausgeber VDB
Publication date: 28/02/2023
Field of study

Die ZBW – Leibniz-Informationszentrum Wirtschaft betreibt seit 2016 eigene angewandte Forschung im Bereich Machine Learning mit dem Zweck, praktikable Lösungen für eine automatisierte oder maschinell unterstützte Inhaltserschließung zu entwickeln. 2020 begann ein Team an der ZBW die Konzeption und Implementierung einer Softwarearchitektur, die es ermöglichte, diese prototypischen Lösungen in einen produktiven Dienst zu überführen und mit den bestehenden Nachweis- und Informationssystemen zu verzahnen. Sowohl die angewandte Forschung als auch die für dieses Vorhaben („AutoSE“) notwendige Softwareentwicklung sind direkt im Bibliotheksbereich der ZBW angesiedelt, werden kontinuierlich anhand des State of the Art vorangetrieben und profitieren von einem engen Austausch mit den Verantwortlichen für die intellektuelle Inhaltserschließung. Dieser Beitrag zeigt die Meilensteine auf, die das AutoSE-Team in zwei Jahren in Bezug auf den Aufbau und die Integration der Software erreicht hat, und skizziert, welche bis zum Ende der Pilotphase (2024) noch ausstehen. Die Architektur basiert auf Open-Source-Software und die eingesetzten Machine-Learning-Komponenten werden im Rahmen einer internationalen Zusammenarbeit im engen Austausch mit der Finnischen Nationalbibliothek (NLF) weiterentwickelt und zur Nachnutzung in dem von der NLF entwickelten Open-Source-Werkzeugkasten Annif aufbereitet. Das Betriebsmodell des AutoSE-Dienstes sieht regelmäßige Überprüfungen sowohl einzelner Komponenten als auch des Produktionsworkflows als Ganzes vor und erlaubt eine fortlaufende Weiterentwicklung der Architektur. Eines der Ergebnisse, das bis zum Ende der Pilotphase vorliegen soll, ist die Dokumentation der Anforderungen an einen dauerhaften produktiven Betrieb des Dienstes, damit die Ressourcen dafür im Rahmen eines tragfähigen Modells langfristig gesichert werden können. Aus diesem Praxisbeispiel lässt sich ableiten, welche Bedingungen gegeben sein müssen, um Machine-Learning-Lösungen wie die in Annif enthaltenen erfolgreich an einer Institution für die Inhaltserschließung einsetzen zu können.Since 2016, ZBW – Leibniz Information Centre for Economics has been conducting their own research in the area of machine learning with the goal to develop viable solutions for automated or machine assisted subject indexing in-house. In 2020, a team at ZBW started designing and implementing a suitable software architecture in order to transfer these prototypical solutions into a productive service and to integrate it into the existing metadata systems and workflows. Both the applied research and the software development necessary for this endeavour (dubbed “AutoSE”) are executed by an organizational unit of the library department of ZBW, are continually pushed forward following the state of the art and benefit from a close communication with the staff responsible for intellectual subject indexing. This article reports on the milestones that the AutoSE team has reached over the last two years with respect to the implementation and the integration of the software and outlines those that are yet to be delivered until the end of the pilot phase (2024). The architecture is based on open source software and its machine-learning-based components are developed in close communication with the National Library of Finland (NLF) and, where possible, adapted to be integrated into NLF’s open source toolkit Annif. The operating model of the AutoSE service includes periodical reviews of individual components and of the productive workflow in its entirety and allows continuous improvements of the architecture. One of the results to be delivered by the end of the pilot phase is a documentation of the requirements for running the productive service on a permanent basis so that the necessary resources can be secured. This practical example shows which conditions have to be met by an institution in order to successfully use machine learning solutions such as the ones offered in Annif for subject indexing

o-bib - Das offene Bibliotheksjournal (VDB - Verein Deutscher Bibliothekare e.V.)