70 research outputs found

    Automated Slovene crosswords solving with the help of the internet

    Get PDF
    The goal of the thesis was to create a program, that automatically solves Slovene crosswords with the help of puzzle dictionary and web search. For the purpose of creating the program, we used a database of 367 crosswords and puzzle dictionary with 104.123 clue answers. We also created a list of web answers for all the crosswords in the database. Because of the enormous amount of the web answers searching the whole solution space would not be feasible. That is why the program solves each crossword in two separate phases. In the first phase, a crossword gets partially solved with the help of the puzzle dictionary, which creates a list of most probable solution candidates for each clue. In the second phase, a crossword is solved with puzzle dictionary and web answers combined. For the purpose of building the program, we used a set of 37 training crosswords. The final results were obtained from a set of 330 testing crosswords

    Logic Programming Applications: What Are the Abstractions and Implementations?

    Full text link
    This article presents an overview of applications of logic programming, classifying them based on the abstractions and implementations of logic languages that support the applications. The three key abstractions are join, recursion, and constraint. Their essential implementations are for-loops, fixed points, and backtracking, respectively. The corresponding kinds of applications are database queries, inductive analysis, and combinatorial search, respectively. We also discuss language extensions and programming paradigms, summarize example application problems by application areas, and touch on example systems that support variants of the abstractions with different implementations

    Incremental inference on higher-order probabilistic graphical models applied to constraint satisfaction problems

    Get PDF
    Thesis (PhD)--Stellenbosch University, 2022.ENGLISH ABSTRACT: Probabilistic graphical models (PGMs) are used extensively in the probabilistic reasoning domain. They are powerful tools for solving systems of complex relationships over a variety of probability distributions, such as medical and fault diagnosis, predictive modelling, object recognition, localisation and mapping, speech recognition, and language processing [5, 6, 7, 8, 9, 10, 11]. Furthermore, constraint satisfaction problems (CSPs) can be formulated as PGMs and solved with PGM inference techniques. However, the prevalent literature on PGMs shows that suboptimal PGM structures are primarily used in practice and a suboptimal formulation for constraint satisfaction PGMs. This dissertation aimed to improve the PGM literature through accessible algorithms and tools for improved PGM structures and inference procedures, specifically focusing on constraint satisfaction. To this end, this dissertation presents three published contributions to the current literature: a comparative study to compare cluster graph topologies to the prevalent factor graphs [1], an application of cluster graphs in land cover classification in the field of cartography [2], and a comprehensive integration of various aspects required to formulate CSPs as PGMs and an algorithm to solve this formulation for problems too complex for traditional PGM tools [3]. First, we present a means of formulating and solving graph colouring problems with probabilistic graphical models. In contrast to the prevailing literature that mostly uses factor graph configurations, we approach it from a cluster graph perspective, using the general-purpose cluster graph construction algorithm, LTRIP. Our experiments indicate a significant advantage for preferring cluster graphs over factor graphs, both in terms of accuracy as well as computational efficiency. Secondly, we use these tools to solve a practical problem: land cover classification. This process is complex due to measuring errors, inefficient algorithms, and low-quality data. We proposed a PGM approach to boost geospatial classifications from different sources and consider the effects of spatial distribution and inter-class dependencies (similarly to graph colouring). Our PGM tools were shown to be robust and were able to produce a diverse, feasible, and spatially-consistent land cover classification even in areas of incomplete and conflicting evidence. Lastly, in our third publication, we investigated and improved the PGM structures used for constraint satisfaction. It is known that tree-structured PGMs always result in an exact solution [12, p355], but is usually impractical for interesting problems due to exponential blow-up. We, therefore, developed the ā€œpurge-and mergeā€ algorithm to incrementally approximate a tree-structured PGM. This algorithm iteratively nudges a malleable graph structure towards a tree structure by selectively merging factors. The merging process is designed to avoid exponential blow-up through sparse data structures from which redundancy is purged as the algorithm progresses. This algorithm is tested on constraint satisfaction puzzles such as Sudoku, Fill-a-pix, and Kakuro and manages to outperform other PGM-based approaches reported in the literature [13, 14, 15]. Overall, the research reported in this dissertation contributed to developing a more optimised approach for higher order probabilistic graphical models. Further studies should concentrate on applying purge-and-merge on problems closer to probabilistic reasoning than constraint satisfaction and report its effectiveness in that domain.AFRIKAANSE OPSOMMING: Grafiese waarskynlikheidsmodelle (PGM) word wyd gebruik vir komplekse waarskynlikheidsprobleme. Dit is kragtige gereedskap om sisteme van komplekse verhoudings oor ā€˜n versameling waarskynlikheidsverspreidings op te los, soos die mediese en foutdiagnoses, voorspellingsmodelle, objekherkenning, lokalisering en kartering, spraakherkenning en taalprosessering [5, 6, 7, 8, 9, 10, 11]. Voorts kan beperkingvoldoeningsprobleme (CSP) as PGMā€™s geformuleer word en met PGM gevolgtrekkingtegnieke opgelos word. Die heersende literatuur oor PGMā€™s toon egter dat sub-optimale PGM-strukture hoofsaaklik in die praktyk gebruik word en ā€˜n sub-optimale PGM-formulering vir CSPā€™s. Die doel met die verhandeling is om die PGM-literatuur deur toeganklike algoritmes en gereedskap vir verbeterde PGM-strukture en gevolgtrekking-prosedures te verbeter deur op CSP toepassings te fokus. Na aanleiding hiervan voeg die verhandeling drie gepubliseerde bydraes by die huidige literatuur: ā€˜n vergelykende studie om bundelgrafieke tot die heersende faktorgrafieke te vergelyk [1], ā€˜n praktiese toepassing vir die gebruik van bundelgrafieke in ā€œland-coverā€- klassifikasie in die kartografieveld [2] en ā€˜n omvattende integrasie van verskeie aspekte om CSPā€™s as PGMā€™s te formuleer en ā€˜n algoritme vir die formulering van probleme te kompleks vir tradisionele PGM-gereedskap [3] Eerstens bied ons ā€˜n wyse van formulering en die oplos van grafiekkleurprobleme met PGMā€™s. In teenstelling met die huidige literatuur wat meestal faktorgrafieke gebruik, benader ons dit van ā€˜n bundelgrafiek-perspektief deur die gebruik van die automatiese bundelgrafiekkonstruksie-algoritme, LTRIP. Ons eksperimente toon ā€˜n beduidende voorkeur vir bundelgrafieke teenoor faktorgrafieke, wat akku raatheid asook berekende doeltreffendheid betref. Tweedens gebruik ons die gereedskap om ā€˜n praktiese probleem op te los: ā€œlandcoverā€-klassifikasie. Die proses is kompleks weens metingsfoute, ondoeltreffende algoritmes en lae-gehalte data. Ons stel ā€˜n PGM-benadering voor om die georuimtelike klassifikasies van verskillende bronne te versterk, asook die uitwerking van ruimtelike verspreiding en interklas-afhanklikhede (soortgelyk aan grafiekkleurprobleme). Ons PGM-gereedskap is robuus en kon ā€˜n diverse, uitvoerbare en ruimtelik-konsekwente ā€œland-coverā€-klassifikasie selfs in gebiede van onvoltooide en konflikterende inligting bewys. Ten slotte het ons in ons derde publikasie die PGM-strukture vir CSPā€™s ondersoek en verbeter. Dit is bekend dat boomstrukture altyd tot ā€˜n eksakte oplossing lei [12, p355], maar is weens eksponensiĆ«le uitbreiding gewoonlik onprakties vir interessante probleme. Ons het gevolglik die algoritme, purge-and-merge, ontwikkel om inkrementeel ā€˜n boomstruktuur na te doen. Die algoritme hervorm ā€˜n bundelgrafiek stapsgewys in ā€˜n boomstruktuur deur faktore selektief te ā€œmergeā€. Die saamsmeltproses is ontwerp om eksponensiĆ«le uitbreiding te vermy deur van yl datastrukture gebruik te maak waarvan die waarskeinlikheidsruimte ge-ā€œpurgeā€ word namate die algoritme vorder. Die algoritme is getoets op CSP-speletjies soos Sudoku, Fill-a-pix en Kakuro en oortref ander PGM-gegronde benaderings waaroor in die literatuur verslag gedoen word [13, 14, 15]. In die geheel gesien, het die navorsing bygedra tot die ontwikkeling van ā€˜n meer geoptimaliseerde benadering vir hoĆ«r-orde PGMā€™s. Verdere studies behoort te fokus op die toepassing van purge-and-merge op probleme nader aan waarskynlikheidsredenasie-probleme as aan CSPā€™s en moet sy effektiwiteit in daar die domein rapporteer.Doctora
    • ā€¦
    corecore