146 research outputs found

    Implications of Computational Cognitive Models for Information Retrieval

    Get PDF
    This dissertation explores the implications of computational cognitive modeling for information retrieval. The parallel between information retrieval and human memory is that the goal of an information retrieval system is to find the set of documents most relevant to the query whereas the goal for the human memory system is to access the relevance of items stored in memory given a memory probe (Steyvers & Griffiths, 2010). The two major topics of this dissertation are desirability and information scent. Desirability is the context independent probability of an item receiving attention (Recker & Pitkow, 1996). Desirability has been widely utilized in numerous experiments to model the probability that a given memory item would be retrieved (Anderson, 2007). Information scent is a context dependent measure defined as the utility of an information item (Pirolli & Card, 1996b). Information scent has been widely utilized to predict the memory item that would be retrieved given a probe (Anderson, 2007) and to predict the browsing behavior of humans (Pirolli & Card, 1996b). In this dissertation, I proposed the theory that desirability observed in human memory is caused by preferential attachment in networks. Additionally, I showed that documents accessed in large repositories mirror the observed statistical properties in human memory and that these properties can be used to improve document ranking. Finally, I showed that the combination of information scent and desirability improves document ranking over existing well-established approaches

    Knowledge Modelling and Learning through Cognitive Networks

    Get PDF
    One of the most promising developments in modelling knowledge is cognitive network science, which aims to investigate cognitive phenomena driven by the networked, associative organization of knowledge. For example, investigating the structure of semantic memory via semantic networks has illuminated how memory recall patterns influence phenomena such as creativity, memory search, learning, and more generally, knowledge acquisition, exploration, and exploitation. In parallel, neural network models for artificial intelligence (AI) are also becoming more widespread as inferential models for understanding which features drive language-related phenomena such as meaning reconstruction, stance detection, and emotional profiling. Whereas cognitive networks map explicitly which entities engage in associative relationships, neural networks perform an implicit mapping of correlations in cognitive data as weights, obtained after training over labelled data and whose interpretation is not immediately evident to the experimenter. This book aims to bring together quantitative, innovative research that focuses on modelling knowledge through cognitive and neural networks to gain insight into mechanisms driving cognitive processes related to knowledge structuring, exploration, and learning. The book comprises a variety of publication types, including reviews and theoretical papers, empirical research, computational modelling, and big data analysis. All papers here share a commonality: they demonstrate how the application of network science and AI can extend and broaden cognitive science in ways that traditional approaches cannot

    The UPS Prototype An Experimental End-User Service across E-Print Archives

    Full text link

    Communicating with your E-memory: finding and refinding in personal lifelogs

    Get PDF
    The rapid development of technology enables the digital capture and storage of our life experiences in an ā€œE-Memoryā€ (electronicā€“memory) or personal lifelog (PLL). This offers the potential for people to store the details of their life in a permanent archive, so that the information is still available even when its physical existence has vanished and when memory traces of it have faded away. A major challenge for PLLs is enabling people to access information when it is needed. Many people may also want to share or transfer some of their memory to their friends and descendants, so that their experiences can be appreciated and their knowledge can be kept even after they have passed away. This thesis further explores peopleā€™s potential needs from their own PLLs, discuss the possible methods people may use and potential problems that they may encounter while accessing their PLLs, and hypothesize that better support of usersā€™ own memory can provide better user experience and improved efficiency for accessing their E-memories (or PLLs). As part of a larger project, three lifeloggers collected their own prototype lifelog collection for about 20 monthsā€™ time. To complete this study, the author developed a prototype PLL system, called the iCLIPS Lifelog Archive Browser (LAB), based on the authorā€™s theoretical exploration and empirical studies, and evaluated it using our prototype lifelog collections through a user study with the three lifeloggers. The results of this study provide promising evidence which support the hypothesis. The end of this thesis also discusses the issues that the lifeloggers encountered in using their lifelogs and future technologies that are desirable based the studies in this thesis

    Word-sense disambiguation in biomedical ontologies

    Get PDF
    With the ever increase in biomedical literature, text-mining has emerged as an important technology to support bio-curation and search. Word sense disambiguation (WSD), the correct identification of terms in text in the light of ambiguity, is an important problem in text-mining. Since the late 1940s many approaches based on supervised (decision trees, naive Bayes, neural networks, support vector machines) and unsupervised machine learning (context-clustering, word-clustering, co-occurrence graphs) have been developed. Knowledge-based methods that make use of the WordNet computational lexicon have also been developed. But only few make use of ontologies, i.e. hierarchical controlled vocabularies, to solve the problem and none exploit inference over ontologies and the use of metadata from publications. This thesis addresses the WSD problem in biomedical ontologies by suggesting diļ¬€erent approaches for word sense disambiguation that use ontologies and metadata. The "Closest Sense" method assumes that the ontology deļ¬nes multiple senses of the term; it computes the shortest path of co-occurring terms in the document to one of these senses. The "Term Cooc" method deļ¬nes a log-odds ratio for co-occurring terms including inferred co-occurrences. The "MetaData" approach trains a classiļ¬er on metadata; it does not require any ontology, but requires training data, which the other methods do not. These approaches are compared to each other when applied to a manually curated training corpus of 2600 documents for seven ambiguous terms from the Gene Ontology and MeSH. All approaches over all conditions achieve 80% success rate on average. The MetaData approach performs best with 96%, when trained on high-quality data. Its performance deteriorates as quality of the training data decreases. The Term Cooc approach performs better on Gene Ontology (92% success) than on MeSH (73% success) as MeSH is not a strict is-a/part-of, but rather a loose is-related-to hierarchy. The Closest Sense approach achieves on average 80% success rate. Furthermore, the thesis showcases applications ranging from ontology design to semantic search where WSD is important
    • ā€¦
    corecore