7 research outputs found

    Usability Guidelines for Desktop Search Engines

    Get PDF
    In this article we describe a usability evaluation of eight desktop search engines (DSEs). We used the heuristic walkthrough method to gather usability problems as well as individual strengths and weaknesses of the tested search engines. The results of the evaluation are integrated into a set of 30 design guidelines for user-friendly DSEs

    Developing a Process Model for the Forensic Extraction of Information from Desktop Search

    Get PDF
    Desktop search applications can contain cached copies of files that were deleted from the file system. Forensic investigators see this as a potential source of evidence, as documents deleted by suspects may still exist in the cache. Whilst there have been attempts at recovering data collected by desktop search applications, there is no methodology governing the process, nor discussion on the most appropriate means to do so. This article seeks to address this issue by developing a process model that can be applied when developing an information extraction application for desktop search applications, discussing preferred methods and the limitations of each. This work represents a more structured approach than other forms of current research

    Desktop Search Engine for Linux

    Get PDF
    Desktop Search Engine become more popular for personal and enterprise after some difficulties occurs when dealing with the huge amount of files and in a multi-user environment. Desktop Search Engine for Linux is a desktop search tool, integrated between Namazu with web-based interface that can search text format files inthe hard drive ofpersonal computer. As increasingly demand for aneffective and efficient desktop search tools especially for the Linux environment where there were just a few tools have been developed for Linux compared for Windows although the usage of Linux operating system are increased from days to days. This system is just for Linux (Debian platform) operating system and just search for a text format files. This system index the entire words of files in the hard drive and create one index files that contains all details about the files in the disk. The system just refers to this index files when processing the searching process for a fast and effective results. From the studies and analysis that has been done during the development of this system, there have a benchmark criterion for desktop search tools that can be use as a reference and also a lot of indexer that can be used to index the files. Only the best indexer was be taken to integrate with this system. This system still can be improved with the support, effort and deep knowledge about desktop searchtools and technical skills

    Integrando motores de indexação de dados para a construção de sistemas de recuperação de informação em ambientes heterogêneos

    Get PDF
    The different proposals for information recovery put forward a set of limitations about the environments which they recover information from, they usually restrict the information recovery to a very specific data source or knowledge area. In this context, this paper proposes a model and a framework in order to integrate different index engines to enable the development of Information Retrieval Systems capable of retrieving information in heterogeneous data sources, and, therefore, showing up as a solution to fulfill the demand to access information in corporate environments.As diversas propostas existentes para a recuperação de informação apresentam limitações referentes ao ambiente em que atuam, restringindo a recuperação de informações a fontes de dados ou a áreas do conhecimento especificas. Neste contexto, este artigo propõe um modelo e um arcabouço para a integração de motores de indexação a fim de possibilitar o desenvolvimento de Sistemas de Recuperação de Informação (SRIs) capacitados a recuperar informações em ambientes de fontes de dados heterogêneas, apresentando-se assim, como uma abordagem para atender a demanda de acesso às informações em ambientes corporativos

    Ein Vergleich ausgewählter Desktop-Suchmaschinen

    Get PDF
    Ilmenauer Beiträge zur Wirtschaftsinformatik Nr. 2011-02 / Technische Universität Ilmenau, Fakultät für Wirtschaftswissenschaften, Institut für Wirtschaftsinformatik ISSN 1861-9223 ISBN 978-3-938940-33-

    Search in glintt HS solutions

    Get PDF
    Documento confidencial. Não pode ser disponibilizado para consultaTese de mestrado integrado. Engenharia Informática e Computação. Faculdade de Engenharia. Universidade do Porto. 201

    Técnicas de mineração incrementais em recuperação de informação

    Get PDF
    [EN] A desirable property of learning algorithms is the ability of incorporating new data in an incremental way. Incremental algorithms have received attention on the last few years. Particulary Bayesian networks, this is due to the hardness of the task. In Bayesian networks one example can change the whole structure of the Bayesian network. In this theses we focus on incremental induction of Tree Augmented Naive Bayes (TAN) algorithm. A incremental version of TAN saves computing time, is more suite to data mining and concept drift. But, as usual in Bayesian learning TAN is restricted to discrete attributes. Complementary to the incremental TAN, we propose an incremental discretization algorithm, necessary to evaluate TAN in domains with continuous attribute. Discretization is a fundamental pre-processing step for some well- known algorithms, the topic of incremental discretization has received few attention from the community. This theses has two major contributions, the benefict of both proposals is incremental learning, one for TAN and the other for discretization.We present and test a algorithm that rebuilds the network structure of tree augmented naive Bayes (TAN) based on the weighted sum of vectors containing the mutual information. We also present a new discretization method, this works in two layers. This two-stage architecture is very fexible. It can be used as supervised or unsupervised. For the second layer any base discretization method can be used: equal width, equal frequency, recursive entropy discretization, chi-merge, etc. The most relevant aspect is that the boundaries of the intervals of the second layer can change when new data is available. We tested experimentally the incremental approach to discretization with batch and incremental learners. The experimental evaluation of incremental TAN shows a perfor mance similar to the batch version. Similar remarks apply to incremental discretization. This is a relevant aspect, because few works in machine learning address the fundamental aspect of incremental discretization. We believe that with Incremental discretization, the evaluation of the incremental algorithms can become more realistic and accurate. We evaluated two versions of incremental discretization: supervised and unsupervised. We have seen that this feature can improve accuracy for the incremental learners and that the preview of future algorithm performance can be more precise. This method of discretization has another advantages, like, can be used with large data set's or can be used in dynamic environments with concept drift, areas where a batch discretization can be difficult or is not adequate.[ES] Esta tesis tenía como objetivo el estudio de una red Bayesiana (TAN) incremental. Durante el transcurso de esta se verificó la laguna en el área de una discretización incremental para la evaluación de un algoritmo incremental. Así se procuró dar como contribución para el área no solo un clasificador Bayesiano incremental sino también un modo de evaluación correcto del clasificador. Los Sistemas de Recuperación de Información tienen como objetivo la realización de las tareas de indexación, búsqueda y clasificación de documentos (expresos en la forma textual), con el fin de satisfacer la necesidad de información del individuo, generalmente expresa a través de consultas. La necesidad de información puede ser entendida como la búsqueda de respuestas para determinadas cuestiones que tienen que ser resueltas, la recuperación de documentos que tratan sobre un determinado asunto o incluso la relación entre asuntos
    corecore