7 research outputs found

    Focused image search in the social Web.

    Get PDF
    Recently, social multimedia-sharing websites, which allow users to upload, annotate, and share online photo or video collections, have become increasingly popular. The user tags or annotations constitute the new multimedia meta-data . We present an image search system that exploits both image textual and visual information. First, we use focused crawling and DOM Tree based web data extraction methods to extract image textual features from social networking image collections. Second, we propose the concept of visual words to handle the image\u27s visual content for fast indexing and searching. We also develop several user friendly search options to allow users to query the index using words and image feature descriptions (visual words). The developed image search system tries to bridge the gap between the scalable industrial image search engines, which are based on keyword search, and the slower content based image retrieval systems developed mostly in the academic field and designed to search based on image content only. We have implemented a working prototype by crawling and indexing over 16,056 images from flickr.com, one of the most popular image sharing websites. Our experimental results on a working prototype confirm the efficiency and effectiveness of the methods, that we proposed

    Data-Driven Technology Management Supported by Artificial Intelligence Solutions

    Get PDF
    Technology Management is an important part of a company’s business strategy. Procuring, evaluating and processing information is crucial for this process' success. This paper describes the way of dealing with today’s challenge of information overload. We introduce technology databases based on the patented TCB/SETS system, concepts of Artificial Intelligence-based information processing and present exemplary existing solutions. Afterwards, we discuss a concept which combines TCB/SETS and information retrieval solutions to a new smart approach. With our paper we address application oriented and theoretical working interest groups such as engineers and technology managers as well as scientists in research and teaching

    Website summarization: a topic hierarchy based approach.

    Get PDF
    Liu Nan.Thesis (M.Phil.)--Chinese University of Hong Kong, 2006.Includes bibliographical references (leaves 84-88).Abstracts in English and Chinese.Abstract --- p.1Acknowledgements --- p.3Contents --- p.4List of Figures --- p.6List of Tables --- p.7Chapter Chapter 1 --- Introduction --- p.8Chapter Chapter 2 --- Related Work --- p.12Chapter 2.1 --- Web Structure Mining --- p.12Chapter 2.1.1 --- HITS Algorithm --- p.13Chapter 2.1.2 --- PageRank Algorithm --- p.13Chapter 2.2 --- Website Mining --- p.14Chapter 2.2.1 --- Website Classification --- p.14Chapter 2.2.2 --- Web Unit Mining --- p.16Chapter 2.2.3 --- Logical Domain Extraction --- p.16Chapter 2.2.4 --- Web Thesaurus Construction --- p.17Chapter Chapter 3 --- Website Topic Hierarchy Generation --- p.19Chapter 3.1 --- Problem Definition --- p.19Chapter 3.2 --- Graph Based Algorithms --- p.21Chapter 3.2.1 --- Breadth First Search --- p.21Chapter 3.2.2 --- Shortest Path Search --- p.23Chapter 3.2.3 --- Minimum Directed Spanning Tree --- p.24Chapter 3.2.4 --- Discussion --- p.27Chapter 3.3 --- Edge Weight Function --- p.28Chapter 3.3.1 --- Relevance Method --- p.29Chapter 3.3.2 --- Machine Learning Method --- p.32Chapter 3.4 --- Experiments --- p.47Chapter 3.4.1 --- Data Preparation --- p.47Chapter 3.4.2 --- Performances of Breadth-first Search --- p.50Chapter 3.4.3 --- Performances of Shortest-path Search --- p.50Chapter 3.4.4 --- Performances of Directed Minimum Spanning Tree --- p.54Chapter 3.4.5 --- Comparison of Different Algorithms --- p.55Chapter Chapter 4 --- Website Summarization Through Keyphrase Extraction --- p.58Chapter 4.1 --- Introduction --- p.58Chapter 4.2 --- Background --- p.60Chapter 4.3 --- Keyphrase Extraction --- p.69Chapter 4.3.1 --- Candidate Phrases Idenfication --- p.69Chapter 4.3.2 --- Feature Calculation without Topic Hierarchy --- p.70Chapter 4.3.3 --- Feature Calculation with Topic Hierarchy --- p.72Chapter 4.3.4 --- Extraction of Keyphrases --- p.75Chapter 4.4 --- Experiments --- p.76Chapter Chapter 5 --- Conclusion and Future Work --- p.82References: --- p.8

    Web modelling for web warehouse design

    Get PDF
    Tese de doutoramento em Informática (Engenharia Informática), apresentada à Universidade de Lisboa através da Faculdade de Ciências, 2007Users require applications to help them obtaining knowledge from the web. However, the specific characteristics of web data make it difficult to create these applications. One possible solution to facilitate this task is to extract information from the web, transform and load it to a Web Warehouse, which provides uniform access methods for automatic processing of the data. Web Warehousing is conceptually similar to Data Warehousing approaches used to integrate relational information from databases. However, the structure of the web is very dynamic and cannot be controlled by the Warehouse designers. Web models frequently do not reflect the current state of the web. Thus, Web Warehouses must be redesigned at a late stage of development. These changes have high costs and may jeopardize entire projects. This thesis addresses the problem of modelling the web and its influence in the design of Web Warehouses. A model of a web portion was derived and based on it, a Web Warehouse prototype was designed. The prototype was validated in several real-usage scenarios. The obtained results show that web modelling is a fundamental step of the web data integration process.Os utilizadores da web recorrem a ferramentas que os ajudem a satisfazer as suas necessidades de informação. Contudo, as características específicas dos conteúdos provenientes da web dificultam o desenvolvimento destas aplicações. Uma aproximação possível para a resolução deste problema é a integração de dados provenientes da web num Armazém de Dados Web que, por sua vez, disponibilize métodos de acesso uniformes e facilitem o processamento automático. Um Armazém de Dados Web é conceptualmente semelhante a um Armazém de Dados de negócio. No entanto, a estrutura da informação a carregar, a web, não pode ser controlada ou facilmente modelada pelos analistas. Os modelos da web existentes não são tipicamente representativos do seu estado presente. Como consequência, os Armazéns de Dados Web sofrem frequentemente alterações profundas no seu desenho quando já se encontram numa fase avançada de desenvolvimento. Estas mudanças têm custos elevados e podem pôr em causa a viabilidade de todo um projecto. Esta tese estuda o problema da modelação da web e a sua influência no desenho de Armazéns de Dados Web. Para este efeito, foi extraído um modelo de uma porção da web, e com base nele, desenhado um protótipo de um Armazém de Dados Web. Este protótipo foi validado através da sua utilização em vários contextos distintos. Os resultados obtidos mostram que a modelação da web deve ser considerada no processo de integração de dados da web.Fundação para Computação Científica Nacional (FCCN); LaSIGE-Laboratório de Sistemas Informáticos de Grande Escala; Fundação para a Ciência e Tecnologia (FCT), (SFRH/BD/11062/2002

    Untangling the Web: A Guide To Internet Research

    Get PDF
    [Excerpt] Untangling the Web for 2007 is the twelfth edition of a book that started as a small handout. After more than a decade of researching, reading about, using, and trying to understand the Internet, I have come to accept that it is indeed a Sisyphean task. Sometimes I feel that all I can do is to push the rock up to the top of that virtual hill, then stand back and watch as it rolls down again. The Internet—in all its glory of information and misinformation—is for all practical purposes limitless, which of course means we can never know it all, see it all, understand it all, or even imagine all it is and will be. The more we know about the Internet, the more acute is our awareness of what we do not know. The Internet emphasizes the depth of our ignorance because our knowledge can only be finite, while our ignorance must necessarily be infinite. My hope is that Untangling the Web will add to our knowledge of the Internet and the world while recognizing that the rock will always roll back down the hill at the end of the day

    2019, UMaine News Press Releases

    Get PDF
    This is a catalog of press releases put out by the University of Maine Division of Marketing and Communications between January 23, 2019 and December 31, 2019
    corecore