855 research outputs found

    Creating sparks: comparing search results using discriminatory search term word co-occurrence to facilitate serendipity in the enterprise.

    Get PDF
    Categories or tags that appear in faceted search interfaces which are representative of an information item, rarely convey unexpected or non-obvious associated concepts buried within search results. No prior research has been identified which assesses the usefulness of discriminative search term word co-occurrence to generate facets to act as catalysts to facilitate insightful and serendipitous encounters during exploratory search. In this study, 53 scientists from two organisations interacted with semi-interactive stimuli, 74% expressing a large/moderate desire to use such techniques within their workplace. Preferences were shown for certain algorithms and colour coding. Insightful and serendipitous encounters were identified. These techniques appear to offer a significant improvement over existing approaches used within the study organisations, providing further evidence that insightful and serendipitous encounters can be facilitated in the search user interface. This research has implications for organisational learning, knowledge discovery and exploratory search interface design

    When things matter: A survey on data-centric Internet of Things

    Get PDF
    With the recent advances in radio-frequency identification (RFID), low-cost wireless sensor devices, and Web technologies, the Internet of Things (IoT) approach has gained momentum in connecting everyday objects to the Internet and facilitating machine-to-human and machine-to-machine communication with the physical world. IoT offers the capability to connect and integrate both digital and physical entities, enabling a whole new class of applications and services, but several significant challenges need to be addressed before these applications and services can be fully realized. A fundamental challenge centers around managing IoT data, typically produced in dynamic and volatile environments, which is not only extremely large in scale and volume, but also noisy and continuous. This paper reviews the main techniques and state-of-the-art research efforts in IoT from data-centric perspectives, including data stream processing, data storage models, complex event processing, and searching in IoT. Open research issues for IoT data management are also discussed

    Extracting News Events from Microblogs

    Full text link
    Twitter stream has become a large source of information for many people, but the magnitude of tweets and the noisy nature of its content have made harvesting the knowledge from Twitter a challenging task for researchers for a long time. Aiming at overcoming some of the main challenges of extracting the hidden information from tweet streams, this work proposes a new approach for real-time detection of news events from the Twitter stream. We divide our approach into three steps. The first step is to use a neural network or deep learning to detect news-relevant tweets from the stream. The second step is to apply a novel streaming data clustering algorithm to the detected news tweets to form news events. The third and final step is to rank the detected events based on the size of the event clusters and growth speed of the tweet frequencies. We evaluate the proposed system on a large, publicly available corpus of annotated news events from Twitter. As part of the evaluation, we compare our approach with a related state-of-the-art solution. Overall, our experiments and user-based evaluation show that our approach on detecting current (real) news events delivers a state-of-the-art performance

    Counting equivalence classes of Boolean functions

    Get PDF
    У овој дисертацији разматран јe проблем израчунавања броја класа еквиваленције Булових функција. Тежина одређивања броја класа еквивален- ције нагло расте са бројем променљивих n. Мотивација за избор ове теме лежи у чињеници да су конкретни бројеви до сада били познати само за релативно мале вредности n, иако је сам проблем теоријски одавно решен...In this dissertation, the problem of calculating the number of equiva- lence classes of Boolean functions is discussed. The difficulty of determining the number of equivalence classes increases sharply with the number of variables n. The motivation for choosing this topic lies in the fact that concrete numbers have been known so far only for relatively small values of n, although the problem itself was theoretically solved a long time ago..

    Discovering visiting behaviors and city perceptions by mining semantic trajectory

    Get PDF
    Tourism is a crucial industry for many cities, necessitating the development of unique attractions to draw in more visitors. Understanding the visiting behaviors and perceptions of visitors helps to uncover the city’s distinctive characteristics, thereby aiding in the further growth of its tourism industry. It’s important to note that different population groups may exhibit varying visiting behaviors depending on the time of their visit, which in turn can shape their impressions of the city. This study explores the dynamic visiting behaviors and city perceptions of locals and tourists throughout different times of the day and week. The study area is London, one of the world’s most famous tourist cities. To conduct this study, User-Generated Content (UGC) is utilized, specifically data from Foursquare check-ins and Flickr tags from April 3, 2012, to September 16, 2013. The study first identifies the spatiotemporal distribution of hotspots for each population group based on their Foursquare check-ins. The relative concentration of locals and tourists is then examined through the difference ratio to understand their unique visiting preferences. Next, the spatiotemporal movements of locals and tourists and their city descriptions during their trips are analyzed by constructing semantic trajectories. The place is the fundamental element of a semantic trajectory, and places are constructed by clustering Foursquare check-ins. The property of the place is defined by three dimensions: location (represented by borough names), locale (represented by place categories), and sense of place (represented by topics generated in topic modeling based on Flickr tags). Semantic trajectories are then clustered based on their semantic dimensions, and typical trajectories are mined for each cluster. The distribution of trajectories and their semantic dimensions are compared between locals and tourists at different time spans to explore how London’s impressions evolve over time. The results suggest distinct visiting behaviors and city perceptions over time for locals and tourists. Both groups primarily concentrate in the city center, with small hotspots around the airport. However, locals tend to visit more suburban areas than tourists. Locals show higher preferences for business districts during the daytime and on weekdays, while tourists consistently show interest in shopping in the city center. In terms of city perceptions, the city center is associated with descriptions of cityscapes and transport during the daytime. At night, people tend to associate the same area with nightlife activities. Furthermore, locals are interested in leisure activities and fitness, while tourists tend to focus on tourist attractions and the Olympics

    European Handbook of Crowdsourced Geographic Information

    Get PDF
    This book focuses on the study of the remarkable new source of geographic information that has become available in the form of user-generated content accessible over the Internet through mobile and Web applications. The exploitation, integration and application of these sources, termed volunteered geographic information (VGI) or crowdsourced geographic information (CGI), offer scientists an unprecedented opportunity to conduct research on a variety of topics at multiple scales and for diversified objectives. The Handbook is organized in five parts, addressing the fundamental questions: What motivates citizens to provide such information in the public domain, and what factors govern/predict its validity?What methods might be used to validate such information? Can VGI be framed within the larger domain of sensor networks, in which inert and static sensors are replaced or combined by intelligent and mobile humans equipped with sensing devices? What limitations are imposed on VGI by differential access to broadband Internet, mobile phones, and other communication technologies, and by concerns over privacy? How do VGI and crowdsourcing enable innovation applications to benefit human society? Chapters examine how crowdsourcing techniques and methods, and the VGI phenomenon, have motivated a multidisciplinary research community to identify both fields of applications and quality criteria depending on the use of VGI. Besides harvesting tools and storage of these data, research has paid remarkable attention to these information resources, in an age when information and participation is one of the most important drivers of development. The collection opens questions and points to new research directions in addition to the findings that each of the authors demonstrates. Despite rapid progress in VGI research, this Handbook also shows that there are technical, social, political and methodological challenges that require further studies and research

    Spatial and Temporal Sentiment Analysis of Twitter data

    Get PDF
    The public have used Twitter world wide for expressing opinions. This study focuses on spatio-temporal variation of georeferenced Tweets’ sentiment polarity, with a view to understanding how opinions evolve on Twitter over space and time and across communities of users. More specifically, the question this study tested is whether sentiment polarity on Twitter exhibits specific time-location patterns. The aim of the study is to investigate the spatial and temporal distribution of georeferenced Twitter sentiment polarity within the area of 1 km buffer around the Curtin Bentley campus boundary in Perth, Western Australia. Tweets posted in campus were assigned into six spatial zones and four time zones. A sentiment analysis was then conducted for each zone using the sentiment analyser tool in the Starlight Visual Information System software. The Feature Manipulation Engine was employed to convert non-spatial files into spatial and temporal feature class. The spatial and temporal distribution of Twitter sentiment polarity patterns over space and time was mapped using Geographic Information Systems (GIS). Some interesting results were identified. For example, the highest percentage of positive Tweets occurred in the social science area, while science and engineering and dormitory areas had the highest percentage of negative postings. The number of negative Tweets increases in the library and science and engineering areas as the end of the semester approaches, reaching a peak around an exam period, while the percentage of negative Tweets drops at the end of the semester in the entertainment and sport and dormitory area. This study will provide some insights into understanding students and staff ’s sentiment variation on Twitter, which could be useful for university teaching and learning management
    corecore