2,306 research outputs found

    The HyperBagGraph DataEdron: An Enriched Browsing Experience of Multimedia Datasets

    Full text link
    Traditional verbatim browsers give back information in a linear way according to a ranking performed by a search engine that may not be optimal for the surfer. The latter may need to assess the pertinence of the information retrieved, particularly when s\cdothe wants to explore other facets of a multi-facetted information space. For instance, in a multimedia dataset different facets such as keywords, authors, publication category, organisations and figures can be of interest. The facet simultaneous visualisation can help to gain insights on the information retrieved and call for further searches. Facets are co-occurence networks, modeled by HyperBag-Graphs -- families of multisets -- and are in fact linked not only to the publication itself, but to any chosen reference. These references allow to navigate inside the dataset and perform visual queries. We explore here the case of scientific publications based on Arxiv searches.Comment: Extension of the hypergraph framework shortly presented in arXiv:1809.00164 (possible small overlaps); use the theoretical framework of hb-graphs presented in arXiv:1809.0019

    Living with Data: Aligning Data Studies and Data Activism Through a Focus on Everyday Experiences of Datafication

    Get PDF
    It is now widely accepted that data are oiling the twenty-first century (Toonders 2014). Data gathering and tracking are practically universal, and datafication (the quantification of aspects of life previously experienced in qualitative, non-numeric form, such as communication, relationships, health and fitness, transport and mobility, democratic participation, leisure and consumption) is a transformation disrupting the social world in all its forms (Couldry 2016). Statistics confirm the assertion that the datafication of almost everything is growing relentlessly: in 2012 it was claimed that 90% of the world’s data had been created in the previous two years (IBM 2012), and a future 40% annual rise in data generation has been estimated (Manyika et al. 2011). Less commonly noted is the place of everyday experience in the machine of datafication. The Berliner Gazette (nd) has claimed that 75% of these newly available data are by-products of people’s everyday activities, and Michael and Lupton also note the centrality of the everyday in the current Big Data moment: Human actors contribute to big datasets when they engage in activities such as making calls and using apps on mobile phones, using online search engines such as Google, purchasing goods or services online or taking part in customer loyalty programmes, uploading contributions to social media platforms, using wearable self-tracking devices or moving around in spaces that are equipped with digital sensing or recording devices (Michael and Lupton 2015, 104). Despite the significance of such everyday practices in the production of large-scale data, little attention has been paid to people’s thoughts and feelings about these data-producing processes. These issues have not, on the whole, been the focus of the emerging field of data studies, which seeks to understand the new roles played by data in times of datafication. This is a problem for a number of reasons. First, if we do not understand whether data condition everyday experiences as it is claimed, and our thinking about these matters is not informed by the perspectives of the people upon whose data datafication is built, scholarship about data-in-society will be incomplete. Second, and importantly for this special issue, in the absence of such knowledge, data activism, which seeks to challenge existing data power relations and to mobilise data in order to enhance social justice, will rely upon the judgments of elite technical actors and activists about what would constitute more just data practices. In contrast, I argue that to build a picture of what just data arrangements (that is, the practices of organisations that handle and produce data, the policies that govern these practices, and provisions for the development of skills that people need in order to engage with data) might look like, it is important to take account of what non-expert citizens themselves say would enable them to live better with data, based on their everyday experiences of datafication. Greater understanding of everyday living with data can contribute significantly to the knowledge base on which data activism is built. A third problem, then, is that by not focusing on these issues, the field of data studies is not currently as well aligned to the aims of data activism as it might be. This paper explores how we might address this gap

    MyEvents: a personal visual analytics approach for mining key events and knowledge discovery in support of personal reminiscence

    Get PDF
    Reminiscence is an important aspect in our life. It preserves precious memories, allows us to form our own identities and encourages us to accept the past. Our work takes advantage of modern sensor technologies to support reminiscence, enabling self-monitoring of personal activities and individual movement in space and time on a daily basis. This paper presents MyEvents, a web-based personal visual analytics platform designed for non-computing experts, that allows for the collection of long-term location and movement data and the generation of event mementos. Our research is focused on two prominent goals in event reminiscence: 1) selection subjectivity and human involvement in the process of self knowledge discovery and memento creation; and 2) the enhancement of event familiarity by presenting target events and their related information for optimal memory recall and reminiscence. A novel multi-significance event ranking model is proposed to determine significant events in the personal history according to user preferences for event category, frequency and regularity. The evaluation results show that MyEvents effectively fulfils the reminiscence goals and tasks.

    From Big Data To Knowledge – Good Practices From Industry

    Get PDF
    Recent advancements in data gathering technologies have led to the rise of a large amount of data through which useful insights and ideas can be derived. These data sets are typically too large to process using traditional data processing tools and applications and thus known in the popular press as ‘big data’. It is essential to extract the hidden meanings in the available data sets by aggregating big data into knowledge, which may then positively contribute to decision making. One way to engage in data-driven strategy is to gather contextual relevant data on specific customers, products, and situations, and determine optimised offerings that are most appealing to the target customers based on sound analytics. Corporations around the world have been increasingly applying analytics, tools and technologies to capture, manage and process such data, and derive value out of the huge volumes of data generated by individuals. The detailed intelligence on consumer behaviour, user patterns and other hidden knowledge that was not possible to derive via traditional means could now be used to facilitate important business processes such as real-time control, and demand forecasting. The aim of our research is to understand and analyse the significance and impact of big data in today’s industrial environment and identify the good practices that can help us derive useful knowledge out of this wealth of information based on content analysis of 34 firms that have initiated big data analytical projects. Our descriptive and network analysis shows that the goals of a big data initiative are extensible and highlighted the importance of data representation. We also find the data analytical techniques adopted are heavily dependent on the project goals

    The role of visualisations in social media monitoring systems

    Get PDF
    Social-Media streams are constantly supplying vast volumes of real-time User Generated Content through platforms such as Twitter, Facebook, and Instagram, which makes it a challenge to monitor and understand. Understanding social conversations has now become a major interest for businesses, PR and advertising agencies, as well as law enforcement and government bodies. Monitoring of social-media allows us to observe large numbers of spontaneous, real-time interactions and varied expression of opinion, often fleeting and private. However, human, expert monitoring is generally unfeasible due to the high volumes of data. This has been a major reason for recent research and development work looking at automated social-media monitoring systems. Such systems often keep the human "out of the loop" as an NLP (Natural Language Processing) pipeline and other data-mining algorithms deal with analysing and extracting features and meaning from the data. This is plagued by a variety of problems, mostly due to the heterogenic, inconsistent and context-poor nature of social-media data, where as a result the accuracy and efficacy of such systems suffers. Nevertheless, automated social-media monitoring systems provide for a scalable, streamlined and often efficient way of dealing with big-data streams. The integration of processing outputs from automated systems and feedback to human experts is a challenge and deserves to be addressed in research literature. This paper will establish the role of the human in the social-media monitoring loop, based on prior systems work in this area. The focus of our investigation will be on use of visualisations for effective feedback to human experts. A specific, custom built system’s case-study in a social-media monitoring scenario will be considered and suggestions on how to bring back the human “into the loop” will be provided. Also some related ethical questions will be briefly considered. It is hoped that this work will inform and provide valuable insight to help improve development of automated social-media monitoring systems

    Using big data for customer centric marketing

    Get PDF
    This chapter deliberates on “big data” and provides a short overview of business intelligence and emerging analytics. It underlines the importance of data for customer-centricity in marketing. This contribution contends that businesses ought to engage in marketing automation tools and apply them to create relevant, targeted customer experiences. Today’s business increasingly rely on digital media and mobile technologies as on-demand, real-time marketing has become more personalised than ever. Therefore, companies and brands are striving to nurture fruitful and long lasting relationships with customers. In a nutshell, this chapter explains why companies should recognise the value of data analysis and mobile applications as tools that drive consumer insights and engagement. It suggests that a strategic approach to big data could drive consumer preferences and may also help to improve the organisational performance.peer-reviewe

    A visual analytics approach for visualisation and knowledge discovery from time-varying personal life data

    Get PDF
    A thesis submitted to the University of Bedfordshire, in ful filment of the requirements for the degree of Doctor of PhilosophyToday, the importance of big data from lifestyles and work activities has been the focus of much research. At the same time, advances in modern sensor technologies have enabled self-logging of a signi cant number of daily activities and movements. Lifestyle logging produces a wide variety of personal data along the lifespan of individuals, including locations, movements, travel distance, step counts and the like, and can be useful in many areas such as healthcare, personal life management, memory recall, and socialisation. However, the amount of obtainable personal life logging data has enormously increased and stands in need of effective processing, analysis, and visualisation to provide hidden insights owing to the lack of semantic information (particularly in spatiotemporal data), complexity, large volume of trivial records, and absence of effective information visualisation on a large scale. Meanwhile, new technologies such as visual analytics have emerged with great potential in data mining and visualisation to overcome the challenges in handling such data and to support individuals in many aspects of their life. Thus, this thesis contemplates the importance of scalability and conducts a comprehensive investigation into visual analytics and its impact on the process of knowledge discovery from the European Commission project MyHealthAvatar at the Centre for Visualisation and Data Analytics by actively involving individuals in order to establish a credible reasoning and effectual interactive visualisation of such multivariate data with particular focus on lifestyle and personal events. To this end, this work widely reviews the foremost existing work on data mining (with the particular focus on semantic enrichment and ranking), data visualisation (of time-oriented, personal, and spatiotemporal data), and methodical evaluations of such approaches. Subsequently, a novel automated place annotation is introduced with multilevel probabilistic latent semantic analysis to automatically attach relevant information to the collected personal spatiotemporal data with low or no semantic information in order to address the inadequate information, which is essential for the process of knowledge discovery. Correspondingly, a multi-signi ficance event ranking model is introduced by involving a number of factors as well as individuals' preferences, which can influence the result within the process of analysis towards credible and high-quality knowledge discovery. The data mining models are assessed in terms of accurateness and performance. The results showed that both models are highly capable of enriching the raw data and providing significant events based on user preferences. An interactive visualisation is also designed and implemented including a set of novel visual components signifi cantly based upon human perception and attentiveness to visualise the extracted knowledge. Each visual component is evaluated iteratively based on usability and perceptibility in order to enhance the visualisation towards reaching the goal of this thesis. Lastly, three integrated visual analytics tools (platforms) are designed and implemented in order to demonstrate how the data mining models and interactive visualisation can be exploited to support different aspects of personal life, such as lifestyle, life pattern, and memory recall (reminiscence). The result of the evaluation for the three integrated visual analytics tools showed that this visual analytics approach can deliver a remarkable experience in gaining knowledge and supporting the users' life in certain aspects

    Using machine learning to support better and intelligent visualisation for genomic data

    Get PDF
    Massive amounts of genomic data are created for the advent of Next Generation Sequencing technologies. Great technological advances in methods of characterising the human diseases, including genetic and environmental factors, make it a great opportunity to understand the diseases and to find new diagnoses and treatments. Translating medical data becomes more and more rich and challenging. Visualisation can greatly aid the processing and integration of complex data. Genomic data visual analytics is rapidly evolving alongside with advances in high-throughput technologies such as Artificial Intelligence (AI), and Virtual Reality (VR). Personalised medicine requires new genomic visualisation tools, which can efficiently extract knowledge from the genomic data effectively and speed up expert decisions about the best treatment of an individual patient’s needs. However, meaningful visual analysis of such large genomic data remains a serious challenge. Visualising these complex genomic data requires not only simply plotting of data but should also lead to better decisions. Machine learning has the ability to make prediction and aid in decision-making. Machine learning and visualisation are both effective ways to deal with big data, but they focus on different purposes. Machine learning applies statistical learning techniques to automatically identify patterns in data to make highly accurate prediction, while visualisation can leverage the human perceptual system to interpret and uncover hidden patterns in big data. Clinicians, experts and researchers intend to use both visualisation and machine learning to analyse their complex genomic data, but it is a serious challenge for them to understand and trust machine learning models in the serious medical industry. The main goal of this thesis is to study the feasibility of intelligent and interactive visualisation which combined with machine learning algorithms for medical data analysis. A prototype has also been developed to illustrate the concept that visualising genomics data from childhood cancers in meaningful and dynamic ways could lead to better decisions. Machine learning algorithms are used and illustrated during visualising the cancer genomic data in order to provide highly accurate predictions. This research could open a new and exciting path to discovery for disease diagnostics and therapies

    Emoto - visualising the online response to London 2012.

    Get PDF
    In recent years we have moved from data scarcity to data abundance. As a response, a variety of methods have been adopted in art, design, business, science and government to understand and communicate meaning in data through visual form. emoto (emoto2012.org) is one such project, it visualised the online audience response to a major global event, the London 2012 Olympic and Paralympic Games. emoto set out to both give expression to and augment online social phenomena, that are emergent and only recently made possible by access to huge real-time data streams. This report charts the development and release of the project, and positions it in relation to current debates on data and visualisation, for example, around the bias and accessibility of the data, and how knowledge practices are changing in an era of so-called 'big data.
    corecore