947 research outputs found

    A review of data visualization: opportunities in manufacturing sequence management.

    No full text
    Data visualization now benefits from developments in technologies that offer innovative ways of presenting complex data. Potentially these have widespread application in communicating the complex information domains typical of manufacturing sequence management environments for global enterprises. In this paper the authors review the visualization functionalities, techniques and applications reported in literature, map these to manufacturing sequence information presentation requirements and identify the opportunities available and likely development paths. Current leading-edge practice in dynamic updating and communication with suppliers is not being exploited in manufacturing sequence management; it could provide significant benefits to manufacturing business. In the context of global manufacturing operations and broad-based user communities with differing needs served by common data sets, tool functionality is generally ahead of user application

    Persistent Homology Guided Force-Directed Graph Layouts

    Full text link
    Graphs are commonly used to encode relationships among entities, yet their abstractness makes them difficult to analyze. Node-link diagrams are popular for drawing graphs, and force-directed layouts provide a flexible method for node arrangements that use local relationships in an attempt to reveal the global shape of the graph. However, clutter and overlap of unrelated structures can lead to confusing graph visualizations. This paper leverages the persistent homology features of an undirected graph as derived information for interactive manipulation of force-directed layouts. We first discuss how to efficiently extract 0-dimensional persistent homology features from both weighted and unweighted undirected graphs. We then introduce the interactive persistence barcode used to manipulate the force-directed graph layout. In particular, the user adds and removes contracting and repulsing forces generated by the persistent homology features, eventually selecting the set of persistent homology features that most improve the layout. Finally, we demonstrate the utility of our approach across a variety of synthetic and real datasets

    A task-based evaluation methodology for visual representation of dynamic networks

    Get PDF
    Current evaluation approaches for visualization strategies of dynamic networks are focused on maintaining the mental map ofthe network over the time or keeping a certain shape to make iteasy to navigate, however the available tools for analyzing temporal network have not been evaluated in terms of how easy to usethey are to perform exploratory data analysis tasks with dynamicnetworks. In this work we present an evaluation methodologythat guides the usability assessment of software tools used to analyze dynamic networks by using the standard ISO 9241-11. Thismethodology has been applied successfully with two popularopen source tools used to analyze temporal networks.ITESO, A.C

    Visual analysis of anatomy ontologies and related genomic information

    Get PDF
    Challenges in scientific research include the difficulty in obtaining overviews of the large amount of data required for analysis, and in resolving the differences in terminology used to store and interpret information in multiple, independently created data sets. Ontologies provide one solution for analysis involving multiple data sources, improving cross-referencing and data integration. This thesis looks at harnessing advanced human perception to reduce the cognitive load in the analysis of the multiple, complex data sets the bioinformatics user group studied use in research, taking advantage also of users’ domain knowledge, to build mental models of data that map to its underlying structure. Guided by a user-centred approach, prototypes were developed to provide a visual method for exploring users’ information requirements and to identify solutions for these requirements. 2D and 3D node-link graphs were built to visualise the hierarchically structured ontology data, to improve analysis of individual and comparison of multiple data sets, by providing overviews of the data, followed by techniques for detailed analysis of regions of interest. Iterative, heuristic and structured user evaluations were used to assess and refine the options developed for the presentation and analysis of the ontology data. The evaluation results confirmed the advantages that visualisation provides over text-based analysis, and also highlighted the advantages of each of 2D and 3D for visual data analysis.Overseas Research Students Awards SchemeJames Watt Scholarshi

    Document-level sentiment analysis of email data

    Get PDF
    Sisi Liu investigated machine learning methods for Email document sentiment analysis. She developed a systematic framework that has been qualitatively and quantitatively proved to be effective and efficient in identifying sentiment from massive amount of Email data. Analytical results obtained from the document-level Email sentiment analysis framework are beneficial for better decision making in various business settings

    Visualization of large temporal social network datasets

    Get PDF
    Social network datasets consist of what sociologists call ‘social structures’, accumulation of all communication channels that social actors share ideas and information between each other. Social network analysis reveals characteristics and properties of social networks by applying specific metrics. Although, size of a real-life social network dataset can reach millions of relations belong to millions of social actors with large temporal dimension, existing information visualization tools can represent at most several thousands of these actors. This thesis presents a conceptual design study focused on visualization of large temporal social network datasets with a novel visualization method. Proposed technique combines Ideal Gas Law (IGL) with Jacob Moreno’s theory of The Cannon of Creativity to layout social network datasets in 3D hyperbolic space and can render 50,000 social actors at interactive speed. A proof-of-the-concept program is developed around this technique allowing users to perform several analysis tasks on temporal social network datasets. Users can explore the network, control the amount of visual clutter, and identify communication anomalies in run time. Moreover, they can search a specific actor and visually follow her communication pattern. The effectiveness of proposed technique is presented with case and usability studies performed using generated and real-life datasets. In particular the Enron email dataset (323,073 emails, 19,898 email addresses over four years) and 20 Newsgroups (44,797 postings, 20 news groups and 5417 email addresses over one month) datasets are analyzed

    Extracting temporal patterns from smart city data

    Get PDF
    Mestrado de dupla diplomação com a DULAY UNIVERSITYIn the modern world data and information become a powerful instrument of management, business, safety, medicine and others. The most fashionable sciences are the sciences which allow us to extract valuable knowledge from big volumes of information. Novel data processing techniques remains a trend for the last five years, in a way that continues to provide interesting results. This paper investigates the algorithms and approaches for processing smart city data, in particular, water consumption data for the city of Bragança, Portugal. Data from the last seven years was processed according to a rigorous methodology, that includes five stages: cleaning, preparation, exploratory analysis, identification of patterns and critical interpretation of the results. After understanding the data and choosing the best algorithms, a web-based data visualizing tools was developed, providing dashboards to geospatial data representation, useful in the decision making of municipalities.В современном мире данные и информация стали одним из самых мощных инстру- ментов в управлении, бизнесе, безопасности, медицине, науке и социальной сфере. Са- мыми модными и востребованными науками в настоящий момент являются науки, поз- воляющие извлекать полезные знания из больших объемов информации. Новые методы обработки данных остаются тенденцией последних пяти лет и продолжают генерировать интересные результаты. В данной работе исследуются алгоритмы и подходы для обработ-ки данных "умного города", в частности, данных о потреблении воды в городе Браганса, Португалия. Данные за последние семь лет обрабатывались в соответствии со строгой методологией, включающей пять этапов: очистка, подготовка, исследовательский анализ, выявление закономерностей и критическая интерпретация результатов. Цель исследова-ниия - определение шаблонов поведения в потрблении воды связанных с определенными событиями и построение модели прогнозова на основе найденных закономерностей. В результате исчерпывающего анализа с помощью множества методов было установлено отсутствие систематических зависимостей в рассматриваемом типе данных. На заключи-тельном этапе был создан инструмент визуализации данных, обеспечивающий динами-ческие панели для представления аналитических данных о распределении потребления. Разработанный инструмент управления аналитикой полезен для принятия решений му-ниципалитетом

    Deep learning with knowledge graphs for fine-grained emotion classification in text

    Get PDF
    This PhD thesis investigates two key challenges in the area of fine-grained emotion detection in textual data. More specifically, this work focuses on (i) the accurate classification of emotion in tweets and (ii) improving the learning of representations from knowledge graphs using graph convolutional neural networks.The first part of this work outlines the task of emotion keyword detection in tweets and introduces a new resource called the EEK dataset. Tweets have previously been categorised as short sequences or sentence-level sentiment analysis, and it could be argued that this should no longer be the case, especially since Twitter increased its allowed character limit. Recurrent Neural Networks have become a well-established method to classify tweets over recent years, but have struggled with accurately classifying longer sequences due to the vanishing and exploding gradient descent problem. A common technique to overcome this problem has been to prune tweets to a shorter sequence length. However, this also meant that often potentially important emotion carrying information, which is often found towards the end of a tweet, was lost (e.g., emojis and hashtags). As such, tweets mostly face also problems with classifying long sequences, similar to other natural language processing tasks. To overcome these challenges, a multi-scale hierarchical recurrent neural network is proposed and benchmarked against other existing methods. The proposed learning model outperforms existing methods on the same task by up to 10.52%. Another key component for the accurate classification of tweets has been the use of language models, where more recent techniques such as BERT and ELMO have achieved great success in a range of different tasks. However, in Sentiment Analysis, a key challenge has always been to use language models that do not only take advantage of the context a word is used in but also the sentiment it carries. Therefore the second part of this work looks at improving representation learning for emotion classification by introducing both linguistic and emotion knowledge to language models. A new linguistically inspired knowledge graph called RELATE is introduced. Then a new language model is trained on a Graph Convolutional Neural Network and compared against several other existing language models, where it is found that the proposed embedding representations achieve competitive results to other LMs, whilst requiring less pre-training time and data. Finally, it is investigated how the proposed methods can be applied to document-level classification tasks. More specifically, this work focuses on the accurate classification of suicide notes and analyses whether sentiment and linguistic features are important for accurate classification

    Visualisation of Long in Time Dynamic Networks on Large Touch Displays

    Get PDF
    Any dataset containing information about relationships between entities can be modelled as a network. This network can be static, where the entities/relationships do not change over time, or dynamic, where the entities/relationships change over time. Network data that changes over time, dynamic network data, is a powerful resource when studying many important phenomena, across wide-ranging fields from travel networks to epidemiology.However, it is very difficult to analyse this data, especially if it covers a long period of time (e.g, one month) with respect to its temporal resolution (e.g. seconds). In this thesis, we address the problem of visualising long in time dynamic networks: networks that may not be particularly large in terms of the number of entities or relationships, but are long in terms of the length of time they cover when compared to their temporal resolution.We first introduce Dynamic Network Plaid, a system for the visualisation and analysis of long in time dynamic networks. We design and build for an 84" touch-screen vertically-mounted display as existing work reports positive results for the use of these in a visualisation context, and that they are useful for collaboration. The Plaid integrates multiple views and we prioritise the visualisation of interaction provenance. In this system we also introduce a novel method of time exploration called ‘interactive timeslicing’. This allows the selection and comparison of points that are far apart in time, a feature not offered by existing visualisation systems. The Plaid is validated through an expert user evaluation with three public health researchers.To confirm observations of the expert user evaluation, we then carry out a formal laboratory study with a large touch-screen display to verify our novel method of time navigation against existing animation and small multiples approaches. From this study, we find that interactive timeslicing outperforms animation and small multiples for complex tasks requiring a compari-son between multiple points that are far apart in time. We also find that small multiples is best suited to comparisons of multiple sequential points in time across a time interval.To generalise the results of this experiment, we later run a second formal laboratory study in the same format as the first, but this time using standard-sized displays with indirect mouse input. The second study reaffirms the results of the first, showing that our novel method of time navigation can facilitate the visual comparison of points that are distant in time in a way that existing approaches, small multiples and animation, cannot. The study demonstrates that our previous results generalise across display size and interaction type (touch vs mouse).In this thesis we introduce novel representations and time interaction techniques to improve the visualisation of long in time dynamic networks, and experimentally show that our novel method of time interaction outperforms other popular methods for some task types
    corecore