245,171 research outputs found

    Big Data Analytics for Smart Cities: The H2020 CLASS Project

    Get PDF
    Applying big-data technologies to field applications has resulted in several new needs. First, processing data across a compute continuum spanning from cloud to edge to devices, with varying capacity, architecture etc. Second, some computations need to be made predictable (real-time response), thus supporting both data-in-motion processing and larger-scale data-at-rest processing. Last, employing an event-driven programming model that supports mixing different APIs and models, such as Map/Reduce, CEP, sequential code, etc.The research leading to these results has received funding from the European Union’s Horizon 2020 Programme under the CLASS Project (www.class-project.eu), grant agreement No. 780622.Peer ReviewedPostprint (author's final draft

    SPL: An extensible language for distributed stream processing

    Get PDF
    Big data is revolutionizing how all sectors of our economy do business, including telecommunication, transportation, medical, and finance. Big data comes in two flavors: data at rest and data in motion. Processing data in motion is stream processing. Stream processing for big data analytics often requires scale that can only be delivered by a distributed system, exploiting parallelism on many hosts and many cores. One such distributed stream processing system is IBM Streams. Early customer experience with IBM Streams uncovered that another core requirement is extensibility, since customers want to build high-performance domain-specific operators for use in their streaming applications. Based on these two core requirements of distribution and extensibility, we designed and implemented the Streams Processing Language (SPL). This article describes SPL with an emphasis on the language design, distributed runtime, and extensibility mechanism. SPL is now the gateway for the IBM Streams platform, used by our customers for stream processing in a broad range of application domains. © 2017 ACM

    Cyber-Physical Systems Technologies: Applications in Industry and Education

    Get PDF
    Industry 4.0 concept development forms new trends as cloud computing,  big data analysis, the industrial internet of things, machine-to-machine technologies. Cyber-physical systems (CPS) paradigm is based on these trends and integrates of computation, networking and physical processes. Synergy Center at Peter the Great St. Petersburg Polytechnic University works in the areas of intelligent systems for data processing and control, motion control systems for robotics, complex automation and mechatronics as components of CPS. Keywords: Industry 4.0, Cyber-physical systems, Digital twin; intelligent control system, automation, Global digitalisation, Practical-oriented online courses, Skills training, Joint international educational programmes

    Development of HU Cloud-based Spark Applications for Streaming Data Analytics

    Get PDF
    Nowadays, streaming data overflows from various sources and technologies such as Internet of Things (IoT), making conventional data analytics methods unsuitable to manage the latency of data processing relative to the growing demand for high processing speed and algorithmically scalability [1]. Real-time streaming data analytics, which processes data while it is in motion, is required to allow many organizations to analyze streaming data effectively and efficiently for being more active in their strategies. To analyze real time “Big” streaming data, parallel and distributed computing over a cloud of computers has become a mainstream solution to allow scalability, resiliency to failure, and fast processing of massive data sets. Several open source data analytics frameworks have been proposed and developed for streaming data analytics successfully. Apache Spark is one such framework being developed at the University of California, Berkley and gains lots of attentions due to reducing IO by storing data in a memory and a unique data executing model. In Computer & Information Sciences (CISC) at Harrisburg University (HU), we have been working on building a private Cloud Computing for future research and planning to involve industry collaboration where high volumes of real time streaming data are used to develop solutions to practical problems in industry. By developing a HU Cloud based environment for Apache Spark applications for streaming data analytics with batch processing on Hadoop Distributed File System (HDFS), we can prepare future big data era that can turn big data into beneficial actions for industry needs. This research aims to develop Spark applications supporting an entire streaming data analytics workflow, which consists of data ingestion, data analytics, data visualization and data storing. In particular, we will focus on a real time stock recommender system based on state-of-the-art Machine Learning (ML)/Deep Learning (DL) frameworks such as mllib, TensorFlow, Apache mxnet and pytorch. The plan is to gather real time stock market data from Google/Yahoo finance data streams to build a model to predict a future stock market trend. The proposed Spark applications on the HU cloud-based architecture will give emphasis to finding time-series forcating module for a specific period, typically based on selected attributes. In addition, we will test scale-out architecture, efficient parallel processing and fault tolerance of Spark applications on the HU Cloud based HDFS. We believe that this research will bring the CISC program at HU significant competitive advantages globally

    Deformation analysis of a metropolis from C- to X-band PSI: proof-of-concept with Cosmo-Skymed over Rome, Italy

    Get PDF
    Stability of monuments and subsidence of residential quarters in Rome (Italy) are depicted based on geospatial analysis of more than 310,000 Persistent Scatterers (PS) obtained from Stanford Method for Persistent Scatterers (StaMPS) processing of 32 COSMO-SkyMed 3m-resolution HH StripMap ascending mode scenes acquired between 21 March 2011 and 10 June 2013. COSMO-SkyMed PS densities and associated displacement velocities are compared with almost 20 years of historical C-band ERS- 1/2, ENVISAT and RADARSAT-1/2 imagery. Accounting for differences in image processing algorithms and satellite acquisition geometries, we assess the feasibility of ground motion monitoring in big cities and metropolitan areas by coupling newly acquired and legacy SAR in full time series. Limitations and operational benefits of the transition from medium resolution C-band to high resolution X-band PS data are discussed, alongside the potential impact on the management of expanding urban environments

    Investigating How Speech And Animation Realism Influence The Perceived Personality Of Virtual Characters And Agents

    Get PDF
    The portrayed personality of virtual characters and agents is understood to influence how we perceive and engage with digital applications. Understanding how the features of speech and animation drive portrayed personality allows us to intentionally design characters to be more personalized and engaging. In this study, we use performance capture data of unscripted conversations from a variety of actors to explore the perceptual outcomes associated with the modalities of speech and motion. Specifically, we contrast full performance-driven characters to those portrayed by generated gestures and synthesized speech, analysing how the features of each influence portrayed personality according to the Big Five personality traits. We find that processing speech and motion can have mixed effects on such traits, with our results highlighting motion as the dominant modality for portraying extraversion and speech as dominant for communicating agreeableness and emotional stability. Our results can support the Extended Reality (XR) community in development of virtual characters, social agents and 3D User Interface (3DUI) agents portraying a range of targeted personalities

    The Issue Of Internet Polling

    Get PDF
    Surveys, polls, and focus groups are common phenomena in our daily lives. We live in a world where big data is big business. Large decisions hinge on the accuracy and predicative power of these numbers. Therefore, it should not be surprising that there is a market for the malicious manipu-1ation of data. Extreme care must be taken in the collection, checking, and processing of data to prevent decisions from being made on incorrect as­sumptions. In order to demonstrate the full potential and possible impact of these attacks, I shall provide the following example: John Doe is a member of the United States Senate. In recent years, the political pressure to make a preemptive strike against a potential nu­clear threat has grown exponentially. In some of the more extreme cases, several senators have begun asking for support to make a motion to the President for military intervention. Eventually, Senator Doe is asked to sign a petition for their cause. Senator Doe decides that he must take the concerns, priorities, and beliefs of the voters in his state into account be­fore he can make a decision as their representative

    Deformation analysis of a metropolis from C- to X-band PSI: proof-of-concept with Cosmo-Skymed over Rome, Italy

    Get PDF
    Stability of monuments and subsidence of residential quarters in Rome (Italy) are depicted based on geospatial analysis of more than 310,000 Persistent Scatterers (PS) obtained from Stanford Method for Persistent Scatterers (StaMPS) processing of 32 COSMO-SkyMed 3m-resolution HH StripMap ascending mode scenes acquired between 21 March 2011 and 10 June 2013. COSMO-SkyMed PS densities and associated displacement velocities are compared with almost 20 years of historical C-band ERS- 1/2, ENVISAT and RADARSAT-1/2 imagery. Accounting for differences in image processing algorithms and satellite acquisition geometries, we assess the feasibility of ground motion monitoring in big cities and metropolitan areas by coupling newly acquired and legacy SAR in full time series. Limitations and operational benefits of the transition from medium resolution C-band to high resolution X-band PS data are discussed, alongside the potential impact on the management of expanding urban environments

    Implementação de um laboratório de Big Data para processamento de dados em batch e streaming

    Get PDF
    Trabalho apresentado em XXX Jornadas Luso-Espanholas de Gestão Científica, 5-8 fevereiro 2020, Bragança, PortugalBig Data é uma área que pretende proporcionar capacidade de processamento dos dados, face ao crescimento exponencial de informação gerada de dia para dia, através de novas tecnologias para recolha, transformação, processamento e análise de dados provenientes de diversas fontes e em diversos formatos. Os desafios do Big Data são significativos, daí terem surgido diversas tecnologias num curto espaço de tempo, o que torna também desafiante a entrada nesta área de estudo/investigação. Este artigo apresenta um projeto de implementação de um laboratório de Big Data, para processamento de dados históricos e em movimento (streaming), cujo propósito é permitir a utilização/exploração das tecnologias associadas em atividades de ensino e investigação. São apresentadas as tecnologias, a arquitetura implementada e testes de processamento de dados realizados para validação da correta configuração e funcionamento do laboratório.Big Data is a field that aims to provide data processing capacity, facing the exponential growth of information generated daily, through new technologies for collecting, transforming, processing and analysing data from various sources and in various formats. The challenges of Big Data are significant, so many technologies have emerged in a short time, making the entry into this area of study / research challenging as well. This paper presents a project for the implementation of a big data laboratory for processing historical and data in motion (streaming), whose purpose is to allow the use / exploitation of associated technologies in teaching and research activities. The technologies, the implemented architecture and data processing tests performed to validate the correct configuration and operation of the laboratory are presented.info:eu-repo/semantics/publishedVersio
    corecore