1 research outputs found

    Building a scientific workflow framework to enable realā€time machine learning and visualization

    Get PDF
    Nowadays, we have entered the era of big data. In the area of high performance computing, largeā€scale simulations can generate huge amounts of data with potentially critical information. However, these data are usually saved in intermediate files and are not instantly visible until advanced data analytics techniques are applied after reading all simulation data from persistent storages (eg, local disks or a parallel file system). This approach puts users in a situation where they spend long time on waiting for running simulations while not knowing the status of the running job. In this paper, we build a new computational framework to couple scientific simulations with multiā€step machine learning processes and inā€situ data visualizations. We also design a new scalable simulationā€time clustering algorithm to automatically detect fluid flow anomalies. This computational framework is built upon different software components and provides plugā€in data analysis and visualization functions over complex scientific workflows. With this advanced framework, users can monitor and get realā€time notifications of special patterns or anomalies from ongoing extremeā€scale turbulent flow simulations
    corecore