10 research outputs found

    Semi-automatic Maintenance of Regression Models: an Application in the Steel Industry

    Get PDF
    Software applications used in the controlling and planning of production processes commonly make use of predictive statistical models. Changes in the process involve a more or less regular need for updating the prediction models on which the operational software applications are based. The objective of this article is • to provide information which helps to design semiautomatic systems for the maintenance of statistical prediction models and • to describe a proof-of-concept implementation in an industrial application. The system developed processes the production data and provides an easy-to-use interface to construct updated models and introduce them into a software application. The article presents the architecture of the maintenance system, with a description of the algorithms that cause the system’s functionality. The system developed was implemented for keeping up-to-date prediction models which are in everyday use in a steel plate mill in the planning of the mechanical properties of steel products. The conclusion of the results is that the semi-automatic approach proposed is competitive with fully automatic and manual approaches. The benefits include good prediction accuracy and decreased workload of the deployment of updated model versions

    A top-down approach for creating and implementing data mining solutions

    No full text
    Abstract The information age is characterized by ever-growing amounts of data surrounding us. By reproducing this data into usable knowledge we can start moving toward the knowledge age. Data mining is the science of transforming measurable information into usable knowledge. During the data mining process, the measurements pass through a chain of sophisticated transformations in order to acquire knowledge. Furthermore, in some applications the results are implemented as software solutions so that they can be continuously utilized. It is evident that the quality and amount of the knowledge formed is highly dependent on the transformations and the process applied. This thesis presents an application independent concept that can be used for managing the data mining process and implementing the acquired results as software applications. The developed concept is divided into two parts – solution formation and solution implementation. The first part presents a systematic way for finding a data mining solution from a set of measurement data. The developed approach allows for easier application of a variety of algorithms to the data, manages the work chain, and differentiates between the data mining tasks. The method is based on storage of the data between the main stages of the data mining process, where the different stages of the process are defined on the basis of the type of algorithms applied to the data. The efficiency of the process is demonstrated with a case study presenting new solutions for resistance spot welding quality control. The second part of the concept presents a component-based data mining application framework, called Smart Archive, designed for implementing the solution. The framework provides functionality that is common to most data mining applications and is especially suitable for implementing applications that process continuously acquired measurements. The work also proposes an efficient algorithm for utilizing cumulative measurement data in the history component of the framework. Using the framework, it is possible to build high-quality data mining applications with shorter development times by configuring the framework to process application-specific data. The efficiency of the framework is illustrated using a case study presenting the results and implementation principles of an application developed for predicting steel slab temperatures in a hot strip mill. In conclusion, this thesis presents a concept that proposes solutions for two fundamental issues of data mining, the creation of a working data mining solution from a set of measurement data and the implementation of it as a stand-alone application

    Smart archive: a component-based data mining application framework

    No full text
    Implementation of data mining applications is a challenging and complicated task, and the applications are often built from scratch. In this paper, a component-based application framework, called Smart Archive (SA) designed for implementing data mining applications, is presented. SA provides functionality common to most data mining applications and components for utilizing history information. Using SA, it is possible to build high-quality applications with shorter development times by configuring the framework to process application-specific data. The architecture, the components, the implementation and the design principles of the framework are presented. The advantages of a framework-based implementation are demonstrated by presenting a case study which compares the framework approach to implementing a real-world application with the option of building an equivalent application from scratch. In conclusion, the paper presents a lucid framework for creating data mining applications and illustrates the importance and advantages of using the presented approach

    Exercise energy expenditure estimation based on acceleration data using the linear mixed model

    No full text
    This paper introduces a novel algorithm for estimating energy expenditure during physical activity. The estimation is based on acceleration data measured from a wrist-worn accelerometer. Simultaneous measurements of acceleration and oxygen consumption using a biaxial accelerometer and a breath gas analyzer were made during four different ac-tivities: walking, running, Nordic walking and bicycling. A variance feature is used to compress the original accel-eration signals. A linear mixed model is fitted to the data to estimate oxygen consumption based on the acceleration data. Lagged values of acceleration are used to take the de-layed effect of physical activity on oxygen consumption into consideration. The algorithm also uses information on the height of the subjects. Oxygen consumption is estimated at 15-second intervals and energy expenditure is directly cal-culated from the oxygen consumption. Based on the experi-mental data gathered from 10 subjects, a new algorithm for estimating energy expenditure is suggested. It is shown that the method estimates energy expenditure very accurately. In walking, running and Nordic walking the model underesti-mates energy expenditure by 13, 2 and 9 percent, respec-tively, and in bicycling energy expenditure is overestimated by 7 percent. Thus, the new approach is a very promising method for estimating energy expenditure.
    corecore