Search CORE

22,980 research outputs found

Predictive User Modeling with Actionable Attributes

Author: Pechenizkiy Mykola
Zliobaite Indre
Publication venue
Publication date: 01/01/2013
Field of study

Different machine learning techniques have been proposed and used for modeling individual and group user needs, interests and preferences. In the traditional predictive modeling instances are described by observable variables, called attributes. The goal is to learn a model for predicting the target variable for unseen instances. For example, for marketing purposes a company consider profiling a new user based on her observed web browsing behavior, referral keywords or other relevant information. In many real world applications the values of some attributes are not only observable, but can be actively decided by a decision maker. Furthermore, in some of such applications the decision maker is interested not only to generate accurate predictions, but to maximize the probability of the desired outcome. For example, a direct marketing manager can choose which type of a special offer to send to a client (actionable attribute), hoping that the right choice will result in a positive response with a higher probability. We study how to learn to choose the value of an actionable attribute in order to maximize the probability of a desired outcome in predictive modeling. We emphasize that not all instances are equally sensitive to changes in actions. Accurate choice of an action is critical for those instances, which are on the borderline (e.g. users who do not have a strong opinion one way or the other). We formulate three supervised learning approaches for learning to select the value of an actionable attribute at an instance level. We also introduce a focused training procedure which puts more emphasis on the situations where varying the action is the most likely to take the effect. The proof of concept experimental validation on two real-world case studies in web analytics and e-learning domains highlights the potential of the proposed approaches

arXiv.org e-Print Archive

Repository TU/e

Pure OAI Repository

Data analytics 2016: proceedings of the fifth international conference on data analytics

Author: Bhulai Sandjai
Semanjski Ivana
Publication venue: The International Academy, Research and Industry Association
Publication date: 01/01/2016
Field of study

VU Research Portal

Ghent University Academic Bibliography

Proactive Assessment of Accident Risk to Improve Safety on a System of Freeways, Research Report 11-15

Author: Nuworsoo Cornelius
Pande Anurag
Shew Cameron
Publication venue: SJSU ScholarWorks
Publication date: 01/05/2012
Field of study

This report describes the development and evaluation of real-time crash risk-assessment models for four freeway corridors: U.S. Route 101 NB (northbound) and SB (southbound) and Interstate 880 NB and SB. Crash data for these freeway segments for the 16-month period from January 2010 through April 2011 are used to link historical crash occurrences with real-time traffic patterns observed through loop-detector data. \u27The crash risk-assessment models are based on a binary classification approach (crash and non-crash outcomes), with traffic parameters measured at surrounding vehicle detection station (VDS) locations as the independent variables. The analysis techniques used in this study are logistic regression and classification trees. Prior to developing the models, some data-related issues such as data cleaning and aggregation were addressed. The modeling efforts revealed that the turbulence resulting from speed variation is significantly associated with crash risk on the U.S. 101 NB corridor. The models estimated with data from U.S. 101 NB were evaluated on the basis of their classification performance, not only on U.S. 101 NB, but also on the other three freeway segments for transferability assessment. It was found that the predictive model derived from one freeway can be readily applied to other freeways, although the classification performance decreases. The models that transfer best to other roadways were determined to be those that use the least number of VDSs–that is, those that use one upstream or downstream station rather than two or three.\ The classification accuracy of the models is discussed in terms of how the models can be used for real-time crash risk assessment. The models can be applied to developing and testing variable speed limits (VSLs) and ramp-metering strategies that proactively attempt to reduce crash risk

SJSU ScholarWorks

Data mining as a tool for environmental scientists

Author: Athanasiadis Ioannis
Comas Joaquim
Frank Eibe
Gibert Karina
Letcher Rebecca
Spate Jessica
Sànchez-Marrè Miquel
Publication venue: International Environmental Modelling and Software Society
Publication date: 01/01/2006
Field of study

Over recent years a huge library of data mining algorithms has been developed to tackle a variety of problems in fields such as medical imaging and network traffic analysis. Many of these techniques are far more flexible than more classical modelling approaches and could be usefully applied to data-rich environmental problems. Certain techniques such as Artificial Neural Networks, Clustering, Case-Based Reasoning and more recently Bayesian Decision Networks have found application in environmental modelling while other methods, for example classification and association rule extraction, have not yet been taken up on any wide scale. We propose that these and other data mining techniques could be usefully applied to difficult problems in the field. This paper introduces several data mining concepts and briefly discusses their application to environmental modelling, where data may be sparse, incomplete, or heterogenous

Research Commons@Waikato

Warranty Data Analysis: A Review

Author: Ahn
Alam
Attardi
Baik
Blischke
Blischke
Brennan
Buddhakulsomsiri
Buddhakulsomsiri
Chen
Chukova
Davis
Djamaludin
Duchesne
Elkins
Escobar
Fredette
Gertsbakh
Grabert
Honari
Hrycej
Hu
Hu
Hu
Hu
Ion
Iskandar
Jung
Kalbfleisch
Kalbfleisch
Kalbfleisch
Kalbfleisch
Kaminskiy
Karim
Karim
Karim
Karim
Kijima
Kleyner
Kleyner
Kleyner
Krivtsov
Lawless
Lawless
Lawless
Lawless
Lawless
Lawless
Majeske
Majeske
Majeske
Marcorin
Marshall
Meeker
Moskowitz
Murthy
Murthy
Murthy
Murthy
Oh
Pal
Phillips
Phillips
Phillips
Rahman
Rai
Rai
Rai
Robinson
Sahin
Singpurwalla
Singpurwalla
Sureka
Suzuki
Suzuki
Suzuki
Suzuki
Suzuki
Suzuki
Thomas
Thomas
Vinta
Vintr
Vittal
Wang
Wasserman
Wasserman
Wasserman
Wilson
Wu
Wu
Wu
Wu
Wu
Wu
Yang
Yang
Zuo
Publication venue: 'Wiley'
Publication date: 10/01/2012
Field of study

Warranty claims and supplementary data contain useful information about product quality and reliability. Analysing such data can therefore be of benefit to manufacturers in identifying early warnings of abnormalities in their products, providing useful information about failure modes to aid design modification, estimating product reliability for deciding on warranty policy and forecasting future warranty claims needed for preparing fiscal plans. In the last two decades, considerable research has been conducted in warranty data analysis (WDA) from several different perspectives. This article attempts to summarise and review the research and developments in WDA with emphasis on models, methods and applications. It concludes with a brief discussion on current practices and possible future trends in WDA

Crossref

Kent Academic Repository