Search CORE

1 research outputs found

Making Data Analysis Expertise Broadly Accessible through Workflows

Author: A Gil
Hyunjoon Jo
Matheus Hauder
Ricky Sethi
Yan Liu
Publication venue
Publication date: 08/05/2012
Field of study

The demand for advanced skills in data analysis spans many areas of science, computing, and business analytics. This paper discusses how non-expert users reuse workflows created by experts and representing complex data mining processes for text analytics. They include workflows for document classification, document clustering, and topic detection, all assembled from components available in well-known text analytics software libraries. The workflows expose to non-experts expert-level knowledge on how these individual components need to be combined with data preparation and feature selection steps to make the underlying statistical learning algorithms most effective. The framework allows non-experts to easily experiment with different combinations of data analysis processes, represented as workflows of computations that they can easily reconfigure. We report on our experiences to date on having users with limited data analytic knowledge and even basic programming skills to apply workflows to their data

CiteSeerX