1 research outputs found

    Data Quality Checking for Machine Learning with MeSQuaL

    No full text
    International audienceThis demo proposes MeSQuaL, a system for profiling and checking data quality before further tasks, such as data analytics and machine learning. MeSQuaL extends SQL for querying relational data with constraints on data quality and facilitates the verification of statistical tests. The system includes: (1) a query interpreter for SQuaL, the SQL-extended language we propose for declaring and querying data with data quality checks and statistical tests; (2) an extensible library of user-defined functions for profiling the data and computing various data quality indicators;and (3) a user interface for declaring data quality constraints,profiling data, monitoring data quality with SQuaL queries, and visualizing the results via data quality dashboards. We showcase our system in action with various scenarios on real-world datasets and show its usability for monitoring data quality over time and checking the quality of data on-deman
    corecore