the power of how-to queries joint work with dan suciu (university of washington) alexandra meliou

14
The Power of How-to Queries joint work with Dan Suciu (University of Washington) Alexandra Meliou

Upload: juniper-mclaughlin

Post on 16-Dec-2015

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The Power of How-to Queries joint work with Dan Suciu (University of Washington) Alexandra Meliou

The Power of How-to Queries

joint work with Dan Suciu (University of Washington)

Alexandra Meliou

Page 2: The Power of How-to Queries joint work with Dan Suciu (University of Washington) Alexandra Meliou

Hypothetical (What-if) Queries

Brokerage companyDB

Key Performance Indicators (KPI)

Example from [Balmin et al. VLDB’00]:“An analyst of a brokerage company wants to know what would be the effect on the return of customers’ portfolios if during the last 3 years they had suggested Intel stocks instead of Motorola.”

change something in the source (hypothesis)

observe the effect in the target

forward

Page 3: The Power of How-to Queries joint work with Dan Suciu (University of Washington) Alexandra Meliou

How-To Queries

Brokerage companyDB

Key Performance Indicators (KPI)

Modified example:“An analyst wants to ask how to achieve a 10% return in customer portfolios, with the least number of changes.”

find changes to the source that achieve the desired effect

declare a desired effect in the target

reverse

Page 4: The Power of How-to Queries joint work with Dan Suciu (University of Washington) Alexandra Meliou

TPC-H example A manufacturing company keeps records of

inventory orders in a LineItem table. KPI: Cannot order more than 8% of the inventory

from any single country

Can reassign orders to new suppliers as long as the supplier can supply the part

Minimize the number of changes

(constraints)

(variables)

(optimization objective) constraint optimization

Page 5: The Power of How-to Queries joint work with Dan Suciu (University of Washington) Alexandra Meliou

extract data

Constraint Optimization on Big Data

DB

construct optimization model

this is for a set of 10 lineitems and 40

suppliers

Mixed Integer Programming (MIP)

solver

transform into data updates

MathProg

Impractical!

Page 6: The Power of How-to Queries joint work with Dan Suciu (University of Washington) Alexandra Meliou

Demo: Tiresias

a tool that makes how-to queries practical

Page 7: The Power of How-to Queries joint work with Dan Suciu (University of Washington) Alexandra Meliou

Tiresias: How-To Query Engine

DBMS

MIPsolver

Tiresias

TiQL (Tiresias Query Language)

Declarative interface, extension to Datalog

Page 8: The Power of How-to Queries joint work with Dan Suciu (University of Washington) Alexandra Meliou

Overview

MathProgor AMPL

TiQL

Visualizations

Page 9: The Power of How-to Queries joint work with Dan Suciu (University of Washington) Alexandra Meliou

MathProg or AMPL

TiQL

Visualizations

Overview

Demo

Page 10: The Power of How-to Queries joint work with Dan Suciu (University of Washington) Alexandra Meliou

Demo

MathProgor AMPL

TiQL

Visualizations

Overview

Language semantics

Evaluation of a TiQL program: Translation from TiQL to linear constraints

Performance optimizations

Page 11: The Power of How-to Queries joint work with Dan Suciu (University of Washington) Alexandra Meliou

Optimizing Performance Model optimizer

eliminates variables, constraints, and parameters uses key constraints, functional dependencies, and

provenance

Partitioning optimizerSignificantly faster than letting the MIP solver do it

Page 12: The Power of How-to Queries joint work with Dan Suciu (University of Washington) Alexandra Meliou

Evaluation of the Model Optimizer

baseline

with optimization

Page 13: The Power of How-to Queries joint work with Dan Suciu (University of Washington) Alexandra Meliou

Evaluation of Tiresias Partitioning

10k tuples 1M tuples

granularity of partitioning

complex dependency on the granularity of partitioning

Page 14: The Power of How-to Queries joint work with Dan Suciu (University of Washington) Alexandra Meliou

Next steps Non-partitionable problems and approximations Soft constraints, diversification of results Interactive visualizations, feedback-based problem

generation Applications

Business intelligence, strategy planning View updates Data cleaning

Take-aways Databases should support how-to queries Data-driven optimizations could benefit the

performance of external tools