spatio-temporal load curve data cleansing and imputation via sparsity and low rank

Spatio-temporal Load Curve Data Cleansing and Imputation via Sparsity and Low Rank

Gonzalo Mateos and Georgios B. Giannakis

Dept. of ECE and Digital Technology Center University of Minnesota

November 5, 2012

Workshop on Architectures and Models for the Smart Grid

Context Robust imputation of network data

Goal: Given few rows per agent, perform distributed cleansing and imputation by leveraging low-rank of the nominal data matrix and sparsity of the outliers.

Network health cartography

Smart metering

Wind farm monitoring

Load curve data cleansing Load curve: electric power consumption recorded periodically

Reliable data: key to realize smart grid vision [Hauser’09]

Uruguay’s aggregate power consumption (MW)

Missing data: Faulty meters, communication errors, few PMUs Outliers: Unscheduled maintenance, strikes, sport events [Chen et al’10]

Spatio-temporal load profiles Power measured at bus , at time

Spatio-temporal model:

Low-rank nominal load profiles Sparse outliers across buses and time

Principal Component Pursuit

Principal component pursuit [Chandrasekaran et al’11], [Candes et al’11]

(as) has low rank, is sparse Goal: Given Y, recover and

Data model

Missing data: set

Sampling operator

Distributed processing paradigms

Limitations of FC-based architectures Lack of robustness (isolated point of failure, non-ideal links) High Tx power (as geographical area grows) Less suitable for tracking applications

Incremental

Limitations of incremental processing Non-robust to node failures (Re-) routing? Hamiltonian routes NP-hard to establish

Fusion Center (FC) In-network

Problem statement Network of smart meters: undirected, connected graph

Challenges Nuclear norm is not separable Global optimization variable

Goal: Given per node and single-hop exchanges, findGoal: Given per node and single-hop exchanges, find

Separable regularization Key property; e.g., [Recht et al’11]

New formulation equivalent to (P1)

Nonconvex; reduces complexity:

Lxρ≥rank[X]

Proposition 1. If stat. pt. of (P2) and ,

then is a global optimum of (P1).

Distributed estimator

Network connectivity (P2) (P3)

Consensus with neighboring nodes

Alternating-directions method of multipliers (ADMM) solver Method [Glowinski-Marrocco’75], [Gabay-Mercier’76] Learning over networks [Schizas et al’07]

Primal variables per agent :

Message passing:n

Distributed iterations

Highly parallelizable with simple recursions Unconstrained QPs per agent No SVD per iteration [O(Tρ3) complexity]

Low overhead for message exchanges is and is small Comm. cost independent of network size

Recap:(P1) (P2) (P3)

CentralizedConvex

Sep. regul.Nonconvex

ConsensusNonconvex

Stationary (P3) Stationary (P2) Global (P1)

Attractive features

Optimality

Proposition 2. If converges to

and , then:

ii) is the global optimum of (P1).

ADMM can converge even for non-convex problems, e.g.,[Boyd et al’11]

Simple distributed algorithm for principal component pursuit Centralized performance guarantees carry over

Synthetic data Random network, N={15,20,25}, T=600

Data , ,

NorthWrite data Power consumption of schools, government building, grocery store (’05-’10)

Data: courtesy of NorthWrite Energy Group, provided by Prof. V. Cherkassky (UofM)

Cleansing Imputation

Outliers: “Building operational transition shoulder periods” Prediction error: 6% for 30% missing data (8% for 50%)

Concluding summary

Estimate cleansed nominal load profiles

Load curve data cleansing and imputation

Distributed algorithm with guaranteed performance

Thank You!

Leveraging sparsity and low rank

Principal component pursuit for smart grid monitoring

Identify when and where ‘bad data’ occur

Ongoing research:

Convergence of ADMM for bi-convex costs Real-time (adaptive) algorithms

spatio-temporal load curve data cleansing and imputation via sparsity and low rank

data model

periodicallyreliable

data deviating

load curve prediction

term load curve

nominal data matrix

accurate load profiles

electric power consumption

Documents

government data cleansing services - ricoh europe...

2014 iiag imputation assessments

sparsity models - tsinghuabigeye.au.tsinghua.edu.cn ›...

learning with structured sparsity -...

amelia imputation

multiple imputation

testing fourier dimensionality and sparsity › ~odonnell...

multiple imputation of multilevel data - stef van buuren...

professional facial cleansing brushes - beurer · the...

sparsity and compressed sensing

sparsity and saliency

refactor column-sparsity - arxiv

rbeis imputation system - unece

when sparsity meets low-rankness: transform learning...

dermacolor camouflage system · pdf filethe dermacolor...

by developing imputation strategies

large-scale epigenomic imputation

sparsity-based dynamic hand gesture recognition using...

garcia imputation

lec 10: bayesian statistics for genetics imputation and...