power grid reduction by sparse convex optimization · 2018-03-27 · convex optimization. on-chip...

22
UT DA Wei Ye 1 , Meng Li 1 , Kai Zhong 2 , Bei Yu 3 , David Z. Pan 1 1 ECE Department, University of Texas at Austin 2 ICES, University of Texas at Austin 3 CSE Department, Chinese University of Hong Kong Power Grid Reduction by Sparse Convex Optimization

Upload: others

Post on 03-Jul-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Power Grid Reduction by Sparse Convex Optimization · 2018-03-27 · Convex Optimization. On-chip Power Delivery Network ... L E for large linear systems 3. Previous Work Power grid

UT DA

Wei Ye1, Meng Li1, Kai Zhong2, Bei Yu3, David Z. Pan1

1ECE Department, University of Texas at Austin2ICES, University of Texas at Austin

3CSE Department, Chinese University of Hong Kong

Power Grid Reduction by Sparse Convex Optimization

Page 2: Power Grid Reduction by Sparse Convex Optimization · 2018-03-27 · Convex Optimization. On-chip Power Delivery Network ... L E for large linear systems 3. Previous Work Power grid

On-chip Power Delivery Network Power grid

› Multi-layer mesh structure› Supply power for on-chip devices

Power grid verification› Verify current density in metal wires (EM)› Verify voltage drop on the grids› More expensive due to increasing sizes of grids

» e.g., 10M nodes, >3 days

1[Yassine+, ICCAD’16]

Page 3: Power Grid Reduction by Sparse Convex Optimization · 2018-03-27 · Convex Optimization. On-chip Power Delivery Network ... L E for large linear systems 3. Previous Work Power grid

Modeling Power Grid Circuit modeling

› Resistors to represent metal wires/vias› Current sources to represent current drawn by underlying devices› Voltage sources to represent external power supply› Transient: capacitors are attached from each node to ground

Port node: node attached current/voltage sources Non-port node: only has internal connection

2

Port node

Non-port node

Current source

Voltage source

Page 4: Power Grid Reduction by Sparse Convex Optimization · 2018-03-27 · Convex Optimization. On-chip Power Delivery Network ... L E for large linear systems 3. Previous Work Power grid

Linear System of Power Grid Resistive grid model:

› is Laplacian matrix (symmetric and diagonally-dominant):

› , denotes a physical conductance between two nodes and

A power grid is safe, if ∀ :

Long runtime to solve for large linear systems

3

Page 5: Power Grid Reduction by Sparse Convex Optimization · 2018-03-27 · Convex Optimization. On-chip Power Delivery Network ... L E for large linear systems 3. Previous Work Power grid

Previous Work

Power grid reduction› Reduce the size of power grid while preserving input-

output behavior› Trade-off between accuracy and reduction size

Topological methods› TICER [Sheehan+, ICCAD’99]

› Multigrid [Su+, DAC’03]

› Effective resistance [Yassine+, ICCAD’16]

Numerical methods› PRIMA [Odabasioglu+, ICCAD’97]

› Random sampling [Zhao+, ICCAD’14]

› Convex optimization [Wang+, DAC’15]4

Page 6: Power Grid Reduction by Sparse Convex Optimization · 2018-03-27 · Convex Optimization. On-chip Power Delivery Network ... L E for large linear systems 3. Previous Work Power grid

Input: › Large power grid› Current source values

Output: reduced power grid› Small› Sparse (as input grid)› Keep all the port nodes› Preserve the accuracy in terms of voltage drop error

Problem Definition

5

Page 7: Power Grid Reduction by Sparse Convex Optimization · 2018-03-27 · Convex Optimization. On-chip Power Delivery Network ... L E for large linear systems 3. Previous Work Power grid

Overall Flow

6

Node and edge set generation

Store reduced nodes and edges

Node elimination by Schur complement

Edge sparsification by GCD

Large graph partition

For each subgraph:

Page 8: Power Grid Reduction by Sparse Convex Optimization · 2018-03-27 · Convex Optimization. On-chip Power Delivery Network ... L E for large linear systems 3. Previous Work Power grid

Node Elimination Linear system: can be represented as a 2 2 block-matrix:

and can be represented as follows:

and 0 Applying Schur complement on the DC system:

which satisfies:

7

Page 9: Power Grid Reduction by Sparse Convex Optimization · 2018-03-27 · Convex Optimization. On-chip Power Delivery Network ... L E for large linear systems 3. Previous Work Power grid

Node Elimination (cont’d)

Output graph keeps all the nodes of interest Output graph is dense Edge sparsification: sparsify the reduced Laplacian without losing

accuracy

8

Node Elimination

Edge Sparsification

Page 10: Power Grid Reduction by Sparse Convex Optimization · 2018-03-27 · Convex Optimization. On-chip Power Delivery Network ... L E for large linear systems 3. Previous Work Power grid

Edge Sparsification Goal of edge sparsification

› Accuracy› Sparsity reduce the nonzero elements off-the-diagonal in L

Formulation (1):

Formulation (2): [Wang+, DAC2014]

9

L2 norm L1 norm

Page 11: Power Grid Reduction by Sparse Convex Optimization · 2018-03-27 · Convex Optimization. On-chip Power Delivery Network ... L E for large linear systems 3. Previous Work Power grid

Edge Sparsification Formulation (2): [DAC2014 Wang+]

Formulation (3):

› Weight vector:› Strongly convex and coordinate-wise Lipschitz smooth

10

Problem: accuracy on the Vdd node does not guarantee accuracy on the current source nodes

Page 12: Power Grid Reduction by Sparse Convex Optimization · 2018-03-27 · Convex Optimization. On-chip Power Delivery Network ... L E for large linear systems 3. Previous Work Power grid

Coordinate Descent (CD) Method Update one coordinate at each iteration Coordinate descent:

Set 1 and 0For a fixed number of iterations (or convergence is reached):

Choose a coordinate ,Compute the step size ∗ by minimizing

argmin f ,

Update , ← ,∗

How to decide the coordinate?› Cyclic (CCD)› Random sampling (RCD)› Greedy coordinate descent (GCD)

11

Page 13: Power Grid Reduction by Sparse Convex Optimization · 2018-03-27 · Convex Optimization. On-chip Power Delivery Network ... L E for large linear systems 3. Previous Work Power grid

CD vs Gradient Descent Gradient descent (GD) algorithm:

← GD/SGD update elements in and gradient matrix

at each iteration CD updates 1 elements in (Laplacian property) CD proves to update elements in for Formulation

(2) and (3).

12

Page 14: Power Grid Reduction by Sparse Convex Optimization · 2018-03-27 · Convex Optimization. On-chip Power Delivery Network ... L E for large linear systems 3. Previous Work Power grid

Greedy Coordinate Descent (GCD)

13

Max-heap

Input L

Output X

Page 15: Power Grid Reduction by Sparse Convex Optimization · 2018-03-27 · Convex Optimization. On-chip Power Delivery Network ... L E for large linear systems 3. Previous Work Power grid

GCD vs CCD

GCD produces sparser results› CCD (RCD) goes through all coordinates repeatedly› GCD selects the most significant coordinates to update

Iteration 1 Iteration 2 Iteration 3 Iteration 4 Iteration T

GCD:

CCD:

Input graph

Add an edgeUpdate an edge

0 5 10 15 20

100

101

102

103

104

105

Edge Weight

Edge

Cou

nt

CCDGCD

14

Page 16: Power Grid Reduction by Sparse Convex Optimization · 2018-03-27 · Convex Optimization. On-chip Power Delivery Network ... L E for large linear systems 3. Previous Work Power grid

GCD Coordinate Selection General Gauss-Southwell Rule:

Observation: the objective function is quadratic w.r.t. the chosen coordinate

GCD is stuck for some corner cases:

A new coordinate selection rule:

15

Page 17: Power Grid Reduction by Sparse Convex Optimization · 2018-03-27 · Convex Optimization. On-chip Power Delivery Network ... L E for large linear systems 3. Previous Work Power grid

GCD Speedup Time complexity is per iteration

› traverse elements to get the best index› As expensive as gradient descent

Observation: each node has at most neighbors → heap

Heap to store elements in : › Pick the largest gradient, 1› Update elements, log

Lookup table› space; 1 for each update

Improved time complexity log

16

Page 18: Power Grid Reduction by Sparse Convex Optimization · 2018-03-27 · Convex Optimization. On-chip Power Delivery Network ... L E for large linear systems 3. Previous Work Power grid

Experimental Results

Sparsity and accuracy trade-off Accuracy and runtime trade-off

17

Page 19: Power Grid Reduction by Sparse Convex Optimization · 2018-03-27 · Convex Optimization. On-chip Power Delivery Network ... L E for large linear systems 3. Previous Work Power grid

Gradient Descent Comparison

18

Sparsity Accuracy

Runtime

Page 20: Power Grid Reduction by Sparse Convex Optimization · 2018-03-27 · Convex Optimization. On-chip Power Delivery Network ... L E for large linear systems 3. Previous Work Power grid

Experimental Results

19

CKT ibmpg2 ibmpg3 ibmpg4 ibmpg5 ibmpg6

#Port Nodes

Before 19,173 100,988 133,622 270,577 380,991

After 19,173 100,988 133,622 270,577 380,991

#Non-port Nodes

Before 46,265 340,088 345,122 311,072 481,675

After 0 0 0 0 0

#Edges Before 106,607 724,184 779,946 871,182 1283,371

After 48,367 243,011 284,187 717,026 935,322

Error 1.2% 0.7% 4.8% 2.2% 2.0%

Runtime 38s 106s 132s 123s 281s

Page 21: Power Grid Reduction by Sparse Convex Optimization · 2018-03-27 · Convex Optimization. On-chip Power Delivery Network ... L E for large linear systems 3. Previous Work Power grid

Conclusion

Main Contributions:› An iterative power grid reduction framework› Weighted convex optimization-based formulation› A GCD algorithm with optimality guarantee and

runtime efficiency for edge sparsification

Future Work:› Extension to RC grid reduction

20

Page 22: Power Grid Reduction by Sparse Convex Optimization · 2018-03-27 · Convex Optimization. On-chip Power Delivery Network ... L E for large linear systems 3. Previous Work Power grid

Thanks