peeling algorithm in financial risk analysis · peeling algorithm in financial risk analysis aijun...

21
Peeling Methodology Radar-chart Visualization Peeling Algorithm in Financial Risk Analysis Aijun Zhang University of Michigan and Bank of America October, 2008 A joint work with Agus Sudjianto, Bank of America Aijun Zhang, October 2008 Peeling Algorithm in Financial Risk Page. 1

Upload: others

Post on 09-Mar-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Peeling Algorithm in Financial Risk Analysis · Peeling Algorithm in Financial Risk Analysis Aijun Zhang University of Michigan and Bank of America October, 2008 ... Radar-chart Visualization

Peeling MethodologyRadar-chart Visualization

Peeling Algorithm in Financial Risk Analysis

Aijun Zhang

University of Michigan and Bank of America

October, 2008

A joint work with Agus Sudjianto, Bank of America

Aijun Zhang, October 2008 Peeling Algorithm in Financial Risk Page. 1

Page 2: Peeling Algorithm in Financial Risk Analysis · Peeling Algorithm in Financial Risk Analysis Aijun Zhang University of Michigan and Bank of America October, 2008 ... Radar-chart Visualization

Peeling MethodologyRadar-chart Visualization

Outline of the talk

Introduction

Peeling MethodologyPrincipal Direction of AnomalyMD-based Peeling Algorithm

Radar-chart VisualizationCBSA GeoRiskTracking Financial Storm

Discussion

Aijun Zhang, October 2008 Peeling Algorithm in Financial Risk Page. 2

Page 3: Peeling Algorithm in Financial Risk Analysis · Peeling Algorithm in Financial Risk Analysis Aijun Zhang University of Michigan and Bank of America October, 2008 ... Radar-chart Visualization

Peeling MethodologyRadar-chart Visualization

Introduction

I Univariate outlier detection: e.g. Box-plot, QQ-plot

>> boxplot(zscore(X), ’PARAM’, val,...);

>> qqplot(zscore(X(:,j)));

I Multivariate outlier detection: Mahalanobis distance

MDi(µ,Σ) = (xi − µ)TΣ−1(xi − µ)

1. Often used are the sample mean x̄ and covariance Σ̂2. MD ∼ χ2

p asymptotically, for x ∼ Np(µ,Σ)3. However, problem with masking and swamping effects ...

Aijun Zhang, October 2008 Peeling Algorithm in Financial Risk Page. 3

Page 4: Peeling Algorithm in Financial Risk Analysis · Peeling Algorithm in Financial Risk Analysis Aijun Zhang University of Michigan and Bank of America October, 2008 ... Radar-chart Visualization

Peeling MethodologyRadar-chart Visualization

Univariate case: Box-plot and QQ-plot

Aijun Zhang, October 2008 Peeling Algorithm in Financial Risk Page. 4

Page 5: Peeling Algorithm in Financial Risk Analysis · Peeling Algorithm in Financial Risk Analysis Aijun Zhang University of Michigan and Bank of America October, 2008 ... Radar-chart Visualization

Peeling MethodologyRadar-chart Visualization

Multivariate case: Masking and Swamping

Aijun Zhang, October 2008 Peeling Algorithm in Financial Risk Page. 5

Page 6: Peeling Algorithm in Financial Risk Analysis · Peeling Algorithm in Financial Risk Analysis Aijun Zhang University of Michigan and Bank of America October, 2008 ... Radar-chart Visualization

Peeling MethodologyRadar-chart Visualization

Our development

I We propose a sequential method, called the peeling algorithm

1. Reasoning from projection pursuit2. MD-based peeling algorithm3. Visualization by polar coordinates

I Ideas mostly originated from real applications in financial risk

a. Anti-Money Laundering projectb. CBSA GeoRisk visualizationc. Recent storm from Wall Street

I Examples will be provided throughout the talk ...

Aijun Zhang, October 2008 Peeling Algorithm in Financial Risk Page. 6

Page 7: Peeling Algorithm in Financial Risk Analysis · Peeling Algorithm in Financial Risk Analysis Aijun Zhang University of Michigan and Bank of America October, 2008 ... Radar-chart Visualization

Peeling MethodologyRadar-chart Visualization

Principal Direction of AnomalyMD-based Peeling Algorithm

Introduction

Peeling MethodologyPrincipal Direction of AnomalyMD-based Peeling Algorithm

Radar-chart VisualizationCBSA GeoRiskTracking Financial Storm

Discussion

Aijun Zhang, October 2008 Peeling Algorithm in Financial Risk Page. 7

Page 8: Peeling Algorithm in Financial Risk Analysis · Peeling Algorithm in Financial Risk Analysis Aijun Zhang University of Michigan and Bank of America October, 2008 ... Radar-chart Visualization

Peeling MethodologyRadar-chart Visualization

Principal Direction of AnomalyMD-based Peeling Algorithm

Problem Setup

Sphering: For {xi}ni=1 in Rp, consider the “sphered” data,

zi = Σ−1/2(xi − x̄), i = 1, . . . , n

given any Σ > 0 (positive definite).

Projection: For w ∈ Rp with ||w|| = 1, the “projected“ data

{wT z1, . . . ,wT zn}, in 1-D.

Question: What is the best direction w∗ that would separateclearly the outlying observations in 1-D space?

Aijun Zhang, October 2008 Peeling Algorithm in Financial Risk Page. 8

Page 9: Peeling Algorithm in Financial Risk Analysis · Peeling Algorithm in Financial Risk Analysis Aijun Zhang University of Michigan and Bank of America October, 2008 ... Radar-chart Visualization

Peeling MethodologyRadar-chart Visualization

Principal Direction of AnomalyMD-based Peeling Algorithm

Principal Direction of Anomaly

Suppose ∃100α% outliers, define the separation Score(w, α) by

n∑i=1

{1

(wT zi − qw,α

)+

+1

n(1− α)

(wT zi − qw,α

)−

}where qw,α = (1− α)-th quantile of projected data. Then,

PDA: w∗α = arg max||w||=1

Score(w, α), α ∈ (0, 0.5]

Aijun Zhang, October 2008 Peeling Algorithm in Financial Risk Page. 9

Page 10: Peeling Algorithm in Financial Risk Analysis · Peeling Algorithm in Financial Risk Analysis Aijun Zhang University of Michigan and Bank of America October, 2008 ... Radar-chart Visualization

Peeling MethodologyRadar-chart Visualization

Principal Direction of AnomalyMD-based Peeling Algorithm

TheoremGiven the projected data D0 = {z1, . . . , zn}, let D ⊂ D0, then

Score(w, α) =1

nα(1− α)

max|D|=dnαe

∑z∈D

wT z− {nα}qw,α

.

For α = j/n with j = 1, . . . , bn/2c, the score is bounded from above byScore(w, j/n) ≤ n

n−j∥∥z̄∗

(1:j)

∥∥, wherez̄∗(1:j) =

1

j

∑z∈D∗

z, D∗ = arg max|D|=j

∥∥∥∥ ∑z∈D

z

∥∥∥∥.The maximum score is attained by the PDA w∗

j/n∝ z̄∗

(1:j).

Aijun Zhang, October 2008 Peeling Algorithm in Financial Risk Page. 10

Page 11: Peeling Algorithm in Financial Risk Analysis · Peeling Algorithm in Financial Risk Analysis Aijun Zhang University of Michigan and Bank of America October, 2008 ... Radar-chart Visualization

Peeling MethodologyRadar-chart Visualization

Principal Direction of AnomalyMD-based Peeling Algorithm

Corollary 1: Set α = 1/n and let z∗ ← maxi ‖zi‖ with maximal Euclidean distance.Then, the PDA is given by

w∗1/n = arg max‖w‖2=1

Score(w, 1/n) = z∗/‖z∗‖

Corollary 2: Based on the raw data D0 = {xi}ni=1, the separation score

Score(w, j/n; Σ) =n

j(n− j)max|D|=j

∑x∈D

wTΣ−1/2(x− x̄), D ⊂ D0

The PDA is given by w∗j/n∝ Σ−1/2(x̄∗

(1:j)− x̄), where x̄∗

(1:j)= 1

j

∑x∈D∗ x and

D∗ = arg max|D|=j

∥∥∥Σ−1/2∑x∈D(x− x̄)

∥∥∥.For α = 1/n and Σ = Σ̂, let x∗ attain the maximal Mahalanobis distance. Then

PDA: w∗1/n = Σ̂−1/2

(x∗ − x̄)/√

MD(x∗)

Aijun Zhang, October 2008 Peeling Algorithm in Financial Risk Page. 11

Page 12: Peeling Algorithm in Financial Risk Analysis · Peeling Algorithm in Financial Risk Analysis Aijun Zhang University of Michigan and Bank of America October, 2008 ... Radar-chart Visualization

Peeling MethodologyRadar-chart Visualization

Principal Direction of AnomalyMD-based Peeling Algorithm

Peeling Algorithm

I One-by-one procedure: detectone outlier every step, remove itbefore proceeding to next step

I Masking/swamping immunity:the “intermediate” observationsare likely affected by the extremeones, but not vice versa.

I The recursive MD-based algorithmis simple to understand, and easyto implement; see Corollary 2

Aijun Zhang, October 2008 Peeling Algorithm in Financial Risk Page. 12

Page 13: Peeling Algorithm in Financial Risk Analysis · Peeling Algorithm in Financial Risk Analysis Aijun Zhang University of Michigan and Bank of America October, 2008 ... Radar-chart Visualization

Peeling MethodologyRadar-chart Visualization

Principal Direction of AnomalyMD-based Peeling Algorithm

Peeling Algorithm

MD-based Peeling Algorithm: input α0 (0.5 by default)

1. Initialize D1 = D0 = {xi}ni=1 and k = 0

2. Compute Mahalanobis distance (MD) for sample D1; findone of the elements with max MD

MD = mahal(D1,D1); [maxMD, outId] = max(MD);

3. If k < nα0, flag D1outId as outlier, update k + 1→ k,D1\outId→ D1 and go back to Step 2.

{4} Determine the best α ∈ (0, α0] based on MD-histogram;output the indices of the corresponding nα outliers.

Aijun Zhang, October 2008 Peeling Algorithm in Financial Risk Page. 13

Page 14: Peeling Algorithm in Financial Risk Analysis · Peeling Algorithm in Financial Risk Analysis Aijun Zhang University of Michigan and Bank of America October, 2008 ... Radar-chart Visualization

Peeling MethodologyRadar-chart Visualization

Principal Direction of AnomalyMD-based Peeling Algorithm

>> alpha0 = 0.2;

>> idx = peel(zscore(X), alpha0);

>> Ticker(idx)’

ans = ’WB’ ’SOV’ ’NCC’ ’SFI’ ’GNW’

Example: 2-D stock returns (financial sector) on Sep 29-30

Aijun Zhang, October 2008 Peeling Algorithm in Financial Risk Page. 14

Page 15: Peeling Algorithm in Financial Risk Analysis · Peeling Algorithm in Financial Risk Analysis Aijun Zhang University of Michigan and Bank of America October, 2008 ... Radar-chart Visualization

Peeling MethodologyRadar-chart Visualization

CBSA GeoRiskTracking Financial Storm

Introduction

Peeling MethodologyPrincipal Direction of AnomalyMD-based Peeling Algorithm

Radar-chart VisualizationCBSA GeoRiskTracking Financial Storm

Discussion

Aijun Zhang, October 2008 Peeling Algorithm in Financial Risk Page. 15

Page 16: Peeling Algorithm in Financial Risk Analysis · Peeling Algorithm in Financial Risk Analysis Aijun Zhang University of Michigan and Bank of America October, 2008 ... Radar-chart Visualization

Peeling MethodologyRadar-chart Visualization

CBSA GeoRiskTracking Financial Storm

Radar-chart Visualization

I For each suspicious subject i, the peeling algorithm gives us

(a) Mahalanobis distance: Di =√

MDi (scalar)(b) Outlying direction: wi s.t. ‖w‖ = 1 (spherical)

I Radar-chart visualization is a natural choice, by converting

Di → Radius, wi → Radian (angle)

I Trivial case if w ∈ R2:>> theta = acos(W(:,1)).*sign(W(:,2));

I Nontrivial if w is high-dimensional. We need dimensionreduction techniques, e.g. MDS (multidimensional scaling)

Aijun Zhang, October 2008 Peeling Algorithm in Financial Risk Page. 16

Page 17: Peeling Algorithm in Financial Risk Analysis · Peeling Algorithm in Financial Risk Analysis Aijun Zhang University of Michigan and Bank of America October, 2008 ... Radar-chart Visualization

Peeling MethodologyRadar-chart Visualization

CBSA GeoRiskTracking Financial Storm

CBSA GeoRisk

I Robust PC1 and PC2 are used as reference coordinates

I Better choices are under development ⇒ to report later

Aijun Zhang, October 2008 Peeling Algorithm in Financial Risk Page. 17

Page 18: Peeling Algorithm in Financial Risk Analysis · Peeling Algorithm in Financial Risk Analysis Aijun Zhang University of Michigan and Bank of America October, 2008 ... Radar-chart Visualization

Peeling MethodologyRadar-chart Visualization

CBSA GeoRiskTracking Financial Storm

Tracking Financial Storm

For i = 1, . . . , 288 (Financial firms included in DJUSFN and KBW)

Ri(t) = αki + βkiR0(t) + εki(t), t ∈ [τk−1, τk]

Portfolio anomaly detection via idiosyncratic residuals ε̂ki . . .

Aijun Zhang, October 2008 Peeling Algorithm in Financial Risk Page. 18

Page 19: Peeling Algorithm in Financial Risk Analysis · Peeling Algorithm in Financial Risk Analysis Aijun Zhang University of Michigan and Bank of America October, 2008 ... Radar-chart Visualization

Peeling MethodologyRadar-chart Visualization

CBSA GeoRiskTracking Financial Storm

Black September - Radar Tracking

Time permitting, show the animated radar chart in Matlab ...

Aijun Zhang, October 2008 Peeling Algorithm in Financial Risk Page. 19

Page 20: Peeling Algorithm in Financial Risk Analysis · Peeling Algorithm in Financial Risk Analysis Aijun Zhang University of Michigan and Bank of America October, 2008 ... Radar-chart Visualization

Peeling MethodologyRadar-chart Visualization

Introduction

Peeling MethodologyPrincipal Direction of AnomalyMD-based Peeling Algorithm

Radar-chart VisualizationCBSA GeoRiskTracking Financial Storm

Discussion

Aijun Zhang, October 2008 Peeling Algorithm in Financial Risk Page. 20

Page 21: Peeling Algorithm in Financial Risk Analysis · Peeling Algorithm in Financial Risk Analysis Aijun Zhang University of Michigan and Bank of America October, 2008 ... Radar-chart Visualization

Peeling MethodologyRadar-chart Visualization

Discussion

I A whole family of interesting problems are being investigated:

1. Robust estimate of location and scale, e.g. MCD(minimum covariance determinant) estimator

2. Peeling-based projection pursuit, e.g. robust PCA3. Spherical clustering: Hierarchical linkage, K-means, the

mixture vMF (von Mises-Fisher) model4. Multidimesional scaling onto polar coordinates

I We look forwards to more applications in financial risk analysis

Aijun Zhang, October 2008 Peeling Algorithm in Financial Risk Page. 21