Transcript
Page 1: Tabular Data Protection Methods *

Manchester, United Kingdom 1

Comparative Evaluation of Four Different Comparative Evaluation of Four Different Sensitive Tabular Data Protection Methods Sensitive Tabular Data Protection Methods

Using a Real Life Table Structure of Using a Real Life Table Structure of Complex Hierarchies and LinksComplex Hierarchies and LinksPopulated with Artificial DataPopulated with Artificial Data

Ramesh A. DandekarRamesh A. DandekarEnergy Information AdministrationEnergy Information Administration

Washington DCWashington DC

([email protected])

UNECE2007 – 17-19 December 2007

Page 2: Tabular Data Protection Methods *

Manchester, United Kingdom 2

Tabular Data Protection MethodsTabular Data Protection Methods**

• Classical LP Based Cell SuppressionClassical LP Based Cell Suppression

• Network Flow Based Cell Suppression Network Flow Based Cell Suppression (USBC)(USBC)

• LP Based Synthetic Tabular Data / CTA LP Based Synthetic Tabular Data / CTA (Dandekar 2001)(Dandekar 2001)

• Micro Data Level Noise Addition (USBC)Micro Data Level Noise Addition (USBC)

P = 10 % Rule UsedP = 10 % Rule Used

** Uses Proprietary Research Tools Uses Proprietary Research Tools

Page 3: Tabular Data Protection Methods *

Manchester, United Kingdom 3

Two Three Dimensional HYPOTHETICALTables

Linked in Four Dimensional Space

1st Table: “Volumes by Grade, Sales Type, PAD

District, and State” , and

2nd Table: “Volumes by Formulation, Sales Type, PAD

District, and State”

Page 4: Tabular Data Protection Methods *

Manchester, United Kingdom 4

Page 5: Tabular Data Protection Methods *

Manchester, United Kingdom 5

11stst Table Table 22ndnd Table Table GradesGrades FormulationsFormulations• RegularRegular• MidgradeMidgrade• PremiumPremium• Total All GradesTotal All Grades

• ConventionalConventional• OxygenatedOxygenated• ReformulatedReformulated• Total All FormulationsTotal All Formulations

Page 6: Tabular Data Protection Methods *

Manchester, United Kingdom 6

11stst Table By Grades Table By Grades TotalTotal

GradesGrades

Form

ulat

ions

Form

ulat

ions

2nd

Tab

le B

y Fo

rmul

atio

ns2n

d T

able

By

Form

ulat

ions

Four Layers: 1) DTW 2) Rack 3) Bulk 4) TotalFour Layers: 1) DTW 2) Rack 3) Bulk 4) Total

Corresponding to each PAD, State and US Level CellCorresponding to each PAD, State and US Level Cell

MIS

SING PORTIO

N

MIS

SING PORTIO

N

Page 7: Tabular Data Protection Methods *

Manchester, United Kingdom 7

1,000 Synthetic Micro Data Records 1,000 Synthetic Micro Data Records Containing Six VariablesContaining Six Variables

Four Categorical VariablesFour Categorical Variables• 51 States51 States

• 3 Grade Types3 Grade Types

• 3 Sale Types3 Sale Types

• 3 Formulation Types3 Formulation Types

One Magnitude VariableOne Magnitude Variable

One Sample Weight VariableOne Sample Weight Variable

Page 8: Tabular Data Protection Methods *

Manchester, United Kingdom 8

Page 9: Tabular Data Protection Methods *

Manchester, United Kingdom 9

Page 10: Tabular Data Protection Methods *

Manchester, United Kingdom 10

Comparative Evaluation of Cell Comparative Evaluation of Cell Suppression MethodsSuppression Methods

Classical Cell Suppression

• 294 Suppressions

• Sensitive Cells Fully Protected

Network Flow Method

• 479 Suppressions

• Sensitive Cells Fully Protected

• 3 exact Disclosures of non-sensitive cells

Page 11: Tabular Data Protection Methods *

Manchester, United Kingdom 11

CTA vs NOISE - TABULAR DATA QUALITY

14321432 633633

Page 12: Tabular Data Protection Methods *

Manchester, United Kingdom 12

Page 13: Tabular Data Protection Methods *

Manchester, United Kingdom 13

Page 14: Tabular Data Protection Methods *

Manchester, United Kingdom 14

THANK YOU!THANK YOU!

ADDITIONAL INFORMATION FROMADDITIONAL INFORMATION FROM

http://mysite.verizon.net/vze7w8vk/


Top Related