using synthetic data to test downscaling methods
DESCRIPTION
Using Synthetic Data to Test Downscaling Methods. John Lanzante (GFDL/NOAA). CONCEPTS. Testing Downscaling: Like Product Testing. My Product . CONCEPTS. STEP1: Recruit Test Subjects ( G ather D ata). STEP2: F eed C ereal For Several Decades (Apply Downscaling Method). CONCEPTS. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/1.jpg)
Using Synthetic Data to Test Downscaling Methods
John Lanzante (GFDL/NOAA)
![Page 2: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/2.jpg)
CONCEPTS
Testing Downscaling:
Like Product Testing
My Product
![Page 3: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/3.jpg)
STEP1: Recruit Test Subjects (Gather Data)CONCEPTS
STEP2: Feed Cereal For Several Decades (Apply Downscaling Method)
![Page 4: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/4.jpg)
CONCEPTS
STEP3: How are subjects affected? How well did downscaling do?
Not so clear – Need more subjects? Need more data?
Real-world data may be limited? Can we generate synthetic data to fill the void?
![Page 5: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/5.jpg)
CONCEPTS
STEP 4a: Snowmen most affected? Generate a new sample.
![Page 6: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/6.jpg)
CONCEPTS
STEP 4b: Snow-women affected differently? Generate a new sample.
![Page 7: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/7.jpg)
REALISTIC EXAMPLESCASE 1 – Linearity:
Simplest downscaling – linear regression
![Page 8: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/8.jpg)
REALISTIC EXAMPLESCASE 1 – Strong Nonlinearity:
Simplest downscaling – linear regression
![Page 9: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/9.jpg)
REALISTIC EXAMPLESSUMMARY CASE 1 – Nonlinearity:
Hard to test nonlinearity in real-world data ? (if we are just entering “non-linear regime”)Simulate various degrees of nonlinearityCompare linear & nonlinear downscaling methodsDetermine amount of degradationDetermine time in future when degradation becomes “too large”
![Page 10: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/10.jpg)
REALISTIC EXAMPLESCASE 2 – Coastal Error:
Downscaling error maximizes along coastline
![Page 11: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/11.jpg)
REALISTIC EXAMPLESCASE 2 – Coastal Error:
Obs gridpoint Entirely landModel gridpoint Partly land, partly ocean
![Page 12: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/12.jpg)
REALISTIC EXAMPLESCASE 2 – Coastal Error:
Land more detail (extremes) than Ocean (damped)Missing peaks & troughs unrecoverable
![Page 13: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/13.jpg)
REALISTIC EXAMPLES
SUMMARY CASE 2 – Costal Error:Simulate land & ocean pointsDownscale land from mixture (land + sea)Vary the proportions of the mixtureIs coastal effect due to mixture/mismatch?
![Page 14: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/14.jpg)
SYNTHETIC DATA MODELOne Particular Synthetic Data Model: O= Observations M= Model y= year d= day Red = free parameter (user selects the value)
Oy d = Ōy + O’y d Yearly mean + AR1
O’y d = rlag1 * O’
y d-1 + ay d AR1
fvar = varŌ / varO [ varO = varŌ + varO]
My d = Oy d + by d corr = correlation(O,M)
a ~ N(0,vara) Proper choice of a & b b ~ N(0,varb) yields desired rlag1 & corr
![Page 15: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/15.jpg)
SYNTHETIC DATA MODEL
STEP 1: Generate Base Time Series rlag1 day-to-day persistence fvar interannual vs. day-to-day variability corr strength of relation: model vs. obsSTEP 2: Historical Adjustment meanOBS characteristics of the distribution
meanMODEL
varOBS
varMODEL
STEP 3: Future Adjustment meanOBS characteristics of the distribution meanMODEL
varOBS
varMODEL
![Page 16: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/16.jpg)
SYNTHETIC DATA MODEL
OUR APPLICATIONS OF THIS MODEL:Downscaling (just getting started)
No results yetApplied successfully to several related
issues (cross-validation, exceedance statistics, testing two distributions)
![Page 17: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/17.jpg)
SUMMARYREAL-WORLD COMPLICATIONS:
Results may not be clear-cut: Sample size too small? Multiple factors may contribute?
Some conditions more interesting?
SOLUTION – GENERATE SYNTHETIC DATA: Advantages of Synthetic Data:
Unlimited sample size (enhance signal/noise)Change one factor at a time
Prescribe exact conditionsVary factor over a wide range (“turn the knob”)
Can extend outside the range of historical data Turn knob “all the way” for unambiguous results
![Page 18: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/18.jpg)
A CAUTIONARY NOTE
No “One Size Fits All”:No single “best” synthetic data modelMust possess appropriate real-world characteristicsAbility to vary the relevant factors
Possible Models For Future Development:Skewed data (transform Gaussian data nonlinearly?)Precipitation (discrete Markov + bounded distribution?)
Model occurrence & amount separately?Multivariate model?
![Page 19: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/19.jpg)
THE END
![Page 20: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/20.jpg)
REALISTIC EXAMPLESCASE 1 – Weak Nonlinearity:
Simplest downscaling – linear regression
![Page 21: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/21.jpg)
SUPPLEMENTAL
Causes of Nonlinearity?At highest T – model soil becomes excessively dry – T becomes excessiveOther possibilities: Water Vapor, Clouds, Sea-Ice, etc.
![Page 22: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/22.jpg)
REALISTIC EXAMPLESCASE 2 – Coastal Error:
Land More extremesOcean Damped
![Page 23: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/23.jpg)
REALISTIC EXAMPLESCASE 2 – Coastal Error:
X/Y Plot: Land (model) vs. Land (obs)
![Page 24: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/24.jpg)
REALISTIC EXAMPLESCASE 2 – Coastal Error:
X/Y Plot: Ocean (model) vs. Land (obs)
![Page 25: Using Synthetic Data to Test Downscaling Methods](https://reader035.vdocuments.us/reader035/viewer/2022062315/56816245550346895dd27ea6/html5/thumbnails/25.jpg)
SYNTHETIC DATA MODEL
STEP 4:Fit downscaling model to historical sample
STEP 5:Test downscaling in historical & future samples
OUR APPLICATIONS OF THIS MODEL:No results to show todayDownscaling (just getting started)Guidance in the use of cross-validationBiases in exceedance statisticsTesting difference between 2 distributions