data quality and uncertainty visualization
TRANSCRIPT
![Page 1: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/1.jpg)
Data Quality and Uncertainty Visualization
UC San DiegoCOGS 220
Winter Quarter 2006Barry Demchak
![Page 2: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/2.jpg)
Immediate Motivation: Wiisard
A joint project of Veterans Administration and UC San Diego, funded by the National Library of Medicine
Mass casualty triage and treatment Enter patient information via PDAs Patient information summarized on tablet PCs Command/control for supervisors and incident
comment personnel Tied together using 802.11b and store-and-
forward database access
![Page 3: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/3.jpg)
Wiisard – Explosion with Pesticides
![Page 4: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/4.jpg)
Wiisard – Network Deployment
![Page 5: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/5.jpg)
Wiisard – Tablet Display
![Page 6: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/6.jpg)
Wiisard – Command/Control
![Page 7: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/7.jpg)
Wiisard – The Problem
What if the network becomes partitioned? Tablet display shows out-of-date patient
information Summary displays are out of date, too
How does this lead to bad decisions? Supervisors may mis-deploy doctors Incident command may mis-deploy resources
People may die
![Page 8: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/8.jpg)
DOD Example Sensor-to-shooter (STS) Networks – Patrick
Driscoll (USMA), June 2002
![Page 9: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/9.jpg)
DOD Example
![Page 10: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/10.jpg)
DOD Example “… our first attempt to get the military
community to realize that there is a degree of uncertainty involved in (digital) information systems that cannot be engineered out of thesystem.”
“Ultimately, our concern was an awareness issue (for the decision maker) …”
“… woman at MITRE had proposed a system of tagging intelligence starting at the source in a way that would reflect the uncertainty of the data being put into the intel database.”
![Page 11: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/11.jpg)
The Problem
How to visualize the uncertainty in data so that humans can exercise judgment in making the best decision
Accounting for uncertainty is not the same thing as visualizing uncertainty
![Page 12: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/12.jpg)
What Labs are Involved MIT Sloan School of Management
Richard Wang (Data Quality) Penn State University
Alan MacEachren (GIS) University of Maine
Kate Beard-Tisdale (GIS) University of California, Santa Cruz
Alex Pang (Scientific Visualization) University of Arkansas, Little Rock
Master of Sciences in Information Quality
![Page 13: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/13.jpg)
What Conferences are There?
MIT Information Quality (IQatMIT) ACM SIGMOD Workshop on Information Qua
lity in Information Systems (IQIS) ACM SIGKDD (Knowledge Discovery and Dat
a Mining) MIT International Conference on Information
Quality (ICIQ)
![Page 14: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/14.jpg)
Semiotic Interpretation
Data Visualization
Normal Mapping
Mapping
Normal
Data Visualization
Normal Mapping
PoorData
Quality
DataMapping
Data UncertaintyVisualization
Uncertainty Mapping
Mapping
Poor DataQuality w/
Uncertainty
![Page 15: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/15.jpg)
Definition of Data Quality From Wand & Wang:
![Page 16: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/16.jpg)
Metrics Timeliness How up to date relative to intended purpose
Ballou et al: Timeliness = Max(0, 1-(currency/volatility) Currency = delivery_time – input_time Volatility = length of time data remains valid Apply sensitivity factor “s”: Timeliness ^ s
Tim
elin
ess
time
Tim
elin
ess
time
Pulse = 80 Pulse = 180
![Page 17: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/17.jpg)
Interplay with Uncertainty
Metrics are application dependent Metrics are data dependent Metrics are user dependent Question: If a metric describes an individual
data element, what is the effect of aggregating data elements having uncertainty??
![Page 18: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/18.jpg)
GIS Examples – NCGIA
Sample point locations as overlay
Sample points and corresponding contours using naïve shading
![Page 19: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/19.jpg)
GIS Examples – NCGIA
Gray shading uncertainty surface captures distance function used by interpolation method
Uncertainty encoded in contour line widths
![Page 20: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/20.jpg)
Fill Clarity
Resolution
GIS Techniques
Contour Crispness
Fog
![Page 21: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/21.jpg)
Merging Data and Uncertainty
Risk and uncertainty separately
Risk and uncertainty combined
![Page 22: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/22.jpg)
Basic Data Examples Errors
![Page 23: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/23.jpg)
Basic Data Examples Errors
![Page 24: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/24.jpg)
Basic Data Examples Ambiguation
![Page 25: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/25.jpg)
Basic Data Examples Ambiguation
![Page 26: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/26.jpg)
Photo Realistic
![Page 27: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/27.jpg)
Uncertainty Vector Glyphs
![Page 28: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/28.jpg)
Uncertainty Vector Glyphs
![Page 29: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/29.jpg)
Hue as Uncertainty With
out
With
![Page 30: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/30.jpg)
Texture as Uncertainty
Raw
Trans-parent Points
Cer-tain-ty
Opaque Lines
![Page 31: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/31.jpg)
Data Confidence
x is a device, is decay constant, R(x) is a weighting for device x in the calculation
Back to Wiisard
x
xpingtimexposttimecurtime
xRC
)()(1
1)(
![Page 32: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/32.jpg)
Back to Wiisard
Individual data (annotation)
Aggregate data (annotated/integrated)
![Page 33: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/33.jpg)
Back to Wiisard Annotated
![Page 34: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/34.jpg)
Back to Wiisard Integrated
![Page 35: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/35.jpg)
Research Questions
What are the dimensions of metrics relevant for determining data quality for medical providers in a mass casualty context?
What kind of visualization best conveys the use suitability for various kinds of data? Single data points Streaming bioinformation Aggregated information
![Page 36: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/36.jpg)
Research Questions What kinds of visualizations are best suited to
field personnel? Non-IS frenzied technicians High glare, small footprint screens Low processing power
What kinds of visualizations are best suited to incident command? Seasoned experts Large, high density displays Highly connected with high data processing
![Page 37: Data quality and uncertainty visualization](https://reader036.vdocuments.us/reader036/viewer/2022081605/58f2fa931a28abf4058b463f/html5/thumbnails/37.jpg)
Conclusion
Data Quality and Uncertainty Visualization are like the weather …
… everyone’s talks about it, but no one does anything about it