theme 2: data & models one of the central processes of science is the interplay between models...

6
Theme 2: Data & Models One of the central processes of science is the interplay between models and data Data informs model generation and selection Models inform data collection and interpretation from both observations and experimentation An iterative feedback loop exists between these two Computational tools currently aid this process and have great potential for further contributions

Upload: meghan-hodge

Post on 17-Jan-2016

222 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Theme 2: Data & Models One of the central processes of science is the interplay between models and data Data informs model generation and selection Models

Theme 2: Data & ModelsOne of the central processes of science is

the interplay between models and dataData informs model generation and selectionModels inform data collection and interpretation

from both observations and experimentationAn iterative feedback loop exists between these

two

Computational tools currently aid this process and have great potential for further contributions

Page 2: Theme 2: Data & Models One of the central processes of science is the interplay between models and data Data informs model generation and selection Models

Why is it important?

Improving this process will:Increase the speed and accuracy of scientific

researchSupport development of more comprehensive

models that cover larger datasetsAllow the effective study of more complex

phenomenaSystematically transfer knowledge and best

practices between scientific groups and fieldsBroaden participation in science

Page 3: Theme 2: Data & Models One of the central processes of science is the interplay between models and data Data informs model generation and selection Models

State of the Art

Some individual scientific projects have the tools to iterate between data and models effectively and automatically, but…

Few, if any, scientific fields have the model formalisms and algorithms to do this

Requires high degree of hand-holding and does not generalize

Page 4: Theme 2: Data & Models One of the central processes of science is the interplay between models and data Data informs model generation and selection Models

State of the Art

Representations of data and models vary widely across different sciences, but typically…

Scientists have far richer conceptions of data and models than currently expressed;they lack context, metadata

Researchers must choose between lack of expressiveness and onerous complexity

Page 5: Theme 2: Data & Models One of the central processes of science is the interplay between models and data Data informs model generation and selection Models

State of the Art

Methodologies vary widely across different sciences, but typically…

Not formalized in ways that support computation

Limited in scalability to data and model space

Tend to focus on going from data to models,not completing the feedback loop

Page 6: Theme 2: Data & Models One of the central processes of science is the interplay between models and data Data informs model generation and selection Models

Intellectual ChallengesIdentify equivalence classes of scientific modeling

domains (generality without compromising usefulness)Increase expressiveness of data and model

representationsDesign scalable methods (datasets, hypothesis spaces)Enable reproducibility and model reusabilityDefine principles of, design, and build interactive

environments that support scientific tasks, e.g., model construction, design of data collection, data analysis“TurboTax” for individual scientists“Wikipedia” for communities of users

Develop evaluation methods for discovery systems and scientific conclusions drawn from data and models