seasr overview

29
Pathways to SEASR National Center for Supercomputing Applications University of Illinois at Urbana-Champaign The SEASR project and its Meandre infrastructure are sponsored by The Andrew W. Mellon Foundation

Upload: loretta-auvil

Post on 28-Nov-2014

1.104 views

Category:

Education


0 download

DESCRIPTION

 

TRANSCRIPT

Page 1: SEASR Overview

Pathways to SEASR

National Center for Supercomputing Applications!University of Illinois at Urbana-Champaign

The SEASR project and its Meandre infrastructure!are sponsored by The Andrew W. Mellon Foundation

Page 2: SEASR Overview

Outline

•  Welcome

•  Logistics

•  Agenda Review

•  Workshop Objective and Goals

•  Participants Project Planning Activity

•  SEASR + Exemplars Quick Glance "

Page 3: SEASR Overview

Welcome

•  Dr. Robert Wilhelmson,

–  Division Director – CyberApplications and Communities

–  National Center for Supercomputing Applications

•  Dr. Chris Mackie

–  Program Officer – Research in Information Technology

–  The Andrew W. Mellon Foundation

Page 4: SEASR Overview

Logistics

•  Network Access

•  Building Access

•  Packet - Reimbursements and Evaluation

•  Coffee/Soda/Water “Treat” Breaks

•  Lunch & Dinner

•  Restrooms

•  Staff Introductions

•  Questions?"

Page 5: SEASR Overview

Agenda Day 1 Morning

•  Thursday, January 15, 2009

•  8:00am Registration and Breakfast

•  8:30am Introduction, Michael Welge, SEASR, NCSA

•  9:00am SEASR Overview and Workshop Goals, Michael Welge

•  10:00am Break

•  10:30am SEASR Application Examples and Demonstrations

–  Zotero and SEASR, Xavier Llorà, Boris Capitanu

–  UIMA and SEASR, Mike Haberman

–  Audio Analysis and SEASR (NEMA, NESTER), J. Stephen Downie

•  Noon Lunch

Page 6: SEASR Overview

Agenda Day 1 Afternoon

•  1:00pm SEASR Architecture, Xavier Llorà and Bernie Acs

–  Workshop Plan Presentation (5 min each) by J. Stephen Downie

•  2:30pm Break

•  3:00pm SEASR Tools with Hands On Demo

–  Community Hub, Loretta Auvil

–  Workbench, Boris Capitanu

–  Server Interface, Xavier Llorà

–  ZigZag, Xavier Llorà

–  Eclipse Plugin, Amit Kumar

•  4:00 SEASR Adoptability and Sustainability, John Unsworth

•  4:30pm End of Day

•  6:00pm Dinner

Page 7: SEASR Overview

Agenda Day 2 Morning

•  Friday, January 16, 2009

•  8:00am Registration and Breakfast

•  8:30am SEASR Application Examples and Demonstrations (continued)

–  MONK and SEASR, Stéfan Sinclair

–  FutureLens and SEASR and MONK, Andrey Puretskiy

–  DISCUS and SEASR, Xavier Llorà

–  Visualization and SEASR, Loretta Auvil

–  Workshop Plan Presentation (5 min each) by Teams

•  10:00am Break

•  10:30am SEASR Installation, Xavier Llorà and Loretta Auvil

–  Workshop Plan Presentation (5 min each) by Teams

•  Noon Lunch

Page 8: SEASR Overview

Agenda Day 2 Afternoon

•  1:00pm SEASR Wrap Up and Summary, Michael Welge and Loretta Auvil

•  Workshop Plan Presentation (5 min each) by Teams

•  2:30pm Break

•  3:00pm SEASR Lab Session, Michael Welge and Loretta Auvil Workshop Plan Presentation (5 min each) by Teams

•  4:30pm End of Day

Page 9: SEASR Overview

Workshop Objective

•  The objective of the workshop is:

•  To explain and demonstrate the utility of SEASR for digital humanities, and to bring you to a point where you could deploy, contribute and utilize the SEASR environment.

•  SEASR + TOOLS + EXEMPLARS + HANDS ON

Page 10: SEASR Overview

Workshop Goals

•  The goals of the workshop are:

•  LEARN: Provide a detailed understanding of the SEASR framework

•  LEARN: Provide a foundation and examples for participant teams to use SEASR in a study or inquiry

•  ADOPT: Share participant generated research plans to utilize SEASR

•  INSTALL: Provide detailed instructions on how to install, build components, integrate existing applications, and maintain the SEASR environment

•  SUPPORT: Develop plans for resolution of issues raised by the user community in utilization of SEASR

•  SUSTAIN: Develop a plan for community driven future development and dissemination of SEASR

•  Learn + Adopt + Sustain

Page 11: SEASR Overview

SEASR Synergy

Requirements Analysis & Specification

Development & System Integration

Prototype or Production Research

Research Applications

Scholarly Discoveries

University International Government

Partnerships

SESOA Execution Engine

Security Interaction UIs

Data Mining Visualization

Community Hub

Integrated Project Team

Pathfinders Communities

Technology Roadmaps Scholarly

Roadmaps

“Infrastructure is a set of practices, standards, technology and cultures that work together to provide a foundation for digital scholarship” Digital Commonwealth 2003

Page 12: SEASR Overview

SEASR: Reach + Relevance + Reuse + Repeatability

SEASR emphasizes flexibility, scalability, modularity, provides community hub and access to heterogeneous data and computational systems –  Semantic driven environment for SOA interoperability –  Encourages sharing and participation for building communities –  Modular construction allows flows to be modified and configured

to encourage reusability within and across domains –  Enables a mashup and integration of tools –  Data-intensive flows can be executed on a simple desktop or a

large cluster(s) without modification –  Computation can be created for distributed execution on servers

where the content lives –  User accessibility to control trust and compliance with required

copyright license of content –  Relies on standardized Resource Description Framework (RDF) to

define components and flow

Page 13: SEASR Overview

SEASRComponents

Virtualiza5onInfrastructure

FileSystemsSharedStores SOAGateways

MeandreInfrastructure

Visualiza5on

MetadataStores

ComponentRepository ComponentDiscovery

MeandreData‐IntensiveFlows

SEASRApps SEASRServicesSEASRPlugins SEASRWebApps

Analy5csData

GatewayConnec5onsDataPersistence

DataTransforma5onNaturalLangProcessing

Descrip5veSta5s5csPredic5veModeling

Discovery

GraphingModelingVis

InfoVis(smallmul5ples)

Develop

erToo

ls

Page 14: SEASR Overview

Data Driven Models

Page 15: SEASR Overview

Knowledge Discovery in Data

Page 16: SEASR Overview

Participant Project Plan

•  Participant Project Plan Guide

–  Research Objective

–  Data Source(s)

–  Transformation(s)

–  Query, Descriptive, and/or Analysis

–  Evaluation – Table, Visualization

–  Interaction

–  Outcome

Page 17: SEASR Overview

Workbench

•  Web-based UI

•  Components and flows are retrieved from server

•  Additional locations of components and flows can be added to server

•  Create flow using a graphical drag and drop interface

•  Change property values

•  Execute the flow

Page 18: SEASR Overview

Community Hub

Page 19: SEASR Overview

SEASR @ Work – Zotero

•  Plugin to Firefox •  Zotero manages the

collection •  Launch SEASR Analytics

–  Citation Analysis uses the JUNG network importance algorithms to rank the authors in the citation network that is exported as RDF data from Zotero to SEASR

–  Zotero Export to Fedora through SEASR

–  Saves results from SEASR Analytics to a Collection

•  Launch MONK Processing –  MONK DB Ingestion Workflow

Page 20: SEASR Overview

Web Service

Interactive Web Application

SEASR @ Work – Fedora

Page 21: SEASR Overview

SEASR @ Work – Entity Mash-up

•  Entity Extraction with OpenNLP

•  Locations viewed on Google Map

•  Dates viewed on Simile Timeline

Page 22: SEASR Overview

SEASR @ Work – Audio Analysis

•  NEMA: Executes a SEASR flow for each run –  Loads audio data

–  Extracts features for every 10 sec moving window of audio

–  Loads and applies the models

–  Sends results back to the WebUI

•  NESTER: Annotation of Audio via Spectral Analysis

Page 23: SEASR Overview

SEASR @ Work – MONK

Executes flows for each analysis requested

–  Predictive modeling using Naïve Bayes

–  Predictive modeling using Support Vector Machines (SVM)

Page 24: SEASR Overview

SEASR @ Work – DISCUS •  On-demand usage of

analytics while surfing –  While navigating

request analytics to be performed on page

–  Text extraction and cleaning

•  Summarization and key work extraction

–  List the important terms on the page being analyzed

–  Provide relevant short summaries

•  Visual maps –  Provide a visual

representation of the key concepts

–  Show the graph of relations between concepts

Page 25: SEASR Overview

SEASR and UIMA : Emotion Tracking Goal is to have this type of Visualization to track emotions across a text document (Leveraging flare.prefuse.org)

Page 26: SEASR Overview

Tag Cloud of the RegForm Tags

Page 27: SEASR Overview

Tag Cloud of All RegForm Text

Page 28: SEASR Overview

Agenda

•  Thursday, January 15, 2009

•  10:00am Break

•  10:30am SEASR Application Examples and Demonstrations

–  Zotero and SEASR, Xavier Llorà, Boris Capitanu

–  UIMA and SEASR, Mike Haberman

–  Audio Analysis and SEASR (NEMA, NESTER), J. Stephen Downie

•  12:00pm Lunch

Page 29: SEASR Overview

Pathways to SEASR

National Center for Supercomputing Applications!University of Illinois at Urbana-Champaign

The SEASR project and its Meandre infrastructure!are sponsored by The Andrew W. Mellon Foundation