seasr overview
DESCRIPTION
TRANSCRIPT
Pathways to SEASR
National Center for Supercomputing Applications!University of Illinois at Urbana-Champaign
The SEASR project and its Meandre infrastructure!are sponsored by The Andrew W. Mellon Foundation
Outline
• Welcome
• Logistics
• Agenda Review
• Workshop Objective and Goals
• Participants Project Planning Activity
• SEASR + Exemplars Quick Glance "
Welcome
• Dr. Robert Wilhelmson,
– Division Director – CyberApplications and Communities
– National Center for Supercomputing Applications
• Dr. Chris Mackie
– Program Officer – Research in Information Technology
– The Andrew W. Mellon Foundation
Logistics
• Network Access
• Building Access
• Packet - Reimbursements and Evaluation
• Coffee/Soda/Water “Treat” Breaks
• Lunch & Dinner
• Restrooms
• Staff Introductions
• Questions?"
Agenda Day 1 Morning
• Thursday, January 15, 2009
• 8:00am Registration and Breakfast
• 8:30am Introduction, Michael Welge, SEASR, NCSA
• 9:00am SEASR Overview and Workshop Goals, Michael Welge
• 10:00am Break
• 10:30am SEASR Application Examples and Demonstrations
– Zotero and SEASR, Xavier Llorà, Boris Capitanu
– UIMA and SEASR, Mike Haberman
– Audio Analysis and SEASR (NEMA, NESTER), J. Stephen Downie
• Noon Lunch
Agenda Day 1 Afternoon
• 1:00pm SEASR Architecture, Xavier Llorà and Bernie Acs
– Workshop Plan Presentation (5 min each) by J. Stephen Downie
• 2:30pm Break
• 3:00pm SEASR Tools with Hands On Demo
– Community Hub, Loretta Auvil
– Workbench, Boris Capitanu
– Server Interface, Xavier Llorà
– ZigZag, Xavier Llorà
– Eclipse Plugin, Amit Kumar
• 4:00 SEASR Adoptability and Sustainability, John Unsworth
• 4:30pm End of Day
• 6:00pm Dinner
Agenda Day 2 Morning
• Friday, January 16, 2009
• 8:00am Registration and Breakfast
• 8:30am SEASR Application Examples and Demonstrations (continued)
– MONK and SEASR, Stéfan Sinclair
– FutureLens and SEASR and MONK, Andrey Puretskiy
– DISCUS and SEASR, Xavier Llorà
– Visualization and SEASR, Loretta Auvil
– Workshop Plan Presentation (5 min each) by Teams
• 10:00am Break
• 10:30am SEASR Installation, Xavier Llorà and Loretta Auvil
– Workshop Plan Presentation (5 min each) by Teams
• Noon Lunch
Agenda Day 2 Afternoon
• 1:00pm SEASR Wrap Up and Summary, Michael Welge and Loretta Auvil
• Workshop Plan Presentation (5 min each) by Teams
• 2:30pm Break
• 3:00pm SEASR Lab Session, Michael Welge and Loretta Auvil Workshop Plan Presentation (5 min each) by Teams
• 4:30pm End of Day
Workshop Objective
• The objective of the workshop is:
• To explain and demonstrate the utility of SEASR for digital humanities, and to bring you to a point where you could deploy, contribute and utilize the SEASR environment.
• SEASR + TOOLS + EXEMPLARS + HANDS ON
Workshop Goals
• The goals of the workshop are:
• LEARN: Provide a detailed understanding of the SEASR framework
• LEARN: Provide a foundation and examples for participant teams to use SEASR in a study or inquiry
• ADOPT: Share participant generated research plans to utilize SEASR
• INSTALL: Provide detailed instructions on how to install, build components, integrate existing applications, and maintain the SEASR environment
• SUPPORT: Develop plans for resolution of issues raised by the user community in utilization of SEASR
• SUSTAIN: Develop a plan for community driven future development and dissemination of SEASR
• Learn + Adopt + Sustain
SEASR Synergy
Requirements Analysis & Specification
Development & System Integration
Prototype or Production Research
Research Applications
Scholarly Discoveries
University International Government
Partnerships
SESOA Execution Engine
Security Interaction UIs
Data Mining Visualization
Community Hub
Integrated Project Team
Pathfinders Communities
Technology Roadmaps Scholarly
Roadmaps
“Infrastructure is a set of practices, standards, technology and cultures that work together to provide a foundation for digital scholarship” Digital Commonwealth 2003
SEASR: Reach + Relevance + Reuse + Repeatability
SEASR emphasizes flexibility, scalability, modularity, provides community hub and access to heterogeneous data and computational systems – Semantic driven environment for SOA interoperability – Encourages sharing and participation for building communities – Modular construction allows flows to be modified and configured
to encourage reusability within and across domains – Enables a mashup and integration of tools – Data-intensive flows can be executed on a simple desktop or a
large cluster(s) without modification – Computation can be created for distributed execution on servers
where the content lives – User accessibility to control trust and compliance with required
copyright license of content – Relies on standardized Resource Description Framework (RDF) to
define components and flow
SEASRComponents
Virtualiza5onInfrastructure
FileSystemsSharedStores SOAGateways
MeandreInfrastructure
Visualiza5on
MetadataStores
ComponentRepository ComponentDiscovery
MeandreData‐IntensiveFlows
SEASRApps SEASRServicesSEASRPlugins SEASRWebApps
Analy5csData
GatewayConnec5onsDataPersistence
DataTransforma5onNaturalLangProcessing
Descrip5veSta5s5csPredic5veModeling
Discovery
GraphingModelingVis
InfoVis(smallmul5ples)
Develop
erToo
ls
Data Driven Models
Knowledge Discovery in Data
Participant Project Plan
• Participant Project Plan Guide
– Research Objective
– Data Source(s)
– Transformation(s)
– Query, Descriptive, and/or Analysis
– Evaluation – Table, Visualization
– Interaction
– Outcome
Workbench
• Web-based UI
• Components and flows are retrieved from server
• Additional locations of components and flows can be added to server
• Create flow using a graphical drag and drop interface
• Change property values
• Execute the flow
Community Hub
SEASR @ Work – Zotero
• Plugin to Firefox • Zotero manages the
collection • Launch SEASR Analytics
– Citation Analysis uses the JUNG network importance algorithms to rank the authors in the citation network that is exported as RDF data from Zotero to SEASR
– Zotero Export to Fedora through SEASR
– Saves results from SEASR Analytics to a Collection
• Launch MONK Processing – MONK DB Ingestion Workflow
Web Service
Interactive Web Application
SEASR @ Work – Fedora
SEASR @ Work – Entity Mash-up
• Entity Extraction with OpenNLP
• Locations viewed on Google Map
• Dates viewed on Simile Timeline
SEASR @ Work – Audio Analysis
• NEMA: Executes a SEASR flow for each run – Loads audio data
– Extracts features for every 10 sec moving window of audio
– Loads and applies the models
– Sends results back to the WebUI
• NESTER: Annotation of Audio via Spectral Analysis
SEASR @ Work – MONK
Executes flows for each analysis requested
– Predictive modeling using Naïve Bayes
– Predictive modeling using Support Vector Machines (SVM)
SEASR @ Work – DISCUS • On-demand usage of
analytics while surfing – While navigating
request analytics to be performed on page
– Text extraction and cleaning
• Summarization and key work extraction
– List the important terms on the page being analyzed
– Provide relevant short summaries
• Visual maps – Provide a visual
representation of the key concepts
– Show the graph of relations between concepts
SEASR and UIMA : Emotion Tracking Goal is to have this type of Visualization to track emotions across a text document (Leveraging flare.prefuse.org)
Tag Cloud of the RegForm Tags
Tag Cloud of All RegForm Text
Agenda
• Thursday, January 15, 2009
• 10:00am Break
• 10:30am SEASR Application Examples and Demonstrations
– Zotero and SEASR, Xavier Llorà, Boris Capitanu
– UIMA and SEASR, Mike Haberman
– Audio Analysis and SEASR (NEMA, NESTER), J. Stephen Downie
• 12:00pm Lunch
Pathways to SEASR
National Center for Supercomputing Applications!University of Illinois at Urbana-Champaign
The SEASR project and its Meandre infrastructure!are sponsored by The Andrew W. Mellon Foundation