sca2013 presentation: a web-based content analysis tool

43
A Web-Based Tool for Collaborative Social Media Data Analysis Xin Chen, Mihaela Vorvoreanu, Krishna Madhavan {chen654, mihaela, cm}@purdue.edu

Upload: xin-chen

Post on 22-Apr-2015

253 views

Category:

Design


0 download

DESCRIPTION

This is a presentation at SCA2013, Karlsruhe, Germany. This shows the design (architecture, database, sketch, wireframe, prototype) of a simple web-based tool that supports asynchronous collaboration among researchers when conducting content analysis on qualitative social media data.

TRANSCRIPT

Page 1: SCA2013 Presentation: A Web-Based Content Analysis Tool

A Web-Based Tool for Collaborative Social Media Data Analysis

Xin Chen, Mihaela Vorvoreanu, Krishna Madhavan{chen654, mihaela, cm}@purdue.edu

Page 2: SCA2013 Presentation: A Web-Based Content Analysis Tool

Motivation

Social Media Discourse

Page 3: SCA2013 Presentation: A Web-Based Content Analysis Tool

Motivation

Hidden Insights On Human Behaviors & Social Phenomenon

Page 4: SCA2013 Presentation: A Web-Based Content Analysis Tool

Human generated textual data on social media are:

Qualitative Data

Large-scale Data

Motivation

Page 5: SCA2013 Presentation: A Web-Based Content Analysis Tool

Human generated textual data on social media are:

Qualitative Data

Large-scale Data

requires qualitative interpretation

Motivation

Page 6: SCA2013 Presentation: A Web-Based Content Analysis Tool

Human generated textual data on social media are:

Qualitative Data

Large-scale Data

requires qualitative interpretation

requires large-scale data mining techniques

Motivation

Page 7: SCA2013 Presentation: A Web-Based Content Analysis Tool

Goal

To build a tool that:

Page 8: SCA2013 Presentation: A Web-Based Content Analysis Tool

Goal

Acquire social media data.

To build a tool that:

Page 9: SCA2013 Presentation: A Web-Based Content Analysis Tool

Goal

Acquire social media data.

Integrate qualitative content analysis and data mining techniques to analyze textual data on social media.

To build a tool that:

Page 10: SCA2013 Presentation: A Web-Based Content Analysis Tool

Goal

Acquire social media data.

Integrate qualitative content analysis and data mining techniques to analyze textual data on social media.

Support asynchronous collaboration among researchers.

To build a tool that:

Page 11: SCA2013 Presentation: A Web-Based Content Analysis Tool

Social Media Analytics & Monitoring Tools

Existing Tools

Focus on marketing

Do not usually incorporate human input

Page 12: SCA2013 Presentation: A Web-Based Content Analysis Tool

Qualitative Analysis Tools

Existing Tools

Complicated to use

Expensive

Do not acquire social media data

Page 13: SCA2013 Presentation: A Web-Based Content Analysis Tool

Social Media Content

API or Web Crawler

Researchers

Computation Server

Web UI

Web Server

SWAB (Social Web Analytics Buddy)

1

2

3

4

5

Data Server

Page 14: SCA2013 Presentation: A Web-Based Content Analysis Tool

Social Media Content

API or Web Crawler

Researchers

SWAB (Social Web Analytics Buddy)

1

2

3

4

5

Data Server

Computation Server

Web UI

Web Server

Page 15: SCA2013 Presentation: A Web-Based Content Analysis Tool

Social Media Content

API or Web Crawler

Researchers

SWAB (Social Web Analytics Buddy)

1

2

3

4

5

Twitter search API

Data Server

Computation Server

Web UI

Web Server

Page 16: SCA2013 Presentation: A Web-Based Content Analysis Tool

Social Media Content

Data Server

API or Web Crawler

Researchers

SWAB (Social Web Analytics Buddy)

1

2

3

4

5

Computation Server

Web UI

Web Server

MySQL

Page 17: SCA2013 Presentation: A Web-Based Content Analysis Tool

Social Media Content

Data Server

API or Web Crawler

Researchers

SWAB (Social Web Analytics Buddy)

1

2

3

4

5

Computation Server

Web UI

Web Server

MySQL

Page 18: SCA2013 Presentation: A Web-Based Content Analysis Tool

Social Media Content

Data Server

API or Web Crawler

Researchers

SWAB (Social Web Analytics Buddy)

1

2

3

4

5

Computation Server

Web UI

Web Server

MySQL

Page 19: SCA2013 Presentation: A Web-Based Content Analysis Tool

Social Media Content

Data Server

API or Web Crawler

Researchers

SWAB (Social Web Analytics Buddy)

1

2

3

4

5

Computation Server

Web UI

Web Server

MySQLClassification & Detection Modeling

Inter-rater Agreement Computation

Page 20: SCA2013 Presentation: A Web-Based Content Analysis Tool

Social Media Content

Data Server

API or Web Crawler

Researchers

SWAB (Social Web Analytics Buddy)

1

2

3

4

5

Computation Server

Web UI

Web Server

MySQL

Page 21: SCA2013 Presentation: A Web-Based Content Analysis Tool

Social Media Content

Data Server

API or Web Crawler

Researchers

SWAB (Social Web Analytics Buddy)

1

2

3

4

5

Computation Server

Web UI

Web Server

MySQL

Sample tweets for researchers to analyze

Send results back to data server

Communicate with computation server

Page 22: SCA2013 Presentation: A Web-Based Content Analysis Tool

Social Media Content

Data Server

API or Web Crawler

Researchers

SWAB (Social Web Analytics Buddy)

1

2

3

4

5

Computation Server

Web UI

Web Server

MySQL

Page 23: SCA2013 Presentation: A Web-Based Content Analysis Tool

Web UI DesignSketch

Page 24: SCA2013 Presentation: A Web-Based Content Analysis Tool

Web UI DesignWireframes: the “Overview” tab

Project title

Page 25: SCA2013 Presentation: A Web-Based Content Analysis Tool

Web UI Design

User account

Wireframes: the “Overview” tab

Page 26: SCA2013 Presentation: A Web-Based Content Analysis Tool

Web UI Design

Collaborator List

Wireframes: the “Overview” tab

Page 27: SCA2013 Presentation: A Web-Based Content Analysis Tool

Web UI Design

Datasource

Wireframes: the “Overview” tab

Page 28: SCA2013 Presentation: A Web-Based Content Analysis Tool

Web UI Design

Multiple datasets streamed using different criteria.

Wireframes: the “Overview” tab

Page 29: SCA2013 Presentation: A Web-Based Content Analysis Tool

Web UI Design

Export data and graphs.

Wireframes: the “Overview” tab

Page 30: SCA2013 Presentation: A Web-Based Content Analysis Tool

Web UI Design

Multiple visualizations and charts to provide data overview.

Wireframes: the “Overview” tab

Page 31: SCA2013 Presentation: A Web-Based Content Analysis Tool

Web UI DesignWireframes: the “Analyze” tab

Themes emerged from exploring the data.

Page 32: SCA2013 Presentation: A Web-Based Content Analysis Tool

Web UI DesignWireframes: the “Analyze” tab

Choose Sample size.

Page 33: SCA2013 Presentation: A Web-Based Content Analysis Tool

Web UI DesignWireframes: the “Analyze” tab

Analyze tweets and write comment.

Page 34: SCA2013 Presentation: A Web-Based Content Analysis Tool

Web UI DesignWireframes: the “Result” tab

All researchers’ results are aggregated in the background. Collaboration happens asynchronously. Reliability measures are computed.

Page 35: SCA2013 Presentation: A Web-Based Content Analysis Tool

Web UI DesignWireframes: the “Result” tab

Classification models can be trained based on the qualitative input.

Page 36: SCA2013 Presentation: A Web-Based Content Analysis Tool

Web UI DesignWireframes: the “Model Application” tab

Apply the trained model to a new dataset to detect similar data as in dataset1 from dataset2.

Page 37: SCA2013 Presentation: A Web-Based Content Analysis Tool

Web UI DesignWireframes: the “Model Application” tab

Choose how to explore the detected data from the new dataset: view list of tweets, user accounts, or geomap.

Page 39: SCA2013 Presentation: A Web-Based Content Analysis Tool

Future Work

Page 40: SCA2013 Presentation: A Web-Based Content Analysis Tool

Future Work

Design features to better support data exploration.

Page 41: SCA2013 Presentation: A Web-Based Content Analysis Tool

Future Work

Design features to better support data exploration.

Explore NoSQL database to handle large datasets.

Page 42: SCA2013 Presentation: A Web-Based Content Analysis Tool

Future Work

Design features to better support data exploration.

Explore NoSQL database to handle large datasets.

Implement more sophisticated data mining and visualization features.

Page 43: SCA2013 Presentation: A Web-Based Content Analysis Tool

Thank you!

Q & A