20120622 web sci12-won-marc smith-semantic and social network analysis of …
Post on 28-Jan-2015
110 Views
Preview:
DESCRIPTION
TRANSCRIPT
Marc A. SmithChief Social ScientistConnected Action Consulting Groupmarc@connectedaction.nethttp://www.connectedaction.nethttp://nodexl.codeplex.com/
A project from the Social Media Research Foundation: http://www.smrfoundation.org
Semantic and Social Network
Analysis of Social Media with
NodeXL
Social Media Research FoundationPeople Disciplines Institutions
University Faculty
Computer Science University of Maryland
Students HCI, CSCW Oxford Internet Institute
Industry Machine Learning Stanford University
Independent Information Visualization Microsoft Research
Researchers UI/UX Illinois Institute of Technology
Developers Social Science/Sociology Connected Action
Network Analysis Cornell
Collective Action Morningside Analytics
About Me
Introductions
Marc A. SmithChief Social ScientistConnected Action Consulting Group
Marc@connectedaction.nethttp://www.connectedaction.nethttp://www.codeplex.com/nodexlhttp://www.twitter.com/marc_smithhttp://delicious.com/marc_smith/Paper http://www.flickr.com/photos/marc_smithhttp://www.facebook.com/marc.smith.sociologisthttp://www.linkedin.com/in/marcasmithhttp://www.slideshare.net/Marc_A_Smithhttp://www.smrfoundation.org
Like MSPaint™ for graphs
What we are trying to do:Open Tools, Open Data, Open Scholarship
• Build the “Firefox of GraphML” – open tools for collecting and visualizing social media data
• Connect users to network analysis – make network charts as easy as making a pie chart
• Connect researchers to social media data sources• Archive: Be the “Allen Very Large Telescope Array”
for Social Media data – coordinate and aggregate the results of many user’s data collection and analysis
• Create open access research papers & findings• Make “collections of connections” easy for users to
manage
What we have done: Open Tools
• NodeXL• Data providers (“spigots”)– ThreadMill Message Board– Exchange Enterprise Email– Voson Hyperlink– SharePoint– Facebook– Twitter– YouTube– Flickr
What we have done: Open Data
• NodeXLGraphGallery.org– User generated collection of
network graphs, datasets and annotations
– Collective repository for the research community
– Published collections of data from a range of social media data sources to help students and researchers connect with data of interest and relevance
Now Available
Group-in-a-box Layout
#occupywallstreet15 November 2011
#teaparty15 November 2011
http://www.newscientist.com/blogs/onepercent/2011/11/occupy-vs-tea-party-what-their.html
This graph represents a directed network of 1,360 Twitter users
whose recent tweets contained "contraceptive OR contraception". The network was obtained
on Friday, 08 June 2012 at 13:22 UTC. There is
an edge for each follows relationship. There is an edge for each "replies-
to" relationship in a tweet. There is an edge
for each "mentions" relationship in a tweet.
There is a self-loop edge for each tweet that is not
a "replies-to" or "mentions". The tweets were made over the 2-
day period from Thursday, 07 June 2012 at 18:46 UTC to Friday, 08 June 2012 at 13:06
UTC. The graph's vertices were grouped by cluster using the Clauset-Newman-Moore cluster
algorithm. The edge colors are based on
relationship values. The vertex sizes are based on
each user’s number of followers. Table 1
reports the summary network metrics that describe the graph.
Summary network metrics
The Vertices spreadsheet lists users who contributed a tweet containing the terms “contraception OR
contraceptives” over two days in early June 2012. Users are ranked by their computed betweenness centrality within the network of follows, replies, and mentions edges. The top 10 vertices, ranked by betweenness centrality are the accounts
at the center of the network. These include: @thinkprogress, @gatesfoundation, @SandraFluke,
@maleeek, @Change, @foxandfriends, @melindagates, @AshleyJudd, @cnalive, and @SOHLTC.
NodeXL calculates network metrics and
word pairs
Contrasting groups
The Content summary spreadsheet displays the most
frequently used URLs, hashtags, and user names within the
network as a whole and within each calculated sub-group.
Contrast hashtags in Groups 2 & 4
Contrasting URL references
Word Pair Contrasts
Marc A. SmithChief Social ScientistConnected Action Consulting Groupmarc@connectedaction.nethttp://www.connectedaction.nethttp://nodexl.codeplex.com/
A project from the Social Media Research Foundation: http://www.smrfoundation.org
Semantic and Social Network
Analysis of Social Media with
NodeXL
top related