massive streaming data analytics: a case study with clustering coefficients david ediger karl jiang...

Massive Streaming Data Analytics: A Case Study with Clustering Coefficients David Ediger Karl Jiang Jason Riedy David A. Bader Georgia Institute of Technology Atlanta, GA USA 1

Post on 22-Dec-2015

212 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Massive Streaming Data Analytics:A Case Study with Clustering Coefficients

David EdigerKarl Jiang

Jason RiedyDavid A. Bader

Georgia Institute of TechnologyAtlanta, GA USA

STINGER Data Structure

• Spatio-temporal Interaction Networks and Graphs (STING) Extensible Representation

• General-purpose data structure for dynamic graphs

• Efficient edge insertion/deletion (updates) with concurrent readers (analysis)

Page 3: Massive Streaming Data Analytics: A Case Study with Clustering Coefficients David Ediger Karl Jiang Jason Riedy David A. Bader Georgia Institute of Technology

STINGER Data Structure

• Array of linked lists, which may have empty slots (from deleting edges)

• Additional storedinfo not in paper

• Efficient updates• Concurrent reads

(no locking)

Page 4: Massive Streaming Data Analytics: A Case Study with Clustering Coefficients David Ediger Karl Jiang Jason Riedy David A. Bader Georgia Institute of Technology

Assumptions for parallelism

• Single streaming source for inserts/deletes• Changes are scattered widely– Batches are sufficiently independent

• Analysis kernels have small range– Graph change only requires access to local

portions and affects small portion of output

Page 5: Massive Streaming Data Analytics: A Case Study with Clustering Coefficients David Ediger Karl Jiang Jason Riedy David A. Bader Georgia Institute of Technology

Assumptions (continued)

Page 6: Massive Streaming Data Analytics: A Case Study with Clustering Coefficients David Ediger Karl Jiang Jason Riedy David A. Bader Georgia Institute of Technology

Case Study:Updating Clustering Coefficients

• Clustering coefficients measure density of closed triangles:

• One way of determining if a graph is a small-world graph

Page 7: Massive Streaming Data Analytics: A Case Study with Clustering Coefficients David Ediger Karl Jiang Jason Riedy David A. Bader Georgia Institute of Technology

Bloom filter

• Consider an edge list represented as a bit array (1 bit per edge) => O(n) storage space

• Bloom filter is a bit array with an arbitrary, smaller number of bits

• A hash function maps a vertex to a specific bit• Small number of bits == high collision rate• To reduce false-positives, use k independent

hash functions to set multiple bits

Page 8: Massive Streaming Data Analytics: A Case Study with Clustering Coefficients David Ediger Karl Jiang Jason Riedy David A. Bader Georgia Institute of Technology

Bloom filter

Page 9: Massive Streaming Data Analytics: A Case Study with Clustering Coefficients David Ediger Karl Jiang Jason Riedy David A. Bader Georgia Institute of Technology

Testbed

• Massively multi-threaded Cray XMT– 64 Threadstorm processors• Each running at 500MHz• Each has 128 hardware streams maintaining a thread

context• Context switches occur every cycle• 512 GiB globally addressable shared memory

– (holds 2 billion vertices and 17 billion edges)

• Synthetic data– 16 million vertices, ~500 million edges

Mark J. Riedy, Esq. Andrews Kurth LLP 1350 I Street, NW, Suite 1100, Washington, D.C., USA 20005 Mobile: +1-703-201-6677 Email:[email protected]

Systemic Impacts of Mini-publics - newdemocracy.com.au€¦ · Please cite as: Riedy, C and Kent, J, 2017. Systemic Impacts of Mini-publics. Report prepared for new Democracy Foundation

High Performance Ficon Demystified, Update and User Experience · High Performance Ficon Demystified, Update and User Experience Dale Riedy IBM [email protected] 8 August 2012 Session

· Web viewAllen Riedy, “History of the Chinese, Japanese, and Korean Collection of the University of Hawaii at Manoa.” In Peter Zhou, Collecting Asia: East Asian Libraries

School of Arts & Sciences | School of Arts and Sciences ...crulli/tech paper.doc · Web viewAmerican School Board Journal 2003: 34-37. Ediger, Marlow. “Technology in the school

By: Zach Riedy. Electricity generated by harnessing the power of the gravitational force of moving water. It is the most widely used form of renewable

Power and Control in Networked Sensors E. Jason Riedy and Robert Szewczyk Presenter: Fayun Luo

Cassidy Alexis Ediger, an infant by her Cassidy Alexis

Programmer en Java OC Informatique 17 – 18salvadore/Burier01/OC/Document/Java.pdf · Exercice1.3 Rédiger l’en-tête d’une méthode publique nommée send, disposant d’un

USGS · Pine Hall Formation o E o c o Melvin Beach Member Fownes Head Member ... Ediger, V.S., 1986, Paleopalynological biostratigraphy, organic matter deposition, and basin analysis

CTE Social Media Outreach 2010. Nice to meet you. Introduction: Winona Dimeo-Ediger My experience with social media: -Blogging/podcasts/media interviews

Perkins IV Secondary Accountability Data Michelle Kamenov Kari-Ann Ediger “Leading for educational excellence and equity. Every day for every one.”

Streaming Graph Analysis A Statistical Framework forstingergraph.com/data/uploads/asonam2013_slides.pdf · Streaming Graph Analysis James Fairbanks, David Ediger, Rob McColl, David

UTSpeaks: Progress or procrastination? (Part 3 - Chris Riedy and open forum)

Laura Ediger

37220090 Defining Biomass by Guest Author Mark J Riedy Esq

Health Promotion and Disease Prevention Roxanne Riedy, MSN Marilee Elias, MSN

The 1950’s By: Margaret Riedy,Kendra Laurens,Theo Ferguson, & Connor Duffy

sub018 moreland energy foundation attachment · Web viewFor Moreland Energy Foundation Ltd Authors: Christopher Riedy, Erin Wilson, Helen Cheney, Keith Tarlo Institute for Sustainable

PO.250 Thomas Bahr - Harris Geospatial...1Barrett M. Sather, 2Joshua M. Riedy& 3Thomas Bahr 1Harris GeospatialSolutions Inc., 2EdgeData, 3Harris GeospatialSolutions GmbH PO.250 Introduction

Finding Transformative Narratives - OPUS at UTS: Home · 2020. 3. 14. · Transformations2019 Finding Transformative Narratives Exploring our common ground Chris Riedy. Sandra Waddock

1 Mark J. Riedy, Esq. Andrews Kurth LLP 1350 I Street, NW, Suite 1100, Washington, D.C., USA 20005 Mobile: +1-703-201-6677 Email:[email protected]

Connecticut Journal of Science Education · Connecticut Science Teachers Association 4 Self Efficacy and The Science Teacher Dr. Marlow Ediger 6 Climate Kids SciJinks-NASA 8 You Dirty…

· Web viewCommunity EmPOWERment. Final Research Report. For Moreland Energy Foundation Ltd. Authors: Christopher Riedy, Erin Wilson, Helen Cheney, Keith Tarlo. Institute for

Streaming Graph Analytics for Massive Graphs · 7/10/2012 · Streaming Graph Analytics for Massive Graphs Jason Riedy, David A. Bader, David Ediger Georgia Institute of Technology

COMMUNITY HEALTH ROXANNE RIEDY MSN MARILEE ELIAS MSN, CNE

English Practice - bctela.ca · Krista Ediger (Richmond) [email protected] Student Writing Journal Editor Cindy Miller (Fort St. James) [email protected] Student Liaison Linda Mei

2014 Capital Pet - Mohawk Hudson Humane Society€¦ · Garden Social Club Members Virginia Riedy Patricia Bredenko Courtney Fitzgerald and Bob & Grace O'Brien Barbara Riley NYS DOH

SPEECH PATHOLOGIST ROLE IN BREATHING AND COMMUNICATION CHANGES FOLLOWING A TOTAL LARYNGECTOMY MID KANSAS EAR, NOSE & THROAT ASSOCIATES RENEE’ L EDIGER,

Sea-floor sediments and bedforms around Turkey, revealed ...old.ims.metu.edu.tr/pdf/324.pdf · Mahmut OKYAR and Vedat EDIGER Middle East Technical University, Institute of Marine

Non-Traditional Security and Multilateralism in Asia · PDF fileNon-Traditional Security and Multilateralism in Asia Mikaela Ediger Europe and Asia January 27, 2014 Sunday, January

r. · ,ED 236 S78-lAUTROR TITLE PUB DATE. NOTE:" PUB TYPE. EDRS PRICE DESCRIPTORS. DOCUMENT RESUME. r. CS 007 368. Ediger, Marlow. Appraising Learner Progress in Reading

Overcoming Operational Challenges of Investing in India and China Mark J. Riedy Partner, Andrews Kurth LLP 1350 I Street, NW Suite 1100 Washington, DC

Novel Architectures for Applications in Data Science and ... · Data Science and Beyond Jason Riedy, Jeffrey Young, Tom Conte Center for Research into Novel Computing Hierarchies

Kindergarten Newsletter Week of October 12, 2015 Mrs. Riedy Kindergarten Newsletter Week of October 12, 2015 Mrs. Riedy *I loved seeing the responses from