benchmarking interactive social networking actions shahram ghandeharizadeh director of database lab...
TRANSCRIPT
![Page 1: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/1.jpg)
Benchmarking Interactive Social Benchmarking Interactive Social Networking ActionsNetworking Actions
Shahram GhandeharizadehShahram GhandeharizadehDirector of Database LabDirector of Database LabComputer Science DepartmentComputer Science DepartmentUniversity of Southern CaliforniaUniversity of Southern California
![Page 2: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/2.jpg)
Outline
Motivation Research questions
Survey use cases BG Benchmark FORSEE Future research
![Page 3: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/3.jpg)
Motivation Data Stores
Cloud Services
Person-to-person cloud services
![Page 4: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/4.jpg)
Research Questions
What is the tradeoff between alternative data models? E.g., Is JSON superior to the relational
data model?
How do alternative architectures compare with one another? E.g., Is cache augmented SQL as good as
a document/extensible store?
Do NewSQL data stores scale as well as NoSQL data stores?
![Page 5: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/5.jpg)
Survey Use Case
S. Barahmand and S. Ghandeharizadeh. BG: A Benchmark to Evaluate Interactive Social Networking Actions. CIDR ‘13, Asilomar, CA.
![Page 6: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/6.jpg)
Data Model
Accounts
Friend
MembersPages
Follow
Resources Own
Share
Share
News Feed Displays Ownd
![Page 7: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/7.jpg)
BG Architecture
Scalable
Emulates User Behavior
Service Level AgreementQuick and Efficient Rating
Visualization Tool
S. Barahmand and S. Ghandeharizadeh. Expedited Benchmarking of Social Networking Actions with Agile Data Loading Techniques. CIKM ‘13, SF, CA.
![Page 8: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/8.jpg)
http://bgbenchmark.org
![Page 9: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/9.jpg)
Good Benchmark = FORSEE
Focus on an important debate & provide relevant metrics to facilitate progress.
One number to describe alternative designs/solution.
Runs in a reasonable amount of time.
Scalable.
Effective abstraction with meaningful requests.
Extendible.
![Page 10: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/10.jpg)
Good Benchmark = FORSEE
F
One number to describe alternative designs/solution.
Runs in a reasonable amount of time.
Scalable.
Effective abstraction with meaningful requests.
Extendible.
+ Unpredictable data
![Page 11: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/11.jpg)
Good Benchmark = FORSEE
F
O
Runs in a reasonable amount of time.
Scalable.
Effective abstraction with meaningful requests.
Extendible.
+ Unpredictable data
SoAR
![Page 12: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/12.jpg)
Good Benchmark = FORSEE
F
O
R
Scalable.
Effective abstraction with meaningful requests.
Extendible.
+ Unpredictable data
SoAR
4 months to rate =1 Week to rate =
![Page 13: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/13.jpg)
Good Benchmark = FORSEE
F
O
R
S
Effective abstraction with meaningful requests.
Extendible.
+ Unpredictable data
SoAR
4 months to rate =1 Week to rate =
![Page 14: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/14.jpg)
Good Benchmark = FORSEE
F
O
R
S
E
Extendible.
+ Unpredictable data
SoAR
4 months to rate =1 Week to rate =
Only when two members are NOT friends!
![Page 15: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/15.jpg)
Good Benchmark = FORSEE
F
O
R
S
E
E
+ Unpredictable data
SoAR
4 months to rate =1 Week to rate =
Only when two members are NOT friends!
FORSEE = PREDICT
![Page 16: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/16.jpg)
Good Benchmark = FORSEE
F
O
R
S
E
E
+ Unpredictable data
SoAR
4 months to rate =1 Week to rate =
Only when two members are NOT friends!
A good benchmark helps settle debates
quickly to enable its discipline to make rapid progress.
![Page 17: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/17.jpg)
Future Research: Data Sciences
Challenge: Wide variety of science applications with diverse debates.
Hypothesis: A benchmark generator.
BenchmarkGenerator
ER diagram
Actions & their dependencies
Key Metrics
Application (data science)
SpecificBenchmark
![Page 18: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/18.jpg)
Future Reseach Evaluate the hypothesis using BG.
Extend to other data science applications.
BenchmarkGenerator
Unpredictable data
![Page 19: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/19.jpg)
![Page 20: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/20.jpg)
![Page 21: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/21.jpg)
Big Data: Operations
Simple Complex
Off-line
Interactive
Ad-hoc
Pre-specified
![Page 22: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/22.jpg)
Big Data: Google Analytics
Simple Complex
Off-line
Interactive
Ad-hoc
Pre-specified
1. Gather click stream data: Optimized for writes, 2. Compute aggregated data: MapReduce/Hadoop
Objective:1. Advertising ROI2. Frequency of access to pages
![Page 23: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/23.jpg)
Big Data: Google Analytics
Simple Complex
Off-line
Interactive
Ad-hoc
Pre-specified
1. Gather click stream data: Optimized for writes, 2. Compute aggregated data: MapReduce/Hadoop3. Enable users to view aggregated data.
Objective:1. Advertising ROI2. Frequency of access to pages
![Page 24: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/24.jpg)
Big Data: Facebook
Show profile page of Farah FawcettFollow Barak ObamaFriend Lady Gaga
Simple Complex
Off-line
Interactive
Ad-hoc
Pre-specified
![Page 25: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/25.jpg)
3 Vs: Facebook High Volume:
1.2 billion user profiles, 150 billion friend connections, 1.13 trillion likes, 17 billion tagged locations, 240 billion photos, ….
High Velocity: 700 million active users daily, 4.5 billion likes daily, 350
million photos uploaded daily, …
High Variety: Mix of data types: Structured records, multimedia content,
text.
Source: http://expandedramblings.com/index.php/by-the-numbers-17-amazing-facebook-stats/ posted on Oct 6, 2013.
![Page 26: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/26.jpg)
Expertise/Contributions
BG Benchmark to evaluate performance of alternative data stores: SQL, NoSQL, NewSQL.
http://bgbenchmark.org
A high performance CASQL solution that minimizes software development life cycle.
KOSAR, a prototype of a CASQL solution.
Simple ComplexOff-line
Interactive
Ad-hoc
Pre-specified
![Page 27: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/27.jpg)
BG, http://bgbenchmark.org
Joint work with Sumita Barahmand Benchmark for interactive social
networking actions. Consists of 11 actions:
![Page 28: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/28.jpg)
CASQL
Joint work with Jason Yap. Key insight: Query result look up is
faster than query processing. Contribution is physical data
independence in CASQL systems: Transparent caching Serial schedules Detection of race conditions and
prevention of inconsistent states.
![Page 29: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/29.jpg)
KOSAR
Joint work with Reihane Boghrati, Lakshmy Mohanan and Neeraj Narang.
A software prototype of CASQL Scalable Highly available Elastic
Boosts performance of a leading industrial strength RDBMS vendor from 2 actions per second to more than 300,000 actions per second.
![Page 30: Benchmarking Interactive Social Networking Actions Shahram Ghandeharizadeh Director of Database Lab Computer Science Department University of Southern](https://reader030.vdocuments.us/reader030/viewer/2022032612/56649ebb5503460f94bc3575/html5/thumbnails/30.jpg)
BG Coordinator
Delta Analyzer
BGClient 2 BGClient NBGClient 1
Experiment
Load
Agile Data Loading Techniques
Experiment
…
Data Store Server
…