appstores imc13

62
A Systematic Study of the Mobile App Ecosystem Thanasis Petsas, Antonis Papadogiannakis, Evangelos P. Markatos Michalis Polychronakis Thomas Karagiannis

Upload: thanasis-petsas

Post on 04-Jul-2015

39 views

Category:

Internet


0 download

DESCRIPTION

Presentation of the paper " Rise of the Planet of the Apps: A Systematic Study of the Mobile App Ecosystem" at Internet Measurement Conference (IMC) 2013: http://conferences.sigcomm.org/imc/2013/index.html The paper can be found here: http://conferences.sigcomm.org/imc/2013/papers/imc217-petsasA.pdf

TRANSCRIPT

Page 1: Appstores imc13

A Systematic Study of the Mobile App Ecosystem

Thanasis Petsas, Antonis Papadogiannakis, Evangelos P. Markatos

Michalis Polychronakis

Thomas Karagiannis

Page 2: Appstores imc13

Smartphone Adoption Explodes

• Smartphone adoption: – 10x faster than 80s PC revolution

– 2x faster than 90s Internet Boom

– 3x faster than social networks

• 1.4 B smartphones in use by 2013!

Source:

2

Page 3: Appstores imc13

Mobile Apps are Getting Popular

50B+ downloads

1M+ apps

50B+ downloads

900K+ apps

Windows Store

2B+ downloads

100K+ apps 3

Page 4: Appstores imc13

A Plethora of Marketplaces

• In addition to the official marketplaces...

• Many alternative markets

4

Page 5: Appstores imc13

Crawler Hosts

Data Collection

Marketplaces PlanetLab Proxies

App stats APKs

App stats APKs

App stats APKs

Database

Ap

p s

tats

AP

Ks

5

Page 6: Appstores imc13

Datasets

Appstore Crawling period

Total apps* New apps / day

Total downloads*

Daily downloads

SlideMe (free) 5 months 16,578 28.0 96 M 215.7 K

SlideMe (paid) 5 months 5,606 6.5 914 K 5.2 K

1Mobile 4.5 months 156,221 210.4 453 M 651.5 K

AppChina 2 months 55,357 336.0 2,623 M 24.1 M

Anzhi 2 months 60,196 29.6 2,816 M 23.7 M

* Last Day ~ 300K apps

Paid apps: • less downloads • fewer uploads

6

Page 7: Appstores imc13

App Popularity Is There a Pareto Effect?

Do

wn

load

s (%

) C

DF

Normalized App Ranking (%)

7

Page 8: Appstores imc13

App Popularity Is There a Pareto Effect?

Do

wn

load

s (%

) C

DF

Normalized App Ranking (%)

10% of the apps account for 90% of the downloads

7

Page 9: Appstores imc13

App Popularity Is There a Power-law Behavior?

8

Page 10: Appstores imc13

App Popularity Is There a Power-law Behavior?

Let’s focus on one appstore

8

Page 11: Appstores imc13

App Popularity Deviations from ZIPF

9

Page 12: Appstores imc13

App Popularity Deviations from ZIPF

WWW INFOCOM‘99

9

Page 13: Appstores imc13

App Popularity Deviations from ZIPF

WWW INFOCOM‘99

9

Page 14: Appstores imc13

App Popularity Deviations from ZIPF

WWW INFOCOM‘99

P2P SOSP’03

9

Page 15: Appstores imc13

App Popularity Deviations from ZIPF

WWW INFOCOM‘99

P2P SOSP’03

UGC IMC’07

9

Page 16: Appstores imc13

Truncation for small x values: Fetch-at-most-once

• Also observed in P2P workloads • Users appear to download an application at most once

P2P SOSP’03

simulations

10

Page 17: Appstores imc13

Truncation for large x values: clustering effect

• Other studies attribute this truncation to information filtering • Our suggestion: the clustering effect

UGC IMC’07

11

Page 18: Appstores imc13

App Clustering

• Apps are grouped into clusters

• App clusters can be formed by

– App categories

– Recommendation systems

– User communities

– Other grouping forces

12

Page 19: Appstores imc13

Clustering Hypothesis

• Users tend to download apps from the same clusters

I like Games!

I like Social apps!

13

Page 20: Appstores imc13

Validating Clustering Effect in User Downloads

Dataset: 361,282 user comment streams, 60,196 apps in 34 categories

14

Page 21: Appstores imc13

Validating Clustering Effect in User Downloads

Dataset: 361,282 user comment streams, 60,196 apps in 34 categories

53% of users commented on apps from a single category

14

Page 22: Appstores imc13

Validating Clustering Effect in User Downloads

Dataset: 361,282 user comment streams, 60,196 apps in 34 categories

94% of users commented on apps from up to 5 categories

14

Page 23: Appstores imc13

User Temporal Affinity

a1 a2 a3 a4 a5

User downloads

sequence a1, a2, a3, a4, a5

x

Aff1 = 0

Aff2 = 1

Aff3 = 1

x

Aff4 = 0

𝐴𝑓𝑓𝑖𝑛𝑖𝑡𝑦 = Affi 𝑛−1𝑖=1

𝑛 − 1

=0 + 1 + 1 + 0

4

= 0.5

15

Page 24: Appstores imc13

Users Exhibit a Strong Temporal Affinity to Categories

0.55

0.14

16

Page 25: Appstores imc13

Users Exhibit a Strong Temporal Affinity to Categories

0.55

0.14

3.9 x

16

Page 26: Appstores imc13

Modeling Appstore Workloads

. . .

Top

bottom

Ap

p p

op

ula

tiry

Reader Games Social Productivity

APP-CLUSTERING model

17

Page 27: Appstores imc13

Modeling Appstore Workloads

. . .

Top

bottom

Ap

p p

op

ula

tiry

Reader Games Social Productivity

APP-CLUSTERING model 1. Download the 1st app – overall app ranking

1

17

Page 28: Appstores imc13

Modeling Appstore Workloads

. . .

Top

bottom

Ap

p p

op

ula

tiry

Reader Games Social Productivity

APP-CLUSTERING model 1. Download the 1st app – overall app ranking

17

Page 29: Appstores imc13

Modeling Appstore Workloads

. . .

Top

bottom

Ap

p p

op

ula

tiry

Reader Games Social Productivity

APP-CLUSTERING model 1. Download the 1st app – overall app ranking 2. Download another app

2

17

Page 30: Appstores imc13

Modeling Appstore Workloads

. . .

Top

bottom

Ap

p p

op

ula

tiry

Reader Games Social Productivity

APP-CLUSTERING model 1. Download the 1st app – overall app ranking 2. Download another app 2.1 with prob. p from a previous app cluster c – cluster app ranking

2.1

p

17

Page 31: Appstores imc13

Modeling Appstore Workloads

. . .

Top

bottom

Ap

p p

op

ula

tiry

Reader Games Social Productivity

APP-CLUSTERING model 1. Download the 1st app – overall app ranking 2. Download another app 2.1 with prob. p from a previous app cluster c – cluster app ranking

17

Page 32: Appstores imc13

Modeling Appstore Workloads

. . .

Top

bottom

Ap

p p

op

ula

tiry

Reader Games Social Productivity

APP-CLUSTERING model 1. Download the 1st app – overall app ranking 2. Download another app 2.1 with prob. p from a previous app cluster c – cluster app ranking 2.2 with prob. 1-p – overall app ranking

2.2

1-p

17

Page 33: Appstores imc13

Modeling Appstore Workloads

. . .

Top

bottom

Ap

p p

op

ula

tiry

Reader Games Social Productivity

APP-CLUSTERING model 1. Download the 1st app – overall app ranking 2. Download another app 2.1 with prob. p from a previous app cluster c – cluster app ranking 2.2 with prob. 1-p – overall app ranking

17

Page 34: Appstores imc13

Modeling Appstore Workloads

. . .

Top

bottom

Ap

p p

op

ula

tiry

Reader Games Social Productivity

APP-CLUSTERING model 1. Download the 1st app – overall app ranking 2. Download another app 2.1 with prob. p from a previous app cluster c – cluster app ranking 2.2 with prob. 1-p – overall app ranking 3. If user’s downloads < d go to 2.

If downloaded apps < user downloads go to 2.

3

17

Page 35: Appstores imc13

Model Parameters Symbol Parameter Description

A Number of apps

D Total downloads

d Downloads per user (average)

C Number of clusters

U Number of users

zr Zipf exponent for overall app ranking

ZG Overall Zipf distribution of all apps

P Percentage of downloads based on clustering effect

zc Zipf exponent for cluster’s app ranking

Zc Zipf distribution of apps in cluster c

D(I,j) Predicted downloads for app with total rank i and rank j in its cluster 18

Page 36: Appstores imc13

Model Parameters Symbol Parameter Description

A Number of apps

D Total downloads

d Downloads per user (average)

C Number of clusters

U Number of users

zr Zipf exponent for overall app ranking

ZG Overall Zipf distribution of all apps

P Percentage of downloads based on clustering effect

zc Zipf exponent for cluster’s app ranking

Zc Zipf distribution of apps in cluster c

D(I,j) Predicted downloads for app with total rank i and rank j in its cluster 18

Page 37: Appstores imc13

Model Parameters Symbol Parameter Description

A Number of apps

D Total downloads

d Downloads per user (average)

C Number of clusters

U Number of users

zr Zipf exponent for overall app ranking

ZG Overall Zipf distribution of all apps

P Percentage of downloads based on clustering effect

zc Zipf exponent for cluster’s app ranking

Zc Zipf distribution of apps in cluster c

D(I,j) Predicted downloads for app with total rank i and rank j in its cluster

Number of downloads of the most popular app

18

Page 38: Appstores imc13

Results

AppChina

19

Page 39: Appstores imc13

Results

AppChina

19

Page 40: Appstores imc13

Results

AppChina

19

Page 41: Appstores imc13

Results

AppChina

19

Page 42: Appstores imc13

Results

AppChina

Model Distance from measured data ZIPF 0.77 ZIPF-at-most-once 0.71 APP-CLUSTERING 0.15

19

Page 43: Appstores imc13

App Pricing

• Main Questions:

– Which are the differences between paid & free apps?

– What is the developers’ income range?

– Which are the common developer strategies

• How do they affect revenue?

20

Page 44: Appstores imc13

The influence of cost

Free Paid

21

Page 45: Appstores imc13

The influence of cost

Clear power-law

Free Paid

21

Page 46: Appstores imc13

The influence of cost

Clear power-law

Free Paid

Users are more selective when downloading paid apps

21

Page 47: Appstores imc13

Developers’ Income

22 (USD)

Page 48: Appstores imc13

Developers’ Income

Median: < 10 $

22 (USD)

Page 49: Appstores imc13

Developers’ Income

80% < 100 $

22 (USD)

Page 50: Appstores imc13

Developers’ Income

95% < 1500 $

22 (USD)

Page 51: Appstores imc13

Developers’ Income

22 (USD)

Page 52: Appstores imc13

Developers’ Income

22 (USD)

Page 53: Appstores imc13

Developers’ Income

Quality is more important than quantity

22 (USD)

Page 54: Appstores imc13

Developers Create a Few Apps

23

Page 55: Appstores imc13

Developers Create a Few Apps

A large portion of developers create only 1 app

23

Page 56: Appstores imc13

Developers Create a Few Apps

95% of developers create < 10 apps

23

Page 57: Appstores imc13

Developers Create a Few Apps

10% of developers offer free & paid apps

23

Page 58: Appstores imc13

Can Free Apps Generate Higher Income Than Paid Apps?

Ne

cess

ary

ad in

com

e (

USD

)

Day

24

Page 59: Appstores imc13

Can Free Apps Generate Higher Income Than Paid Apps?

Ne

cess

ary

ad in

com

e (

USD

)

Day

Average: 0.21 $

24

Page 60: Appstores imc13

Can Free Apps Generate Higher Income Than Paid Apps?

Ne

cess

ary

ad in

com

e (

USD

)

Day

Average: 0.21 $

An average free app needs about 0.21 $/download to match the income of a paid app

24

Page 61: Appstores imc13

Conclusions

• App popularity: Zipf with truncated ends – Fetch-at-most-once

– Clustering effect

• Practical implications – New replacement policies for app caching

– Effective prefetching

– Better recommendation systems

– Increase income

25

Page 62: Appstores imc13

Thank you!

26