a longitudinal analysis of the ads.txt standard · imc 2019: a longitudinal analysis of the ads.txt...

Post on 25-Sep-2020

6 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

IMC 2019: A Longitudinal Analysis of the ads.txt Standard

A Longitudinal Analysis of

the ads.txt Standard

October 22, 2019

Muhammad Ahmad Bashir Sajjad Arshad Engin KirdaWilliam Robertson Christo Wilson

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 2

What is ads.txt Standard?

Introduced by Interactive Advertising Bureau (IAB)to combat ‘domain spoofing’ fraud

in Real Time Bidding (RTB) auctions

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 3

Online Advertising

Website / Publisher Advertising & Analytics (A&A) Companies

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 3

Online Advertising

Website / Publisher Advertising & Analytics (A&A) Companies

Users’ Profile

Users’ Profile

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 3

Online Advertising

Website / Publisher Advertising & Analytics (A&A) Companies

Users’ Profile

Users’ Profile

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 4

Shift Towards Real Time Bidding (RTB)Website / Publisher Advertising & Analytics (A&A) Companies

Observable via browser

Not observable

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 4

Shift Towards Real Time Bidding (RTB)Website / Publisher Advertising & Analytics (A&A) Companies

Observable via browser

Not observable

Ad Exchange

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 4

Shift Towards Real Time Bidding (RTB)

RTB Auction

Website / Publisher Advertising & Analytics (A&A) Companies

Observable via browser

Not observable

Ad Exchange Demand Side Platform (DSPs)

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 4

Shift Towards Real Time Bidding (RTB)

RTB Auction

Website / Publisher Advertising & Analytics (A&A) Companies

Observable via browser

Not observable

Advertisers

Ad Exchange Demand Side Platform (DSPs)

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 4

Shift Towards Real Time Bidding (RTB)

$0.05

$0.10

$0.20

RTB Auction

Website / Publisher Advertising & Analytics (A&A) Companies

Observable via browser

Not observable

Advertisers

Ad Exchange Demand Side Platform (DSPs)

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 4

Shift Towards Real Time Bidding (RTB)

$0.05

$0.10

$0.20

RTB Auction

Website / Publisher Advertising & Analytics (A&A) Companies

Observable via browser

Not observable

Advertisers

Ad Exchange Demand Side Platform (DSPs)

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 4

Shift Towards Real Time Bidding (RTB)

$0.05

$0.10

$0.20

RTB Auction

Website / Publisher Advertising & Analytics (A&A) Companies

RTB currently holds 42% share of US Digital Advertising.

Observable via browser

Not observable

Advertisers

Ad Exchange Demand Side Platform (DSPs)

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 5

An Example of RTB AuctionUserX Publisher Ad Exchange

UserX=xo$

UserX=ABC

Demand Side Platforms (DSPs)

UserX=123

Observable via browser

Not observable

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 5

An Example of RTB AuctionUserX Publisher Ad Exchange

UserX=xo$

UserX=ABC

Demand Side Platforms (DSPs)

GET, CNN’s CookieUserX=123

Observable via browser

Not observable

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 5

An Example of RTB AuctionUserX Publisher Ad Exchange

UserX=xo$

UserX=ABC

Demand Side Platforms (DSPs)

GET, CNN’s Cookie

GET, DoubleClick’s Cookie = 123

UserX=123

Observable via browser

Not observable

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 5

An Example of RTB Auction

RTB AuctionUserX Publisher Ad Exchange

UserX=xo$

UserX=ABC

Demand Side Platforms (DSPs)

GET, CNN’s Cookie

GET, DoubleClick’s Cookie = 123

UserX=123

Observable via browser

Not observable

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 5

An Example of RTB Auction

RTB AuctionUserX Publisher Ad Exchange

UserX=xo$

UserX=ABC

Demand Side Platforms (DSPs)

GET, CNN’s Cookie

GET, DoubleClick’s Cookie = 123

UserX=123

Bid for User 123 ?

Bid for User 123 ?

Observable via browser

Not observable

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 5

An Example of RTB Auction

RTB AuctionUserX Publisher Ad Exchange

UserX=xo$

UserX=ABC

Demand Side Platforms (DSPs)

GET, CNN’s Cookie

GET, DoubleClick’s Cookie = 123

UserX=123 $0.15

$0.40

Bid for User 123 ?

Bid for User 123 ?

Observable via browser

Not observable

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 5

An Example of RTB Auction

RTB AuctionUserX Publisher Ad Exchange

UserX=xo$

UserX=ABC

Demand Side Platforms (DSPs)

GET, CNN’s Cookie

GET, DoubleClick’s Cookie = 123

UserX=123 $0.15

$0.40

Bid for User 123 ?

Bid for User 123 ?

GET, RightMedia’s Cookie = ABC

Observable via browser

Not observable

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 5

An Example of RTB Auction

RTB AuctionUserX Publisher Ad Exchange

UserX=xo$

UserX=ABC

Demand Side Platforms (DSPs)

GET, CNN’s Cookie

GET, DoubleClick’s Cookie = 123

UserX=123 $0.15

$0.40

Bid for User 123 ?

Bid for User 123 ?

DSPs don’t receive the request directly from

the browser.

GET, RightMedia’s Cookie = ABC

Observable via browser

Not observable

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 6

Potential for Fraud in RTB AuctionsObservable via browser

Not observablePublishers Ad Exchanges

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 6

Potential for Fraud in RTB AuctionsObservable via browser

Not observablePublishers Ad Exchanges DSPs (Bidders)

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 6

Potential for Fraud in RTB AuctionsObservable via browser

Not observablePublishers Ad Exchanges DSPs (Bidders)

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 6

Potential for Fraud in RTB AuctionsObservable via browser

Not observablePublishers Ad Exchanges DSPs (Bidders)

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 6

Potential for Fraud in RTB AuctionsObservable via browser

Not observablePublishers Ad Exchanges DSPs (Bidders)

DSPs will think the user is on CNN, rather than

FNN

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 7

Ad Fraud = Loss in $$$

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 7

Ad Fraud = Loss in $$$

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 8

Programmatic Buying & Selling via ads.txt Standard

Authorized Digital Sellers (ads.txt) • Introduced by Interactive Advertising Bureau (IAB) in May 2017 • Enables publishers to list authorized exchanges • At the root of the top-level domain

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 8

Programmatic Buying & Selling via ads.txt Standard

Authorized Digital Sellers (ads.txt) • Introduced by Interactive Advertising Bureau (IAB) in May 2017 • Enables publishers to list authorized exchanges • At the root of the top-level domain

google.com, pub−7439281311086140, DIRECT

rubiconproject.com, 11078, DIRECT

c.amazon−adsystem.com, 3159, DIRECTopenx.com, 183753,RESELLER

https://cnn.com/ads.txt

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 8

Programmatic Buying & Selling via ads.txt Standard

Authorized Digital Sellers (ads.txt) • Introduced by Interactive Advertising Bureau (IAB) in May 2017 • Enables publishers to list authorized exchanges • At the root of the top-level domain

google.com, pub−7439281311086140, DIRECT

rubiconproject.com, 11078, DIRECT

c.amazon−adsystem.com, 3159, DIRECTopenx.com, 183753,RESELLER

https://cnn.com/ads.txt File Format:

<FIELD #1>, <FIELD #2>, <FIELD #3>

#1: Exchange domain * #2: Publisher’s account ID #3: Type of account/relationship

* Not always an exchange

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 9

Potential for Fraud in RTB AuctionsPublishers Observable via browser

Not observable

Ad Exchanges DSPs (Bidders)

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 9

Potential for Fraud in RTB AuctionsPublishers Observable via browser

Not observable

Ad Exchanges DSPs (Bidders)

google.com, pub−7439281311086140, DIRECT

rubiconproject.com, 11078, DIRECT

c.amazon−adsystem.com, 3159, DIRECTopenx.com, 183753,RESELLER

https://cnn.com/ads.txt

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 10

Why Study ads.txt standard?

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 10

Why Study ads.txt standard?Potential to combat domain spoofing fraud

• Check for any shortcomings.

• Lessons can be learnt for upcoming standards.

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 10

Why Study ads.txt standard?Potential to combat domain spoofing fraud

• Check for any shortcomings.

• Lessons can be learnt for upcoming standards.

Might be useful for privacy researchers

• Means of bringing transparency in the advertising ecosystem

• Publishers are disclosing their relationships with ad exchanges

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 11

Goals / Research Questions

1. Are publishers adopting the ads.txt standard?

2. Are major A&A companies embracing the standard?

• Which A&A companies have adopted?

• Do these companies appear to be complying with the standard?

3. Can this data be useful for transparency of the ad ecosystem?

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 12

Talk Overview

1. RTB background & motivation for the ads.txt standard

2. Data collection

3. Adoption trends

4. Compliance of the ads.txt standard

5. Means of providing transparency

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 13

Data Collection Methodology

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 13

Data Collection Methodology

Alexa top-100K publishers

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 13

Data Collection Methodologygoogle.com, pub−7439281311086140, DIRECT

rubiconproject.com, 11078, DIRECT

c.amazon−adsystem.com, 3159, DIRECT

openx.com, 183753,RESELLER

https://cnn.com/ads.txt

Alexa top-100K publishers

Using Python’s requests module

Fetched /ads.txt URL

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 13

Data Collection Methodologygoogle.com, pub−7439281311086140, DIRECT

rubiconproject.com, 11078, DIRECT

c.amazon−adsystem.com, 3159, DIRECT

openx.com, 183753,RESELLER

https://cnn.com/ads.txt

Alexa top-100K publishers

Using Python’s requests module

Fetched /ads.txt URLStored output

as base64

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 13

Data Collection Methodologygoogle.com, pub−7439281311086140, DIRECT

rubiconproject.com, 11078, DIRECT

c.amazon−adsystem.com, 3159, DIRECT

openx.com, 183753,RESELLER

https://cnn.com/ads.txt

Alexa top-100K publishers

Using Python’s requests module

Fetched /ads.txt URLStored output

as base64

Crawled 15 URLs perpublisher using

Chrome Debugging

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 13

Data Collection Methodologygoogle.com, pub−7439281311086140, DIRECT

rubiconproject.com, 11078, DIRECT

c.amazon−adsystem.com, 3159, DIRECT

openx.com, 183753,RESELLER

https://cnn.com/ads.txt

Alexa top-100K publishers

Using Python’s requests module

Fetched /ads.txt URLStored output

as base64

Crawled 15 URLs perpublisher using

Chrome Debugging

Stored Inclusion Chains

pub/index.html

ads/script.js

ads/frame.html

logo.jpg

ads/img_a.jpgadnet/

data.js

Example Inclusions

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 13

Data Collection Methodologygoogle.com, pub−7439281311086140, DIRECT

rubiconproject.com, 11078, DIRECT

c.amazon−adsystem.com, 3159, DIRECT

openx.com, 183753,RESELLER

https://cnn.com/ads.txt

Alexa top-100K publishers

Using Python’s requests module

Fetched /ads.txt URLStored output

as base64

• Collected 26 snapshots between January 2018 - April 2019

Crawled 15 URLs perpublisher using

Chrome Debugging

Stored Inclusion Chains

pub/index.html

ads/script.js

ads/frame.html

logo.jpg

ads/img_a.jpgadnet/

data.js

Example Inclusions

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 14

Authorized Exchanges Over Time

Number of authorized exchange domains over time

Num

ber o

f Exc

hang

es

All ExchangesValid Exchange

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 14

Authorized Exchanges Over Time

Number of authorized exchange domains over time

Num

ber o

f Exc

hang

es

All ExchangesValid Exchange Increase: 860 —> 1400

Total unique exchanges: 2381

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 14

Authorized Exchanges Over Time

Number of authorized exchange domains over time

Num

ber o

f Exc

hang

es

All ExchangesValid Exchange

Mistakes in ads.txt files

Increase: 860 —> 1400Total unique exchanges: 2381

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 14

Authorized Exchanges Over Time

Number of authorized exchange domains over time

Num

ber o

f Exc

hang

es

All ExchangesValid Exchange

Mistakes in ads.txt files

A. Syntax Errorsgoogle.com pub−7439281311086140, DIRECT

openx.com , pub−7439281311086140

Increase: 860 —> 1400Total unique exchanges: 2381

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 14

Authorized Exchanges Over Time

Number of authorized exchange domains over time

Num

ber o

f Exc

hang

es

All ExchangesValid Exchange

Mistakes in ads.txt files

A. Syntax Errorsgoogle.com pub−7439281311086140, DIRECT

openx.com , pub−7439281311086140

B. Semantic Errors

Increase: 860 —> 1400Total unique exchanges: 2381

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 14

Authorized Exchanges Over Time

Number of authorized exchange domains over time

Num

ber o

f Exc

hang

es

All ExchangesValid Exchange

Mistakes in ads.txt files

A. Syntax Errorsgoogle.com pub−7439281311086140, DIRECT

openx.com , pub−7439281311086140

B. Semantic Errors

rubicnprject.com, 11078, DIRECT Misspelled

Increase: 860 —> 1400Total unique exchanges: 2381

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 14

Authorized Exchanges Over Time

Number of authorized exchange domains over time

Num

ber o

f Exc

hang

es

All ExchangesValid Exchange

Mistakes in ads.txt files

A. Syntax Errorsgoogle.com pub−7439281311086140, DIRECT

openx.com , pub−7439281311086140

B. Semantic Errors

rubicnprject.com, 11078, DIRECT

doubleclick.net, 3159, DIRECT

Misspelled

Should be google.com

Increase: 860 —> 1400Total unique exchanges: 2381

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 14

Authorized Exchanges Over Time

Number of authorized exchange domains over time

Num

ber o

f Exc

hang

es

All ExchangesValid Exchange

Mistakes in ads.txt files

A. Syntax Errorsgoogle.com pub−7439281311086140, DIRECT

openx.com , pub−7439281311086140

B. Semantic Errorsrubicnprject.com, 11078, DIRECT

doubleclick.net, 3159, DIRECT

Misspelled

Should be google.com

Increase: 860 —> 1400Total unique exchanges: 2381

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 15

Fixing Semantic Errors

Increase: 860 —> 1400Total unique exchanges: 2381

Number of authorized exchange domains over time

Num

ber o

f Exc

hang

es

All ExchangesValid Exchange

rubicnprject.com, 11078, DIRECT

doubleclick.net, 3159, DIRECT

Misspelled

Should be google.com

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 15

Fixing Semantic Errors

Increase: 860 —> 1400Total unique exchanges: 2381

Number of authorized exchange domains over time

Fetch WHOIS recordNum

ber o

f Exc

hang

es

All ExchangesValid Exchange

rubicnprject.com, 11078, DIRECT

doubleclick.net, 3159, DIRECT

Misspelled

Should be google.com

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 15

Fixing Semantic Errors

Increase: 860 —> 1400Total unique exchanges: 2381

Number of authorized exchange domains over time

Fetch WHOIS record

Exists

1035 domains

Num

ber o

f Exc

hang

es

All ExchangesValid Exchange

rubicnprject.com, 11078, DIRECT

doubleclick.net, 3159, DIRECT

Misspelled

Should be google.com

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 15

Fixing Semantic Errors

Increase: 860 —> 1400Total unique exchanges: 2381

Number of authorized exchange domains over time

Fetch WHOIS record

Exists

1035 domains

Num

ber o

f Exc

hang

es

All ExchangesValid Exchange

rubicnprject.com, 11078, DIRECT

doubleclick.net, 3159, DIRECT

Misspelled

Should be google.com

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 15

Fixing Semantic Errors

Increase: 860 —> 1400Total unique exchanges: 2381

Number of authorized exchange domains over time

Fetch WHOIS record

Exists Not Found

1346 domains Perform DNS lookup1035 domains

Num

ber o

f Exc

hang

es

All ExchangesValid Exchange

rubicnprject.com, 11078, DIRECT

doubleclick.net, 3159, DIRECT

Misspelled

Should be google.com

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 15

Fixing Semantic Errors

Increase: 860 —> 1400Total unique exchanges: 2381

Number of authorized exchange domains over time

Fetch WHOIS record

Exists Not Found

1346 domains Perform DNS lookup1035 domains

No Resolution

Num

ber o

f Exc

hang

es

All ExchangesValid Exchange

rubicnprject.com, 11078, DIRECT

doubleclick.net, 3159, DIRECT

Misspelled

Should be google.com

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 15

Fixing Semantic Errors

Increase: 860 —> 1400Total unique exchanges: 2381

Number of authorized exchange domains over time

Fetch WHOIS record

Exists Not Found

1346 domains Perform DNS lookup1035 domains

No Resolution

Num

ber o

f Exc

hang

es

All ExchangesValid Exchange

rubicnprject.com, 11078, DIRECT

doubleclick.net, 3159, DIRECT

Misspelled

Should be google.com

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 16

Talk Overview

1. RTB background & motivation for the ads.txt standard

2. Data collection

3. Adoption trends

4. Compliance of the ads.txt standard

5. Means of providing transparency

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 17

How Have Publishers Adopted?

ads.txt adoption by Alexa Top-100K publishers over time

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 17

How Have Publishers Adopted?

ads.txt adoption by Alexa Top-100K publishers over time

Adoption has been modest Increase: 12% —> 20%

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 17

How Have Publishers Adopted?

ads.txt adoption by Alexa Top-100K publishers over time

Publishers that run RTB ads Adoption is pretty decent. ~64%

Adoption has been modest Increase: 12% —> 20%

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 18

Authorized Exchanges Discovered

Number of authorized exchange domains over time

Num

ber o

f Exc

hang

es

All ExchangesValid Exchange

1035 valid domains

Number of unique publishers per authorized exchange domain

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 18

Authorized Exchanges Discovered

Number of authorized exchange domains over time

Num

ber o

f Exc

hang

es

All ExchangesValid Exchange

1035 valid domains

Number of unique publishers per authorized exchange domain

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 18

Authorized Exchanges Discovered

Number of authorized exchange domains over time

Num

ber o

f Exc

hang

es

All ExchangesValid Exchange

1035 valid domains

Number of unique publishers per authorized exchange domain

Out of 1035 domains, 20% appear only on a single ads.txt file

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 19

Adoption Trends are Encouraging• 64% of the publishers from Alexa top-100K who run RTB ads.

• Major A&A domains support ads.txt

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 19

Adoption Trends are Encouraging• 64% of the publishers from Alexa top-100K who run RTB ads.

• Major A&A domains support ads.txt

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 20

Talk Overview

1. RTB background & motivation for the ads.txt standard

2. Data collection

3. Adoption trends

4. Compliance of the ads.txt standard

5. Means of providing transparency

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 21

A Simple Check for CompliancePublisher

Observable via browser

Not observableAd Exchange DSP (Bidder)

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 21

A Simple Check for CompliancePublisher

Observable via browser

Not observableAd Exchange DSP (Bidder)

google.com, pub−787, DIRECT

rubiconproject.com, 11078, DIRECTopenx.com, 183753,RESELLER

https://cnn.com/ads.txt

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 21

A Simple Check for CompliancePublisher

Observable via browser

Not observableAd Exchange DSP (Bidder)

google.com, pub−787, DIRECT

rubiconproject.com, 11078, DIRECTopenx.com, 183753,RESELLER

https://cnn.com/ads.txt✔

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 21

A Simple Check for CompliancePublisher

Observable via browser

Not observableAd Exchange DSP (Bidder)

google.com, pub−787, DIRECT

rubiconproject.com, 11078, DIRECTopenx.com, 183753,RESELLER

https://cnn.com/ads.txt✔

google.com, pub−787, DIRECT

rubiconproject.com, 11078, DIRECT

https://cnn.com/ads.txt

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 21

A Simple Check for CompliancePublisher

Observable via browser

Not observableAd Exchange DSP (Bidder)

google.com, pub−787, DIRECT

rubiconproject.com, 11078, DIRECTopenx.com, 183753,RESELLER

https://cnn.com/ads.txt✔

google.com, pub−787, DIRECT

rubiconproject.com, 11078, DIRECT

https://cnn.com/ads.txt✗

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 22

Checking Compliance at Scale

pub/index.html

ads/script.js

ads/frame.html

logo.jpg

ads/img_a.jpgadnet/

data.js

Example InclusionsAdvertisements (Detected Using EasyList)

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 22

Checking Compliance at Scale

pub/index.html

ads/script.js

ads/frame.html

logo.jpg

ads/img_a.jpgadnet/

data.js

Example InclusionsAdvertisements (Detected Using EasyList)

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 22

Checking Compliance at Scale

pub/index.html

ads/script.js

ads/frame.html

logo.jpg

ads/img_a.jpgadnet/

data.js

Example InclusionsAdvertisements (Detected Using EasyList)

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 23

Checking Compliance at Scale

pub/index.html

ads/script.js

ads/frame.html

logo.jpg

ads/img_a.jpgadnet/

data.js

Example InclusionsAdvertisements (Detected Using EasyList)

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 23

Checking Compliance at Scale

pub/index.html

ads/script.js

ads/frame.html

logo.jpg

ads/img_a.jpgadnet/

data.js

Example InclusionsAdvertisements (Detected Using EasyList)

Chain must have at least 3 elements

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 24

Extracting Tuples For Compliance Analysis

Advertisements (Detected Using EasyList)

Chain must have at least 3 elements

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 24

Extracting Tuples For Compliance Analysis

Advertisements (Detected Using EasyList)

Chain must have at least 3 elements

Associated Tuples

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 24

Extracting Tuples For Compliance Analysis

Advertisements (Detected Using EasyList)

Chain must have at least 3 elements

Associated Tuples

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 24

Extracting Tuples For Compliance Analysis

Advertisements (Detected Using EasyList)

Chain must have at least 3 elements

Associated Tuples

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 24

Extracting Tuples For Compliance Analysis

Advertisements (Detected Using EasyList)

Chain must have at least 3 elements

Associated Tuples

25

Compliance Per Tuple

ads.txt ads.txt

25

Compliance Per Tuple

ads.txt ads.txt

Percentage of non-compliant Exchange-DSP tuples per publisher

25

Compliance Per Tuple

ads.txt ads.txt

Percentage of non-compliant Exchange-DSP tuples per publisher

25

Compliance Per Tuple

ads.txt ads.txt

Percentage of non-compliant Exchange-DSP tuples per publisher

75% publishers have 100% compliant inventory

25

Compliance Per Tuple

ads.txt ads.txt

Percentage of non-compliant Exchange-DSP tuples per publisher

75% publishers have 100% compliant inventory

Compliance over time

25

Compliance Per Tuple

ads.txt ads.txt

Percentage of non-compliant Exchange-DSP tuples per publisher

75% publishers have 100% compliant inventory

Compliance over time

Compliance has increased over the year

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 26

Top Non-Compliant TuplesExchange /

SellerDSP / Buyer

% Acrossall pubs

gumgum domdex 20.4

gumgum appnexus 20.5

taboola weborama 52.7

taboola rubiconproject 11.6

gumgum pubmatic 27.3

justpremium openx 100

rubiconproject yahoo 2.63

springserve pubmatic 49.3

nativeroll weborama 100

nativeroll seedr 100

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 26

Top Non-Compliant TuplesExchange /

SellerDSP / Buyer

% Acrossall pubs

gumgum domdex 20.4

gumgum appnexus 20.5

taboola weborama 52.7

taboola rubiconproject 11.6

gumgum pubmatic 27.3

justpremium openx 100

rubiconproject yahoo 2.63

springserve pubmatic 49.3

nativeroll weborama 100

nativeroll seedr 100

Sellers

• Some sellers are systematically non-compliant (e.g. NativeRoll, JustPremium).

• Top exchanges do not appear as non-compliant.

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 26

Top Non-Compliant TuplesExchange /

SellerDSP / Buyer

% Acrossall pubs

gumgum domdex 20.4

gumgum appnexus 20.5

taboola weborama 52.7

taboola rubiconproject 11.6

gumgum pubmatic 27.3

justpremium openx 100

rubiconproject yahoo 2.63

springserve pubmatic 49.3

nativeroll weborama 100

nativeroll seedr 100

Sellers

• Some sellers are systematically non-compliant (e.g. NativeRoll, JustPremium).

• Top exchanges do not appear as non-compliant.

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 26

Top Non-Compliant TuplesExchange /

SellerDSP / Buyer

% Acrossall pubs

gumgum domdex 20.4

gumgum appnexus 20.5

taboola weborama 52.7

taboola rubiconproject 11.6

gumgum pubmatic 27.3

justpremium openx 100

rubiconproject yahoo 2.63

springserve pubmatic 49.3

nativeroll weborama 100

nativeroll seedr 100

Sellers

• Some sellers are systematically non-compliant (e.g. NativeRoll, JustPremium).

• Top exchanges do not appear as non-compliant.

Buyers

• Most are actually top exchanges (sellers).

• 8/10 top sellers are buying non-compliant inventory.

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 27

Compliance of ads.txt Standard

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 27

Top exchanges and DSPs appear to be honoring the standard

Compliance of ads.txt Standard

More than 75% of publishers that run RTB have 100% compliance

Major ad exchanges purchasing non-compliant inventory

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 28

Talk Overview

1. RTB background & motivation for the ads.txt standard

2. Data collection

3. Adoption trends

4. Compliance of the ads.txt standard

5. Means of providing transparency

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 29

Identification of A&A DomainsEfforts to identify Advertising and Analytics related domains.

• Filter lists (e.g. EasyList) are manually curated

• Machine learning based approaches

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 29

Identification of A&A DomainsEfforts to identify Advertising and Analytics related domains.

• Filter lists (e.g. EasyList) are manually curated

• Machine learning based approaches

google.com, pub−7439281311086140, DIRECT

rubiconproject.com, 11078, DIRECT

c.amazon−adsystem.com, 3159, DIRECT

openx.com, 183753,RESELLER RESELLER

https://cnn.com/ads.txt

indexexchange.com, 197545, DIRECT

adtech.com, 232547, DIRECTopenx.com, 203799,RESELLER

https://washingtonpost.com/ads.txt

1035 valid domainsTop Exchanges like Google,

openX have adopted

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 30

New Standards Coming As Well

Have the potential to make the ecosystem even more transparent

• App-ads.txt (For mobile apps)

• Sellers.json (To reveal relationships of sellers in the ecosystem)

IMC 2019: A Longitudinal Analysis of the ads.txt Standard 31

Conclusions / Takeaways• A 15-month study to study the ads.txt standard

• We observed heathy adoption (64% publishers have adopted)

• Major exchanges & DSPs are part of it, and honoring the standard

• A step towards making the ecosystem transparent

• Data needs to be cleaned

Questions?ahmad@ccs.neu.edu

Datasethttps://personalization.ccs.neu.edu/Projects/Adstxt

top related