tutorial 11 (computational advertising)

Computational advertising

Kira Radinsky

Slides based on material from the paper

“Bandits for Taxonomies: A Model-based Approach” by

Sandeep Pandey, Deepak Agarwal, Deepayan Chakrabarti,

Vanja Josifovski, in SDM 2007

The Content Match Problem

Advert

Ad impression: Showing an ad to a user

(click)

Advert

Ad click: user click leads to revenue for ad server and content provider

(click)

Advert

The Content Match Problem:

Match ads to pages to maximize clicks

Advert

Maximizing the number of clicks means: For each webpage, find the ad with the best

Click-Through Rate (CTR) but without wasting too many impressions in

learning this.

Outline

Problem

Background: Multi-armed bandits

• Proposed Multi-level Policy

• Experiments

• Conclusions

Background: Bandits

Bandit “arms”

p1 p2 p3(unknown payoff

probabilities)

Pull arms sequentially so as to maximize the total

expected reward

• Estimate payoff probabilities pi

• Bias the estimation process towards better arms

Background: Bandits Solutions

• Try 1: Greedy Solution:

• Compute the sample mean of an arm A by dividing the total reward received from the arm by the number of times the arm has been pulled. At each time step choose the arm with

highest sample mean.

• Try 2: Naïve solution:

• Pull each arm an equal number of times.

• Epsilon-greedy strategy:

• The best bandit is selected for a proportion 1 − ε of the trials,

and another bandit is randomly selected (with uniform

probability) for a proportion ε.

• Many more strategies

Ad matching as a bandit problemW

Bandit “arms”

~106 ads

Ad matching as a bandit problem

Content Match = A matrix

• Each row is a bandit

• Each cell has an unknown CTR

One instance of the MAB

problem (1 bandit)

Unknown CTR

Background: Bandits

Bandit Policy

1.Assign priority to

each arm

2. “Pull” arm with

max priority, and

observe reward

3.Update priorities

Priority 1 Priority 2 Priority 3

Allocation

Estimation

Background: Bandits

Why not simply apply a bandit policy

directly to the problem?

• Convergence is too slow

~109 instances of the MAB

problem(bandits), with ~106 arms per

instance (bandit)

• Additional structure is available, that

can help Taxonomies

Outline

Problem

Proposed Multi-level Policy

• Experiments

• Conclusions

Multi-level Policy

Webpages

… …

……

classes

Consider only two levels

Multi-level Policy

ApparelCompu-

ters Travel

… …

……

Consider only two levels

Ad parent

classes

Ad child classes

One MAB problem

instance (bandit)

Multi-level Policy

ApparelCompu-

ters Travel

… …

……

Key idea: CTRs in a block are homogeneous

Ad parent

classes

One MAB problem

instance (bandit)

Ad child classes

Multi-level Policy

• CTRs in a block are homogeneous

– Used in allocation (picking ad for each new page)

– Used in estimation (updating priorities after each observation)

Multi-level Policy

Used in allocation (picking ad for each new page)

– Used in estimation (updating priorities after each observation)

Multi-level Policy (Allocation)

classifier

• Classify webpage page class, parent page class

• Run bandit on ad parent classes pick one ad parent class

• Classify webpage page class, parent page class

• Run bandit on ad parent classes pick one ad parent class

• Run bandit among cells pick one ad class

• In general, continue from root to leaf final ad

classifier

Bandits at higher levels

• use aggregated information

• have fewer bandit arms

Quickly figure out the best ad parent class

classifier

Multi-level Policy

Used in allocation (picking ad for each new page)

Used in estimation (updating priorities after each observation)

Multi-level Policy (Estimation)

– Observations from one cell also give information about others in the block

– How can we model this dependence?

• Shrinkage Model

Scell | CTRcell ~ Bin (Ncell, CTRcell)

CTRcell ~ Beta (Paramsblock)

# clicks in

# impressions in cell

All cells in a block come from the same distribution

• Intuitively, this leads to shrinkageof cell CTRs towards block CTRs

E[CTR] = α.Priorblock + (1-α).Scell/Ncell

Estimated

Beta prior (“block

CTR”)

Observed

Outline

Problem

Proposed Multi-level Policy

Experiments

• Conclusions

Experiments [S. Panday et al. 2007]

20 nodes

221 nodes

~7000 leaves

Taxonomy structure

use these 2

levels

Depth 0

Depth 1

Depth 2

Experiments

• Data collected over a 1 day period

• Collected from only one server, under some other ad-matching rules (not our bandit)

• ~229M impressions

• CTR values have been linearly transformed for purposes of confidentiality

Experiments (Multi-level Policy)

Multi-level gives much higher #clicks

Number of pulls

Experiments (Multi-level Policy)

Multi-level gives much better Mean-Squared Error it has learnt

tutorial 11 (computational advertising)

ad class

multilevel policy ctrs

bandits bandit policy

ad matching

ad server

final ad

page classifier ad

homogeneous ad parent

Technology

ebooksclub.org computational fourier optics a matlab...

computational advertising-the linkedin way

introduction to computational advertising - stanford...

advertising strategy tutorial 9

challenges in computational advertising

computational homology tutorial homology...

university of toronto -...

tutorial on computational linguistic phylogeny

display advertising landscape ms &e 239 computational...

iconip’2008 tutorial on computational resources in neural...

computational advertising duygu gunaydin lu li shuanglong...

computational advertising and...

tutorial: hyper-heuristics and computational...

tutorial: hyper-heuristics and computational...

recent advances in computational advertising

a tutorial on computational geometry

computational advertising

computational advertising: techniques for targeting

final program report: samsi computational advertising...

tutorial: computational methods for aeroacoustics