intro to computational advertising - stanford university

89
MS&E 239 Stanford University Autumn 2010 Instructors: Andrei Broder and Vanja Josifovski Introduction to Computational Advertising 1

Upload: others

Post on 03-Feb-2022

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Intro to Computational Advertising - Stanford University

MS&E 239Stanford University

Autumn 2010Instructors: Andrei Broder and Vanja Josifovski

Introduction to Computational Advertising

1

Page 2: Intro to Computational Advertising - Stanford University

This lecture1. Logistics of the course2. Introduction & Overview of Computational

Advertising

2

Page 3: Intro to Computational Advertising - Stanford University

Instructor Dr. Vanja Josifovski

Principal Research Scientist at Yahoo! Research. Research Area: Computational Advertising – Performance

Advertising Previously at IBM Research working on databases and enterprise

search M.Sc. from University of Florida, PhD from Linkopings University in

Sweden. [email protected] http://research.yahoo.com/Vanja_Josifovski

3

Page 4: Intro to Computational Advertising - Stanford University

Dr. Andrei Broder

Yahoo! Fellow and Vice President for Computational Advertising

Research interests: computational advertising, web search, context-driven information supply, and randomized algorithms.

M.Sc. and Ph.D. in Computer Science at Stanford under Don Knuth. [email protected] http://research.yahoo.com/Andrei_Broder

Instructor

4

Page 5: Intro to Computational Advertising - Stanford University

General course info Course Website:

http://www.stanford.edu/class/msande239/ TA: Pranav Dandekar (Office hours: TBD) Course email lists Staff:msande239-aut1011-staffAll: msande239-aut1011-students Please use the staff list to communicate

with the staff Lectures: 10am ~ 12:30pm Fridays in Gates

B12 Office Hours: by appointment – preferably

right after the lecture5

Page 6: Intro to Computational Advertising - Stanford University

Course Overview (subject to change)

1. 09/24 Intro2. 10/01 Textual advertising basics3. 10/08 Marketplace and economics4. 10/15 Sponsored search 5. 10/22 Contextual advertising6. 10/29 Reactive methods for ad selection7. 11/05 Display advertising8. 11/12 Targeting9. 11/19 Emerging formats (Mobile etc)10. 12/03 Project Presentations

6

Page 7: Intro to Computational Advertising - Stanford University

Logistics

7

Page 8: Intro to Computational Advertising - Stanford University

General lecture structure Overview: 1:15 hour Break: 10 minutes In depth, discussion, and occasional quizzes:

1:15 hour

8

Page 9: Intro to Computational Advertising - Stanford University

Homework Approximately 3 homework assignments Based on reading research paper, typically

answer a question in the following style: Why does the algorithms in the paper work? Try to extend the idea in the paper from a different

view of angle? How to modify it for a new scenario?

Some homeworks will be conventional exercise style

Some homework might require interaction with existing advertising systems

9

Page 10: Intro to Computational Advertising - Stanford University

Project Hands on web advertising:

We will introduce you to an actual business You build a campaign for them & measure how it does. You pay all the costs

Just kidding -- all teams will have the same budget provided by the sponsor

Prepare a presentation showing What was your strategy What worked What didn’t Why?

Teams should be formed within 2 weeks If you would like a different but equivalent project, please

propose it, and if possible we will accommodate. (e.g. different sponsorship, a programming project, etc)

10

Page 11: Intro to Computational Advertising - Stanford University

Grading Team project 40% In class quizzes 10% Homework 30% Final (take home) 20% CR/NC students: please put the same effort in the

project as other participants

11

Page 12: Intro to Computational Advertising - Stanford University

Questions?We welcome suggestions about all

aspects of the course: msande239-aut0910-staff

12

Page 13: Intro to Computational Advertising - Stanford University

Introduction to Computational Advertising

13

Page 14: Intro to Computational Advertising - Stanford University

Disclaimers This talk presents the opinions of the authors. It

does not necessarily reflect the views of Yahoo! inc or any other entity.

Algorithms, techniques, features, etc mentioned here might or might not be in use by Yahoo! or any other company.

These lectures benefitted from the contributions of many colleagues and co-authors at Yahoo! and elsewhere. Their help is gratefully acknowledged.

14

Page 15: Intro to Computational Advertising - Stanford University

Lecture 1 plan Overview and key messages Classical advertising Difference between classic and computational

The opportunity: revenue and beyond Computational Advertising landscape Graphical ads: guaranteed delivery, performance

delivery, exchanges Textual ads

Ad selection Textual ad selection Performance graphical ads

Mobile advertising Closing remarks

15

Page 16: Intro to Computational Advertising - Stanford University

Computational advertising – the central challenge

Examples Context = Web search results Sponsored search Context = Publisher page Content match, banners Other contexts: mobile, video, newspapers, etc

Related challenge 1: Design markets and exchanges that help in this task, and maximize value for users, advertisers, and publishers

Related challenge 2: Build the infrastructure to support this process16

Find the "best match" between a given user in a given context and a suitable advertisement.

Page 17: Intro to Computational Advertising - Stanford University

Participants: Publishers, Advertisers, Users, & “Matcher”

Advertisers

Users

Publishers

Match maker

17

Page 18: Intro to Computational Advertising - Stanford University

What is “Computational Advertising”? New scientific sub-discipline bringing together Information retrieval Large scale search and text analysis Statistical modeling Machine learning Microeconomics Game theory, auction theory, mechanism design Classification Optimization Recommender systems ….

18

Page 19: Intro to Computational Advertising - Stanford University

Establishing a new discipline…

19

Page 20: Intro to Computational Advertising - Stanford University

Establishing a new discipline…

20

Page 21: Intro to Computational Advertising - Stanford University

Establishing a new discipline…

21

Page 22: Intro to Computational Advertising - Stanford University

CACM

CACM, May 2009: http://mags.acm.org/communications/200905/?pg=1822

Page 23: Intro to Computational Advertising - Stanford University

Key messages1. Computational advertising = A principled way to find

the "best match" between a user in a context and a suitable ad.

2. The financial scale for computational advertising is huge

Small constants matter Expect plenty of further research

3. Advertising is a form of information. Adding ads to a context is similar to the integration problem of other

types of information Finding the “best ad” is a type of information retrieval problem with

multiple, possibly contradictory utility functions4. New application domains and new techniques are

emerging every day Good area for research + new businesses!

23

Page 24: Intro to Computational Advertising - Stanford University

Classic Advertising

24

Page 25: Intro to Computational Advertising - Stanford University

Long history….

Japan ,1806 USA,189025

Page 26: Intro to Computational Advertising - Stanford University

Brand advertisingGoal: create a distinct favorable image

26

Page 27: Intro to Computational Advertising - Stanford University

Direct marketingAdvertising that involves a "direct response”: buy, subscribe, vote, donate, etc, now or soon

27

Page 28: Intro to Computational Advertising - Stanford University

Why “computational” advertising?

28

Page 29: Intro to Computational Advertising - Stanford University

Lots of computational this and that … Computational Linguistics Computational Biology Computational Chemistry Computational Finance Computational Geometry Computational Neuroscience Computational Physics Computational Mechanics Computational Economics …

All are about a) Mixing an old science with computing capabilitiesb) Thinking algorithmically about an old challenge

29

Page 30: Intro to Computational Advertising - Stanford University

Why go “computational” ?

Classical: Relatively few venues – magazines, billboards,

newspapers, handbills, TV, etc High cost per venue ($3Mil for a Super Bowl TV ad) No personalization possible Targeting by the wisdom of ad-people Hard to measure ROI

Computational – almost the exact opposite: Billions of opportunities Billions of creatives Totally personalizable Tiny cost per opportunity Much more quantifiable

30

Page 31: Intro to Computational Advertising - Stanford University

"Half the money I spend on advertising is wasted; the trouble is I don't know which half.“

31

John Wanamaker, ~ 1875

Page 32: Intro to Computational Advertising - Stanford University

Computational advertising – the central challenge

Examples Context = Web search results Sponsored search Context = Publisher page Content match, banners Other contexts: mobile, video, newspapers, etc

Related challenge 1: Design markets and exchanges that help in this task, and maximize value for users, advertisers, and publishers

Related challenge 2: Build the infrastructure to support this process32

Find the "best match" between a given user in a given context and a suitable advertisement.

Page 33: Intro to Computational Advertising - Stanford University

The central challenge decomposed

1. Representation = represent the user, the context, and the ads in an effective & efficient way

2. Definition = define the mathematical optimization problem to capture the actual marketplace constraints and goals

3. Solution = solve the optimization problem in an effective & efficient way

33

Find the "best match" between a given user in a given context and a suitable advertisement.

Page 34: Intro to Computational Advertising - Stanford University

Computational Advertising and Market Design

MARKET DESIGN

COMPUTATIONAL ADVERTISING

SYSTEMS

34

Page 35: Intro to Computational Advertising - Stanford University

The OpportunityThe money and beyond

35

Page 36: Intro to Computational Advertising - Stanford University

US Online Spending

36

Page 37: Intro to Computational Advertising - Stanford University

Online spending as percent oftotal media spending

37

Page 38: Intro to Computational Advertising - Stanford University

Worldwide advertising by media

38

Page 39: Intro to Computational Advertising - Stanford University

The advertising $$ budget vs. the human time budget

39

Page 40: Intro to Computational Advertising - Stanford University

The Value of Web Advertising Beyond revenue… Advertising supports a vast eco-system on the Web:

Publisher revenue Make niche interests businesses possible Value for consumers There would be a lot less to see on the web without the ads!

However, relevance lags beyond search results and page content Most users perceive ads less relevant than page content/search

results But, some studies show that ~30% of users interact with the ads

Progress in Web Advertising Impact on user experience Impact on the web-based economy

40

Page 41: Intro to Computational Advertising - Stanford University

Computational Advertising Landscape

41

Page 42: Intro to Computational Advertising - Stanford University

Marketplace basics What do advertisers pay? CPM = cost per thousand impressions Typically used for graphical/banner ads (brand advertising) Could be paid in advance “Guaranteed delivery”

CPC = cost per click Typically used for textual ads

CPT/CPA = cost per transaction/action a.k.a. referral fees or affiliate fees Typically used for shopping (“buy from our sponsors”),

travel, etc. … but now also used for textual ads (risk mitigation)

42

Page 43: Intro to Computational Advertising - Stanford University

Graphical ads

43

Page 44: Intro to Computational Advertising - Stanford University

The central challenge decomposed

1. Representation = represent the user, the context, and the ads in an effective & efficient way

2. Definition = define the mathematical optimization problem to capture the actual marketplace constraints and goals

3. Solution = solve the optimization problem in an effective & efficient way

44

Find the "best match" between a given user in a given context and a suitable advertisement.

Page 45: Intro to Computational Advertising - Stanford University

Graphical ads – Guaranteed Delivery Two types of online graphical advertising Guaranteed delivery (GD) Performance graphical advertising (non-

guaranteed delivery, NGD) Guaranteed delivery Contract booked based on targeting attributes

of an impression: age, income, location,… Each contract has a duration and a desired

number of impressions Issues in GD Contract pricing Traffic forecasting Impression allocation to the active contracts …

45

Representation

Definition of the Optimization Problem

Page 46: Intro to Computational Advertising - Stanford University

Performance graphical ads Graphical ads can also be placed based on

performance – CPM/CPC/CPA Assume for the moement

Optimization Problem Definition = Max CTR Matching approaches:

1. Reactive: explore the placement of a particular ad on different pages; for each page observe achieved CTR; once the CTRs are learned, given page, pick the ad with highest observed CTR

2. Predictive: generate features for the ad using related ads (same advertiser), landing page, or advertiser metadata – predict performance based on page and ad features

3. Hybrid: (1) and (2) are complementary and can be combined

46

Page 47: Intro to Computational Advertising - Stanford University

Representation1. Reactive: explore the placement of a particular ad on

different pages; for each page observe achieved CTR; once the CTRs are learned, given page, pick the ad with highest observed CTR Ads represented by achieved CTR/page + weights

1. Predictive: generate features for the ad using related ads (same advertiser), landing page, or advertiser metadata – predict performance based on page and ad features Ads (pages) represented by features of ads (resp. pages) + weights

2. Hybrid: (1) and (2) are complementary and can be combined Combined representation

47

Page 48: Intro to Computational Advertising - Stanford University

Textual Ads

48

Page 49: Intro to Computational Advertising - Stanford University

Textual ads1. Ads driven by search keywords = “sponsored search”

(a.k.a. “keyword driven ads”, “paid search”, “adwords”, etc) Advertiser chooses a

“bid phrase” = query on which to display Can also subscribe to

“advanced match” = display me on related queries Needed to achieve volume Huge challenge

2. Ads driven by the content of a web page = “content match” (a.k.a. “context driven ads”, “contextual ads”, “adsense”, etc)

Textual ads are heavily related to Search and IR

49

Page 50: Intro to Computational Advertising - Stanford University

Historical view on textual advertising Late 1990s Alta Vista tried the Sponsored Search

model Rejected by the early search engine users

Goto.com (acquired later by Overture) develops a search engine for paid ads Users with commercial interest go to this engine At the peak, a billion dollar business

Google tries the Sponsored Search model again This time a success

Advertisers cannot get enough volume Content match to provide more impressions

50

Page 51: Intro to Computational Advertising - Stanford University

Textual ads anatomy

51

Page 52: Intro to Computational Advertising - Stanford University

Search Yahoo for sigir 2010

52

Page 53: Intro to Computational Advertising - Stanford University

Search Bing for sigir 2010

53

Page 54: Intro to Computational Advertising - Stanford University

Search Google for sigir 2010

54

Page 55: Intro to Computational Advertising - Stanford University

Google campaign

55

Page 56: Intro to Computational Advertising - Stanford University

Yahoo! Search

56

Page 57: Intro to Computational Advertising - Stanford University

Bing

57

Page 58: Intro to Computational Advertising - Stanford University

Facebook

58

Page 59: Intro to Computational Advertising - Stanford University

Visible and invisible parts

Title

Creative

Display URL

Bid phrase: sigir 2010Bid: $1

59

Landing URL: http://research.yahoo.com/tutorials/sigir10_compadv/

Page 60: Intro to Computational Advertising - Stanford University

Destination: the landing page

60

Page 61: Intro to Computational Advertising - Stanford University

Textual ads schema

Advertiser

Account 1

Campaign 1

Ad group 1

Creatives Bid phrases

Ad group 2 …

Campaign 2 …

Account 2 …

61

Ad

Page 62: Intro to Computational Advertising - Stanford University

Ad Selection

62

Page 63: Intro to Computational Advertising - Stanford University

Participants: Publishers, Advertisers, Users, & “Match maker”

Advertisers

Users

Publishers

Match maker

63

Page 64: Intro to Computational Advertising - Stanford University

Some more complexity: dual roles

Sponsored search: Pub = AA (Yahoo!, Google)

Content match/Graphical Ads: Pub = AA (Yahoo! content) Pub = Adv (“House Ads”)

Advertisers

Users

Publishers

Match maker

64

Page 65: Intro to Computational Advertising - Stanford University

Problem definition: Ad selection objective

Each participant has its own utility1. Advertisers wants ROI and volume2. User wants relevance3. Publisher wants revenue per impressions/search4. Ad network wants revenue and growth

Ad selection: optimize for a goal that balances the utilities of the four participants

Some tradeoffs are linked with the short term and long term business objectives: Allow for easy adjustments based on periodical

changes in objectives

65

Page 66: Intro to Computational Advertising - Stanford University

Efficiency requirements: Scale and Cost of Serving The Billions:

Billions of individual ads in sponsored search and content match

Billions of unique queries/millions of searches per hour Trillions of page impressions (content match and graphical

advertising) Billions of users

The Milliseconds: Requests served while the user ‘waits’: no more than 100ms

response time The Money:

Serving each requests require some CPU amount Data usually needs to be in memory Per-request cost needs to be lower than the serving cost Low CTR make this a challenging problem

66

Page 67: Intro to Computational Advertising - Stanford University

Textual ad selection

67

Page 68: Intro to Computational Advertising - Stanford University

In the beginning: The database approach Thinking of SS as a data base problem

SELECT adsFROM ad_tableWHERE bid_phrase = query

Implementation Sponsored search

Match the query to the ad bid phrase (some normalization performed) Advertisers cannot bid on all feasible queries (especially in the tail)

Need advanced match Advanced match translate the query into bid phrases

Very difficult to capture context, relevance, etc. Pricing is misleading – bid on original phrase has little to do with value

of AM Content match bid phrases from pages

very difficult to capture context, semantics,. relevance, etc.

68

Page 69: Intro to Computational Advertising - Stanford University

Textual ad schemaAdvertiser

Account 1

Campaign 1

Ad group 1

Creatives Bid phrases

Ad group 2 …

Campaign 2 …

Account 2 …

Ad

New Year deals on lawn & garden toolsBuy appliances on

Black Friday

Kitchen appliances

Brand name appliancesCompare prices and save moneywww.appliances-r-us.com

{ Miele, KitchenAid, Cuisinart, …}

Can be just a single bid phrase, or

thousands of bid phrases (which are

not necessarily topically coherent)

Page 70: Intro to Computational Advertising - Stanford University

New old concept: advertising as information

“I do not regard advertising as entertainment or

an art form, but as a medium of information….”

[David Ogilvy, 1985]

“Advertising as Information” [Nelson, 1974]

Irrelevant ads are annoying; relevant ads are

interesting

Vogue, Skiing, etc are mostly ads and advertorials

Finding the best textual ad is an information retrieval problem with multiple, possible contradictory utility functions

70

Page 71: Intro to Computational Advertising - Stanford University

Advertising as information, London 1657

The ad explains “The Vertue of the COFFEE drink” what coffee is, how it grows, how it cures numerous maladies, including Dropsy, Gout, and Scurvy, …

71

Page 72: Intro to Computational Advertising - Stanford University

Finding the “best ad” as an Information Retrieval (IR) problem Representation: Treat the ads as documents in IR

[Ribeiro-Neto et al. SIGIR 2005] [Broder et al. SIGIR2007] [Broder et al. CIKM2008]

Optimization/solution: Retrieve the ads by evaluating the query over the ad corpus

Details Analyze the “query” and extract query-features

Query = full context (content, user profile, environment, etc) Analyze the documents (= ads) and extract doc-features Devise a scoring function = predicates on q-features and

d-features + weights Build a search engine that produces quickly the ads that

maximize the scoring function

72

Page 73: Intro to Computational Advertising - Stanford University

Setting the ad retrieval problem Ads corpus = Textual Ads: Bid phrase(s) + Title + Creative + URL +

Landing Page + … Graphical Ads: Advertiser supplied meta data + URL +

Landing Page + Content of pages with clicks… Query features = Search Keywords + Outside Knowledge Expansion +

Context features Context features (for sponsored search) = Location + User data + Previous searches + …

Context features (for content match) = Location + User data + Page topic + Page keywords …

73

Page 74: Intro to Computational Advertising - Stanford University

74

Research questions Should

we show ads at

all?

How to select

relevant ads?

How to index the

ad corpus?

Can we generate the bid phrases

automatically?

Can we optimally

choose the landing page?

Should we use

the landing

for indexing?

What is the interplay

between the organic and sponsored

results?

Page 75: Intro to Computational Advertising - Stanford University

75

Feature generation for improved ad retrieval

(SIGIR 2007, ACM TWEB 2009)

How to select

relevant ads?

Page 76: Intro to Computational Advertising - Stanford University

76

Queries, queries, queries …

Search engine

ex560lku

Computer modem

firewall

Network protection

device

firewire

Bus interface standard

wii

Video game console

wwi

World War I

wwii

World War II

Page 77: Intro to Computational Advertising - Stanford University

77

Query classification using Web search results

Humans often find it hard to readily see what the query is about … But they can easily make sense of it once they look at

the search results… Let computers do the same thing Infer the query intent from the top algorithmic search

results (“pseudo relevance feedback”) Classify search results (either summaries or full pages) Let these results “vote” to determine the query class(es) in a

large taxonomy of commercial topics Our goal: Construct additional features to retrieve better ads Better representation

Page 78: Intro to Computational Advertising - Stanford University

78

Example: ex560lku

CATEGORIES1. Computing/Computer/ Hardware/Computer/Peri-pherals/ComputerModems

Page 79: Intro to Computational Advertising - Stanford University

79

If we know it is about actiontec usb modem then we have plenty of ads …

Page 80: Intro to Computational Advertising - Stanford University

80

Traditional approach:

Query Classifier

Our approach:

Query Search engine

Insufficient data

ClassifierUsing Web as external knowledge

Search results

Very large scale

Pre-classify all pages just

once !

Our approach

Page 81: Intro to Computational Advertising - Stanford University

81

Research questions (effectiveness)

Snippets or full pages?

Aggregation:

bundling or voting?

Number of search results to obtain

Number of classes per search result

Page 82: Intro to Computational Advertising - Stanford University

82

The effect of using Web search results

Page 83: Intro to Computational Advertising - Stanford University

83

Beyond the bag of words: matching textual ads in the enriched feature space

[SIGIR 2007, Broder et al.;CIKM 2008, Broder et al.]

Page 84: Intro to Computational Advertising - Stanford University

84

Our approach to sponsored searchQuery

Front end

Candidate ads

Revenue reorderin

g

Ad slate

Ad query generation

First pass retrieval

Relevance reordering

Ad search engine

Ad query

Miele

<Miele, appliances, kitchen,“appliances repair”, “appliance parts”,Business/Shopping/Home/Appliances>Rich query

The hidden parts of ads (bid phrases + landing pages) allow us to augment the ads (cf. query expansion)

Page 85: Intro to Computational Advertising - Stanford University

Summary

85

Page 86: Intro to Computational Advertising - Stanford University

Key messages1. Computational advertising = A principled way to find the

"best match" between a user in a context and a suitable ad.2. Key sub-problems:

I. Representation of user/context/adsII. Definition of optimization problemIII. Efficient and effective solution

3. The financial scale for computational advertising is huge Small constants matter Expect plenty of further research

4. Advertising is a form of information. Adding ads to a context is similar to the integration problem of other

types of information Finding the “best ad” is often a type of information retrieval problem with

multiple, possibly contradictory utility functions5. New application domains and new techniques are emerging

every day Good area for research + new businesses!

86

Page 87: Intro to Computational Advertising - Stanford University

Conferences where most of the action happens WWW (World Wide Web)

WSDM (Web Search and Data Mining)

SIGIR (Information Retrieval)

CIKM (Information and Knowledge Management)

EC (Electronic Commerce)

4 workshops in 2009: SIGIR, KDD, EC, WINE

First to sixth Workshop on Sponsored Search Auctions

87

Page 89: Intro to Computational Advertising - Stanford University

89

This talk is Copyright 2010.Authors retain all rights, including copyrights and

distribution rights. No publication or further distribution in full or in part permitted without

explicit written permission