prabhakar raghavan yahoo! research...yahoo! research 2 agenda • web search leading up to today...

55
1 Web Search From information retrieval to microeconomic modeling Prabhakar Raghavan Yahoo! Research

Upload: others

Post on 09-Apr-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

1

Web SearchFrom information retrieval to microeconomic modeling

Prabhakar RaghavanYahoo! Research

Page 2: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

2Yahoo! Research

Agenda

• Web search leading up to today– Two search engines in web search

• Where algorithmic search is going– The hard research problems

• Search advertising– IR meets markets

Page 3: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

3Yahoo! Research

Algorithmic results=Audience

Advertisements=Monetization

-$ +$

Page 4: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

4Yahoo! Research

“No results”

Page 5: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

5Yahoo! Research

The two engines

The Web

Ad indexes

Web Results 1 - 10 of about 7,310,000 for miele. (0.12 seconds)

Miele, Inc -- Anything else is a compromise At the heart of your home, Appliances by Miele. ... USA. to miele.com. Residential Appliances. Vacuum Cleaners. Dishwashers. Cooking Appliances. Steam Oven. Coffee System ... www.miele.com/ - 20k - Cached - Similar pages

Miele Welcome to Miele, the home of the very best appliances and kitchens in the world. www.miele.co.uk/ - 3k - Cached - Similar pages

Miele - Deutscher Hersteller von Einbaugeräten, Hausgeräten ... - [ Translate this page ] Das Portal zum Thema Essen & Geniessen online unter www.zu-tisch.de. Miele weltweit ...ein Leben lang. ... Wählen Sie die Miele Vertretung Ihres Landes. www.miele.de/ - 10k - Cached - Similar pages

Herzlich willkommen bei Miele Österreich - [ Translate this page ] Herzlich willkommen bei Miele Österreich Wenn Sie nicht automatisch weitergeleitet werden, klicken Sie bitte hier! HAUSHALTSGERÄTE ... www.miele.at/ - 3k - Cached - Similar pages

Sponsored Links

CG Appliance Express Discount Appliances (650) 756-3931Same Day Certified Installation www.cgappliance.com San Francisco-Oakland-San Jose, CA Miele Vacuum Cleaners Miele Vacuums- Complete SelectionFree Shipping! www.vacuums.com Miele Vacuum Cleaners Miele-Free Air shipping! All models. Helpful advice. www.best-vacuum.com

Web spider

Indexer

Indexes

Search

User

Page 6: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

6Yahoo! Research

Brief (non-technical) history

• Early keyword-based engines– Altavista, Excite, Infoseek, Inktomi, Lycos,

ca. 1995-1997

• Sponsored search ranking: Goto.com(morphed into Overture.com → Yahoo!)– Your search ranking depended on how much

you paid– Auction for keywords: casino was

expensive!

Page 7: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

7Yahoo! Research

Brief (non-technical) history

• 1998+: Link-based ranking pioneered by Google– Blew away all early engines except Inktomi– Great user experience in search of a business

model– Meanwhile Goto/Overture’s annual revenues were

nearing $1 billion

• Result: Google added sponsored search “ads” to the side, independent of search results– 2003: Yahoo follows suit, acquiring Overture (for

paid placement) and Inktomi (for search)

Page 8: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

8Yahoo! Research

(Personal) history

• Keynote at ACM SODA 1997:– What if AltaVista had all the computing

power in the world – could it give us better results?

• Biggest bottlenecks in modeling user cognition, not computational

• If only we could find a computationally expensive way to solve the problem– then we know how to make it go faster

Page 9: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

9Yahoo! Research

Classical user information needs

• Informational – want to learn about something (~25%)

• Navigational – want to go to that page (~40%)

• Transactional – want to do something (web-mediated) (~35%)

– Access a service

– Downloads

– Shop• Gray areas

– Find a good hub– Exploratory search “see what’s there”

Low hemoglobin

United Airlines

Mendocino weather

Mars surface images

Nikon CoolPix

Car rental Finland

Broder 2002, A Taxomony of web search

Page 10: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

10Yahoo! Research

Algorithmic search is changing

• Already, more than a list of docs• Moving towards identifying a user’s

task– (As expressed in the query box)

• Enabling task completion

Page 11: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

11Yahoo! Research

Page 12: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

12Yahoo! Research

Page 13: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

13Yahoo! Research

Page 14: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

14Yahoo! Research

Search and content supply

• Premise:– People don’t want to search– People want to get tasks done

Query stream Session analysis Intent +Attributes

pizzaG

oa pizza

pizza hut Intent= buy pizza

Geo= Goa

Page 15: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

15Yahoo! Research

User’s experience

• No longer a list of 10 documentsPizza Hut GoaAddress:Tel: 12345678 Click to call

Reviews1. Blah blah …2. …

Page 16: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

16Yahoo! Research

Net: new vista of Search

• Yahoo! Mobile OneSearch, Google Universal Search, Ask …

• Intents are satisfied by juxtapositions of objects and transactional services– Objects are people, places,

businesses, universities …– Services let you do things

Page 17: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

17Yahoo! Research

Hard problems to be solved

• What’s the data model for the web of objects?– What logic/algebra governs it?

• What objects/services are retrieved on a query/intent?

• How do you measure fulfillment for the user?

Page 18: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

18Yahoo! Research

Fundamentally inter-disciplinary

• Needs computer science to meld with– Sociology– Ethnography– Cognitive psychology– Statistics– Optimization

Page 19: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

19Yahoo! Research

Sample: Estimating user responses

• The expected number of times a query is asked is a constant– But the mass in the tail is non-trivial.

• What is the likely response of a user to a page view?– (Response prediction)

Fresh challenges in building andoptimizing computing artifacts.

Page 20: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

20Yahoo! Research

Learning in long-tailed spaces

Features Long-tailedsparsity

Domain info

Agarwal and Merugu, ACM KDD 2007

Page 21: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

21

Community content

Long-tail authorship

Page 22: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

22Yahoo! Research

Page 23: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

23Yahoo! Research

The power of social tagging

• Flickr – community phenomenon• Millions of users share and tag each

others’ photographs (why???)• The wisdom of the crowd can be used

to search• The principle is not new – anchor text

used in “standard” search• Don’t try to pass the Turing test?

Page 24: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

24Yahoo! Research

Challenges in tag-based search

• How do we use these tags better?– How do you cope with spam?– What’s the ratings and reputation system?

• The bigger challenge: where else can you exploit the power of the people?

• What are the incentive mechanisms?– Luis von Ahn, ESP Game

Page 25: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

25Yahoo! Research

Page 26: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

26Yahoo! Research

Building a knowledge base

Page 27: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

27Yahoo! Research

Social content Social capital

Page 28: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

28Yahoo! Research

Incentives

Page 29: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

29Yahoo! Research

Incentives and trust

• What assignment of incentives leads to good user behavior?– What’s “good” user behavior?– Good questions, good answers, new

questions … ?

• Whose content do you trust and why?• Interface of CS and microeconomics.

Page 30: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

30

Search advertising

Page 31: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

31Yahoo! Research

Search query

Ad

Page 32: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

32Yahoo! Research

Landing page

Page 33: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

33Yahoo! Research

How it works

Advertiser

Landing page

Sponsoredsearch engine

I want to bid $5 oncanon cameraI want to bid $2 oncannon camera

Engine decides when/where to show this ad.

Engine decides how much to charge advertiser on a click.

Ad Index

Page 34: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

34Yahoo! Research

A marketplace

• For each keyword, a market in which advertisers compete for attention– Advertisers willing to pay for placement

on ads next to search results• Market varies by keyword

– Commercial vs non-commercial intent in keywords

– Ford vs Stanford map

Page 35: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

35Yahoo! Research

Monetization

• When a searcher clicks on a displayed ad, advertiser pays the search engine– E.g., a Ford dealership may place an

ad for the search Ford– Out of all such ads, engine chooses to

display some on each Ford search– When a searcher clicks an ad, the

advertiser who placed it pays

Page 36: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

36Yahoo! Research

Engine: Three sub-problems

1. Retrieve ads matching query2. Order the ads3. Pricing on a click-through

IR

Econ

Page 37: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

37Yahoo! Research

1. Retrieve ads matching query

• Advertisers bid on keywords/phrases• Match query to “nearby” bidded keywords

– Retrieve ads for these bidded keywords

• Not a standard IR problem– Bidded keywords skewed to commerce– Advertisers and engine (trying to) co-operate– Marketplace of keywords

• Venezia hotels1. Retrieve ads

Page 38: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

38Yahoo! Research

Retrieving ads

• Standard tricks from IR: phrasing, spell-correction, ranking …– Use both ad snippet and

landing page features– Advertisers are trying to help!

1. Retrieve ads

Page 39: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

39Yahoo! Research

Net of retrieval

• Given a query, the retrieval step yields a set of candidate ads, together with an IR score for each

1. Retrieve adsRibero-Neto et al SIGIR 2005; Yih et al WWW2006

Page 40: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

40Yahoo! Research

Ads go in slots

like these

Page 41: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

41Yahoo! Research

Higher slots get more clicks

Page 42: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

42Yahoo! Research

2. Order the ads

• Most generally, composite IR+Econscore … for today’s talk, focus on Econ

• Original GoTo/Overture scheme:– Order by bid

Page 43: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

43Yahoo! Research

Bid ordering(Overture/former Yahoo!)

Page 44: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

44Yahoo! Research

Revenue ordering, simplified

• Each ad j has an expected click-through denoted CTRj

• Ad j’s bid is denoted bj

• Then, expected revenue from this ad is Rj = bj x CTRj

• Order ads by Rj

Estimated (how?)

Page 45: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

45Yahoo! Research

Revenue ordering?

Page 46: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

46Yahoo! Research

Economic ordering

• Bid and revenue ordering: two forms of ordering by Econ score

• Does revenue ordering maximize revenue?• No – advertisers react to ordering scheme,

by changing their bid behavior!• Lahaie+Pennock ACM EC 2007

– Family of schemes bridging Bid and Revenue ordering

Edelman, Ostrovsky, Schwarz 2006

Page 47: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

47Yahoo! Research

More generally

• Rank ordering determined from a machine-learned function

• Features are bids, ad text- and landing-page content, query terms …

Page 48: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

48Yahoo! Research

3. Pricing on a click-through

• For the keyword tundra, say an advertiser has a value of $10 per click.

• How much should he bid?• How much should he be charged?

The value of a slot for an advertiser,what he bids andwhat he is charged, may all be different.

3. Pricing on a click-through

Page 49: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

49Yahoo! Research

Competitive market

• We have multiple advertisers, users, and the central service;

• Each has an economic incentive in participation

• How do we set up a system of payments that works for all?– Mechanism design– Sub-field of game theory

3. Pricing on a click-through

Page 50: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

50Yahoo! Research

Mechanism design

• Before the players play in the auction, we get to set the rules

• Designing the rules = mechanism design

• These rules govern players’ behavior

3. Pricing on a click-through

Page 51: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

51Yahoo! Research

Marketplace design

• What tools do you give advertisers?– What information do they get on the

market?

• What do they get to bid on?– Laptop vs Ultralight laptop

Page 52: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

52Yahoo! Research

Google AdWords

Page 53: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

53Yahoo! Research

Sample questions

• Do you set reserve prices on bids?• What’s the effect of targeting?

– E.g., geo-targeting– Effect on market “thickness”– Effect on retrieval engine

Page 54: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

54Yahoo! Research

A new convergence

• Monetization and economic value an intrinsic part of system design– Not an afterthought– Mistakes are costly!

• Computing meets humanities like never before – sociology, economics, anthropology …

Page 55: Prabhakar Raghavan Yahoo! Research...Yahoo! Research 2 Agenda • Web search leading up to today –Two search engines in web search • Where algorithmic search is going –The hard

55

Thank you.

[email protected]

http://research.yahoo.com