question answering - cleveland state...
Post on 22-May-2020
4 Views
Preview:
TRANSCRIPT
Question Answering
What is Ques+on Answering?
Dan Jurafsky
2
Ques%on Answering
What do worms eat?
worms
eat
what
worms
eat
grass
Worms eat grass
worms
eat
grass
Grass is eaten by wormsbirds
eat
worms
Birds eat worms
horses
eat
grass
Horses with worms eat grass
with
worms
!"#$%&'( )&*#'%+,-.'$/#0$(
One of the oldest NLP tasks (punched card systems in 1961) Simmons, Klein, McConlogue. 1964. Indexing and Dependency Logic for Answering English Ques+ons. American Documenta+on 15:30, 196-‐204
Dan Jurafsky
Ques%on Answering: IBM’s Watson • Won Jeopardy on February 16, 2011!
3
WILLIAM WILKINSON’S “AN ACCOUNT OF THE PRINCIPALITIES OF
WALLACHIA AND MOLDOVIA” INSPIRED THIS AUTHOR’S
MOST FAMOUS NOVEL
Bram Stoker
Dan Jurafsky
Apple’s Siri
4
Dan Jurafsky
Wolfram Alpha
5
Dan Jurafsky
6
Types of Ques%ons in Modern Systems
• Factoid ques+ons • Who wrote “The Universal Declara4on of Human Rights”? • How many calories are there in two slices of apple pie? • What is the average age of the onset of au4sm? • Where is Apple Computer based?
• Complex (narra+ve) ques+ons: • In children with an acute febrile illness, what is the efficacy of acetaminophen in reducing fever?
• What do scholars think about Jefferson’s posi4on on dealing with pirates?
Dan Jurafsky
Commercial systems: mainly factoid ques%ons
Where is the Louvre Museum located? In Paris, France
What’s the abbrevia+on for limited partnership?
L.P.
What are the names of Odin’s ravens? Huginn and Muninn
What currency is used in China? The yuan
What kind of nuts are used in marzipan? almonds
What instrument does Max Roach play? drums
What is the telephone number for Stanford University?
650-‐723-‐2300
Dan Jurafsky
Paradigms for QA
• IR-‐based approaches • TREC; IBM Watson; Google
• Knowledge-‐based and Hybrid approaches • IBM Watson; Apple Siri; Wolfram Alpha; True Knowledge Evi
8
Dan Jurafsky
Many ques%ons can already be answered by web search
• a
9
Dan Jurafsky
IR-‐based Ques%on Answering
• a
10
Dan Jurafsky
11
IR-‐based Factoid QA
DocumentDocumentDocument
DocumentDocume
ntDocumentDocume
ntDocument
Question Processing
PassageRetrieval
Query Formulation
Answer Type Detection
Question
Passage Retrieval
Document Retrieval
Answer Processing
Answer
passages
Indexing
RelevantDocs
DocumentDocumentDocument
Dan Jurafsky
IR-‐based Factoid QA • QUESTION PROCESSING
• Detect ques+on type, answer type, focus, rela+ons • Formulate queries to send to a search engine
• PASSAGE RETRIEVAL • Retrieve ranked documents • Break into suitable passages and rerank
• ANSWER PROCESSING • Extract candidate answers • Rank candidates
• using evidence from the text and external sources
Dan Jurafsky
Knowledge-‐based approaches (Siri)
• Build a seman+c representa+on of the query • Times, dates, loca+ons, en++es, numeric quan++es
• Map from this seman+cs to query structured data or resources • Geospa+al databases • Ontologies (Wikipedia infoboxes, dbPedia, WordNet, Yago) • Restaurant review sources and reserva+on services • Scien+fic databases
13
Dan Jurafsky
Hybrid approaches (IBM Watson)
• Build a shallow seman+c representa+on of the query • Generate answer candidates using IR methods
• Augmented with ontologies and semi-‐structured data
• Score each candidate using richer knowledge sources • Geospa+al databases • Temporal reasoning • Taxonomical classifica+on
14
Question Answering
What is Ques+on Answering?
Question Answering
Answer Types and Query Formula+on
Dan Jurafsky
Factoid Q/A
17
DocumentDocumentDocument
DocumentDocume
ntDocumentDocume
ntDocument
Question Processing
PassageRetrieval
Query Formulation
Answer Type Detection
Question
Passage Retrieval
Document Retrieval
Answer Processing
Answer
passages
Indexing
RelevantDocs
DocumentDocumentDocument
Dan Jurafsky
Ques%on Processing Things to extract from the ques%on
• Answer Type Detec+on • Decide the named en%ty type (person, place) of the answer
• Query Formula+on • Choose query keywords for the IR system
• Ques+on Type classifica+on • Is this a defini+on ques+on, a math ques+on, a list ques+on?
• Focus Detec+on • Find the ques+on words that are replaced by the answer
• Rela+on Extrac+on • Find rela+ons between en++es in the ques+on 18
Dan Jurafsky
Question Processing They’re the two states you could be reentering if you’re crossing Florida’s northern border
• Answer Type: US state • Query: two states, border, Florida, north • Focus: the two states • Rela+ons: borders(Florida, ?x, north)
19
Dan Jurafsky
Answer Type Detec%on: Named En%%es
• Who founded Virgin Airlines? • PERSON
• What Canadian city has the largest popula4on? • CITY.
Dan Jurafsky
Answer Type Taxonomy
• 6 coarse classes • ABBEVIATION, ENTITY, DESCRIPTION, HUMAN, LOCATION, NUMERIC
• 50 finer classes • LOCATION: city, country, mountain… • HUMAN: group, individual, +tle, descrip+on • ENTITY: animal, body, color, currency…
21
Xin Li, Dan Roth. 2002. Learning Ques+on Classifiers. COLING'02
Dan Jurafsky
22
Part of Li & Roth’s Answer Type Taxonomy
LOCATION
NUMERIC
ENTITY HUMAN
ABBREVIATIONDESCRIPTION
country city state
datepercent
money
sizedistance
individual
title
group
food
currency
animal
definition
reason expression
abbreviation
Dan Jurafsky
23
Answer Types
Dan Jurafsky
24
More Answer Types
Dan Jurafsky
Answer types in Jeopardy
• 2500 answer types in 20,000 Jeopardy ques+on sample • The most frequent 200 answer types cover < 50% of data • The 40 most frequent Jeopardy answer types he, country, city, man, film, state, she, author, group, here, company, president, capital, star, novel, character, woman, river, island, king, song, part, series, sport, singer, actor, play, team, show, actress, animal, presiden+al, composer, musical, na+on, book, +tle, leader, game
25
Ferrucci et al. 2010. Building Watson: An Overview of the DeepQA Project. AI Magazine. Fall 2010. 59-‐79.
Dan Jurafsky
Answer Type Detec%on
• Hand-‐wrioen rules • Machine Learning • Hybrids
Dan Jurafsky
Answer Type Detec%on
• Regular expression-‐based rules can get some cases: • Who {is|was|are|were} PERSON • PERSON (YEAR – YEAR)
• Other rules use the ques%on headword: (the headword of the first noun phrase ater the wh-‐word)
• Which city in China has the largest number of foreign financial companies?
• What is the state flower of California?
Dan Jurafsky
Answer Type Detec%on
• Most oten, we treat the problem as machine learning classifica+on • Define a taxonomy of ques+on types • Annotate training data for each ques+on type • Train classifiers for each ques+on class using a rich set of features. • features include those hand-‐wrioen rules!
28
Dan Jurafsky
Features for Answer Type Detec%on
• Ques+on words and phrases • Part-‐of-‐speech tags • Parse features (headwords) • Named En++es • Seman+cally related words
29
Dan Jurafsky
Factoid Q/A
30
DocumentDocumentDocument
DocumentDocume
ntDocumentDocume
ntDocument
Question Processing
PassageRetrieval
Query Formulation
Answer Type Detection
Question
Passage Retrieval
Document Retrieval
Answer Processing
Answer
passages
Indexing
RelevantDocs
DocumentDocumentDocument
Dan Jurafsky
Keyword Selec%on Algorithm
1. Select all non-‐stop words in quota+ons 2. Select all NNP words in recognized named en++es 3. Select all complex nominals with their adjec+val modifiers 4. Select all other complex nominals 5. Select all nouns with their adjec+val modifiers 6. Select all other nouns 7. Select all verbs 8. Select all adverbs 9. Select the QFW word (skipped in all previous steps) 10. Select all other words
Dan Moldovan, Sanda Harabagiu, Marius Paca, Rada Mihalcea, Richard Goodrum, Roxana Girju and Vasile Rus. 1999. Proceedings of TREC-‐8.
Dan Jurafsky
Choosing keywords from the query
32
Who coined the term “cyberspace” in his novel “Neuromancer”?
1 1
4 4
7
cyberspace/1 Neuromancer/1 term/4 novel/4 coined/7
Slide from Mihai Surdeanu
Question Answering
Answer Types and Query Formula+on
Question Answering
Passage Retrieval and Answer Extrac+on
Dan Jurafsky
Factoid Q/A
35
DocumentDocumentDocument
DocumentDocume
ntDocumentDocume
ntDocument
Question Processing
PassageRetrieval
Query Formulation
Answer Type Detection
Question
Passage Retrieval
Document Retrieval
Answer Processing
Answer
passages
Indexing
RelevantDocs
DocumentDocumentDocument
Dan Jurafsky
36
Passage Retrieval
• Step 1: IR engine retrieves documents using query terms • Step 2: Segment the documents into shorter units
• something like paragraphs
• Step 3: Passage ranking • Use answer type to help rerank passages
Dan Jurafsky
Features for Passage Ranking
• Number of Named En++es of the right type in passage • Number of query words in passage • Number of ques+on N-‐grams also in passage • Proximity of query keywords to each other in passage • Longest sequence of ques+on words • Rank of the document containing passage
Either in rule-‐based classifiers or with supervised machine learning
Dan Jurafsky
Factoid Q/A
38
DocumentDocumentDocument
DocumentDocume
ntDocumentDocume
ntDocument
Question Processing
PassageRetrieval
Query Formulation
Answer Type Detection
Question
Passage Retrieval
Document Retrieval
Answer Processing
Answer
passages
Indexing
RelevantDocs
DocumentDocumentDocument
Dan Jurafsky
Answer Extrac%on
• Run an answer-‐type named-‐en+ty tagger on the passages • Each answer type requires a named-‐en+ty tagger that detects it • If answer type is CITY, tagger has to tag CITY • Can be full NER, simple regular expressions, or hybrid
• Return the string with the right type: • Who is the prime minister of India (PERSON) Manmohan Singh, Prime Minister of India, had told left leaders that the deal would not be renegotiated.!
• How tall is Mt. Everest? (LENGTH) The official height of Mount Everest is 29035 feet!
Dan Jurafsky
Ranking Candidate Answers
• But what if there are mul+ple candidate answers!
Q: Who was Queen Victoria’s second son?!• Answer Type: Person
• Passage: The Marie biscuit is named ater Marie Alexandrovna, the daughter of Czar Alexander II of Russia and wife of Alfred, the second son of Queen Victoria and Prince Albert
Dan Jurafsky
Ranking Candidate Answers
• But what if there are mul+ple candidate answers!
Q: Who was Queen Victoria’s second son?!• Answer Type: Person
• Passage: The Marie biscuit is named ater Marie Alexandrovna, the daughter of Czar Alexander II of Russia and wife of Alfred, the second son of Queen Victoria and Prince Albert
Dan Jurafsky
Use machine learning: Features for ranking candidate answers
Answer type match: Candidate contains a phrase with the correct answer type. PaZern match: Regular expression paoern matches the candidate. Ques%on keywords: # of ques+on keywords in the candidate. Keyword distance: Distance in words between the candidate and query keywords Novelty factor: A word in the candidate is not in the query. Apposi%on features: The candidate is an apposi+ve to ques+on terms Punctua%on loca%on: The candidate is immediately followed by a comma, period, quota+on marks, semicolon, or exclama+on mark. Sequences of ques%on terms: The length of the longest sequence of ques+on terms that occurs in the candidate answer.
Dan Jurafsky
Candidate Answer scoring in IBM Watson
• Each candidate answer gets scores from >50 components • (from unstructured text, semi-‐structured text, triple stores)
• logical form (parse) match between ques+on and candidate • passage source reliability • geospa+al loca+on • California is ”southwest of Montana”
• temporal rela+onships • taxonomic classifica+on 43
Dan Jurafsky
44
Common Evalua%on Metrics
1. Accuracy (does answer match gold-‐labeled answer?) 2. Mean Reciprocal Rank
• Return a ranked list of N candidate answers. • Score is 1/Rank of the first right answer.
MRR =
1rankii=1
N
!
N
Question Answering
Passage Retrieval and Answer Extrac+on
Question Answering
Using Knowledge in QA
Dan Jurafsky
Rela%on Extrac%on
• Answers: Databases of Rela+ons • born-‐in(“Emma Goldman”, “June 27 1869”) • author-‐of(“Cao Xue Qin”, “Dream of the Red Chamber”) • Draw from Wikipedia infoboxes, DBpedia, FreeBase, etc.
• Ques+ons: Extrac+ng Rela+ons in Ques+ons Whose granddaughter starred in E.T.?
(acted-in ?x “E.T.”)! (granddaughter-of ?x ?y)!47
Dan Jurafsky
Temporal Reasoning
• Rela+on databases • (and obituaries, biographical dic+onaries, etc.)
• IBM Watson ”In 1594 he took a job as a tax collector in Andalusia” Candidates: • Thoreau is a bad answer (born in 1817) • Cervantes is possible (was alive in 1594)
48
Dan Jurafsky
Geospa%al knowledge (containment, direc%onality, borders)
• Beijing is a good answer for ”Asian city” • California is ”southwest of Montana” • geonames.org:
49
Dan Jurafsky
Context and Conversa%on in Virtual Assistants like Siri
• Coreference helps resolve ambigui+es U: “Book a table at Il Fornaio at 7:00 with my mom” U: “Also send her an email reminder”
• Clarifica+on ques+ons: U: “Chicago pizza” S: “Did you mean pizza restaurants in Chicago or Chicago-‐style pizza?”
50
Question Answering
Using Knowledge in QA
Question Answering
Advanced: Answering Complex Ques+ons
Dan Jurafsky
Answering harder ques%ons Q: What is water spinach? A: Water spinach (ipomoea aqua+ca) is a semi-‐aqua+c leafy green plant with long hollow stems and spear-‐ or heart-‐shaped leaves, widely grown throughout Asia as a leaf vegetable. The leaves and stems are oten eaten s+r-‐fried flavored with salt or in soups. Other common names include morning glory vegetable, kangkong (Malay), rau muong (Viet.), ong choi (Cant.), and kong xin cai (Mand.). It is not related to spinach, but is closely related to sweet potato and convolvulus.
Dan Jurafsky
Answering harder ques%on Q: In children with an acute febrile illness, what is the efficacy of single medica+on therapy with acetaminophen or ibuprofen in reducing fever? A: Ibuprofen provided greater temperature decrement and longer dura+on of an+pyresis than acetaminophen when the two drugs were administered in approximately equal doses. (PubMedID: 1621668, Evidence Strength: A)
Dan Jurafsky
Answering harder ques%ons via query-‐focused summariza%on
• The (booom-‐up) snippet method • Find a set of relevant documents • Extract informa+ve sentences from the documents (using |-‐idf, MMR) • Order and modify the sentences into an answer
• The (top-‐down) informa+on extrac+on method • build specific answerers for different ques+on types: • defini+on ques+ons, • biography ques+ons, • certain medical ques+ons
Dan Jurafsky
The Informa%on Extrac%on method
• a good biography of a person contains: • a person’s birth/death, fame factor, educa%on, na%onality and so on
• a good defini%on contains: • genus or hypernym • The Hajj is a type of ritual
• a medical answer about a drug’s use contains: • the problem (the medical condi+on), • the interven%on (the drug or procedure), and • the outcome (the result of the study).
Dan Jurafsky
Informa%on that should be in the answer for 3 kinds of ques%ons
Dan Jurafsky
Document Retrieval
11 Web documents1127 total sentences
Predicate Identification
Data-Driven Analysis
383 Non-Specific Definitional sentences
Sentence clusters, Importance ordering
DefinitionCreation
9 Genus-Species SentencesThe Hajj, or pilgrimage to Makkah (Mecca), is the central duty of Islam.The Hajj is a milestone event in a Muslim's life.The hajj is one of five pillars that make up the foundation of Islam....
The Hajj, or pilgrimage to Makkah [Mecca], is the central duty of Islam. More than two million Muslims are expected to take the Hajj this year. Muslims must perform the hajj at least once in their lifetime if physically and financially able. The Hajj is a milestone event in a Muslim's life. The annual hajj begins in the twelfth month of the Islamic year (which is lunar, not solar, so that hajj and Ramadan fall sometimes in summer, sometimes in winter). The Hajj is a week-long pilgrimage that begins in the 12th month of the Islamic lunar calendar. Another ceremony, which was not connected with the rites of the Ka'ba before the rise of Islam, is the Hajj, the annual pilgrimage to 'Arafat, about two miles east of Mecca, toward Mina!
"What is the Hajj?" (Ndocs=20, Len=8)
Architecture for complex ques%on answering: defini%on ques%ons S. Blair-‐Goldensohn, K. McKeown and A. Schlaikjer. 2004.
Answering Defini+on Ques+ons: A Hyrbid Approach.
Question Answering
Advanced: Answering Complex Ques+ons
top related