si485i : nlp

SI485i : NLP Set 8 PCFGs and the CKY Algorithm

Upload: elu

Post on 23-Feb-2016

60 views

Category:

Documents

0 download

Report

Download

Tags:

Embed Size (px):

DESCRIPTION

SI485i : NLP. Set 8 PCFGs and the CKY Algorithm. PCFGs. We saw how CFGs can model English (sort of) Probabilistic CFGs put weights on the production rules NP -> DET NN with probability 0.34 NP -> NN NN with probability 0.16. PCFGs. - PowerPoint PPT Presentation

TRANSCRIPT

SI485i : NLP

Set 8PCFGs and the CKY Algorithm

PCFGs• We saw how CFGs can model English (sort of)• Probabilistic CFGs put weights on the production

rules

• NP -> DET NN with probability 0.34• NP -> NN NN with probability 0.16

PCFGs• We still parse sentences and come up with a

syntactic derivation tree• But now we can talk about how confident the tree is• P(tree) !

Buffalo Example• What is the probability of this tree?

• It’s the product of all the inner trees, e.g., P(S->NP VP)

PCFG Formalized• G = (T, N, S, R, P)

• T is set of terminals• N is set of nonterminals• For NLP, we usually distinguish out a set P N of preterminals, ⊂

which always rewrite as terminals• S is the start symbol (one of the nonterminals)• R is rules/productions of the form X → γ, where X is a

nonterminal and γ is a sequence of terminals and nonterminals• P(R) gives the probability of each rule.

• A grammar G generates a language model L.

Some slides adapted from Chris Manning

Some notation• w1n = w1 … wn = the word sequence from 1 to n• wab = the subsequence wa … wb

• We’ll write P(Ni → ζj) to mean P(Ni → ζj | Ni )• Take note, this is a conditional probability. For instance, the

sum of all rules headed by an NP must sum to 1!• We’ll want to calculate the best tree T

• maxT P(T * ⇒ wab)

Trees and Probabilities• P(t) -- The probability of tree is the product of the

probabilities of the rules used to generate it.• P(w1n) -- The probability of the string is the sum of

the probabilities of all possible trees that have the string as their yield• P(w1n) = Σj P(w1n, tj) where tj is a parse of w1n• = Σj P(tj)

Example PCFG

P(tree) computation

Time to Parse• Let’s parse!!• Almost ready…• Trees must be in Chomsky Normal Form first.

Chomsky Normal Form• All rules are Z -> X Y or Z -> w• Transforming a grammar to CNF does not change its

weak generative capacity.• Remove all unary rules and empties• Transform n-ary rules: VP->V NP PP becomes • VP -> V @VP-V and @VP-V -> NP PP

• Why do we do this? Parsing is easier now.

Converting into CNF

The CKY Algorithm• Cocke-Kasami-Younger (CKY)

DynamicProgrammingIs back!

The CKY AlgorithmNP->NN NNS 0.13p = 0.13 x .0023 x .0014p = 1.87 x 10^-7

NP->NNP NNS 0.056p = 0.056 x .001 x .0014p = 7.84 x 10^-8

The CKY Algorithm• What is the runtime? O( ?? )• Note that each cell must

check all pairs of children below it.

• Binarizing the CFG rules is a must. The complexity explodes if you do not.

Evaluating CKY• How do we know if our parser works?

• Count the number of correct labels in your table...the label and the span it dominates• [ label, start, finish ]

• Most trees have an error or two!

• Count how many spans are correct, wrong, and compute a Precision/Recall ratio.

Probabilities?• Where do the probabilities come from?• P( NP -> DT NN ) = ???

• Penn Treebank: a bunch of newspaper articles whose sentences have been manually annotated with full parse trees

• P( NP -> DT NN ) = C( NP -> DT NN ) / C( NP )

SI485i : NLP Set 8 PCFGs and the CKY Algorithm. PCFGs We saw how CFGs can model English (sort of) Probabilistic CFGs put weights on the production rules

SI485i : NLP Set 11 Distributional Similarity slides adapted from Dan Jurafsky and Bill MacCartney

NLP-10005586(1-2018) · NLP BusinessNLP NLP Meta-Thinkingprocess BusinessNLPŽ r * -J NLP Module 1 BusinessNLP Practitioner Foundation Skills BusinessNLP (NLP Quotient) BusinessNLPŽ

SI485i : NLP - usna.edu · Perplexity • Perplexity is the probability of the test set (assigned by the language model), normalized by the number of words: • Chain rule:

NLP Classic Code & New Code NLP Practitioner 2014holisticdirections.com/.../2014/...Winnipeg_NEW_flyer_2014_10_Aug1.pdf · Jacquie Nagy NLP Classic Code & New Code NLP Practitioner

Introduction to NLP - ils.albany.eduWhat is NLP? 2.Some application areas of NLP 3.A brief history of NLP 4.Famous NLP systems 5. Ambiguity and NLP 6.Overcoming ambiguity – Brief

SI485i : NLP...other probabilistic models in NLP, especially •For pilot studies •in domains where the number of zeros isn’t so huge. Exercise Hey, I just met you, And this is

NLP Michael Hall - The Spirit of NLP

Nlp-Automata in Nlp

SI485i : NLP - usna.edu · PDF fileExercise How many senses of hand can you come up with? 1. ... hyponym car mango chair dog . WordNet • A hierarchically organized lexical database

SI485i : NLP · What NLP topics did we miss? • Dialogue Systems • Dialogue is a fascinating topic. Not only do we need to understand language, but now discourse cues: •Questions

CS11-711 Advanced NLP NLP Experimental Design

NLP Practitioner Heart of NLP - NLP Courses

SI485i : NLP Day 1 Intro to NLP. Assumptions about You You know… how to program Java basic UNIX usage basic probability and statistics (we’ll also review)

[Nlp ebook] anne linden - mindworks - nlp tools

NLP Training - NLP Certification

Life Changing NLP Training - Home - Dr Bridget NLP · Life Changing NLP Training ... Certified Master Practitioner of NLP (Neuro Linguistic Programming), ... NLP PRACTITIONER COURSE

NLP mastering nlp coaching skills.pdf

SI485i : NLP - USNA · language is where the real gains come from in NLP. ... New code for new features. Call your language models, get a probability, and then multiply new

The Evolution of NLP - Welcome to GWiz NLP - Gwiz NLP

SI485i : NLP · Deriving Naïve Bayes • Let and label Y be discrete. • Then, we can estimate and directly from the training data by ... language is where the real gains come from

SI485i Natural Language Processing · NLP is hard. (news headlines) 1. Minister Accused Of Having 8 Wives In Jail 2. Juvenile Court to Try Shooting Defendant 3. Teacher Strikes Idle

SI485i : NLP Day 2 Probability Review. Introduction to Probability Experiment (trial) Repeatable procedure with well-defined possible outcomes Outcome

SI485i : NLP Set 9 Advanced PCFGs Some slides from Chris Manning

SI485i : NLP › Users › cs › nchamber › courses › nlp › ...• Dialogue is a fascinating topic. Not only do we need to understand language, but now discourse cues: •Questions

SI485i : NLP - USNA Syntax • Grammar, or syntax: •The kind of implicit knowledge of your native language that you had mastered by the time you were 3 years old

NLP Practitioner Certification Training - ubermind.coubermind.co/assets/uber-mind-nlp-flyer-2017-1213-03.pdf · grates both traditional (behavioral) NLP along with NLP New Code. NLP

SI485i : NLP Set 10 Lexical Relations slides adapted from Dan Jurafsky and Bill MacCartney

ACCELERATED NLP PRACTITIONER TRAINING - … · Meta-NLP ® — Accelerated NLP Training Copyright: ... Neuro-Linguistic Programming NLP as a Model ... Changing Meta-Programs

SI485i : NLP - United States Naval Academy · SI425 : NLP Set 5 Naïve Bayes Classification . Motivation • We want to predict something. • We have some text related to this something

SI485i : NLP Set 13 Information Extraction. 2 “Yesterday GM released third quarter results showing a 10% in profit over the same period last year. “John

SI485i : NLP...•This means Laplace is a blunt instrument •Could use more fine-grained method (add-k) • Laplace smoothing not often used for N-grams, as we have much better methods

SI485i : NLP

NLP Practitioner Notes · NLP Practitioner Notes NLP Practitioner Course presented by Micheal Goodman NLP Master Practitioner (ABNLP) and NLP Trainer (NLPEA) 2018

SI485i : NLP - USNA · Who cares about NLP? ... ... system commands 13 . What NLP topics did we miss?

si485i : nlp

Documents

np nn nn

c np dt nn c np

psnp vp4 buffalo buffalo

parse trees p np dt

production rules np

vpv np pp

cky algorithm16 npnn

npnnp nns0