i256: applied natural language processing

I256: Applied Natural Language Processing

Marti HearstSept 27, 2006

Evaluation Measures

Evaluation MeasuresPrecision:

Proportion of those you labeled X that the gold standard thinks really is X #correctly labeled by alg/ all labels assigned by alg #True Positive / (#True Positive + #False Positive)

Recall:Proportion of those items that are labeled X in the gold standard that you actually label X#correctly labeled by alg / all possible correct labels#True Positive / (#True Positive + # False Negative)

F-measure

Can “cheat” with precision scores by labeling (almost) nothing with X.Can “cheat” on recall by labeling everything with X.The better you do on precision, the worse on recall, and vice versaThe F-measure is a balance between the two.

2*precision*recall / (recall+precision)

Evaluation Measures

Accuracy:Proportion that you got right (#True Positive + #True Negative) / N

N = TP + TN + FP + FNError:

(#False Positive + #False Negative)/N

Prec/Recall vs. Accuracy/ErrorWhen to use Precision/Recall?

Useful when there are only a few positives and many many negativesAlso good for ranked ordering

– Search results rankingWhen to use Accuracy/Error

When every item has to be judged, and it’s important that every item be correct.Error is better when the differences between algorithms are very small; let’s you focus on small improvements.

– Speech recognition

Evaluating Partial Parsing

How do we evaluate it?

Evaluating Partial Parsing

Testing our Simple FuleLet’s see where we missed:

Update rules; Evaluate Again

Evaluate on More Examples

Incorrect vs. MissedAdd code to print out which were incorrect

Missed vs. Incorrect

What is a good Chunking Baseline?

The Tree Data Structure

Baseline Code (continued)

Evaluating the Baseline

Cascaded Chunking

Next Time

Summarization

i256: applied natural language processing

Documents

i256 applied natural language processing fall 2009 lecture...

1 i256 applied natural language processing fall 2009 lecture...

i256 applied natural language processing fall 2009 lecture 5...

applied signal processing

1 i256: applied natural language processing marti hearst...

1 i256: applied natural language processing marti hearst...

applied adaptive signal processing report

i256 applied natural language processing fall 2009 lecture...

1 i256: applied natural language processing marti hearst nov...

image processing applied to traffic

1 i256: applied natural language processing marti hearst nov...

applied digital signal processing

i256 applied natural language processing fall 2009 lecture 1...

1 i256 applied natural language processing fall 2009 lecture...

i256 applied natural language processing fall 2009 lecture...

1 i256: applied natural language processing marti hearst oct...

i256 applied natural language processing fall 2009 lecture 9...

the electrical engineering and applied signal processing...

1 i256 applied natural language processing fall 2009...

applied digital signal processing -...