ling 581: advanced computational linguistics lecture notes january 19th
Post on 21-Dec-2015
216 Views
Preview:
TRANSCRIPT
Administrivia• New room
– Shantz 338– (I have asked Jennifer Columbus to investigate refund: however, I’m told it may
not happen)
Marshall 480
Shantz338
Penn Treebank
• Availability– Source:• Linguistic Data Consortium (LDC)• U. of Arizona is a (fee-paying) member of this
consortium• Resources are made available to the community
through the main library• URL
– http://sabio.library.arizona.edu/search/X
tregex
• Tregex is a Tgrep2-style utility for matching patterns in trees.
writtenIn Java
run-tregex-gui.command shell script
-mx flag, the 300m default memory size will need to be increased depending on the platform
tregex• Different results from:
– @SBAR < /^WH.*-([0-9]+)$/#1%index << (@NP < (/^-NONE-/ < /^\*T\*-([0-9]+)$/#1%index))
top related