Extracting ""((Adj|Noun)+|((Adj|Noun)(Noun-Prep)?)(Adj|Noun))Noun"" from Text (Justeson & Katz, 1995)
- by ssuhan
I would like to query if it is possible to extract ((Adj|Noun)+|((Adj|Noun)(Noun-Prep)?)(Adj|Noun))Noun proposed by Justeson and Katz (1995) in R package openNLP?
That is, I would like to use this linguistic filtering to extract candidate noun phrases.
I cannot well understand its meaning.
Could you do me a favor to explain it or transform such representation into R language.
Many thanks.
Maybe we can start the sample code from:
library("openNLP")
acq <- "This paper describes a novel optical thread plug
gauge (OTPG) for internal thread inspection using machine
vision. The OTPG is composed of a rigid industrial
endoscope, a charge-coupled device camera, and a two
degree-of-freedom motion control unit. A sequence of
partial wall images of an internal thread are retrieved and
reconstructed into a 2D unwrapped image. Then, a digital
image processing and classification procedure is used to
normalize, segment, and determine the quality of the
internal thread."
acqTag <- tagPOS(acq)
acqTagSplit = strsplit(acqTag," ")