[問題] 常規表達式找副詞

作者: yuseke (大將)   2015-12-08 17:34:57
as title
這兩天在看RE的部分
根據PYTHON 基金會的網站:
https://docs.python.org/2/library/re.html
7.2.5.7. Finding all Adverbs and their Positions¶
If one wants more information about all matches of a pattern than the matched
text, finditer() is useful as it provides instances of MatchObject instead of
strings. Continuing with the previous example, if one was a writer who wanted
to find all of the adverbs and their positions in some text, he or she would
use finditer() in the following manner:
>>> text = "He was carefully disguised but captured quickly by police."
>>> for m in re.finditer(r"\w+ly", text):
... print '%02d-%02d: %s' % (m.start(), m.end(), m.group(0))
07-16: carefully
40-47: quickly
關於""""for m in re.finditer(r"\w+ly", text):""""
這個部份我有一個疑問,
可是並不是所有的adv都有ly字尾.......
這種情形該怎麼處理呢?
作者: tiefblau (tiefblau)   2015-12-08 17:39:00
他只是在舉例而已......
作者: yuseke (大將)   2015-12-08 17:43:00
那如果我真的遇到這個問題的時候該怎麼辦呢?
作者: uranusjr (←這人是超級笨蛋)   2015-12-08 17:45:00
只能用字典來做, 或者因為有些副詞和其他詞性拼法相同,大概只能用 machine learning 來做; 自然語言分析很難的
作者: bibo9901 (function(){})()   2015-12-08 20:56:00
用NLTK做POS tagging 然後查 wordnet (?)

Links booklink

Contact Us: admin [ a t ] ucptt.com