首页>
外国专利>
Acronym extraction system and method of identifying acronyms and extracting corresponding expansions from text
Acronym extraction system and method of identifying acronyms and extracting corresponding expansions from text
展开▼
机译:首字母缩略词提取系统和识别首字母缩略词并从文本中提取相应扩展名的方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
An acronym expansion system of the present invention receives electronic documents and extracts acronyms and their corresponding expansions. A part-of-speech tagger decomposes text into string tokens or words and tags them with their part-of-speech, while an acronym identifier determines whether a word is a potential acronym based on various conditions. An expansion identifier retrieves lists of words preceding and following a potential acronym to search for the expansion. The resulting word lists are examined sequentially to identify and retrieve an expansion for the potential acronym. An expansion extractor receives the potential acronym and a processed word list to retrieve the expansion of the potential acronym from that list. The extractor may utilize information from prior search iterations, and verifies an extracted expansion against a set of rules to remove spurious expansions.
展开▼