首页>
外国专利>
System and method for rapid development of natural language understanding using active learning
System and method for rapid development of natural language understanding using active learning
展开▼
机译:利用主动学习快速发展自然语言理解的系统和方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
A method, computer program product, and data processing system for training a statistical parser by utilizing active learning techniques to reduce the size of the corpus of human-annotated training samples (e.g., sentences) needed is disclosed. According to a preferred embodiment of the present invention, the statistical parser under training is used to compare the grammatical structure of the samples according to the parser's current level of training. The samples are then divided into clusters, with each cluster representing samples having a similar structure as ascertained by the statistical parser. Uncertainty metrics are applied to the clustered samples to select samples from each cluster that reflect uncertainty in the statistical parser's grammatical model. These selected samples may then be annotated by a human trainer for training the statistical parser.
展开▼