首页> 中文期刊> 《中文信息学报》 >汉语共时语料库与追踪语料库:语料库语言学的新方向

汉语共时语料库与追踪语料库:语料库语言学的新方向

         

摘要

The advancement of information technology and the Internet has offered important solutions to many classical problems in Chinese natural language processing. It has also opened up new opportunities for corpus linguistics, particularly the cultivation and utilization of large corpora for monitoring and tracking various language phenomena from the linguistic perspective, and investigating such language development in relation to the underlying social and cultural implications traditionally studied by humanities and social sciences. Over the past 17 years, the LI-VAC corpus has grown into a very large corpus of its kind, containing results from the analysis of about 400 million Chinese characters drawn from news media from 7 communities of pan-Chinese regions. The long-term effort behind LIVAC has enabled it to function as serial time capsules, which provide a solid foundation for scientifically tracking and monitoring various phenomena of language changes together with the associated social and cultural developments within and across pan-Chinese regions. This paper introduces how the LIVAC synchronous corpus has evolved into a monitoring corpus of Chinese communities.%随着信息技术的不断提升、互联网的普及,汉语自然语言处理的难题不断得到解决,汉语语料库的发展和语料库语言学的应用也面临着新的契机.如何持续充分应用庞大的多种语料库,并协同与配合语言学和人文、社会科学多个领域,来追踪了解各种语言现象及其背后的社会文化深层含义,是语料库语言学可以承担的新任务.LIVAC汉语共时语料库持续处理和分析泛华语七个地区十七年四亿字的语料,可真正起到“时间锦囊”的作用,为紧密追踪、科学观察泛华地区语言现象及有关社会文化演变,提供了坚实的基础和科学依据.该文介绍LIVAC如何由汉语“共时语料库”演变为“追踪语料库”.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号