首页> 外文会议>Pacific Symposium on Biocomputing 2004; Jan 6-10, 2004; Hawaii, USA >THE COMPOSITIONAL STRUCTURE OF GENE ONTOLOGY TERMS
【24h】

THE COMPOSITIONAL STRUCTURE OF GENE ONTOLOGY TERMS

机译:基因本体论术语的组成结构

获取原文
获取原文并翻译 | 示例

摘要

An analysis of the term names in the Gene Ontology reveals the prevalence of substring relations between terms: 65.3% of all GO terms contain another GO term as a proper substring. This substring relation often coincides with a derivational relationship between the terms. For example, the term regulation of cell proliferation (G0:0042127) is derived from the term cell proliferation (00:0008283) by addition of the phrase regulation of. Further, we note that particular substrings which are not themselves GO terms (e.g. regulation of in the preceding example) recur frequently and in consistent subtrees of the ontology, and that these frequently occurring substrings often indicate interesting semantic relationships between the related terms. We describe the extent of these phenomena―substring relations between terms, and the recurrence of derivational phrases such as regulation of―and propose that these phenomena can be exploited in various ways to make the information in GO more computationally accessible, to construct a conceptually richer representation of the data encoded in the ontology, and to assist in the analysis of natural language texts.
机译:对基因本体中术语名称的分析揭示了术语之间子字符串关系的普遍性:所有GO术语中有65.3%包含另一个GO术语作为适当的子字符串。这种子串关系通常与项之间的派生关系一致。例如,术语“细胞增殖(G0:0042127)”通过添加术语“调控”而衍生自术语“细胞增殖(00:0008283)”。此外,我们注意到本身不是GO术语的特定子字符串(例如,前面示例中的reg)经常在本体的一致子树中重复出现,并且这些频繁出现的子字符串通常指示相关术语之间有趣的语义关系。我们描述了这些现象的程度(术语之间的子串关系以及诸如的调节之类的派生短语的重复出现),并提出可以以各种方式利用这些现象以使GO中的信息在计算上更易于访问,从而在概念上更丰富表示本体中编码的数据,并有助于自然语言文本的分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号