首页>
外国专利>
METHOD FOR DISAMBIGUATING BETWEEN AUTHORS WITH SAME NAME ON BASIS OF NETWORK REPRESENTATION AND SEMANTIC REPRESENTATION
METHOD FOR DISAMBIGUATING BETWEEN AUTHORS WITH SAME NAME ON BASIS OF NETWORK REPRESENTATION AND SEMANTIC REPRESENTATION
展开▼
机译:基于网络表示和语义表示的作者在作者之间消除作者的方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
The present invention discloses a method for disambiguating between authors with a same name on basis of network representation and semantic representation. This method comprises: 1) extracting semantic and discrete features of each publication in a target publication library; 2) calculating a similarity between the theses based on the discrete features to obtain a relationship similarity matrix of the theses; if the publication has no common author or institution with other theses, it is added into an discrete publication set; 3) calculating a semantic similarity matrix of the theses based on the semantic features of the theses; and adding theses which do not contain the semantic features in the target publication library to the discrete publication set; 4) performing weighted summation on the relationship similarity matrix and the semantic similarity matrix to obtain a publication similarity matrix and clustering the same; adding theses which do not belong to any cluster to the publication discrete set; and 5) allocating the theses in the discrete publication set to corresponding clusters by using a method based on similarity threshold matching. The present invention enables disambiguation between the authors of the same name of theses with high accuracy.
展开▼