随着信息技术不断的发展,海量数据的处理效率成为不可逃避的问题.传统的网页分类算法在分类效果上已经相对成熟,所以在这样的背景下从传统网页分类算法中特征值权重算法的效率和代价出发,分析并提出了基于简化MD5的特征值权重算法.有效减少了特征值提取时的比对和最后一次排序的效率,从而提高了整个网页分类的效率.%With the rapid development of information technology, the process efficiency of massive data has become inescapable problem and Web page classification algorithm has matured yet, so from the efficiency and cost of eigenvalue weights algorithm in traditional web page classification algorithm is started. Eigenvalue weights algorithm based on a simplified MD5, and effective in reducing the efficiency of eigenvalue extraction time and the last sort of eigenvalue are analyzed and proposed. Improving the efficiency of the whole Web page classification is presented.
展开▼