With the rapid development of the Internet, "user centric, open architecture concept of user participation" has been popular, Internet users gradually from passive reception of network information to take the initiative to create network information transformation. Portals, forums, micro-blog and other Internet media has become the people's release, dissemination and access to critical information, express emotions, an important platform to express their views. The key to social events is to grab relevant social events and public opinion resources through people, to provide relevant departments as a basis for decision-making, is an important direction of the development of public opinion monitoring system. In this project, I mainly responsible for the design and implementation of the information collection subsystem of public opinion. Based on public opinion information acquisition technology is the Internet search technology, makes it to send and search engines in the realization of design ideas and technology, there are many similarities, the search engine technology through research, W can provide valuable experience for the public opinion information collection. I am in the beginning of the project, in order to obtain better acquisition breadth and accuracy, study the basic principle of search engine, compared to the existing search engine technology research, especially clever search engine key technology element, and ultimately determine the technical architecture of the project, and absorb advantages of full text search engine index to achieve public opinion information acquisition system. In the search query conversion, acquisition and analysis of non directional orientation acquisition site query rules and page structure, to achieve the precise acquisition. In this paper, based on the network public opinion monitoring platform, the network public opinion information collection strategy is studied, and the design of collection system. Take the system design with theoretical research methods to guide practice, firstly studies the structure and characteristics of the network public opinion, the public opinion collection space and the main source of analysis, combining the current situation of the development of domestic and international public opinion, according to the current widespread public opinion collection efficiency is not high, the target strong limitation problem, put forward a user can set the personalized theme of public opinion collection strategy based on meta search engine. By using the key words matching, regular expression filtering and the strategy based on domain name, the topic relevance of the system is guaranteed, and the redundant data are filtered to improve the running efficiency of the system. The source of public opinion information is set for the public to reflect the views, attitudes and views, the tendency of the major news portal sites, blog forums, online communities, as well as emerging media such as micro-blog. The purpose of this paper is to design a set of adaptive public opinion information collection system to monitor the network public opinion in Colleges and universities, to achieve real-time mining more efficiently than public university information sensitive Internet, clean and structured processing of the acquired data, prepare for public opinion analysis, tendency of data and events of hot events that follow.
展开▼