User identification and session identification are two major steps in preprocessing web log data for web usage mining. This paper introduces a fast active user-based user identification algorithm with time complexity O(n). The algorithm uses both an IP address and a finite users' inactive time to identify different users in the web log. Website ontology is useful for identifying website structure and break points for browsing behavior. For session identification, we present an ontology-based method that utilizes the website structure and functionalities to identify different sessions.
展开▼