Semantic preprocessing of Web request streams for Web usage mining
- Authors
- Jung, Jason J.
- Issue Date
- 2005
- Publisher
- GRAZ UNIV TECHNOLGOY, INST INFORMATION SYSTEMS COMPUTER MEDIA-IICM
- Keywords
- Web usage mining; semantic analysis; browsing patterns
- Citation
- JOURNAL OF UNIVERSAL COMPUTER SCIENCE, v.11, no.8, pp 1383 - 1396
- Pages
- 14
- Journal Title
- JOURNAL OF UNIVERSAL COMPUTER SCIENCE
- Volume
- 11
- Number
- 8
- Start Page
- 1383
- End Page
- 1396
- URI
- https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/37655
- DOI
- 10.3217/jucs-011-08-1383
- ISSN
- 0948-695X
0948-6968
- Abstract
- Efficient data preparation needs to discover the underlying knowledge from complicated Web usage data. In this paper, we have focused on two main tasks, semantic outlier detection from online Web request streams and segmentation (or sessionization) of them. We thereby exploit semantic technologies to infer the relationships among Web requests. Web ontologies such as taxonomies and directories can label each Web request as all the corresponding hierarchical topic paths. Our algorithm consists of two steps. The first step is the nested repetition of top-down partitioning for establishing a set of candidates of session boundaries, and the next step is evaluation process of bottom-up merging for reconstructing segmented sequences. In addition, we propose the hybrid approach of this method, as combining with the existing heuristics. Using synthesized dataset and real-world dataset of the access log files of IRCache, we conducted experiments and showed that semantic preprocessing method improves the performance of rule discovery algorithms. It means that we can conceptually track the behavior of users tending to easily change their intentions and interests, or simultaneously try to search various kinds of information on the Web.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Software > School of Computer Science and Engineering > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.