Semantic analysis for data preparation of web usage mining
- Authors
- Jung, Jason J.; Jo, GS
- Issue Date
- 2004
- Publisher
- SPRINGER-VERLAG BERLIN
- Citation
- INNOVATIONS IN APPLIED ARTIFICIAL INTELLIGENCE, v.3029, pp 1249 - 1258
- Pages
- 10
- Journal Title
- INNOVATIONS IN APPLIED ARTIFICIAL INTELLIGENCE
- Volume
- 3029
- Start Page
- 1249
- End Page
- 1258
- URI
- https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/37636
- DOI
- 10.1007/978-3-540-24677-0_128
- ISSN
- 0302-9743
1611-3349
- Abstract
- As the web usage patterns from clients are getting more complex, simple sessionizations based on time and navigation-oriented heuristics have been restricted to exploit various kinds of rule discovering methods. In this paper, we present semantic analysis approach based on semantic session reconstruction as finding out semantic outliers from web log data. Web directory service is applied to enrich semantics to web logs, categorizing them to all possible hierarchical paths. In order to detect the candidate set of session identifiers, semantic factors like semantic mean, deviation, and distance matrix are established. Eventually, each semantic session is obtained based on nested repetition of top-down partitioning and evaluation process. For experiment, we applied this ontology-oriented heuristics to sessionize the access log files for one week from IRCache. Compared with time-oriented heuristics, more than 48% of sessions were additionally detected by semantic outlier analysis. It means that we can conceptually track the behavior of users tending to easily change their intentions and interests, or simultaneously try to search various kinds of information on the web.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Software > School of Computer Science and Engineering > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.