FoDoSu: Multi-document summarization exploiting semantic analysis based on social Folksonomy
- Authors
- Heu, Jee-Uk; Qasim, Iqbal; Lee, Dong-Ho
- Issue Date
- Jan-2015
- Publisher
- ELSEVIER SCI LTD
- Keywords
- Multi-document summarization; Folksonomy; Tag cluster; Semantic analysis
- Citation
- INFORMATION PROCESSING & MANAGEMENT, v.51, no.1, pp.212 - 225
- Indexed
- SCIE
SSCI
SCOPUS
- Journal Title
- INFORMATION PROCESSING & MANAGEMENT
- Volume
- 51
- Number
- 1
- Start Page
- 212
- End Page
- 225
- URI
- https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/19243
- DOI
- 10.1016/j.ipm.2014.06.003
- ISSN
- 0306-4573
- Abstract
- Multi-document summarization techniques aim to reduce documents into a small set of words or paragraphs that convey the main meaning of the original document. Many approaches to multi-document summarization have used probability-based methods and machine learning techniques to simultaneously summarize multiple documents sharing a common topic. However, these techniques fail to semantically analyze proper nouns and newly-coined words because most depend on an out-of-date dictionary or thesaurus. To overcome these drawbacks, we propose a novel multi-document summarization system called FoDoSu, or Folksonomy-based Multi-Document Summarization, that employs the tag clusters used by Flickr, a Folksonomy system, for detecting key sentences from multiple documents. We first Create a word frequency table for analyzing the semantics and contributions of words using the HITS algorithm. Then, by exploiting tag clusters, we analyze the semantic relationships between words in the word frequency table. Finally, we create a summary of multiple documents by analyzing the importance of each word and its semantic relatedness to others. Experimental results from the TAC 2008 and 2009 data sets demonstrate the improvement of our proposed framework over existing summarization systems. (C) 2014 Elsevier Ltd. All rights reserved.
- Files in This Item
-
Go to Link
- Appears in
Collections - COLLEGE OF COMPUTING > DEPARTMENT OF ARTIFICIAL INTELLIGENCE > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.