Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Classifying web pages using information extraction patterns - Preliminary results and findings

Full metadata record
DC Field Value Language
dc.contributor.authorSoon, L.-K.-
dc.contributor.authorLee, S.H.-
dc.date.available2019-04-10T10:59:48Z-
dc.date.created2018-04-17-
dc.date.issued2010-
dc.identifier.isbn9780769543192-
dc.identifier.urihttp://scholarworks.bwise.kr/ssu/handle/2018.sw.ssu/33335-
dc.description.abstractWeb page classification plays an essential role in facilitating more efficient information retrieval and information processing. Conventionally, web text documents are represented by term frequency matrix for classification purpose. However, considering the limitations of representing documents using terms or keywords, we propose to represent web pages using information extraction patterns that are identified within the pages respectively. In this paper, we present the results as well as the findings obtained from our preliminary experiments. Our experimental results indicate that the existence of a word in different contexts has different impact to the classification task. Thus, the extraction patterns used to represent each document are more semantically meaningful and give better insight to web classification in comparison with keywords. © 2010 IEEE.-
dc.relation.isPartOfProceedings of the 6th International Conference on Signal Image Technology and Internet Based Systems, SITIS 2010-
dc.titleClassifying web pages using information extraction patterns - Preliminary results and findings-
dc.typeConference-
dc.identifier.doi10.1109/SITIS.2010.42-
dc.type.rimsCONF-
dc.identifier.bibliographicCitation6th International Conference on Signal Image Technology and Internet Based Systems, SITIS 2010, pp.195 - 202-
dc.description.journalClass2-
dc.identifier.scopusid2-s2.0-79952541239-
dc.citation.conferenceDate2010-12-15-
dc.citation.conferencePlaceKuala Lumpur-
dc.citation.endPage202-
dc.citation.startPage195-
dc.citation.title6th International Conference on Signal Image Technology and Internet Based Systems, SITIS 2010-
dc.contributor.affiliatedAuthorLee, S.H.-
dc.type.docTypeConference Paper-
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Information Technology > School of Software > 2. Conference Papers

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Lee, Sang Ho photo

Lee, Sang Ho
College of Information Technology (School of Software)
Read more

Altmetrics

Total Views & Downloads

BROWSE