Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Using multiple indexes for efficient subsequence matching in time-series databases

Full metadata record
DC Field Value Language
dc.contributor.authorLim, Seung Hwan-
dc.contributor.authorPark, Heejin-
dc.contributor.authorKim, Sang Wook-
dc.date.accessioned2022-12-21T05:18:30Z-
dc.date.available2022-12-21T05:18:30Z-
dc.date.created2022-08-26-
dc.date.issued2007-12-
dc.identifier.issn0020-0255-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/179305-
dc.description.abstractA time-series database is a set of data sequences, each of which is a list of changing values of an object in a given period of time. Subsequence matching is an operation that searches for such data subsequences whose changing patterns are similar to a query sequence in a time-series database. This paper addresses a performance issue of time-series subsequence matching. First, we quantitatively examine the performance degradation caused by the window size effect, and then show that the performance of subsequence matching with a single index is not satisfactory in real applications. We claim that index interpolation is a fairly effective tool to solve this problem. Index interpolation performs subsequence matching by selecting the most appropriate one from multiple indexes built on windows of their distinct sizes. For index interpolation, we need to decide the sizes of windows for multiple indexes to be built. In this paper, we solve the problem of selecting optimal window sizes from the perspective of physical database design. Given a set of pairs (length,ftequency) of query sequences to be performed in a target application and a set of window sizes for building multiple indexes, we devise a formula that estimates the overall cost of all the subsequence matchings performed in a target application. By using this formula, we propose an algorithm that determines the optimal window sizes for maximizing the performance of entire subsequence matchings. We formally prove the optimality as well as the effectiveness of the algorithm. Finally, we show the superiority of our approach by performing extensive experiments with a real-life stock data set and a large volume of synthetic data sets.-
dc.language영어-
dc.language.isoen-
dc.publisherELSEVIER SCIENCE INC-
dc.titleUsing multiple indexes for efficient subsequence matching in time-series databases-
dc.typeArticle-
dc.contributor.affiliatedAuthorPark, Heejin-
dc.contributor.affiliatedAuthorKim, Sang Wook-
dc.identifier.doi10.1016/j.ins.2007.07.004-
dc.identifier.scopusid2-s2.0-34648851357-
dc.identifier.wosid000250899400014-
dc.identifier.bibliographicCitationINFORMATION SCIENCES, v.177, no.24, pp.5691 - 5706-
dc.relation.isPartOfINFORMATION SCIENCES-
dc.citation.titleINFORMATION SCIENCES-
dc.citation.volume177-
dc.citation.number24-
dc.citation.startPage5691-
dc.citation.endPage5706-
dc.type.rimsART-
dc.type.docTypeArticle-
dc.description.journalClass1-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.subject.keywordPlusALGORITHM-
dc.subject.keywordPlusRETRIEVAL-
dc.subject.keywordPlusTRANSFORM-
dc.subject.keywordAuthortime-series databases-
dc.subject.keywordAuthorsubsequence matching-
dc.subject.keywordAuthorindex interpolation-
dc.subject.keywordAuthorphysical database design-
dc.identifier.urlsciencedirect.com/science/article/pii/S0020025507003325?via%3Dihub-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Park, Hee jin photo

Park, Hee jin
COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE