Detailed Information

Cited 10 time in webofscience Cited 0 time in scopus
Metadata Downloads

Linearized Suffix Tree: an Efficient Index Data Structure with the Capabilities of Suffix Trees and Suffix Arrays

Authors
Kim, Dong KyueKim, MinhwanPark, Heejin
Issue Date
Nov-2008
Publisher
SPRINGER
Keywords
Suffix trees; Suffix arrays; Index data structures; String algorithms
Citation
ALGORITHMICA, v.52, no.3, pp.350 - 377
Indexed
SCIE
SCOPUS
Journal Title
ALGORITHMICA
Volume
52
Number
3
Start Page
350
End Page
377
URI
https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/171791
DOI
10.1007/s00453-007-9061-2
ISSN
0178-4617
Abstract
Suffix trees and suffix arrays are fundamental full-text index data structures to solve problems occurring in string processing. Since suffix trees and suffix arrays have different capabilities, some problems are solved more efficiently using suffix trees and others are solved more efficiently using suffix arrays. We consider efficient index data structures with the capabilities of both suffix trees and suffix arrays without requiring much space. When the size of an alphabet is small, enhanced suffix arrays are such index data structures. However, when the size of an alphabet is large, enhanced suffix arrays lose the power of suffix trees. Pattern searching in an enhanced suffix array takes O(m vertical bar Sigma vertical bar) time while pattern searching in a suffix tree takes O(m log vertical bar Sigma vertical bar) time where m is the length of a pattern and Sigma is an alphabet. In this paper, we present linearized suffix trees which are efficient index data structures with the capabilities of both suffix trees and suffix arrays even when the size of an alphabet is large. A linearized suffix tree has all the functionalities of the enhanced suffix array and supports the pattern search in O(m log vertical bar Sigma vertical bar) time. In a different point of view, it can be considered a practical implementation of the suffix tree supporting O(m log vertical bar Sigma vertical bar)-time pattern search. In addition, we also present two efficient algorithms for computing suffix links on the enhanced suffix array and the linearized suffix tree. These are the first algorithms that run in O(n) time without using the range minima query. Our experimental results show that our algorithms are faster than the previous algorithms.
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Dong Kyue photo

Kim, Dong Kyue
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE