Article

On-Line Indexing for General Alphabets via Predecessor Queries on Subsets of an Ordered List

Author:

Tsvi KopelowitzAuthors Info & Claims

FOCS '12: Proceedings of the 2012 IEEE 53rd Annual Symposium on Foundations of Computer Science

Pages 283 - 292

https://doi.org/10.1109/FOCS.2012.79

Published: 20 October 2012 Publication History

Abstract

The problem of \textit{Text Indexing} is a fundamental algorithmic problem in which one wishes to preprocess a text in order to quickly locate pattern queries within the text. In the ever evolving world of dynamic and on-line data, there is also a need for developing solutions to index texts which arrive on-line, i.e.~a character at a time, and still be able to quickly locate said patterns. In this paper, a new solution for on-line indexing is presented by providing an on-line suffix tree construction in $O(\log \log n + \log\log |\Sigma|)$ worst-case expected time per character, where $n$ is the size of the string, and $\Sigma$ is the alphabet. This improves upon all previously known on-line suffix tree constructions for general alphabets, at the cost of having the run time in expectation. The main idea is to reduce the problem of constructing a suffix tree on-line to an interesting variant of the order maintenance problem, which may be of independent interest. In the famous order maintenance problem, one wishes to maintain a dynamic list $L$ of size $n$ under insertions, deletions, and order queries. In an order query, one is given two nodes from $L$ and must determine which node precedes the other in $L$. In an extension to this problem, named the {\em Predecessor search on Dynamic Subsets of an Ordered Dynamic List problem} (POLP for short), it is also necessary to maintain dynamic subsets $S_1, \cdots, S_k\subseteq L$, such that given some $u\in L$ it will be possible to quickly locate the predecessor of $u$ in $S_i$, for any integer $1\leq i \leq k$. This paper provides an efficient data structure capable of locating the predecessor of $u$ in $S_i$ in $O(\log \log n)$ worst-case time and answering order queries on $L$ in $O(1)$ worst-case time, while allowing updates to $L$ in $O(1)$ worst-case expected time and updates to the subsets in $O(\log \log n)$ worst-case expected time. This improves over a previous data structure which may be implicitly obtained from Dietz~\cite{Dietz89}, in which the updates to the sets and $L$ are done in $O(\log \log n)$ \textit{amortized} expected time. In addition, the bounds shown here match the currently best known bounds for predecessor search in the RAM model. Furthermore, this paper improves or simplifies bounds for several additional applications, including fully-persistent arrays, the monotonic list labeling problem, and the Order-Maintenance Problem.

Cited By

View all

Kempa DKociumaka TLeonardi SGupta A(2022)Dynamic suffix array with polylogarithmic queries and updatesProceedings of the 54th Annual ACM SIGACT Symposium on Theory of Computing10.1145/3519935.3520061(1657-1670)Online publication date: 9-Jun-2022
https://dl.acm.org/doi/10.1145/3519935.3520061
Kuszmaul JAgrawal KLee I(2022)Bamboo Trimming RevisitedProceedings of the 34th ACM Symposium on Parallelism in Algorithms and Architectures10.1145/3490148.3538580(411-417)Online publication date: 11-Jul-2022
https://dl.acm.org/doi/10.1145/3490148.3538580
Bender MKuszmaul WMarx D(2021)Randomized cup game algorithms against strong adversariesProceedings of the Thirty-Second Annual ACM-SIAM Symposium on Discrete Algorithms10.5555/3458064.3458187(2059-2077)Online publication date: 10-Jan-2021
https://dl.acm.org/doi/10.5555/3458064.3458187
Show More Cited By

On-Line Indexing for General Alphabets via Predecessor Queries on Subsets of an Ordered List
1. Mathematics of computing
  1. Discrete mathematics
    1. Graph theory

Recommendations

Compressed Property Suffix Trees
DCC '11: Proceedings of the 2011 Data Compression Conference

Property matching is a biologically motivated problem where the task is to find those occurrences of an online pattern $P$ in a string text $T$ (of size $n$), such that the matched part in $T$ satisfies some conceptual property. The property of a string ...
Predecessor queries in constant time?
ESA'05: Proceedings of the 13th annual European conference on Algorithms

In this paper we design a new static data structure for batched predecessor queries. In particular, our data structure supports $O(\sqrt{{\rm log}n})$ queries in O(1) time per query and requires $O(n^{\epsilon\sqrt{{\rm log}n}})$ space for any ε > 0. ...
Optimal suffix tree construction with large alphabets
FOCS '97: Proceedings of the 38th Annual Symposium on Foundations of Computer Science

The suffix tree of a string is the fundamental data structure of combinatorial pattern matching. Weiner (1973), who introduced the data structure, gave an O(n)-time algorithm for building the suffix tree of an n-character string drawn from a constant ...

Comments

Information & Contributors

Information

Published In

FOCS '12: Proceedings of the 2012 IEEE 53rd Annual Symposium on Foundations of Computer Science

October 2012

770 pages

ISBN:9780769548746

Publisher

IEEE Computer Society

United States

Publication History

Published: 20 October 2012

Author Tags

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

14
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 30 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Kempa DKociumaka TLeonardi SGupta A(2022)Dynamic suffix array with polylogarithmic queries and updatesProceedings of the 54th Annual ACM SIGACT Symposium on Theory of Computing10.1145/3519935.3520061(1657-1670)Online publication date: 9-Jun-2022
https://dl.acm.org/doi/10.1145/3519935.3520061
Kuszmaul JAgrawal KLee I(2022)Bamboo Trimming RevisitedProceedings of the 34th ACM Symposium on Parallelism in Algorithms and Architectures10.1145/3490148.3538580(411-417)Online publication date: 11-Jul-2022
https://dl.acm.org/doi/10.1145/3490148.3538580
Bender MKuszmaul WMarx D(2021)Randomized cup game algorithms against strong adversariesProceedings of the Thirty-Second Annual ACM-SIAM Symposium on Discrete Algorithms10.5555/3458064.3458187(2059-2077)Online publication date: 10-Jan-2021
https://dl.acm.org/doi/10.5555/3458064.3458187
Kuszmaul WKhuller SVassilevska Williams V(2021)How asymmetry helps buffer management: achieving optimal tail size in cup gamesProceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing10.1145/3406325.3451033(1248-1261)Online publication date: 15-Jun-2021
https://dl.acm.org/doi/10.1145/3406325.3451033
Navarro GRojas-Ledesma J(2020)Predecessor SearchACM Computing Surveys10.1145/340937153:5(1-35)Online publication date: 28-Sep-2020
https://dl.acm.org/doi/10.1145/3409371
Bender MFarach-Colton MKuszmaul WCharikar MCohen E(2019)Achieving optimal backlog in multi-processor cup gamesProceedings of the 51st Annual ACM SIGACT Symposium on Theory of Computing10.1145/3313276.3316342(1148-1157)Online publication date: 23-Jun-2019
https://dl.acm.org/doi/10.1145/3313276.3316342
Gawrychowski PKarczmarz AKociumaka TŁącki JSankowski PCzumaj A(2018)Optimal dynamic stringsProceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms10.5555/3174304.3175404(1509-1528)Online publication date: 7-Jan-2018
https://dl.acm.org/doi/10.5555/3174304.3175404
Saks M(2018)Online Labeling: Algorithms, Lower Bounds and Open QuestionsComputer Science – Theory and Applications10.1007/978-3-319-90530-3_3(23-28)Online publication date: 6-Jun-2018
https://dl.acm.org/doi/10.1007/978-3-319-90530-3_3
Bender MFineman JGilbert SKopelowitz TMontes PKlein P(2017)File maintenanceProceedings of the Twenty-Eighth Annual ACM-SIAM Symposium on Discrete Algorithms10.5555/3039686.3039784(1503-1522)Online publication date: 16-Jan-2017
https://dl.acm.org/doi/10.5555/3039686.3039784
Kucherov GNekrich Y(2017)Full-Fledged Real-Time Indexing for Constant Size AlphabetsAlgorithmica10.1007/s00453-016-0199-779:2(387-400)Online publication date: 1-Oct-2017
https://dl.acm.org/doi/10.1007/s00453-016-0199-7
Show More Cited By

Abstract

Cited By

Recommendations

Compressed Property Suffix Trees

Predecessor queries in constant time?

Optimal suffix tree construction with large alphabets

Comments

Information

Published In

Publisher

Publication History

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

Share

Share this Publication link

Share on social media

Affiliations