Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleApril 2024
Generic Non-recursive Suffix Array Construction
ACM Transactions on Algorithms (TALG), Volume 20, Issue 2Article No.: 18, Pages 1–42https://doi.org/10.1145/3641854The suffix array is arguably one of the most important data structures in sequence analysis and consequently there is a multitude of suffix sorting algorithms. However, to this date the GSACA algorithm introduced in 2015 is the only known non-recursive ...
- research-articleJanuary 2024
Reconstructing parameterized strings from parameterized suffix and LCP arrays
Theoretical Computer Science (TCSC), Volume 981, Issue Chttps://doi.org/10.1016/j.tcs.2023.114230AbstractReconstructing input from a data structure entails determining whether an instance of the data structure is in fact valid or not, and if valid, discovering the underlying data that it represents. In this paper we consider the parameterized suffix ...
- research-articleJanuary 2023
String inference from longest-common-prefix array
Theoretical Computer Science (TCSC), Volume 942, Issue CPages 180–199https://doi.org/10.1016/j.tcs.2022.11.032AbstractThe suffix array, perhaps the most important data structure in modern string processing, is often augmented with the longest common prefix (LCP) array which stores the lengths of the longest common prefixes for lexicographically ...
- ArticleNovember 2022
On the Optimisation of the GSACA Suffix Array Construction Algorithm
String Processing and Information RetrievalPages 99–113https://doi.org/10.1007/978-3-031-20643-6_8AbstractThe suffix array is arguably one of the most important data structures in sequence analysis and consequently there is a multitude of suffix sorting algorithms. However, to this date the GSACA algorithm introduced in 2015 is the only known non-...
-
- ArticleAugust 2022
Safeness: Suffix Arrays Driven Materialized View Selection Framework for Large-Scale Workloads
Big Data Analytics and Knowledge DiscoveryPages 74–86https://doi.org/10.1007/978-3-031-12670-3_7AbstractMaterialized views (MVs) are an elegant redundant optimization technique for analytical workloads. Numerous frameworks and algorithms for selecting MVs have been propounded, and some are deployed in commercial DBMSs. The central role of MVs in ...
- research-articleJune 2022
Dynamic suffix array with polylogarithmic queries and updates
STOC 2022: Proceedings of the 54th Annual ACM SIGACT Symposium on Theory of ComputingPages 1657–1670https://doi.org/10.1145/3519935.3520061The suffix array SA[1..n] of a text T of length n is a permutation of {1, …, n} describing the lexicographical ordering of suffixes of T and is considered to be one of the most important data structures for string processing, with dozens of applications ...
- research-articleMay 2022
Optimal in-place suffix sorting
Information and Computation (ICOM), Volume 285, Issue PBhttps://doi.org/10.1016/j.ic.2021.104818AbstractThe suffix array is a fundamental data structure for many applications that involve string searching and data compression. We obtain the first in-place suffix array construction algorithms that are optimal both in time and space for (...
- research-articleAugust 2021
The exact multiple pattern matching problem solved by a reference tree approach
Theoretical Computer Science (TCSC), Volume 882, Issue CPages 29–48https://doi.org/10.1016/j.tcs.2021.06.003AbstractGiven a text T and a set of r patterns P 1 , P 2 , … , P r, the exact multiple pattern matching problem reports the ending positions of all occurrences of P i in T for 1 ≤ i ≤ r. By transforming all substrings with a fixed length of T ...
- ArticleOctober 2019
SACABench: Benchmarking Suffix Array Construction
- Johannes Bahne,
- Nico Bertram,
- Marvin Böcker,
- Jonas Bode,
- Johannes Fischer,
- Hermann Foot,
- Florian Grieskamp,
- Florian Kurpicz,
- Marvin Löbel,
- Oliver Magiera,
- Rosa Pink,
- David Piper,
- Christopher Poeplau
String Processing and Information RetrievalPages 407–416https://doi.org/10.1007/978-3-030-32686-9_29AbstractWe present a practical comparison of suffix array construction algorithms on modern hardware. The benchmark is conducted using our new benchmark framework SACABench, which allows for an easy deployment of publicly available implementations, simple ...
- ArticleOctober 2019
Inducing the Lyndon Array
String Processing and Information RetrievalPages 138–151https://doi.org/10.1007/978-3-030-32686-9_10AbstractIn this paper we propose a variant of the induced suffix sorting algorithm by Nong (TOIS, 2013) that computes simultaneously the Lyndon array and the suffix array of a text in O(n) time using words of working space, where n is the length of the ...
- research-articleSeptember 2019
Linking indexing data structures to de Bruijn graphs: Construction and update
Journal of Computer and System Sciences (JCSS), Volume 104, Issue CPages 165–183https://doi.org/10.1016/j.jcss.2016.06.008AbstractDNA sequencing technologies have tremendously increased their throughput, and hence complicated DNA assembly. Numerous assembly programs use de Bruijn graphs (dBG) built from short reads to merge these into contigs, which represent ...
- articleApril 2019
Fixed Block Compression Boosting in FM-Indexes: Theory and Practice
Algorithmica (ALGR), Volume 81, Issue 4Pages 1370–1391https://doi.org/10.1007/s00453-018-0475-9The FM index (Ferragina and Manzini in J ACM 52(4):552---581, 2005) is a widely-used compressed data structure that stores a string T in a compressed form and also supports fast pattern matching queries. In this paper, we describe new FM-index variants ...
- articleJuly 2018
Fast induced sorting suffixes on a multicore machine
The Journal of Supercomputing (JSCO), Volume 74, Issue 7Pages 3468–3485https://doi.org/10.1007/s11227-018-2395-5Sorting the suffixes of an input string is a fundamental task in many applications such as data compression, genome alignment, and full-text search. The induced sorting (IS) method has been successfully applied to design a number of state-of-the-art ...
- research-articleFebruary 2017
Optimal suffix sorting and LCP array construction for constant alphabets
Information Processing Letters (IPRL), Volume 118, Issue CPages 30–34https://doi.org/10.1016/j.ipl.2016.09.010We show how the longest common prefix (LCP) array can be generated as a by-product of the suffix array construction algorithm SACA-K (Nong, 2013). Our algorithm builds on Fischer's proposal (Fischer, WADS'11), and also runs in linear time, but uses only ...
- research-articleDecember 2016
Binary block order Rouen Transform
- Jacqueline W. Daykin,
- Richard Groult,
- Yannick Guesnet,
- Thierry Lecroq,
- Arnaud Lefebvre,
- Martine Léonard,
- Élise Prieur-Gaston
Theoretical Computer Science (TCSC), Volume 656, Issue PBPages 118–134https://doi.org/10.1016/j.tcs.2016.05.028Novel twin binary Burrows-Wheeler type transforms are introduced.The transforms are defined for Lyndon-like B-words which apply binary block order.We call this approach the B-BWT Rouen Transform.These bijective Rouen Transforms and inverses are computed ...