Issue |
RAIRO-Theor. Inf. Appl.
Volume 43, Number 2, April-June 2009
|
|
---|---|---|
Page(s) | 281 - 297 | |
DOI | https://doi.org/10.1051/ita:2008030 | |
Published online | 04 December 2008 |
Efficient validation and construction of border arrays and validation of string matching automata
University of Rouen, LITIS EA 4108,
Avenue de l'Université,
Technopôle du Madrillet,
76801 Saint-Étienne-du-Rouvray Cedex,
France; Jean-Pierre.Duval@univ_rouen.fr;
Thierry.Lecroq@univ_rouen.fr;
Arnaud.Lefebvre@univ_rouen.fr
Received:
4
December
2007
Accepted:
17
October
2008
We present an on-line linear time and space algorithm to check if an integer array f is the border array of at least one string w built on a bounded or unbounded size alphabet Σ. First of all, we show a bijection between the border array of a string w and the skeleton of the DFA recognizing Σ*ω, called a string matching automaton (SMA). Different strings can have the same border array but the originality of the presented method is that the correspondence between a border array and a skeleton of SMA is independent from the underlying strings. This enables to design algorithms for validating and generating border arrays that outperform existing ones. The validating algorithm lowers the delay (maximal number of comparisons on one element of the array) from O(|w|) to 1 + min{|Σ|,1 + log2|ω|} compared to existing algorithms. We then give results on the numbers of distinct border arrays depending on the alphabet size. We also present an algorithm that checks if a given directed unlabeled graph G is the skeleton of a SMA on an alphabet of size s in linear time. Along the process the algorithm can build one string w for which G is the SMA skeleton.
Mathematics Subject Classification: 68R15 / 68W05
Key words: Combinatorics on words / period / border / string matching / string matching automata
© EDP Sciences, 2008
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.