The Prague Stringology Conference 2008

Kazuhiko Kusano, Wataru Matsubara, Akira Ishino and Ayumi Shinohara

Average Value of Sum of Exponents of Runs in Strings

Abstract:
A substring w[i..j] in w is called a repetition of period p if s[k] = s[k+p] for any i ≤ k ≤ j - p. Especially, a maximal repetition, which cannot be extended neither to left nor to right, is called a run. The ratio of the length of the run to its period, i.e. (j - i+1)/p, is called an exponent. The sum of exponents of runs in a string is of interest. The maximal value of the sum is still unknown, and the current upper bound is 2.9n given by Crochemore and Ilie, where n is the length of a string. In this paper we show a closed formula which exactly expresses the average value of it for any n and any alphabet size, and the limit of this value per unit length as n approaches infinity. For binary strings, the limit value is approximately 1.13103.

Download paper: Article in PostScript Article in PDF BibTeX Reference
 PostScript   PDF   BibTeX reference 
Download presentation: Presentation