https://doi.org/10.1140/epjb/e2009-00212-6
Finite size correction for fixed word length Zipf analysis
1
Department of Physics, Zanjan University, P.O. Box 45196, Zanjan, Iran
2
Sufi Institute, P.O. Box 45195, Zanjan, Iran
Corresponding author: a darooneh@znu.ac.ir
Received:
14
October
2008
Revised:
4
May
2009
Published online:
23
June
2009
Zipf's original law deals with the statistics of ranked words in natural languages. It has recently been generalized to “words” defined as n-tuples of symbols derived by translation of real-valued univariate timeseries into a literal sequence. We verify that the rank-frequency plot of these words shows, for fractional Brownian motion, the previously found power laws, but with large finite length corrections. We verify a finite size scaling ansatz for these corrections and, as aresult, demonstrate greatly improved estimates of the (generalized) Zipf exponents. This allows us to find the correct relation between the Zipf exponent and the Hurst exponent characterizing the fractional Brownian motion.
PACS: 05.45.Tp – Time series analysis / 05.40.Jc – Brownian motion / 05.10.-a – Computational methods in statistical physics and nonlinear dynamics
© EDP Sciences, Società Italiana di Fisica, Springer-Verlag, 2009