In this paper we study the limiting dynamics of a sequential process that generalizes Pólya's urn. This process has been studied also in the context of language generation, discrete choice, repeat consumption, and models for the web graph. The process we study generates future items by copying from past items. It is parameterized by a sequence of weights describing how much to prefer copying from recent versus more distant locations. We show that, if the weight sequence follows a power law with exponent α ĝ [0, 1), then the sequences generated by the model tend toward a limiting behavior in which the eventual frequency of each token in the alphabet attains a limit. Moreover, in the case α > 2, we show that the sequence converges to a token being chosen infinitely often, and each other token being chosen only constantly many times.
Asymptotic Behavior of Sequence Models / Chierichetti, F.; Kumar, R.; Tomkins, A.. - (2020), pp. 2824-2830. (Intervento presentato al convegno 29th International World Wide Web Conference, WWW 2020 tenutosi a twn) [10.1145/3366423.3380044].
Asymptotic Behavior of Sequence Models
Chierichetti F.;
2020
Abstract
In this paper we study the limiting dynamics of a sequential process that generalizes Pólya's urn. This process has been studied also in the context of language generation, discrete choice, repeat consumption, and models for the web graph. The process we study generates future items by copying from past items. It is parameterized by a sequence of weights describing how much to prefer copying from recent versus more distant locations. We show that, if the weight sequence follows a power law with exponent α ĝ [0, 1), then the sequences generated by the model tend toward a limiting behavior in which the eventual frequency of each token in the alphabet attains a limit. Moreover, in the case α > 2, we show that the sequence converges to a token being chosen infinitely often, and each other token being chosen only constantly many times.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.