A commonly used distribution on the probability simplex is the Dirichlet distribution. In this paper we present the linear exponential family as an alternative. The distribution is known in the statistics community, but we present in this paper a numerically stable method to compute its parameters. Although the Dirichlet distribution is known to be a good Bayesian prior for probabilities we believe this paper shows that the linear exponential model offers a good alternative in other contexts, such as when we want to use posterior probabilities as features for automatic speech recognition. We show how to incorporate posterior probabilities as additional features to an existing GMM, and show that the resulting model gives a 3% relative gain on a broadcast news speech recognition system.
Bibliographic reference. Olsen, Peder / Goel, Vaibhava / Micchelli, Charles / Hershey, John R. (2010): "Modeling posterior probabilities using the linear exponential family", In INTERSPEECH-2010, 2994-2997.