Meta Learning for Hyperparameter Optimization in Dialogue System

Jen-Tzung Chien, Wei Xiang Lieow


The performance of dialogue system based on deep reinforcement learning (DRL) highly depends on the selected hyperparameters in DRL algorithms. Traditionally, Gaussian process (GP) provides a probabilistic approach to Bayesian optimization for sequential search which is beneficial to select optimal hyperparameter. However, GP suffers from the expanding computation when the dimension of hyperparameters and the number of search points are increased. This paper presents a meta learning approach to carry out multifidelity Bayesian optimization where a two-level recurrent neural network (RNN) is developed for sequential learning and optimization. The search space is explored via the first-level RNN with cheap and low fidelity over a global region of hyperparameters. The optimization is then exploited and leveraged by the second-level RNN with a high fidelity on the successively small regions. The experiments on the hyperparameter optimization for dialogue system based on the deep Q network show the effectiveness and efficiency by using the proposed multifidelity Bayesian optimization.


 DOI: 10.21437/Interspeech.2019-1383

Cite as: Chien, J., Lieow, W.X. (2019) Meta Learning for Hyperparameter Optimization in Dialogue System. Proc. Interspeech 2019, 839-843, DOI: 10.21437/Interspeech.2019-1383.


@inproceedings{Chien2019,
  author={Jen-Tzung Chien and Wei Xiang Lieow},
  title={{Meta Learning for Hyperparameter Optimization in Dialogue System}},
  year=2019,
  booktitle={Proc. Interspeech 2019},
  pages={839--843},
  doi={10.21437/Interspeech.2019-1383},
  url={http://dx.doi.org/10.21437/Interspeech.2019-1383}
}