Optimality Generalization with KL Divergence for Generator