Towards Structural Risk Minimization for RL