Regret Analysis For Discounted Reinforcement Learning