smartcab-using-Reinforcement-Learning
In this, the self-driving agent uses RL techniques in a simplified world (created using Pygame) to effectively reaching its destinations in the allotted time. It uses Q-Learning algorithm to find the best configuration of learning and exploration factors to ensure that it is reaching its destinations with consistently positive results without causing any traffic violations (like traffic light and oncoming traffic at each intersection).