A reinforcement learning machine learns to perform actions which will maximize rewards (or minimize punishments).