The networks, represented as a police cruiser and a green Jeep in a video game world, were created in Unity3D. Shuchmann, in a response to the video on his YouTube page, said: Each neural network was built to use reinforcement learning to complete a specific task. In this case, the police car has to stop the Jeep from crossing a white line. If it succeeds it’s rewarded, if it fails it’s punished. The adverse is true for the Jeep. Your mind probably isn’t blown after watching that – the cars mostly sort of get lost and go in circles with the occasional brief spat of movement that makes sense. But this is an excellent representation of how reinforcement learning works. Machines, under the current AI paradigm, figure things out using brute force. Early in the video the cars just sit there motionless, later they’re almost playing the game. Given enough time, these cars would probably exhibit superhuman tactics as they continued to optimize their networks to achieve their goal. Some AI experts fear this kind of ‘open-ended’ learning situation could result in a doomsday scenario. But, for now, this is just a really cool way to gain insight into how machines ‘think’ using today’s machine learning systems. You can check out Shuchmann’s other videos here. Visit out our Artificial Intelligence section for more news and analysis from the world of AI.