Artificial Intelligence and Human Evolution by Ameet Joshi
Author:Ameet Joshi
Language: eng
Format: epub
ISBN: 9781484298077
Publisher: Apress
The camera can observe the motion of the arm. There is a central processing unit on the arm assembly, which we can call the brain of the machine. The brain is generating these actions as electric signals as well as receiving electric signals from the camera in the form of sensory information. This setup takes care of the first two steps outlined in the process of learning. The next steps need to be handled by the brain of the machine. First, we need to define a goal. Letâs say the goal is to move an object that is kept in front of the arm to the left by one foot. The brain of the system is able to sense the information from the camera and compare it with this goal and generate a label in the form of good, bad, or neutral. For example, if the arm moves the block in the right direction, it would generate a positive label, if it tries to move it in the opposite direction, it would generate a negative label, and if it does not move at all or moves in a perpendicular direction, it will generate a neutral label. It can even go further to generate a numeric value based on how good or bad the outcome is. Now, we have all the six steps in the online learning paradigm taken care of and we have a generic machine that is ready to start learning.
The brain can start producing random movements of the arm within its limitation and start capturing the labels based on the camera input. This process is also called exploration, where the system is just exploring the action space. If the same process continues for a long time, say a week, the brain will have a huge set of labeled actions recorded in its memory, but the operation of the arm would not improve at all. Now, in order to improve the operation of the arm and get closer to the desired goal of moving the object one foot to the left, we need to introduce the concept of exploitation. Our brain alternates between exploration and exploitation as it continues to record the labeled actions. Exploration means generating random actions without using any of the recorded memory, while exploitation means creating action that it knows would produce a good or positive label or feedback. As the machine starts for the first time, the memory is pretty much empty, and there is nothing to exploit. However, as it continues to explore more, the memory starts to get some actions associated with positive feedback or label. Then it can start to exploit these prerecorded actions and maybe try to make slight tweaks to them to see if the feedback gets better or worse. After such an operation continues for a sufficient amount of time, the arm would reach pretty close to the desired goal of moving the object by one foot to the left. As the machine gets more and more positive
Download
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Deep Learning with Python by François Chollet(14993)
The Mikado Method by Ola Ellnestam Daniel Brolund(12251)
Hello! Python by Anthony Briggs(12147)
OCA Java SE 8 Programmer I Certification Guide by Mala Gupta(11527)
Dependency Injection in .NET by Mark Seemann(11307)
A Developer's Guide to Building Resilient Cloud Applications with Azure by Hamida Rebai Trabelsi(10528)
Algorithms of the Intelligent Web by Haralambos Marmanis;Dmitry Babenko(10125)
The Well-Grounded Java Developer by Benjamin J. Evans Martijn Verburg(9776)
Grails in Action by Glen Smith Peter Ledbrook(9453)
Hit Refresh by Satya Nadella(9039)
Secrets of the JavaScript Ninja by John Resig Bear Bibeault(9001)
Sass and Compass in Action by Wynn Netherland Nathan Weizenbaum Chris Eppstein Brandon Mathis(9000)
The Kubernetes Operator Framework Book by Michael Dame(8473)
Test-Driven iOS Development with Swift 4 by Dominik Hauser(8423)
Exploring Deepfakes by Bryan Lyon and Matt Tora(8298)
Robo-Advisor with Python by Aki Ranin(8248)
Practical Computer Architecture with Python and ARM by Alan Clements(8224)
Implementing Enterprise Observability for Success by Manisha Agrawal and Karun Krishnannair(8193)
Building Low Latency Applications with C++ by Sourav Ghosh(8098)