Deep Learning with PyTorch by Eli Stevens Luca Antiga & Thomas Viehmann
Author:Eli Stevens, Luca Antiga & Thomas Viehmann [Eli Stevens, Luca Antiga & Viehmann, Thomas]
Language: eng
Format: epub
Publisher: Manning Publications Co.
Published: 0101-01-01T00:00:00+00:00
Figure 9.5 A CT scan with approximately 1,000 structures that look like tumors to the untrained eye. Exactly one has been identified as a nodule when reviewed by a human specialist. The rest are normal anatomical structures like blood vessels, lesions, and other non-problematic lumps.
You might have seen elsewhere that end-to-end approaches for detection and classi-fication of objects are very successful in general vision tasks. TorchVision includes end-to-end models like Fast R-CNN/Mask R-CNN, but these are typically trained on hundreds of thousands of images, and those datasets aren’t constrained by the number of samples from rare classes. The project architecture we will use has the benefit of working well with a more modest amount of data. So while it’s certainly theoretically possible to just throw an arbitrarily large amount of data at a neural network until it learns the specifics of the proverbial lost needle, as well as how to ignore the hay, it’s going to be practically prohibitive to collect enough data and wait for a long enough time to train the network properly. That won’t be the best approach since the results are poor, and most readers won’t have access to the compute resources to pull it off at all.
To come up with the best solution, we could investigate proven model designs that can better integrate data in an end-to-end manner.3 These complicated designs are capable of producing high-quality results, but they’re not the best because understanding the design decisions behind them requires having mastered fundamental concepts first. That makes these advanced models poor candidates to use while teaching those same fundamentals!
That’s not to say that our multistep design is the best approach, either, but that’s because “best” is only relative to the criteria we chose to evaluate approaches. There are many “best” approaches, just as there are many goals we could have in mind as we work on a project. Our self-contained, multistep approach has some disadvantages as well.
Recall the GAN game from chapter 2. There, we had two networks cooperating to produce convincing forgeries of old master artists. The artist would produce a candidate work, and the scholar would critique it, giving the artist feedback on how to improve. Put in technical terms, the structure of the model allowed gradients to backpropagate from the final classifier (fake or real) to the earliest parts of the project (the artist).
Our approach for solving the problem won’t use end-to-end gradient backpropagation to directly optimize for our end goal. Instead, we’ll optimize discrete chunks of the problem individually, since our segmentation model and classification model won’t be trained in tandem with each other. That might limit the top-end effectiveness of our solution, but we feel that this will make for a much better learning experience.
We feel that being able to focus on a single step at a time allows us to zoom in and concentrate on the smaller number of new skills we’re learning. Each of our two models will be focused on performing exactly one task. Similar to a
Download
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Computer Vision & Pattern Recognition | Expert Systems |
Intelligence & Semantics | Machine Theory |
Natural Language Processing | Neural Networks |
Algorithms of the Intelligent Web by Haralambos Marmanis;Dmitry Babenko(7851)
Hadoop in Practice by Alex Holmes(5660)
Jquery UI in Action : Master the concepts Of Jquery UI: A Step By Step Approach by ANMOL GOYAL(5512)
Life 3.0: Being Human in the Age of Artificial Intelligence by Tegmark Max(4507)
Functional Programming in JavaScript by Mantyla Dan(3723)
The Age of Surveillance Capitalism by Shoshana Zuboff(3422)
Big Data Analysis with Python by Ivan Marin(3006)
Blockchain Basics by Daniel Drescher(2890)
The Rosie Effect by Graeme Simsion(2708)
WordPress Plugin Development Cookbook by Yannick Lefebvre(2602)
Hands-On Machine Learning for Algorithmic Trading by Stefan Jansen(2517)
Applied Predictive Modeling by Max Kuhn & Kjell Johnson(2478)
Dawn of the New Everything by Jaron Lanier(2438)
Test-Driven Development with Java by Alan Mellor(2371)
The Art Of Deception by Kevin Mitnick(2297)
Data Augmentation with Python by Duc Haba(2218)
Rapid Viz: A New Method for the Rapid Visualization of Ideas by Kurt Hanks & Larry Belliston(2195)
Human Dynamics Research in Smart and Connected Communities by Shih-Lung Shaw & Daniel Sui(2178)
The Infinite Retina by Robert Scoble Irena Cronin(2172)