Building Machine Learning Powered Applications by Emmanuel Ameisen
Author:Emmanuel Ameisen
Language: eng
Format: epub
Publisher: O'Reilly Media
Published: 2019-06-15T16:00:00+00:00
Dimensionality Reduction for Errors
We described vectorization and dimensionality reduction techniques for data exploration in “Vectorizing” and “Dimensionality reduction”. Let’s see how the same techniques can be used to make error analysis more efficient.
When we first covered how to use dimensionality reduction methods to visualize data, we colored each point in a dataset by its class to observe the topology of labels. When analyzing model errors, we can use different color schemes to identify errors.
To identify error trends, color each data point by whether a model’s prediction was correct or not. This will allow you to identify types of similar data points a model performs poorly on. Once you identify a region in which a model performs poorly, visualize a few data points in it. Visualizing hard examples is a great way to generate features represented in these examples to help a model fit them better.
To help surface trends in hard examples, you can also use the clustering methods from “Clustering”. After clustering data, measure model performance on each cluster and identify clusters where the model performs worst. Inspect data points in these clusters to help you generate more features.
Dimensionality reduction techniques are one way of surfacing challenging examples. To do so, we can also directly use a model’s confidence score.
Download
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Computer Vision & Pattern Recognition | Expert Systems |
Intelligence & Semantics | Machine Theory |
Natural Language Processing | Neural Networks |
Algorithms of the Intelligent Web by Haralambos Marmanis;Dmitry Babenko(8301)
Test-Driven Development with Java by Alan Mellor(6728)
Data Augmentation with Python by Duc Haba(6641)
Principles of Data Fabric by Sonia Mezzetta(6392)
Learn Blender Simulations the Right Way by Stephen Pearson(6292)
Microservices with Spring Boot 3 and Spring Cloud by Magnus Larsson(6165)
Hadoop in Practice by Alex Holmes(5958)
Jquery UI in Action : Master the concepts Of Jquery UI: A Step By Step Approach by ANMOL GOYAL(5807)
RPA Solution Architect's Handbook by Sachin Sahgal(5561)
Big Data Analysis with Python by Ivan Marin(5367)
The Infinite Retina by Robert Scoble Irena Cronin(5252)
Life 3.0: Being Human in the Age of Artificial Intelligence by Tegmark Max(5147)
Pretrain Vision and Large Language Models in Python by Emily Webber(4330)
Infrastructure as Code for Beginners by Russ McKendrick(4091)
Functional Programming in JavaScript by Mantyla Dan(4038)
The Age of Surveillance Capitalism by Shoshana Zuboff(3955)
WordPress Plugin Development Cookbook by Yannick Lefebvre(3805)
Embracing Microservices Design by Ovais Mehboob Ahmed Khan Nabil Siddiqui and Timothy Oleson(3609)
Applied Machine Learning for Healthcare and Life Sciences Using AWS by Ujjwal Ratan(3580)
