Data Science with Java by Michael R. Brzustowicz PhD

Data Science with Java by Michael R. Brzustowicz PhD

Author:Michael R. Brzustowicz, PhD
Language: eng
Format: epub
Publisher: O'Reilly Media
Published: 2017-06-15T04:00:00+00:00


Unit normal scaling

Unit normal scaling is also known as a z-score. It rescales every data point in a column such that it is a member of unit normal distribution by centering it about the mean and dividing by the standard deviation. Each column will then have an average value of zero, and its distribution of values will mostly be smaller than 1, although as a distribution, this is not guaranteed because the values are unbounded.

This can be implemented as follows:

@Override public double visit(int row, int column, double value) { double mean = mss.getMean()[column]; double std = mss.getStandardDeviation()[column]; return (value - mean) / std; }



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.