Spark for Python Developers by Amit Nandi
Author:Amit Nandi [Nandi, Amit]
Language: eng
Format: epub, pdf
Publisher: Packt Publishing
Published: 2015-12-23T23:00:00+00:00
Supervised and unsupervised learning
We delve more deeply here in to the traditional machine learning algorithms offered by Spark MLlib. We distinguish between supervised and unsupervised learning depending on whether the data is labeled. We distinguish between categorical or continuous depending on whether the data is discrete or continuous.
The following diagram explains the Spark MLlib supervised and unsupervised machine learning algorithms and preprocessing techniques:
The following supervised and unsupervised MLlib algorithms and preprocessing techniques are currently available in Spark:
Clustering: This is an unsupervised machine learning technique where the data is not labeled. The aim is to extract structure from the data:K-Means: This partitions the data in K distinct clusters
Gaussian Mixture: Clusters are assigned based on the maximum posterior probability of the component
Power Iteration Clustering (PIC): This groups vertices of a graph based on pairwise edge similarities
Latent Dirichlet Allocation (LDA): This is used to group collections of text documents into topics
Streaming K-Means: This means clusters dynamically streaming data using a windowing function on the incoming data
Download
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
The Mikado Method by Ola Ellnestam Daniel Brolund(20603)
Hello! Python by Anthony Briggs(19899)
Secrets of the JavaScript Ninja by John Resig Bear Bibeault(18208)
Dependency Injection in .NET by Mark Seemann(18108)
The Well-Grounded Java Developer by Benjamin J. Evans Martijn Verburg(17575)
OCA Java SE 8 Programmer I Certification Guide by Mala Gupta(17422)
Kotlin in Action by Dmitry Jemerov(17185)
Adobe Camera Raw For Digital Photographers Only by Rob Sheppard(16930)
Algorithms of the Intelligent Web by Haralambos Marmanis;Dmitry Babenko(16235)
Grails in Action by Glen Smith Peter Ledbrook(15390)
Sass and Compass in Action by Wynn Netherland Nathan Weizenbaum Chris Eppstein Brandon Mathis(13266)
Secrets of the JavaScript Ninja by John Resig & Bear Bibeault(11381)
A Developer's Guide to Building Resilient Cloud Applications with Azure by Hamida Rebai Trabelsi(10579)
Test-Driven iOS Development with Swift 4 by Dominik Hauser(10393)
Jquery UI in Action : Master the concepts Of Jquery UI: A Step By Step Approach by ANMOL GOYAL(9387)
Hit Refresh by Satya Nadella(9083)
The Kubernetes Operator Framework Book by Michael Dame(8521)
Exploring Deepfakes by Bryan Lyon and Matt Tora(8348)
Robo-Advisor with Python by Aki Ranin(8294)