Real-Time Big Data Analytics by Sumit Gupta & Saxena Shilpi
Author:Sumit Gupta & Saxena Shilpi
Language: eng
Format: mobi, epub
Publisher: Packt Publishing
Published: 2016-02-25T22:00:00+00:00
The following diagram shows the high-level components and the master-worker view of Spark:
The preceding diagram depicts the various components involved in setting up the Spark cluster, and the same components are also responsible for the execution of the Spark job.
Although all the components are important, let's briefly discuss the cluster/resource manager, as it defines the deployment model and allocation of resources to our submitted jobs.
Spark enables and provides flexibility to choose our resource manager. As of Spark 1.5.1, the following are the resource managers or deployment models that are supported by Spark:
Apache Mesos: Apache Mesos (http://mesos.apache.org/) is a cluster manager that provides efficient resource isolation and sharing across distributed applications or frameworks. It can run Hadoop, MPI, Hypertable, Spark, and other frameworks on a dynamically shared pool of nodes. Apache Mesos and Spark are closely related to each other (but they are not the same). The story started way back in 2009 when Mesos was ready and there were talks going on about the ideas/frameworks that can be developed on top of Mesos, and that's exactly how Spark was born.
Download
Real-Time Big Data Analytics by Sumit Gupta & Saxena Shilpi.epub
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Algorithms of the Intelligent Web by Haralambos Marmanis;Dmitry Babenko(8309)
Azure Data and AI Architect Handbook by Olivier Mertens & Breght Van Baelen(6807)
Building Statistical Models in Python by Huy Hoang Nguyen & Paul N Adams & Stuart J Miller(6783)
Serverless Machine Learning with Amazon Redshift ML by Debu Panda & Phil Bates & Bhanu Pittampally & Sumeet Joshi(6670)
Data Wrangling on AWS by Navnit Shukla | Sankar M | Sam Palani(6456)
Driving Data Quality with Data Contracts by Andrew Jones(6400)
Machine Learning Model Serving Patterns and Best Practices by Md Johirul Islam(6158)
Learning SQL by Alan Beaulieu(6004)
Weapons of Math Destruction by Cathy O'Neil(5797)
Big Data Analysis with Python by Ivan Marin(5396)
Data Engineering with dbt by Roberto Zagni(4402)
Solidity Programming Essentials by Ritesh Modi(4050)
Time Series Analysis with Python Cookbook by Tarek A. Atwan(3909)
Pandas Cookbook by Theodore Petrou(3613)
Blockchain Basics by Daniel Drescher(3306)
Hands-On Machine Learning for Algorithmic Trading by Stefan Jansen(2914)
Feature Store for Machine Learning by Jayanth Kumar M J(2820)
Learn T-SQL Querying by Pam Lahoud & Pedro Lopes(2803)
Mastering Python for Finance by Unknown(2748)
