The Artificial Intelligence Infrastructure Workshop by Chinmay Arankalle Gareth Dwyer Bas Geerdink Kunal Gera 
 Kevin Liao and Anand N.S

The Artificial Intelligence Infrastructure Workshop by Chinmay Arankalle Gareth Dwyer Bas Geerdink Kunal Gera 
 Kevin Liao and Anand N.S

Author:Chinmay Arankalle, Gareth Dwyer, Bas Geerdink, Kunal Gera,
 Kevin Liao, and Anand N.S.
Language: eng
Format: epub
Publisher: Packt Publishing Pvt. Ltd.
Published: 2020-08-14T00:00:00+00:00


Apache Spark and Databricks

The most popular integrated platform for learning and using Apache Spark is provided by Databricks. Databricks takes Apache Spark to the next level. It offers five times the performance (compared to Vanilla Apache Spark on the cloud) and integrated Jupyter notebooks in a secure cloud-enabled platform. The core team that developed Apache Spark while at Berkeley is part of Databricks. We will get into the details of the core operations of Spark, namely, transformations and actions. We will use the integrated Jupyter notebooks in Databricks to write the code for this. Databricks enables us to spin Spark clusters on the cloud and connect to it with integrated Jupyter notebooks. So, in the next section, let's set up the Databricks environment and learn to create and use a Jupyter notebook.



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.