Mastering Spark with R by Javier Luraschi

Mastering Spark with R by Javier Luraschi

Author:Javier Luraschi
Language: eng
Format: epub
Publisher: O'Reilly Media
Published: 2019-06-23T16:00:00+00:00


Figure 8-1. Spark processing raw data from a data lakes, databases, and data warehouses

To support a broad variety of data sources, Spark needs to be able to read and write data in several different file formats (CSV, JSON, Parquet, and others), and access them while stored in several file systems (HDFS, S3, DBFS, and more) and, potentially, interoperate with other storage systems (databases, data warehouses, etc.). We will get to all of that, but first, we will start by presenting how to read, write, and copy data using Spark.



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.