HDInsight Essentials Second Edition by 2015

HDInsight Essentials Second Edition by 2015

Author:2015
Language: eng
Format: epub
Publisher: Packt Publishing


The following screenshot shows you the preceding steps:

With the preceding steps, we have uploaded the six .csv files to Azure Blob storage using CloudXplorer.

Using Sqoop to move data from RDBMS to Data Lake

Sqoop enables us to transfer data between any relational database and Hadoop. You can import data from any relational database that has a JDBC adaptor such as SQL Server, MySQL, Oracle, Teradata, and others, to HDInsight.

Key benefits

The major benefits of using Sqoop to move data are as follows:

Leverages RDBMS metadata to get the column data types

It is simple to script and uses SQL

It can be used to handle change data capture by importing daily transactional data to HDInsight

It uses MapReduce for export and import that enables parallel and efficient data movement



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.