Data Engineering with Google Cloud Platform by Adi Wijaya

Data Engineering with Google Cloud Platform by Adi Wijaya

Author:Adi Wijaya
Language: eng
Format: epub
Publisher: packt Publishing Pvt Ltd
Published: 2022-02-18T00:00:00+00:00


Remember that in the ephemeral model, your cluster literally will be deleted. The data in the cluster machines will also be gone after each job. In the preceding case, using a permanent Dataproc approach is the only option.

If we decide to use the ephemeral cluster approach, then a question may come up – how do we sync the cluster creation and deletion with a job?

There are many ways; you can do it manually and automatically. One automatic option is using the Dataproc workflow template. Let's practice using it to understand the next section.



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.