makeporngreatagain.pro
yeahporn.top
hd xxx

Practice Test 1 | Google Cloud Certified Professional Data Engineer | Dumps | Mock Test

4,795

Data analysts are switching to use Apache Spark to perform experiments on the data before applying the changes to production. Those experiments are not critical, but they will be conducted on big data sets. As a data engineer, the head of data asked you to prepare the tech stack required to be used by data analysts to run their Spark scripts and experiment on with taking into consideration the cost of the stack used.

Which of the following tech stack is suggested?

A. Launch a Dataproc cluster in high-availability mode with using high-memory worker machine types.
B. Launch a Dataproc cluster in standard mode with using high-CPU worker machine types.
C. Launch a Dataproc cluster in standard mode with using high-memory worker machine types.
D. Advice the data analysts to use Dataprep for their data manipulation.

Answer: C.

Answer C is correct: The data sets are big in size and hence high memory machine is the choice.

Answer A is incorrect: Since the scenario states non-critical experiments will be conducted by data analysts, Dataproc cluster used can be in standard mode.

Answer B is incorrect: Since the scenario states non-critical experiments, there is no need for high-CPU worker machine types.

Answer D is incorrect: Dataprep does not provide Apache Spark job transformation. Dataprep is best for visual exploration and manual cleaning and preparation of data for analysis and machine learning.

Source(s):

Cloud Dataprep: https://cloud.google.com/dataproc

Comments are closed, but trackbacks and pingbacks are open.

baseofporn.com https://www.opoptube.com
Ads Blocker Image Powered by Code Help Pro

Ads Blocker Detected!!!

We have detected that you are using extensions to block ads. Please support us by disabling these ads blocker.