Practice Test 1 | Google Cloud Certified Professional Data Engineer | Dumps | Mock Test
You are building a data pipeline using Google Dataflow SDK. This pipeline is going to perform operations on data using conditional and for loops creating a branch pipeline.
Which of the following concepts should be used to achieve this?
A. ParDo
B. PCollection
C. Transform
D. Pipeline
Answer: C.
A transform represents a processing operation that transforms data. A transform takes one or more PCollections as input, performs an operation that you specify on each element in that collection, and produces one or more PCollections as output. A transform can perform nearly any kind of processing operation, including performing mathematical computations on data, converting data from one format to another, grouping data together, reading and writing data, filtering data to output only the elements you want, or combining data elements into single values.
Source(s):
Dataflow – Programming Model for Apache Beam: https://cloud.google.com/dataflow/docs/concepts/beam-programming-model
Comments are closed, but trackbacks and pingbacks are open.