Free Practice Google Professional-Data-Engineer Exam Questions 2025

Stay ahead with 100% Free Cloud Certified Professional Data Engineer Professional-Data-Engineer Dumps Practice Questions

Page: 1 / 95

Total 472 Questions | Updated On: Aug 21, 2019

Add To Cart

Question 1

You are developing an Apache Beam pipeline to extract data from a Cloud SQL instance by using JdbclO.

You have two projects running in Google Cloud. The pipeline will be deployed and executed on Dataflow in

Project A. The Cloud SQL instance is running jn Project B and does not have a public IP address. After

deploying the pipeline, you noticed that the pipeline failed to extract data from the Cloud SQL instance due to

connection failure. You verified that VPC Service Controls and shared VPC are not in use in these projects.

You want to resolve this error while ensuring that the data does not go through the public internet. What

should you do?

A : Set up VPC Network Peering between Project A and Project B. Add a firewall rule to allow the peeredsubnet range to access all instances on the network.

B : Turn off the external IP addresses on the Dataflow worker. Enable Cloud NAT in Project A.

C : Set up VPC Network Peering between Project A and Project B. Create a Compute Engine instance without external IP address in Project B on the peered subnet to serve as a proxy server to the Cloud SQL database.

D : Add the external IP addresses of the Dataflow worker as authorized networks in the Cloud SOL instance.

Answer: C

Question 2

Which of these statements about exporting data from BigQuery is false?

A : To export more than 1 GB of data, you need to put a wildcard in the destination filename.

B : The only supported export destination is Google Cloud Storage.

C : Data can only be exported in JSON or Avro format.

D : The only compression option available is GZIP.

Answer: C

Question 3

A team of data scientists has been using an on-premises cluster running Hadoop and HBase. They want to migrate to a managed service in Google Cloud. They also want to minimize changes to programs that make extensive use of the HBase API. What GCP service would you recommend?

A : Bigtable

B : BigQuery

C : Cloud Spanner

D : Cloud Dataflow

Answer: A

Question 4

A data analyst currently has the bigquery.dataViewer role and can successfully query a materialized view. They also want to be able to refresh the materialized view. You want to use a predefined role but not grant them any more permissions than needed to refresh the materialized view. What predefined role would you grant to the user?

A : bigquery.dataOwner

B : bigquery.admin

C : bigquery.dataEditor

D : bigquery.mvUpdater

Answer: C

Question 5

You work for a manufacturing plant that batches application log files together into a single log file once a day at 2:00 AM. You have written a Google Cloud Dataflow job to process that log file. You need to make sure the log file in processed once per day as inexpensively as possible. What should you do?

A : Change the processing job to use Google Cloud Dataproc instead

B : Manually start the Cloud Dataflow job each morning when you get into the office.

C : Create a cron job with Google App Engine Cron Service to run the Cloud Dataflow job.

D : Configure the Cloud Dataflow job as a streaming job so that it processes the log data immediately.

Answer: C

Page: 1 / 95

Total 472 Questions | Updated On: Aug 21, 2019

Add To Cart