Microsoft DP-203 Quiz 2 Topic 3 Questions 6-10

Question: 1

You implement an enterprise data warehouse in Azure Synapse Analytics.

You have a large fact table that is 10 terabytes (TB) in size.

Incoming queries use the primary key SaleKey column to retrieve data as displayed in the following table:

q1_DP-203

You need to distribute the large fact table across multiple nodes to optimize performance of the table.

Which technology should you use?

Ahash distributed table with clustered index

Bhash distributed table with clustered Columnstore index

Cround robin distributed table with clustered index

Dround robin distributed table with clustered Columnstore index

Eheap table with distribution replicate

Show Answer

Question: 2

Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.

After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.

You have an Azure Data Lake Storage account that contains a staging zone.

You need to design a daily process to ingest incremental data from the staging zone, transform the data by executing an R script, and then insert the transformed data into a data warehouse in Azure Synapse Analytics.

Solution: You schedule an Azure Databricks job that executes an R notebook, and then inserts the data into the data warehouse.

Does this meet the goal?

AYes

BNo

Show Answer

Question: 3

You have a Microsoft Purview account. The Lineage view of a CSV file is shown in the following exhibit.

q3_DP-203

How is the data for the lineage populated?

Amanually

Bby scanning data stores

Cby executing a Data Factory pipeline

Show Answer

Question: 4

You are designing a fact table named FactPurchase in an Azure Synaps Analytics dedicated SQL pool. The table contains purchases from suppliers for a retail store. FactPurchase will contain the following columns.

q4_DP-203

FactPurchase will have 1 million rows of data added daily and will contain three years of data.

Transact-SQL queries similar to the following query will be executed daily.

q4_DP-203

Around-robin

Breplicated

Chash-distributed on DateKey

Dhash-distributed on PurchaseKey

Show Answer

Question: 5

You are designing an inventory updates table in an Azure Synapse Analytics dedicated SQL pool. The table will have a clustered columnstore index and will include the following columns:

* EventDate: 1 million per day

* EventTypelD: 10 million per event type

* WarehouselD: 100 million per warehouse

* ProductCategoryTypeiD: 25 million per product category type

You identify the following usage patterns:

Analyst will most commonly analyze transactions for a warehouse.

Queries will summarize by product category type, date, and/or inventory event type.

You need to recommend a partition strategy for the table to minimize query times.

On which column should you recommend partitioning the table?

AProductCategoryTypeID

BEventDate

CWarehouseID

DEventTypeID

Show Answer

Microsoft DP-203 Quiz:2 Topic:3 Questions:6-10