Databricks Certified Data Engineer Associate | Exam Preparation- Part 4 & 5

Databricks Certified Data Engineer Associate | Exam Preparation- Part 4 & 5

37.937 Lượt nghe
Databricks Certified Data Engineer Associate | Exam Preparation- Part 4 & 5
Udemy Practice Exam: Databricks Certified Data Engineer Associate Course Link: https://www.udemy.com/course/practice-exam-databricks-certified-data-engineer-associate/?referralCode=2A37198988726F6FB42E ☕ Buy me a coffee: https://buymeacoffee.com/navalyemul Follow me on LinkedIn: https://www.linkedin.com/in/naval-yemul-a5803523/ This video helps you to crack the Databricks Certified Data Engineer Associate Exam V2/ V3. These are all real questions. Databricks Certification and Badging- Databricks https://www.databricks.com/learn/certification Databricks Certifications and Badging https://www.youtube.com/watch?v=Cz6vfGF4FBE&list=PL7S7dD8r4QdVzOYRzIG2UJdCaCasqBv1F&index=7&t=89s All About Delta Lake | Lakehouse https://www.youtube.com/watch?v=agtUI25LxuA&list=PL7S7dD8r4QdVzOYRzIG2UJdCaCasqBv1F&index=38 Internals of Delta Lake | Databricks | Lakehouse https://www.youtube.com/watch?v=DIY8M-rqprc&list=PL7S7dD8r4QdVzOYRzIG2UJdCaCasqBv1F&index=41 Link For Databricks Playlist: https://www.youtube.com/playlistlist=PL7S7dD8r4QdVzOYRzIG2UJdCaCasqBv1F #lakehouse #databricks #azuredatabricks #dataengineering #certification #azure #learnazuredatabricks #azuredatabrickscourse #azuredatabricksforbeginners Link for Azure Data Factory (ADF) Playlist: https://www.youtube.com/playlist?list=PL7S7dD8r4QdUwL115KJ1dYanJVzoYZuS7 Link for Databricks: https://www.youtube.com/playlist?list=PL7S7dD8r4QdVzOYRzIG2UJdCaCasqBv1F Link for SQL Playlist: https://www.youtube.com/playlist?list=PL7S7dD8r4QdXJpTDRf2poxNElICrEOIeJ Link for Power BI Playlist: https://www.youtube.com/playlist?list=PL7S7dD8r4QdXaSEobGb50YiXixE57csJ9 Link for Python Playlist: https://www.youtube.com/playlist?list=PL7S7dD8r4QdWJRrp8wnI5ZhjUP0c8Mi0o Link for Azure Cloud Playlist: https://www.youtube.com/playlist?list=PL7S7dD8r4QdUr7txKDXL2shU-Qu392qqS Link for Big Data: PySpark: https://www.youtube.com/playlist?list=PL7S7dD8r4QdWUXL4zOC6NYnxDD8OmIVLr 1:30 1. A data engineer has three tables in a Delta Live Tables (DLT) pipeline. They have configured the pipeline to drop invalid records at each table. They notice that some data is being dropped due to quality concerns at some point in the DLT pipeline. They would like to determine at which table in their pipeline the data is being dropped. Which of the following approaches can the data engineer take to identify the table that is dropping the records? 5:43 2. A data engineer has a single-task Job that runs each morning before they begin working. After identifying an upstream data issue, they need to set up another task to run a new notebook prior to the original task. Which of the following approaches can the data engineer use to set up the new task? 10:02 3. An engineering manager wants to monitor the performance of a recent project using a Databricks SQL query. For the first week following the project’s release, the manager wants the query results to be updated every minute. However, the manager is concerned that the computing resources used for the query will be left running and cost the organization a lot of money beyond the first week of the project’s release. 15:22 4. A data analysis team has noticed that their Databricks SQL queries are running too slowly when connected to their always-on SQL endpoint. They claim that this issue is present when many members of the team are running small queries simultaneously. 20:16 5. A data engineer wants to schedule their Databricks SQL dashboard to refresh once per day, but they only want the associated SQL endpoint to be running when it is necessary. Which of the following approaches can the data engineer use to minimize the total running time of the SQL endpoint used in the refresh schedule of their dashboard? 22:13 6. A data engineer wants to schedule their Databricks SQL dashboard to refresh once per day, but they only want the associated SQL endpoint to be running when it is necessary. 25:39 7. A single Job runs two notebooks as two separate tasks. A data engineer has noticed that one of the notebooks is running slowly in the Job’s current run. The data engineer asks a tech lead for help in identifying why this might be the case. 28:24 2. A data engineer has a Job with multiple tasks that run nightly. Each of the tasks runs slowly because the clusters take a long time to start. Which of the following actions can the data engineer perform to improve the start-up time for the clusters used for the Job? Data Governance 33:27 1. Which part of the Databricks Platform can a data engineer use to revoke permissions from users on tables? 35:03 2. A new data engineering team. has been assigned to an ELT project. The new data engineering team will need full privileges on the database customers to fully manage the project. 37:03 3. A new data engineering team has been assigned to work on a project. The team will need access to database customers in order to see what tables already exist. The team has its own group team.