Databricks Quiz — Advanced Performance & Optimization

1. Which operation in Spark triggers a shuffle and can impact performance if not optimized?

2. Which Delta Lake feature allows concurrent reads and writes without conflicts?

3. What is the primary benefit of Databricks Auto Scaling for clusters?

4. In Spark Structured Streaming, what is a common method to achieve exactly-once semantics?

5. Which of the following improves performance when writing large Delta Lake tables?

6. Which Databricks tool helps you manage ML model training, experiment tracking, and deployment?

7. Which of the following is a recommended practice for ETL pipelines in Databricks?

8. What is the advantage of Delta Lake Z-Ordering?

9. Which type of join is generally more efficient in Spark for small lookup tables?

10. In Databricks, which practice is recommended for large streaming workloads?