#optimization

Questions tagged optimization

Explain how you would design a partition strategy for a large dataset in HDFS.

Spark/Big Datahard

Explain how you would implement real-time analytics using a streaming platform like Kafka or Kinesis.

Spark/Big Datahard

Explain how you would use Kafka Connect to ingest data from a relational database into Kafka while ensuring minimal latency and exactly-once semantics.

Spark/Big Datahard

Explain job execution in Spark: stages, tasks, Catalyst Optimizer

Spark/Big Datahard

Explain read and write modes in Spark.

Spark/Big Datahard

Explain repartition vs. coalesce. Which one would you use to reduce shuffle operations?

Spark/Big Datahard

Explain the DAG in Spark and how it plays a role in execution.

Spark/Big Datahard

Explain the Medallion architecture and its benefits in data engineering.

Spark/Big Datahard

+20 More Questions with Expert Answers

Get the complete 1,800+ question library with detailed, expert-level answers covering SQL, Spark, System Design, Python, Cloud, and Behavioral topics.

Get PDF Bundle - from $21 Try Free Sample

Previous 1...8 9 10 11 12...23 Next

Other Tags

#join #partition #python #spark #sql #window #airflow #etl #bigquery #snowflake #lakehouse

#optimization

Questions tagged optimization

All easy (0)medium (0)hard (440+)

Explain how you would design a partition strategy for a large dataset in HDFS.

Spark/Big Datahard

Explain how you would implement real-time analytics using a streaming platform like Kafka or Kinesis.

Spark/Big Datahard

Explain how you would use Kafka Connect to ingest data from a relational database into Kafka while ensuring minimal latency and exactly-once semantics.

Spark/Big Datahard

Explain job execution in Spark: stages, tasks, Catalyst Optimizer

Spark/Big Datahard

Explain read and write modes in Spark.

Spark/Big Datahard

Explain repartition vs. coalesce. Which one would you use to reduce shuffle operations?

Spark/Big Datahard

Explain the DAG in Spark and how it plays a role in execution.

Spark/Big Datahard

Explain the Medallion architecture and its benefits in data engineering.

Spark/Big Datahard

+20 More Questions with Expert Answers

Get the complete 1,800+ question library with detailed, expert-level answers covering SQL, Spark, System Design, Python, Cloud, and Behavioral topics.

Get PDF Bundle - from $21 Try Free Sample

Previous 1...8 9 10 11 12...23 Next

Other Tags

#join #partition #python #spark #sql #window #airflow #etl #bigquery #snowflake #lakehouse