Pertanyaan dan Jawaban Wawancara Paling Populer & Tes Online
Platform edukasi untuk persiapan wawancara, tes online, tutorial, dan latihan langsung

Bangun keterampilan dengan jalur belajar terfokus, tes simulasi, dan konten siap wawancara.

WithoutBook menghadirkan pertanyaan wawancara per subjek, tes latihan online, tutorial, dan panduan perbandingan dalam satu ruang belajar yang responsif.

Prepare Interview

Apache Spark Pertanyaan dan Jawaban Wawancara

Ques 6. Explain the concept of partitions in Apache Spark.

Partitions are basic units of parallelism in Spark. They represent the logical division of data across the nodes in a cluster, and each partition is processed independently.

Example:

val inputRDD = sc.parallelize(Seq(1, 2, 3, 4, 5), 2)

Apakah ini membantu? Add Comment View Comments
 

Ques 7. What is a Spark Executor and what role does it play in Spark applications?

A Spark Executor is a process responsible for executing tasks on a worker node. Executors are launched at the beginning of a Spark application and run tasks until the application completes or encounters an error.

Example:

spark-submit --master yarn --deploy-mode client --num-executors 3 mySparkApp.jar

Apakah ini membantu? Add Comment View Comments
 

Ques 8. How does Spark handle fault tolerance in RDDs?

Spark achieves fault tolerance through lineage information (DAG) and recomputing lost data from the original source. If a partition of an RDD is lost, Spark can recompute it using the lineage information.

Example:

val resilientRDD = originalRDD.filter(x => x > 0)

Apakah ini membantu? Add Comment View Comments
 

Ques 9. What is the Broadcast variable in Spark and when is it used?

A Broadcast variable is a read-only variable cached on each worker node. It is used to efficiently distribute large read-only data structures, such as lookup tables, to all tasks in a Spark job.

Example:

val broadcastVar = sc.broadcast(Array(1, 2, 3))

Apakah ini membantu? Add Comment View Comments
 

Ques 10. Explain the role of the Spark Driver in a Spark application.

The Spark Driver is the program that runs the main() function and creates the SparkContext. It coordinates the execution of tasks on the Spark Executors and collects results from them.

Example:

object MyApp {
  def main(args: Array[String]): Unit = {
    val sc = new SparkContext("local", "MyApp")
  }
}

Apakah ini membantu? Add Comment View Comments
 

Most helpful rated by users:

Hak Cipta © 2026, WithoutBook.