Hadoop Interview questions


Total available count: 27
Subject - Apache
Subsubject - Hadoop

What is wide Transformations?

Wide transformations are the result of groupByKey and reduceByKey. The data required to compute the records in a single partition may reside in many partitions of the parent RDD(Resilient Distributed Datasets). All of the tuples with the same key must end up in the same partition, processed by the same task. To satisfy these operations, Spark must execute RDD(Resilient Distributed Datasets shuffle/shamble, which transfers data across clusters and results in a new stage with a new set of partitions. 




Next 5 interview question(s)

1
What is Narrow Transformations?
2
How many type of transformations exist?
3
What is Preferred Locations?
4
How do you define actions?
5
What is the transformation?