Hadoop Interview questions

Total available count: 27
Subject - Apache
Subsubject - Hadoop

What is Narrow Transformations?

Narrow transformations are the result of map, filter, and such that is from the data from a single partition only, i.e. it is self-sustained. An output RDD(Resilient Distributed Datasets) has partitions with records that originate from a single partition in the parent RDD(Resilient Distributed Datasets). Only a limited subset of partitions used to calculate the result. Spark groups narrow transformations as a stage.

Next 5 interview question(s)

How many type of transformations exist?
What is Preferred Locations?
How do you define actions?
What is the transformation?
What are the possible operations on RDD?