Which HDFS command uploads a local file X into an existing HDFS directory Y?
Identify the MapReduce v2 (MRv2 / YARN) daemon responsible for launching application containers and monitoring application resource usage?
In a MapReduce job, the reducer receives all values associated with same key. Which statement best describes the ordering of these values?
Note:
* Input to theReduceris the sorted output of the mappers.
* The framework calls the application's Reduce function once for each unique key in the sorted order.
* Example:
For the given sample input the first map emits:
< Hello, 1>
< World, 1>
< Bye, 1>
< World, 1>
The second map emits:
< Hello, 1>
< Hadoop, 1>
< Goodbye, 1>
< Hadoop, 1>
You have just executed a MapReduce job. Where is intermediate data written to after being emitted from the Mapper's map method?
The mapper output (intermediate data) is stored on the Local file system (NOT HDFS) of each individual mapper nodes. This is typically a temporary directory location which can be setup in config by the hadoop administrator. The intermediate data is cleaned up after the Hadoop Job completes.
When can a reduce class also serve as a combiner without affecting the output of a MapReduce program?
You can use your reducer code as a combiner if the operation performed is commutative and associative.