Google News
logo
Hadoop - Interview Questions
Explain "Distributed Cache" in a "MapReduce Framework"
Distributed Cache can be explained as, a facility provided by the MapReduce framework to cache files needed by applications. Once you have cached a file for your job, Hadoop framework will make it available on each and every data nodes where you map/reduce tasks are running. Then you can access the cache file as a local file in your Mapper or Reducer job.
Advertisement