The Ultimate Cheat Sheet On Hadoop

Top 20 frequently asked questions to test your Hadoop knowledge given in the below Hadoop cheat sheet. Try finding your own answers and match the answers given here.

Question #1 

You have written a MapReduce job that will process 500 million input records and generate 500 million key-value pairs. The data is not uniformly distributed. Your MapReduce job will create a significant amount of intermediate data that it needs to transfer between mappers and reducers which is a potential bottleneck. A custom implementation of which of the following interfaces is most likely to reduce the amount of intermediate data transferred across the network?

A. Writable
B. WritableComparable
C. InputFormat
D. OutputFormat
E. Combiner
F. Partitioner
Ans: e

Question #2 

Where is Hive metastore stored by default ?

B. In client machine in the form of a flat file.
C. In client machine in a derby database
D. In lib directory of HADOOP_HOME, and requires HADOOP_CLASSPATH to be modified.
Ans: c


An effective methodology for BI Implementation

Given below is an approach that one could follow to ensure an effective implementation of any BI project.
how to implement bi prjectss

Steps in Business Intelligence Project

  1. Requirements elicitation is the most important activity in a BI exercise. We need to very clearly understand the motivation behind the management's need for analytics and their short-term, medium-term and long-term objectives. 
  2. You may conduct a workshop with all stakeholders to understand these requirements. Identifying and involving all stakeholders in the requirements discussion is very important to gain buy-in for the BI initiative throughout the enterprise.
  3. Anyone would like to see a sample of analytics and play around a little to get a sense of the benefit/impact of analytics. Hence developing a quick prototype and showcasing the same to all stakeholders will help you get the necessary impetus for the initiative. 
  4. CAVEAT: Please ensure that the prototype development does not take more than two weeks.



