Featured post

The Ultimate Cheat Sheet On Hadoop

Top 20 frequently asked questions to test your Hadoop knowledge given in the below Hadoop cheat sheet. Try finding your own answers and match the answers given here.

Question #1 

You have written a MapReduce job that will process 500 million input records and generate 500 million key-value pairs. The data is not uniformly distributed. Your MapReduce job will create a significant amount of intermediate data that it needs to transfer between mappers and reducers which is a potential bottleneck. A custom implementation of which of the following interfaces is most likely to reduce the amount of intermediate data transferred across the network?

A. Writable
B. WritableComparable
C. InputFormat
D. OutputFormat
E. Combiner
F. Partitioner
Ans: e

Question #2 

Where is Hive metastore stored by default ?

B. In client machine in the form of a flat file.
C. In client machine in a derby database
D. In lib directory of HADOOP_HOME, and requires HADOOP_CLASSPATH to be modified.
Ans: c


New Wave in Data Analytics in 2014

Now that we’re in the swing of a new year, we’ve taken stock of the data analytics trends that are brewing and developed a list of the Top 5 trends we believe are going to dominate the industry this year. Even if some of them don’t realize their full potential in 2014, it promises to be an important year in which consumer trends and technology innovation will further shape a future in which companies make data-driven decisions.
1. Data Visualization Goes Mainstream
In the mid-90s, e-mail introduced the Internet to consumers, made it more accessible, and catalyzed user adoption. Similarly, data visualization will make data analytics more accessible in 2014. Visual analytics allows business users to ask interactive questions of their prepared data sets and get immediate visual responses, which makes the whole process engaging.
This trend will democratize access to data and foster a strong data analysis culture where business users will look for data and perform visual analysis before making decisions. The quick wins that data visualization provides will lead to a changed mindset that will allow for future forays into more advanced analytics that uses math, statistics, and complex data sets. In 2014, we could see some further innovation around collaboration of business users in answering business questions. Soon, the business utility and future of a dashboard could be determined by how many “likes,” “shares,” and comments it receives from business users.
Read more atHERE


Popular posts from this blog

Hadoop fs (File System) Commands List

Hyperledger Fabric: 20 Real Interview Questions

AWS Vs Azure Load Balancers Top Insights

4 Important Skills You Need for Data Scientists