Featured post

The Ultimate Cheat Sheet On Hadoop

Top 20 frequently asked questions to test your Hadoop knowledge given in the below Hadoop cheat sheet. Try finding your own answers and match the answers given here.

Question #1 

You have written a MapReduce job that will process 500 million input records and generate 500 million key-value pairs. The data is not uniformly distributed. Your MapReduce job will create a significant amount of intermediate data that it needs to transfer between mappers and reducers which is a potential bottleneck. A custom implementation of which of the following interfaces is most likely to reduce the amount of intermediate data transferred across the network?

A. Writable
B. WritableComparable
C. InputFormat
D. OutputFormat
E. Combiner
F. Partitioner
Ans: e

Question #2 

Where is Hive metastore stored by default ?

B. In client machine in the form of a flat file.
C. In client machine in a derby database
D. In lib directory of HADOOP_HOME, and requires HADOOP_CLASSPATH to be modified.
Ans: c


4 Key features to know in Industrial IoT

The Industrial Internet is somewhat similar to the Internet of Things (IoT), sometimes even referred to as the “Industrial Internet of Things”.
[JAVAa and IOT latest Career Options]

The general idea behind the IoT

It is that many hi-tech and simple electronic devices would become interconnected with M2M (Machine-to-Machine) protocols, and therefore remotely controlled and relaying data to each other, without any manual input, or even equipped with ambient intelligence.

Role of Smart Phones

This idea is currently used in smart houses, and many seek wider applications. Its industrial counterpart can be viewed more from a resource and manufacturing perspective.


We can currently see such trends in mining, with automated trains, robotic hauling trucks, excavators, drills, and mineral processing plants, including remote fault, and malfunction control, calibration, and configuration for increased efficiency.


Despite the current and constant need for “on-site” human workers and operators, the trend is moving towards eliminating the human factor to the required minimum, thus a single central operation control room could remotely operate and supervise several operation sites. Human operators could also be issued with remote ATLAS-like humanoid substitutes if necessary.


Popular posts from this blog

AWS Vs Azure Load Balancers Top Insights

Hadoop File System Basic Commands

4 Important Skills You Need for Data Scientists

Hyperledger Fabric: 20 Real Interview Questions