Featured post

The Ultimate Cheat Sheet On Hadoop

Top 20 frequently asked questions to test your Hadoop knowledge given in the below Hadoop cheat sheet. Try finding your own answers and match the answers given here.




Question #1 

You have written a MapReduce job that will process 500 million input records and generate 500 million key-value pairs. The data is not uniformly distributed. Your MapReduce job will create a significant amount of intermediate data that it needs to transfer between mappers and reducers which is a potential bottleneck. A custom implementation of which of the following interfaces is most likely to reduce the amount of intermediate data transferred across the network?



A. Writable
B. WritableComparable
C. InputFormat
D. OutputFormat
E. Combiner
F. Partitioner
Ans: e




Question #2 

Where is Hive metastore stored by default ?


A. In HDFS
B. In client machine in the form of a flat file.
C. In client machine in a derby database
D. In lib directory of HADOOP_HOME, and requires HADOOP_CLASSPATH to be modified.
Ans: c




Question…

Major Players in Cloud Computing

As of now the following are major players in cloud computing. Amazon.com is a web retailer and has the world's largest public cloud.

  • Google operates a computing cloud built upon open source software which is optimized for Internet search.
  •  Hewlett-Packard provides business printers with the capability to scan and store information within pods in cloud computing systems that combine servers, data storage, and management software in a single integrated package.
  • IBM employs a hybrid commercial and open source cloud strategy developed from prototype projects with client companies and government agencies.
  • Microsoft has a commercial software centric infrastructure for delivering cloud computing services.
  • Oracle markets an engineered systems approach combining hardware and software it promotes as providing superior performance and security.
  • NetSuite provides financial and resource planning functions.
  • Salesforce.com sells cloud-based e-mail, computer storage, and customer management and customer management software; it also has acquired other companies to offer social enterprise tools.
  • Other major technology suppliers which have cloud-related hardware and software products include Cisco and Dell.

Comments

Popular posts from this blog

Hadoop fs (File System) Commands List

Hyperledger Fabric: 20 Real Interview Questions

AWS Vs Azure Load Balancers Top Insights

4 Important Skills You Need for Data Scientists