Featured post

The Ultimate Cheat Sheet On Hadoop

Top 20 frequently asked questions to test your Hadoop knowledge given in the below Hadoop cheat sheet. Try finding your own answers and match the answers given here.




Question #1 

You have written a MapReduce job that will process 500 million input records and generate 500 million key-value pairs. The data is not uniformly distributed. Your MapReduce job will create a significant amount of intermediate data that it needs to transfer between mappers and reducers which is a potential bottleneck. A custom implementation of which of the following interfaces is most likely to reduce the amount of intermediate data transferred across the network?



A. Writable
B. WritableComparable
C. InputFormat
D. OutputFormat
E. Combiner
F. Partitioner
Ans: e




Question #2 

Where is Hive metastore stored by default ?


A. In HDFS
B. In client machine in the form of a flat file.
C. In client machine in a derby database
D. In lib directory of HADOOP_HOME, and requires HADOOP_CLASSPATH to be modified.
Ans: c




Question…

Overview Of Cloud Standards

Cloud computing slowly becoming reality. So it has to address many concerns such as security, interoperability, portability, and governance at the earliest opportunity.

This can be accelerated by compliance to guidelines and standards defined in consensus by the cloud providers. 

Without addressing these concerns, users would be wary to tread this path in spite of its powerful economic model for business computing.

Interoperability/integration

Interoperability enables products/software components to work with or integrate with each other seamlessly, in order to achieve the desired result.

Thus, it provides flexibility and the choice to use multiple products to achieve our needs. This is enabled by either integrating through standard interfaces or by means of a broker that converts one product interface to another.
Cloud computing depends on compliance standards.

Security

Security involves the protection of information assets through various policies, procedures, and technologies, which need to adhere to standards and best practices in order to achieve the desired level of security.

For example, the Payment Card Industry (PCI) data security standards from PCI SSC define ways to secure credit card data to avoid fraud. This is applicable to all organizations that hold, process or pass credit cardholder information.
Quickly Learn Cloud Computing

Portability


Software is said to be portable when the cost of porting the same from an existing platform for which it was originally developed, to a new platform, is less than the cost of re-writing it for the new platform. Software with good portability thus avoids vendor lock-in.

This is typically achieved by adhering to standard interfaces defined between the software component and vendor platforms. For example, Java programs are set to be portable across operating systems (OS) that adhere to standard interfaces defined between the Java runtime environment and the OS.
Portability and governance are the key factors in Cloud computing.

Governance

Risk Management and Compliance (GRC) - governance focuses on ensuring that the enterprise adheres to defined policies and processes. Risk management puts in controls to manage and mitigate risks as defined by the enterprise.

Compliance ensures that the enterprise adheres to various legal/legislative as well as internal policies. Standards have been defined for IT systems to adhere to a certain industry as well as legal standards such as Sarbanes-Oxley (SOX), Health Insurance Portability and Accountability Act (HIPAA), etc.

Related Posts:

  1. Learn Cloud Computing a Flow-Chart Way
  2. Why Data is Very Sexy Today

Comments

Popular posts from this blog

Hadoop fs (File System) Commands List

Hyperledger Fabric: 20 Real Interview Questions

AWS Vs Azure Load Balancers Top Insights

4 Important Skills You Need for Data Scientists