Featured post

Best Machine Learning Book for Beginners

You need a mixof different technologies for Data Science projects. Instead of learning many skills, just learn a few. The four main steps of any project are extracting the data, model development, artificial intelligence, and presentation. Attending interviews with many skills is not so easy. So keep the skills short.
A person with many skills can't perform all the work. You had better learn a few skills like Python, MATLAB, Tableau, and RDBMS. So that you can get a job quickly in the data-science project.
Out of Data Science skills, Machine learning is a new concept. Why because you can learn Python, like any other language. Tableau also the same. Here is the area that needs your 60% effort is Machine learning.  Machine Learning best book to start.

Related Posts How to write multiple IF-conditions in Python Simplified

Big Data: Top Hadoop Interview Questions (3 of 5)

1) What are daemons in Hadoop?

Big Data: Top Hadoop Interview Questions
#Big Data: Top Hadoop Interview Questions:
In reality running Hadoop means, running daemons of resident programs in multiple servers of your network. This kind of architecture is called fully configured cluster.

2) How daemons run in Hadoop architecture?

Some daemons run in only one server, and others run in more than one server

3) What are the 5 daemons of Hadoop?

-Name node
-Secondary name node
-Data Node
-Job tracker
-Task tracker

4) How many levels do we classify Hadoop broadly?

Broadly we can classify as, it is combination of distributed storage and distributed computation.
Also, as Master/Slave architecture

5) Who is the master of HDFS?

Name node is the master of HDFS

6) What are the functions of Name node?

-Master of HDFS
-Directs slave node i.e., Data nodes
-Book keeping for HDFS
-Monitor overall health of HDFS

7) What is data node?

Each slave machine will have Data node daemon.It performs grunt work of distributed file system

8) What are the functions of Data node?

-Main functionality is read or write HDFS file blocks to local system
-Data node communicates to name node about data blocks. Name node in turn communcates about data block and Data nodes to client.
-Data nodes can communicate each other
-Every change of data in Data node will communicate to Name node

9) How many replicas of data blocks stored in different Data nodes?

3

10) What is Secondary Data node(SNN)?
  • SNN is an assistant to Name node. It also monitors the state of HDFS cluster
  • Like Name node each cluster has one SNN, and it typically resides on its own machine
  • Data nodes and Task trackers run on multiple servers.
It does not record any changes, but time to time it suggests Name node to take SNAP shots of HDFS metadata

11) What will happen if Name node fails?

Then , human interventions is required. That time SNN acts as Name node.

12) What is the role of Job tracker?

This is mediator between client and Task tracker
- Prepares execution plan
-Assign works to task trackers
-Assign nodes to different tasks
-Monitors all tasks are running fine or not

13) What is the role of Task tracker?

Manages execution of individual tasks on each slave node
Single task tracker for each slave node
A task tracker can spread multiple JVMs in a single slave node, to process parallel

Comments

Popular posts from this blog

Hyperledger Fabric: 20 Real Interview Questions

Python IF Statements Multiple Conditions Examples

Best Machine Learning Book for Beginners