Featured Post

Python map() and lambda() Use Cases and Examples

Image
 In Python, map() and lambda functions are often used together for functional programming. Here are some examples to illustrate how they work. Python map and lambda top use cases 1. Using map() with lambda The map() function applies a given function to all items in an iterable (like a list) and returns a map object (which can be converted to a list). Example: Doubling Numbers numbers = [ 1 , 2 , 3 , 4 , 5 ] doubled = list ( map ( lambda x: x * 2 , numbers)) print (doubled) # Output: [2, 4, 6, 8, 10] 2. Using map() to Convert Data Types Example: Converting Strings to Integers string_numbers = [ "1" , "2" , "3" , "4" , "5" ] integers = list ( map ( lambda x: int (x), string_numbers)) print (integers) # Output: [1, 2, 3, 4, 5] 3. Using map() with Multiple Iterables You can also use map() with more than one iterable. The lambda function can take multiple arguments. Example: Adding Two Lists Element-wise list1 = [ 1 , 2 , 3 ]

How to Setup Hadoop Cluster Top Ideas

Hadoop cluster setup in Centos Operating System explained in this post. So you can install CentOs either in your Laptop or in Virtual Machine.
Hadoop cluster setup proces
Hadoop Cluster Setup Process

9 Steps Process to Setup Hadoop Cluster

Step 1:  Installing Sun Java on Linux. Commands to execute for the same:
sudo apt-add-repository ppa:flexiondotorg/java
sudo apt-get update
sudo apt-get install sun-java6-jre sun-java6-plugin
sudo update-java-alternatives -s java-6-sun

Step 2:  Create Hadoop User. Commands to execute for the same:
$sudo addgroup hadoop
$sudo adduser —ingroup hadoop hduser

Step 3:  Install SSH Server if not already present. Commands are:
$ sudo apt-get install openssh-server
$ su - hduser
$ ssh-keygen -t rsa -P ""
$ cat $HOME/.ssh/id_rsa.pub >> $HOME/.ssh/authorized_keys

Step 4:  Installing Hadoop. Commands for the same are:
$wget http://www.eng.lsu.edu/mirrors/apache/hadoop/core/hadoop-0.22.0/hadoop-0.22.0.tar.gz
$ cd /home/hduser
$ tar xzf hadoop-0.22.2.tar.gz
$ mv hadoop-0.22.2 hadoop

Step 5:  Edit .bashrc. Commands:
# Set Hadoop-related environment variables
export HADOOP_HOME=/home/hduser/hadoop
# Set JAVA_HOME (we will also configure JAVA_HOME directly for Hadoop later on)
export JAVA_HOME=/usr/lib/jvm/java-6-sun

Step 6:  Update hadoop-env.sh
Here we need to only update the JAVA_HOME variable, we just need to open the file from text editor.
$vi /home/hduser/hadoop/conf/hadoop-env.sh
And update the following:
export JAVA_HOME=/usr/lib/jvm/java-6-sun

Step 7:  Update the files residing in the conf folder.  The three main files are:
Core-site.xml
Mapred-site.xml
Hdfs-site.xml
Here we need to change the value of namenode and jobtracker to where we want to run.

Step 8:  Start the hadoop cluster, using command.
Start-all.sh

Step 9:  Check if all the processes are up and running, using command:
Jps
That’s it. The cluster should be up and running with the following 5 processes:
  • NameNode
  • SecondaryNameNode
  • DataNode
  • JobTracker
  • TaskTracker
If we see all these processes, it means our installation is successful and the cluster is up and running.

Comments

Popular posts from this blog

How to Fix datetime Import Error in Python Quickly

SQL Query: 3 Methods for Calculating Cumulative SUM

Python placeholder '_' Perfect Way to Use it