Posts

Showing posts matching the search for Hadoop

Hadoop HDSF File system commands given in this post. These are useful for your projects and interviews. HDFS commands HDFS File System Commands. Hadoop fs -cmd <args> cmd is a specific command and arg is the variable name. The List of Commands cat Hadoop fs –cat FILE [FILE …] Displays the files' content. For reading compressed files. chgrp Hadoop fs –chgrp [-R] GROUP PATH [PATH …] Changes the group association for files and directories. The – R option applies the change recursively. The user must be the files' owner or a superuser. chmod Hadoop fs –chmod [-R] MODE[,MODE …] PATH [PATH …] Changes the permissions of files and directories. Like, its Unix equivalent, MODE can be a 3-digit octal mode, or {augo}+/-{rwxX}. The -R option applies the change recursively. The user must be the files' owner or a superuser. chown Hadoop fs –chown [-R] [OWNER][:[GROUP]] PATH [PATH…] Changes the ownership of files and di...

How to Setup Hadoop Cluster Top Ideas

- September 13, 2015

Hadoop cluster setup in Centos Operating System explained in this post. So you can install CentOs either in your Laptop or in Virtual Machine. Hadoop Cluster Setup Process 9 Steps Process to Setup Hadoop Cluster Step 1: Installing Sun Java on Linux. Commands to execute for the same: sudo apt-add-repository ppa:flexiondotorg/java sudo apt-get update sudo apt-get install sun-java6-jre sun-java6-plugin sudo update-java-alternatives -s java-6-sun Step 2: Create Hadoop User. Commands to execute for the same: $sudo addgroup hadoop $sudo adduser —ingroup hadoop hduser Step 3: Install SSH Server if not already present. Commands are: $ sudo apt-get install openssh-server $ su - hduser $ ssh-keygen -t rsa -P "" $ cat $HOME/.ssh/id_rsa.pub >> $HOME/.ssh/authorized_keys Step 4: Installing Hadoop. Commands for the same are: $wget http://www.eng.lsu.edu/mirrors/apache/hadoop/core/hadoop-0.22.0/hadoop-0.22.0.tar.gz $ cd /home/hduser $ tar xzf ...

30 High Paying Tech Jobs,$110,000 Plus Salary

- June 09, 2015

There is a growing demand for software developers across the globe. These 30 highly paying IT jobs really worth. PaaS or "Platform as a Service" is a type of cloud computing technology. It hosts everything that a developer needs to write an app. These apps once written, would live on PaaS cloud. Paas++jobs Cassandra is a free and open source NoSQL database. It's a kind of database that can handle and store data of different types and sizes of data and it's increasingly the go-to database for mobile and cloud applications. Several IT companies including Apple and Netflix use Cassandra. Cassandra+jobs MapReduce has been called "the heart of Hadoop." MapReduce is the method that allows Hadoop to store all kinds of data across many low-cost computer servers. To get meaningful data of Hadoop, a programmer writes software programs (often in the popular language, Java) for MapReduce. Mapreduce+jobs 30 High Paying IT Jobs Cloudera is a company that ma...

Big Data:Top Hadoop Interview Questions (2 of 5)

- June 29, 2015

Frequently asked Hadoop interview questions. 1. What is Hadoop? Hadoop is a framework that allows users the power of distributed computing. 2.What is the difference between SQL and Hadoop? SQL is allowed to work with structured data. But SQL is most suitable for legacy technologies. Hadoop is suitable for unstructured data. And, it is well suited for modern technologis. Hadoop 3. What is Hadoop framework? It is distributed network of commodity servers(A server can contain multiple clusters, and a cluster can have multiple nodes) 4. What are 4 properties of Hadoop? Accessible-Hadoop runs on large clusters of commodity machines Robust-An assumption that low commodity machines cause many machine failures. But it handles these tactfully. Scalable-Hadoop scales linearly to handle larger data by adding more nodes to the cluster. Simple-Hadoop allows users to quickly write efficient parallel code 5. What kind of data Hadoop needs? Traditional RDBMS having re...

Here's Quick Guide on Hadoop Security

- August 08, 2015

Here is a topic of security and tools in Hadoop. These are security things that everyone needs to take care of while working with the Hadoop cluster. Hadoop Security Security We live in a very insecure world. For instance, your home's front door to all-important virtual keys, your passwords, everything needs to be secured. In Big data systems, where humongous amounts of data are processed, transformed, and stored. So security you need for the data . Imagine if your company spent a couple of million dollars installing a Hadoop cluster to gather and analyze your customers' spending habits for a product category using a Big Data solution. Here lack of data security leads to customer apprehension. Security Concerns Because that solution was not secure, your competitor got access to that data, and your sales dropped 20% for that product category. How did the system allow unauthorized access to data? Wasn't there any authentication mechanism in place? Why were there no alerts? Th...

Big data: Quiz-1 Hadoop Top Interview Questions

- July 25, 2015

In this post, I have given a Quiz on Big data with answers. This is part-1 set of questions for your quick reference. Photo credit: Srini Q.1) How Hadoop achieve scaling in terms of storage? A.By increasing the hard disk capacity of the machine B.By increasing the RAM capacity of the machine C.By increasing both the hard disk and RAM capacity of the machine D.By increasing the hard disk capacity of the machine and by adding more machine Q.2) How fault tolerance with respect to data is achieved in Hadoop? A.By breaking the data into smaller blocks and distributing these smaller blocks into several machines B.By adding extra nodes. C.By breaking the data into smaller blocks and copying each block several times, and distributing these replicas across several machines. By doing this Hadoop makes sure even if the machines are failed the replica is present in some other machine D.None of these Q.3) In what all parameters Hadoop scales up? A. Storage only B. Performan...

Hadoop Skills Free Video Training

- March 31, 2015

Are you interested in the world of Big data technologies, but find it a little cryptic and see the whole thing as a big puzzle. The hadoop free video training really useful to learn quickly. Are you looking to understand how Big Data impact large and small business and people like you and me? Do you feel many people talk about Big Data and Hadoop, and even do not know the basics like history of Hadoop, major players and vendors of Hadoop. Then this is the course just for you! This course builds a essential fundamental understanding of Big Data problems and Hadoop as a solution. This course takes you through: Understanding of Big Data problems with easy to understand examples. History and advent of Hadoop right from when Hadoop wasn’t even named Hadoop. What is Hadoop Magic which makes it so unique and powerful. Understanding the difference between Data science and data engineering, which is one of the big confusions in selecting a carrier or understanding a job role. And mos...

Search This Blog

ApplyBigAnalytics

Posts

Featured Post

Python: Built-in Functions vs. For & If Loops – 5 Programs Explained

Top 100 Hadoop Complex Interview Questions (Part 3 of 4)

Hadoop fs (File System) Commands List