Posts

Featured Post

SQL Interview Success: Unlocking the Top 5 Frequently Asked Queries

Image
 Here are the five top commonly asked SQL queries in the interviews. These you can expect in Data Analyst, or, Data Engineer interviews. Top SQL Queries for Interviews 01. Joins The commonly asked question pertains to providing two tables, determining the number of rows that will return on various join types, and the resultant. Table1 -------- id ---- 1 1 2 3 Table2 -------- id ---- 1 3 1 NULL Output ------- Inner join --------------- 5 rows will return The result will be: =============== 1  1 1   1 1   1 1    1 3    3 02. Substring and Concat Here, we need to write an SQL query to make the upper case of the first letter and the small case of the remaining letter. Table1 ------ ename ===== raJu venKat kRIshna Solution: ========== SELECT CONCAT(UPPER(SUBSTRING(name, 1, 1)), LOWER(SUBSTRING(name, 2))) AS capitalized_name FROM Table1; 03. Case statement SQL Query ========= SELECT Code1, Code2,      CASE         WHEN Code1 = 'A' AND Code2 = 'AA' THEN "A" | "A

SPARK is Replacement for MapReduce in Bigdata Real Analytics!

Image
Apache Spark is among the Hadoop ecosystem technologies acting as catalysts for broader adoption of big data infrastructure. Now, Looker -- a vendor of business intelligence software -- has announced support for Spark and other Hadoop technologies. The goal? To speed up access to the data that fuels business decision making. SPARK Jobs Hadoop's arrival on the scene 10 years ago may have started the big data revolution, but only recently did adoption of this technology begin spreading to a wider audience. Apache Spark is one of the catalysts for the growing adoption rates. Spark can be used as a replacement for MapReduce, a component of Hadoop implementations, to speed up the processing and analytics of big data by 100x in memory, according to the Apache Software Foundation. In today's business environment, in which real-time analytics is the goal and organizations don't want to wait for data warehouses and analysts to provide batch intelligence back to business u

Hot Skills: Spark Self Study Materials

Image
Spark: With job postings up 120% year-over-year on Dice, demand for this open-source cluster-computing framework is broad-based. Government contractors and financial-services firms are just a few of the groups eager to find candidates with this skillset. 2015 Average Salary: $113,214 Related: SPARK Self Study Materials Spark Big Data and Cloud:  As companies expand their tech infrastructures, they need cloud and Big Data services such as Azure (#2), Hive (#8), and Cassandra (#9) for data storage, analysis, and security. Big Data and cloud-related skills dominated the Highest-Paid Skills list on Dice’s salary survey for the second straight year.  2015 Average Salary: Big Data—$121,328 Azure — $110,207 Salesforce: This customer-service platform serves as the bedrock for many companies’ customer service departments. Demand for Salesforce professionals seems unlikely to decline anytime soon. Employers are even willing to offer telecommuting options to lure Salesforce talent. 2

How to Use Chaid Useful for Data Science Developers

Image
The Chaid is one of the most asked skills for Data Science engineers. The CHAID Analysis (Chi-Square Automatic Interaction Detection) is a form of analysis that determines how variables best combine to explain the outcome in a given dependent variable. Chaid Model The model can be used in cases of market penetration, predicting and interpreting responses, or a multitude of other research problems. CHAID analysis is especially useful for data expressing categorized values instead of continuous values. For this kind of data, some common statistical tools such as regression are not applicable and CHAID analysis is a perfect tool to discover the relationship between variables.  One of the outstanding advantages of CHAID analysis is that it can visualize the relationship between the target (dependent) variable and the related factors with a tree 1. CHAID Analysis for Surveys Analysis Most survey answers have categorized values instead of continuous values.  Finding out the statistical re

The best solution Ceph Data Storage for big data

Image
#The best solution Ceph Data Storage for big data: The power of Ceph can transform your organization’s IT infrastructure and your ability to manage vast amounts of data. If your organization runs applications with different storage interface needs, Ceph is for you! Ceph’s foundation is the Reliable Autonomic Distributed Object Store (RADOS), which provides your applications with object, block, and file system storage in a single unified storage cluster—making Ceph flexible, highly reliable and easy for you to manage. Ceph’s RADOS provides you with extraordinary data storage scalability—thousands of client hosts or KVMs accessing petabytes to exabytes of data. Each one of your applications can use the object, block or file system interfaces to the same RADOS cluster simultaneously, which means your Ceph storage system serves as a flexible foundation for all of your data storage needs. You can use Ceph for free, and deploy it on economical commodity hardware. Ceph is a better way

OpenStack Private Cloud, What IT Developers Should Learn

Image
An example of OpenStack Usage: The second largest car manufacturer in the world, Volkswagen Group, will use the open-source cloud computing platform OpenStack to build a private cloud that will host websites for its brands VW, Audi, and Porsche, and will be a platform for innovating automotive technology, the company announced Wednesday. Photo Credit: Srini For the past two years, VW officials at the company’s Wolfsburg, Germany, headquarters debated what platform to use. VW decided to first build out a private cloud based on OpenStack that will eventually span thousands of physical nodes across multiple data centers in the U.S., Europe, and Asia. Eventually, VW hopes to incorporate public cloud resources to create a hybrid cloud, said officials with VW’s consultant, Mirantis. When fully built out, VW’s private cloud could be one of the top five or 10 largest OpenStack-based clouds in production, said Mirantis co-founder and chief marketing officer Boris Renski. According to t

3 top IT Skills every new IT Professionals learn to progress in software career

Image
What are the skills needed by the new IT professionals or job seekers who help the  organisation  transition to IT-as-a-Service.  In order  to lead their  organisations  to the cloud, IT professionals must focus on three fundamental areas: Core  Virtualisation  Skill Sets IT professionals must think and operate in the virtual world. No longer can they be tied to the old paradigm of physical assets dedicated to specific users or applications. They must think in terms of “services” riding on top of a fully virtualized infrastructure, and how applications will take advantage of shared resources with both servers and storage. This requires comprehensive skills in both server and storage virtualization technology, and enough experience as a practitioner to understand the intricacies and critical elements of managing virtual platforms. Rules of Old IT and New IT Cross-training Competency Leaders of IT innovation cannot be completely siloed and hyper-focused. Although there will

Linux Must Read Course Contents

Image
The complete syllabus for the Linux certification course you need to know before start preparation for the test. List of Course Contents The Linux community and a career in open source Finding your way on a Linux system The power of the command line The Linux operating system Security and file permissions Topic 1: The Linux Community and a Career in Open Source (weight: 7) 1.1 Linux Evolution and Popular Operating Systems Weight: 2 Description: Knowledge of Linux development and major distributions. Key Knowledge Areas: Open Source Philosophy Distributions Embedded Systems The following is a partial list of the used files, terms and utilities: Android Debian, Ubuntu (LTS) CentOS, openSUSE, Red Hat Linux Mint, Scientific Linux 1.2 Major Open Source Applications Weight: 2 Description: Awareness of major applications as well as their uses and development. Key Knowledge Areas: Desktop Applications Server Applications Development Languages Package Management