Posts

Showing posts from November, 2015

Featured Post

SQL Interview Success: Unlocking the Top 5 Frequently Asked Queries

Image
 Here are the five top commonly asked SQL queries in the interviews. These you can expect in Data Analyst, or, Data Engineer interviews. Top SQL Queries for Interviews 01. Joins The commonly asked question pertains to providing two tables, determining the number of rows that will return on various join types, and the resultant. Table1 -------- id ---- 1 1 2 3 Table2 -------- id ---- 1 3 1 NULL Output ------- Inner join --------------- 5 rows will return The result will be: =============== 1  1 1   1 1   1 1    1 3    3 02. Substring and Concat Here, we need to write an SQL query to make the upper case of the first letter and the small case of the remaining letter. Table1 ------ ename ===== raJu venKat kRIshna Solution: ========== SELECT CONCAT(UPPER(SUBSTRING(name, 1, 1)), LOWER(SUBSTRING(name, 2))) AS capitalized_name FROM Table1; 03. Case statement SQL Query ========= SELECT Code1, Code2,      CASE         WHEN Code1 = 'A' AND Code2 = 'AA' THEN "A" | "A

The awesome points to learn from DB2 NoSQL GraphStore

Image
 #db2 graphstore: One best example, prior to understanding the RDF format for Graph data model -  If the graph data model is the model the semantic web uses to store data, RDF is the format in which it is written.  Related: Highly Demanding Web Designer Jobs Summary of DB2 Graph Store: DB2-RDF support is officially called "NoSQL Graph Support".   The API extends the Jena API (Graph layer).  Developers familiar with Jena TDB will have the Model layer capabilities they are accustomed to. Although the DB2-RDF functionality is being released with DB2 LUW 10.1, it is also compatible with DB2 9.7. Full supports for SPARQL 1.0 and a subset of SPARQL 1.1.  Full SPARQL 1.1 support (which is till a W3C working draft) will be forthcoming. While RDBMS implementations of RDF graphs have typically been non-performant, that is not the case here*.  Some very impressive and innovative work has been put into optimization capabilities.  Out-of-the box performance is compara

The Ultimate Cheat Sheet On Hadoop

Image
Top 20 frequently asked questions to test your Hadoop knowledge given in the below Hadoop cheat sheet . Try finding your own answers and match the answers given here. Question #1  You have written a MapReduce job that will process 500 million input records and generate 500 million key-value pairs. The data is not uniformly distributed. Your MapReduce job will create a significant amount of intermediate data that it needs to transfer between mappers and reducers which is a potential bottleneck. A custom implementation of which of the following interfaces is most likely to reduce the amount of intermediate data transferred across the network? A. Writable B. WritableComparable C. InputFormat D. OutputFormat E. Combiner F. Partitioner Ans: e Question #2  Where is Hive metastore stored by default ? A. In HDFS B. In client machine in the form of a flat file. C. In client machine in a derby database D. In lib directory of HADOOP_HOME, and requires HADOOP_CLASSPATH to be modified. Ans

How to learn Tableau best way with self Study tutorials

Image
#Tips To Mastering Tableau Self Study Video Tutorials The Tableau training website offers a multitude of resources for Tableau users with many of the videos being brief and addressing specific topics. This self-study syllabus organizes those videos to help you find the training you need on specific topics quickly. Tableau 9 for Data Science Engineers. To learn any Software Tool, you need to follow these steps: Tutorials - Either on-line or class room Books -Read theory from the scratch Hands on Training - Just practice what you learnt Materials -Written by experienced developers Blogs/Websites/Forums give you much insights The below link contains valuable video tutorials. You can learn Tableau quickly in a just few days. Take Video Lessons Here

The 12 best QlikView Interview Questions with answers

1) What is QlikView ? QlikView is a program that makes it possible to retrieve and assimilate data from different sources. Once loaded into the program, the data is presented in a way that is easy to understand and work with. 2) In how many flavors QlikView is available? QlikView comes in three flavours called QlikView Enterprise, QlikView Profesisonal and QlikView Analyzer. If you are running QlikView Enterprise all parts of this tutorial will be relevant for you. If you are running QlikView Professional only the first part “Working with QlikView” is relevant. For those running QlikView Analyzer only the very first lesson may be relevant Related: QlikView+Jobs+technical+Skills 3)How to start QlikView? You start QlikView by double-clicking the QlikView icon in the QlikView group (created during the installation procedure). You will also find QlikView on the Start menu, under Programs. It is also possible to start QlikView by double-clicking the icon of a QlikView file. After Qlik

Scrum Vs Agile Methodology best explained with more details

Image
#Life cycle of scrum with more details: Scrum is part of the Agile movement. Agile is a response to the failure of the dominant software development project management paradigms (including waterfall) and borrows many principles from lean manufacturing. In 2001, 17 pioneers of similar methods met at the Snowbird Ski Resort in Utah and wrote the Agile Manifesto, a declaration of four values and twelve principles.  These values and principles stand in stark contrast to the traditional Project Manager’s Body Of Knowledge (PMBOK). The Agile Manifesto placed a new emphasis on communication and collaboration, functioning software, team self organization, and the flexibility to adapt to emerging business realities. Next Steps:   Agile Scrum Training+Scrum Certification Prep. Training How Does Scrum Fit With Agile? The Agile Manifesto doesn’t provide concrete steps. Organizations usually seek more specific methods within the Agile movement. These include Crystal Clear,

2 Top Tableau Unique Features

Image
Tableau is one of the most popular tools in data analysis. Learning the Tableau gives you so many options in data analysis career. You can download Tableau Software free version here . Get a complete understanding document on how Tableau works here . Read this post for advancing in your Tableau Career. Unique functionality in Tableau Tableau Software was founded on the idea that analysis and visualization should not be isolated activities but must be synergistically integrated into a visual analysis process. Visual analysis means specifically: 1). Data Exploration Visual analysis is designed to support analytical reasoning. The goal of the visual analysis is to answer important questions using data and facts. In order to support analysis, it is not enough to only access and report on the data. Analysis requires computational support throughout the process. Typical steps in the analysis include such operations as filtering to focus on items of interest sorting to rank

Industrial IoT what GE says to improve Productivity

Image
GE is once a top company in Heavy Engineering. This is to say items related to Thermal Power plants, Turbines, and maintenance. GE had always believed that since it knew the materials and the physics of its jet engines and medical scanners, no one could best it in understanding those machines. GE Industrial Internet  The aim is it should not share its data to third parties.    GE sets up its own IoT center.    GE is in IoT mood.    GE can improve operational efficiency by studying data from its machines like situated India and Russia. This is just an example.  GE is Targetting for Predictive Maintenace Improves industrial productivity Based on criticality productivity will zoom if maintenance carried in-time.

The Linux Commands List That Give Deep Insights on Usage

Here is the list of Linux commands, which explains all the daily-use commands. And each command has showed with an example so you can get deep insights quickly. List of Linux commands 1. What is a command shell? A program that interprets commands Allows a user to execute commands by typing them manually at a terminal, or automatically in programs called shell scripts. A shell is not an operating system. It is a way to interface with the operating system and run commands. 2. What is BASH? BASH = Bourne Again Shell Bash is a shell written as a free replacement to the standard Bourne Shell (/bin/sh) originally written by Steve Bourne for UNIX systems. It has all of the features of the original Bourne Shell, plus additions that make it easier to program with and use from the command line. Since it is Free Software, it has been adopted as the default shell on most Linux systems. 3. How is BASH different from the DOS command prompt? Case Sensitivity: In Linux/UNIX, commands and filenames are

Limitations of Mobile Computing

What is Mobile Computing? Mobile computing ─ ability to use the technology to wirelessly connect to and use centrally located information and/or application software through the application of small, portable, and wireless computing and communication devices voice, data and multimedia communication standards Limitations Resource constraints: Battery Interference: the quality of service (QoS) Bandwidth: connection latency Dynamic changes in communication environment: variations in signal power within a region, thus link delays and connection losses Network Issues: discovery of the connection-service to destination and connection stability Interoperability issues: the varying protocol standards Security constraints: Protocols conserving privacy of communication 

Benefits of having Certified SAS Base Programmer

Image
Why certification is beneficial? Professionals in data management, data warehousing or in a business intelligence role would find the certification ideal. In addition, recent college graduates having an inclination to logically solve problems and pursuing to enter the data analysis field will find the certification beneficial to kick start their careers.  Base SAS Programmer This course is also ideal, if you are a working professional OR a recent graduate who is Aspiring to be in fast growing career Looking for a more challenging position Aiming to get into a more skillful role Aspiring to be one of the coolest scientists of 21st century What is Base Sas? It's the foundation for all SAS software. Along with an easy-to-learn, flexible programming language, you get a web-based programming interface; ready-to-use programs for data manipulation, information storage and retrieval, descriptive statistics and reporting; a centralized metadata repository; and a mac

1 Best Group on IoT Career and Jobs

All freshers and experienced software developers can join in this group who wish to take their career on Internet-of-things(IoT).   IT JOBS on the Internet of Things Join Today to get the benefit . Imagine a world where billions of objects can sense, communicate and share information, all interconnected over public or private Internet Protocol (IP) networks. These interconnected objects have data regularly collected, analyzed and used to initiate action, providing a wealth of intelligence for planning, management and decision making. This is the world of the Internet of Things (IoT). Join the group Today. The IoT concept was coined by a member of the Radio Frequency Identification (RFID) development community in 1999, and it has recently become more relevant to the practical world largely because of the growth of mobile devices, embedded and ubiquitous communication, cloud computing and data analytics. Best on-line Training for the Internet of Things Since then, many visionaries

Top Differences Read Today Agile vs Waterfall model

Image
The Agile and Waterfall both models are popular in Software development. The Agile model is so flexible compared to the waterfall model. The top differences between Waterfall and Agile give you a clear understanding of both processes. Waterfall Model The traditional model is a waterfall. It has less flexibility. Expensive and time-consuming model. Less scalable to meet the demand of customer requirements. The approach is top-down. Starting from requirements one has to finish all the stages, till deployment to complete one cycle. With a small change in requirement, one has to follow all the stages till deployment. The waterfall model creates idleness in resource management. Agile Model The agile model is excellent for the rapid deployment of small changes The small split requirements you can call sprints Less idleness in resource management. Scope for complete team involvement. Faster delivery makes clients happy. You can deploy changes related to compliance or re

Career Vs Your Attitude Do These Now

Image
75% of the American workforce are actively looking for jobs at any given point in time, of which 69% of them are currently employed.  1. Networking  Tapping into your Network, Creating, and Maximizing your Personal Brand and Researching Companies in your Field of Interest will help you in finding jobs that suit your interest.  Trying to randomly find a job will get you nowhere. PREPARE, PLAN, and RESEARCH , and you will find a job that is meant for you.  If you have attended a get-together of friends or a social gathering off-late, chances are that you would have come across at least half a dozen people, talking about their jobs; how they are unhappy with it and looking for a change, or how difficult it seems to get one in the first place. Job Search. Searching for a job has become one of the most time-occupying tasks in recent times. Almost as difficult as getting one, a job search means researching, identifying, and applying for jobs and going through the functions of interviews, di

Netezza tool real usage speeds up data analytics

The IBM Netezza data warehouse appliance is easy-to-use and dramatically accelerates the entire analytic process. The programming interfaces and parallelization options make it straightforward to move a majority of analytics inside the appliance, regardless of whether they are being performed using tools from such vendors as IBM SPSS, SAS, or Revolution Analytics, or written in languages such as Java,Lua, Perl, Python, R or Fortran. Additionally, IBM Netezza data warehouse appliances are delivered with a built-in library of parallelized analytic functions, purpose-built for large data volumes, to kick-start and accelerate any analytic application development and deployment. The simplicity and ease of development is what truly sets IBM Netezza apart. It is the first appliance of its kind – packing the power and scalability of hundreds of processing cores in an architecture ideally suited for parallel analytics. Instead of a fragmented analytics infrastructure with multiple systems

Here are 5 Skills You need to Become SAS Data Analyst

Image
Want to know what will happen in the future? Find the most lucrative opportunities? Get insights into impending outcomes? No problem. With our SAS data mining software, you can: SAS Data Analyst. Simplify data preparation. Interact with your data quickly and intuitively using dynamic charts and graphs to understand key relationships. Quickly and easily create better models. Take the guesswork out of building models that are both stable and accurate using proven techniques and a drag-and-drop interface that's both easy-to-use and powerful. Put your best models into service. Fast. Spend less time and effort scoring new data using automated, interactive processes that work in both batch and real-time environments. The requirement varies from company to company. I am giving here the basic skills you need for a SAS data analyst Experience in SAS or R analytics Scripting languages of Python/JavaScript/VB Script SQL and PL/SQL Databases knowledge in Oracle, DB2, SQL Server Hadoop and Big

Information Architect Key Responsibilities

Gartner defines "enterprise information architecture" as that part of the enterprise architecture process. It is a set of requirements , principles and models — the current state, future state, and guidance necessary to flexibly share and exchange information assets to achieve effective enterprise change.  Enterprise information Architect also called EIA. Information Architecture  The transition from information that is isolated within applications to a flexible, comprehensive enterprise information architecture will require changes in technology, process, organizational structure and orientation.  In particular, EA practitioners comfortable with technical architecture must now devote time to understanding this emerging discipline.  Gartner projects that, EA teams will be forced by the business to spend as much time on information architecture as they currently spend on technical architecture.  Changes will also impact a range of disciplines across the organization and will

10 Hot IT Skills great demand Beyond 2020

The below are the highest demanding and high-paying IT jobs in 2016 Wireless Network Engineers can expect a 9.7 percent increase in pay, ranging from $108,750-$150,750, which is the highest increase in starting salary of all IT jobs in our guide! What’s going on here? The general push toward wireless offices and the growth of mobile application development, which requires testing mobile apps in a wireless environment, are just a few reasons for the increased starting pay.  CRM Business Analyst -2015 salary: $84,500-$116,750 -2016 salary (est): $87,500-$126,000 - Percent increase (est): 6.1%  Data Modeler - 2015 salary: $101,750-$145,250 - 2016 salary (est): $106,750-$155,500 - Percent increase (est): 6.2%  Business Intelligence Analyst -2015 Salary: $108,500-$153,000 -2016 Salary (est): $113,750-$164,000 -Percent increase (avg): 6.2%  Data Scientists and Big Data Engineers are both projected to see an 8.9 percent increase in starting pay over 2015, ranging from $109,000-$153,

Big data: Quiz-2 Hadoop Top Interview Questions

Image
I hope you enjoyed my previous post. This is second set of Questions exclusively for Big data engineers. Read QUIZ-1 . Q.1) You have submitted a job on an input file which has 400 input splits in HDFS. How many map tasks will run? A. At most 400. B. At least 400 C. Between 400 and 1200. D. Between 100 and 400. Ans: c QUESTION 2 What is not true about LocalJobRunner mode? Choose two A. It requires JobTracker up and running. B. It runs Mapper and Reducer in one single process C. It stores output in local file system D. It allows use of Distributed Cache. Ans: d,a Hadoop Jobs and Career QUESTION 3 What is the command you will use to run a driver named “SalesAnalyisis” whose compilped code is available in a jar file “SalesAnalytics.jar” with input data in directory “/sales/data” and output in a directory “/sales/analytics”? A. hadoopfs  –jar  SalesAnalytics.jar  SalesAnalysis  -input  /sales/data  -output /sales/analysis B. hadoopfs  jar  SalesAnalytics.jar