Featured post

The Ultimate Cheat Sheet On Hadoop

Top 20 frequently asked questions to test your Hadoop knowledge given in the below Hadoop cheat sheet. Try finding your own answers and match the answers given here.




Question #1 

You have written a MapReduce job that will process 500 million input records and generate 500 million key-value pairs. The data is not uniformly distributed. Your MapReduce job will create a significant amount of intermediate data that it needs to transfer between mappers and reducers which is a potential bottleneck. A custom implementation of which of the following interfaces is most likely to reduce the amount of intermediate data transferred across the network?



A. Writable
B. WritableComparable
C. InputFormat
D. OutputFormat
E. Combiner
F. Partitioner
Ans: e




Question #2 

Where is Hive metastore stored by default ?


A. In HDFS
B. In client machine in the form of a flat file.
C. In client machine in a derby database
D. In lib directory of HADOOP_HOME, and requires HADOOP_CLASSPATH to be modified.
Ans: c




Question…

SAS career these are different job roles

What is SAS?
Originating in 1966 at North Carolina State University, SAS is a proprietary 4th generation programming language specifically designed for data analysis. Data analysis includes data management, ETL, descriptive statistics, plots and graphics, inferential statistics, data mining, forecasting, etc.. In addition to the SAS programming language, SAS Institute, its parent company, also sells solutions built on top of the SAS language.

SAS career
Many people assumptions about SAS: Some people confuse SAS with the common database query language called “SQL”, thinking that both languages manage data in some way. While SAS may be used for database querying (reading/writing data to databases), this is only a tiny fraction of SAS’s capabilities. To give some scope, the entire SQL language is available in SAS as but one “procedure” (think of a procedure as a bundle of functionality).
SKILL Set: What It Is: Pronounced “sass,” this software helps workers perform a variety of tasks, including business forecasting, project management and statistical analysis. 
Different roles in SAS Jobs:
  1. Business analyst
  2. Clinical data programmer
  3. Data Analyst
  4. Data Quality Steward
  5. Data Scientist
  6. Data warehouse architect
  7. Database administrator
  8. Database programmer
  9. Developer
  10. ETL specialist
  11. Financial analyst
  12. IT Manager
  13. Marketing analyst
  14. Platform Administrator
  15. Programmer
  16. Project Manager
  17. Quality analyst
  18. Report programmer
  19. Reporting Analyst
  20. Securities analyst
  21. Statistician
  22. Statistics programmer
  23. Systems/network programmer

Comments

Popular posts from this blog

Hadoop fs (File System) Commands List

Hyperledger Fabric: 20 Real Interview Questions

AWS Vs Azure Load Balancers Top Insights

4 Important Skills You Need for Data Scientists