Posts

Showing posts with the label asked interview questions

Featured Post

8 Ways to Optimize AWS Glue Jobs in a Nutshell

Image
  Improving the performance of AWS Glue jobs involves several strategies that target different aspects of the ETL (Extract, Transform, Load) process. Here are some key practices. 1. Optimize Job Scripts Partitioning : Ensure your data is properly partitioned. Partitioning divides your data into manageable chunks, allowing parallel processing and reducing the amount of data scanned. Filtering : Apply pushdown predicates to filter data early in the ETL process, reducing the amount of data processed downstream. Compression : Use compressed file formats (e.g., Parquet, ORC) for your data sources and sinks. These formats not only reduce storage costs but also improve I/O performance. Optimize Transformations : Minimize the number of transformations and actions in your script. Combine transformations where possible and use DataFrame APIs which are optimized for performance. 2. Use Appropriate Data Formats Parquet and ORC : These columnar formats are efficient for storage and querying, signif

5 AWS Tricky Interview Questions

Image
Below are the top tricky questions asked in AWS interviews 1) How do you store objects in Reduced Redundancy Storage in S3? Specify REDUCED_REDUNDANCY tags in the HTML code. Specify REDUCED_MIRRORING on a PUT request using an optional header. Specify LOW_REPLICATION on a PUT request using an optional header. Specify LOW_REPLICATION on a GET request for objects. Specify REDUCED_REDUNDANCY on a PUT request using an optional header. 2) Which is required to log on to the AWS management console on a mobile device? SSO alias SiteMinder Gateway address IAM token AWS account 3) How do you receive notifications based on the value of a metric being above or below a stated threshold in Amazon CloudWatch? Set Amazon SmartAlert. Log on through the Amazon console. Create an Amazon CloudWatch threshold dashboard. Create an Amazon CloudWatch alarm. Create a DescribeAlarmHistory API. 4) Scenario-You are testing Oracle, SQL Server, and Mongo DB databases in your AWS account to determine which database t

12 Top Hadoop Security Interview Questions

Image
Here are the interview questions on Hadoop security. Useful to learn for your data science project and for interviews.  12  Hadoop Security Interview Questions How does Hadoop security work? How do you enforce access control to your data? How can you control who is authorized to access, modify, and stop Hadoop MapReduce jobs? How do you get your (insert application here) to integrate with Hadoop security controls? How do you enforce authentication for users on all types of Hadoop clients (for example, web consoles and processes)? How can you ensure that rogue services don't impersonate real services (for example, rogue Task Trackers and tasks, unauthorized processes presenting block IDs to Data Nodes to get access to data blocks, and so on)? Can you tie in your organization's Lightweight Directory Access Protocol (LDAP) directory and user groups to Hadoop's permissions structure? Can you encrypt data in transit in Hadoop? Can your data be encrypted at rest on HDFS? How can

5 top data warehousing questions to read before interview

Image
Hey, welcome to data analytics. Here I have given a few questions and answers. The below five questions helps you to understand very basics on data analytics. These are very basic questions. But, for laymen, they can understand easily and also good for interviews. Stocksnap.io Questions on Business Intelligence What is Business Intelligence?  It is a well-established process in the business world whereby decision makers integrate strategic thinking with IT  What is Web Analytics?  It is the collection, analysis, and reporting of Web site usage by visitors and customers of a website in order to better understand the effectiveness of online initiatives and other changes to the website in an objective, the scientific way through experimentation, testing, and measurement.  What is academic analytics?  This is the area analytics applied in academia  What is action analytics?  This is the area of analytics to produce actionable intelligence  What is the process of