Posts

Showing posts with the label logic

Featured Post

8 Ways to Optimize AWS Glue Jobs in a Nutshell

Image
  Improving the performance of AWS Glue jobs involves several strategies that target different aspects of the ETL (Extract, Transform, Load) process. Here are some key practices. 1. Optimize Job Scripts Partitioning : Ensure your data is properly partitioned. Partitioning divides your data into manageable chunks, allowing parallel processing and reducing the amount of data scanned. Filtering : Apply pushdown predicates to filter data early in the ETL process, reducing the amount of data processed downstream. Compression : Use compressed file formats (e.g., Parquet, ORC) for your data sources and sinks. These formats not only reduce storage costs but also improve I/O performance. Optimize Transformations : Minimize the number of transformations and actions in your script. Combine transformations where possible and use DataFrame APIs which are optimized for performance. 2. Use Appropriate Data Formats Parquet and ORC : These columnar formats are efficient for storage and querying, signif

How to Write ETL Logic in Python: Sample Code to Practice

Image
Here's an example Python code that uses the mysql-connector library to connect to a MySQL database, extract data from a table, transform it, and load it as a JSON file. Here's an example: Python ETL Sample Code import mysql.connector import json # Connect to the MySQL database cnx = mysql.connector.connect(user='username', password='password',                               host='localhost',                               database='database_name') # Define a cursor to execute SQL queries cursor = cnx.cursor() # Define the SQL query to extract data query = ("SELECT column1, column2, column3 FROM table_name") # Execute the SQL query cursor.execute(query) # Fetch all rows from the result set rows = cursor.fetchall() # Transform the rows into a list of dictionaries result = [] for row in rows:     result.append({'column1': row[0], 'column2': row[1], 'column3': row[2]}) # Save the result as a JSON file with open('ou

Python: How to Write Logic to Print Triangle

Image
Here's an example to print Triangle in Python. It's the best interview question. You can expect this type of question in your interview. So this is useful to practice it. How to Print Triangle Logic to Print Trianagle max = 8 for x in range(1, max + 1): for y in range(1, x + 1): print y, print Result 1 1 2 1 2 3 1 2 3 4 1 2 3 4 5 1 2 3 4 5 6 1 2 3 4 5 6 7 1 2 3 4 5 6 7 8   Also, Read Python while loop Python for loop example