Featured Post

8 Ways to Optimize AWS Glue Jobs in a Nutshell

Image
  Improving the performance of AWS Glue jobs involves several strategies that target different aspects of the ETL (Extract, Transform, Load) process. Here are some key practices. 1. Optimize Job Scripts Partitioning : Ensure your data is properly partitioned. Partitioning divides your data into manageable chunks, allowing parallel processing and reducing the amount of data scanned. Filtering : Apply pushdown predicates to filter data early in the ETL process, reducing the amount of data processed downstream. Compression : Use compressed file formats (e.g., Parquet, ORC) for your data sources and sinks. These formats not only reduce storage costs but also improve I/O performance. Optimize Transformations : Minimize the number of transformations and actions in your script. Combine transformations where possible and use DataFrame APIs which are optimized for performance. 2. Use Appropriate Data Formats Parquet and ORC : These columnar formats are efficient for storage and querying, signif

How to Search for Single CHAR in Python Using Regular-expression

Here is the logic for searching single CHAR using regular expression(Regex). For instance, we use wildcards to search for anything on our computers. The Regex in Python works similarly.

Regular Expression


Regular expression

People use asterisk * for searching any document. For instance, if you type *.pdf, it returns all the pdfs available in the location (where you are conducting your search). Similar way, in Python, you can search using regular expressions.

Import Regex 

The first thing you need to do is import 're' if you want to work with regular expressions.


Example program: search for single CHAR

To match any single character, you can use [….]. Below, you will find an example to search for: 'l' or 'a' or 'b'

import re
pattern = r'[lab]' sequence = 'we love python' obj = re.search(pattern,sequence) if obj: print("We found a match here @",obj.group()) else: print("Sorry no match found.")

The word 'love' has the letter 'l'. Hence, a match is found.

Comments

Popular posts from this blog

How to Fix datetime Import Error in Python Quickly

Explained Ideal Structure of Python Class

How to Check Kafka Available Brokers