Featured Post

Scraping Website: How to Write a Script in Python

Image
Here's a python script that you can use as a model to scrape a website. Python script The below logic uses BeautifulSoup Package for web scraping. import requests from bs4 import BeautifulSoup url = 'https://www.example.com' response = requests.get(url) soup = BeautifulSoup(response.text, 'html.parser') # Print the title of the webpage print(soup.title.text) # Print all the links in the webpage for link in soup.find_all('a'):     print(link.get('href')) In this script, we first import the Requests and Beautiful Soup libraries. We then define the URL we want to scrape and use the Requests library to send a GET request to that URL. We then pass the response text to Beautiful Soup to parse the HTML contents of the webpage. We then use Beautiful Soup to extract the title of the webpage and print it to the console. We also use a for loop to find all the links in the webpage and print their href attributes to the console. This is just a basic example, but

How to write Regular expression Quickly in python and Examples

Regular Expressions purpose is to find matching string in another string. You will get either 'True' or 'False' as a response. I am not sharing here how to play tennis. My intention is if you just follow ideas, you can play tennis today.

Python supports regular expressions. It has a special library to work with these. I have shared best examples for your quick reference.
 

Python Regular Expressions

  1. What is a regular expression
  2. How does python support
  3. Best examples


1. What is regular expression


>>> haystack = 'My phone number is 213-867-5309.' 
>>> '213-867-5309' in haystack
True


This is just a fundamental use of the regular expression. The real use of Regular Expression comes here. That is - to find if the main has any valid phone number.


Regular expressions also called regexes.

2. Why do we need regx

  1. Data mining - to get required data if it is present are not
  2. Data validations - to get an answer if the received string is valid or not.

Python support


Python has its own regular expression library. That is called re. What you need to do is just import it.

>>>import re


When data matches and not matches

  1. If a match found, it returns the String
  2. If there is no match, it will return null


Example for regex


>>> import re
>>> re.search(r'fox', 'The quick brown fox jumped...')
<_sre.SRE_Match object; span=(16, 19), match='fox'>

Notes: The returned string is 'fox'.


Matching string


>>> match = re.search(r'fox', 'The quick brown fox jumped...')
>>> match.group() 'fox'

Notes: The returned string is 'fox'.



Multiple matches

>>> import re >>> re.findall(r'o', 'The quick brown fox jumped...')
['o', 'o']

Notes: It returns multiple strings.

Comments

Popular posts from this blog

7 AWS Interview Questions asked in Infosys, TCS

How to Decode TLV Quickly