Featured Post

Scraping Website: How to Write a Script in Python

Image
Here's a python script that you can use as a model to scrape a website. Python script The below logic uses BeautifulSoup Package for web scraping. import requests from bs4 import BeautifulSoup url = 'https://www.example.com' response = requests.get(url) soup = BeautifulSoup(response.text, 'html.parser') # Print the title of the webpage print(soup.title.text) # Print all the links in the webpage for link in soup.find_all('a'):     print(link.get('href')) In this script, we first import the Requests and Beautiful Soup libraries. We then define the URL we want to scrape and use the Requests library to send a GET request to that URL. We then pass the response text to Beautiful Soup to parse the HTML contents of the webpage. We then use Beautiful Soup to extract the title of the webpage and print it to the console. We also use a for loop to find all the links in the webpage and print their href attributes to the console. This is just a basic example, but

Hadoop 2x vs 3x top differences

In many interviews, the first question for Hadoop developers is what are the differences between Hadoop 2 and 3. You already know that Hadoop upgraded from version 1.

Hadoop features


The below list is useful to know the differences. I have given Hadoop details in the form of questions and answers so that beginners can understand.

Hadoop 2.x Vs 3.x


hadoop v2 vs 3
The major change in hadoop 3 is no storage overhead. So, you may be curious about how Hadoop 3 is managing storage.

My plan is for you is first to go through the list of differences and check the references section, to learn more about Hadoop storage management.

References

Follow me on twitter

Comments

Popular posts from this blog

7 AWS Interview Questions asked in Infosys, TCS

How to Decode TLV Quickly