Featured Post

Scraping Website: How to Write a Script in Python

Image
Here's a python script that you can use as a model to scrape a website. Python script The below logic uses BeautifulSoup Package for web scraping. import requests from bs4 import BeautifulSoup url = 'https://www.example.com' response = requests.get(url) soup = BeautifulSoup(response.text, 'html.parser') # Print the title of the webpage print(soup.title.text) # Print all the links in the webpage for link in soup.find_all('a'):     print(link.get('href')) In this script, we first import the Requests and Beautiful Soup libraries. We then define the URL we want to scrape and use the Requests library to send a GET request to that URL. We then pass the response text to Beautiful Soup to parse the HTML contents of the webpage. We then use Beautiful Soup to extract the title of the webpage and print it to the console. We also use a for loop to find all the links in the webpage and print their href attributes to the console. This is just a basic example, but

These Lovely Skills You Need to Enter as Data Architect

We can analyze data quickly now. The data can be any size. The basic skills you need are R language skills and Machine learning for analyzing the data.

MapReduce Techniques

The MapReduce techniques and parallel processing in Hadoop allow us to cheaply and efficiently implement MapReduce on Internet scale problems. We use SQL like tools Pig and Hive.

NoSQL

We analyze so-called NoSQL storage solutions exemplified by HBase for their critical features: speed of reads and writes, data consistency, and ability to scale to extreme volumes.

Data Architect Requirements

  1. Data architects develop, analyze and administer data for businesses, schools and other organizations.
  2. They need sophisticated design and development skills to collect data and translate it for use in computer applications and systems. Most data architects have a bachelor's degree in information technology (IT), as well as years of experience in the IT field.
  3. Experience with programs such as Microsoft Excel, Access, SQL, Business Objects software, XML, SOA, ETL, BI, ERP and Agile; computer programming and data modeling skills*
  4. Degree in Engineering/Computer Science is desirable. Read more.

Comments

Popular posts from this blog

7 AWS Interview Questions asked in Infosys, TCS

How to Decode TLV Quickly