Data lake Repository You Need to Know About it

In DataLake data stored internally in a repository. You can say this format as a blob. The data in DataLake does not have a particular Schema or Format.

data lake repository  example
Photo Credit: Srini

SQL Database

  • Let us take a traditional database, here, a database design and Scheme are to be defined before you enter data. In data-lake, there is no format for this. It is like a dump. 
  • This dump you can send to Hadoop repository for data analysis. This repository can be incremental. Also, you can build a large database.

Data lake Vs Hadoop

Data-Lake is a dump of data with no format. There are many pre-formats required before it sends for analytics. One is data security and encryption. These techniques to be done before you send your data to Hadoop repository.

In real-time, Hadoop data analytics need lot other pre-processing of data required to proceed further.

Comments

Popular Posts

7 AWS Interview Questions asked in Infosys, TCS

Hyperledger Fabric: 20 Real Interview Questions

How to Fix Python Syntax Errors Quickly

Python 'getsizeof' Command the Real Purpose

How to Check Log File in Kafka

5 HBase Vs. RDBMS Top Functional Differences

Python Dictionary Vs List With Examples

Blue Prism complete tutorials download now

Linux Relative Vs. Absolute Path Top Differences

How to Use the ps Command in Linux