Featured post

4 Layers of AWS Architecture a Quick Answer

I have collected real interview questions on AWS key architecture components. Those are S3, EC2, SQS, and SimpleDB. AWS is one of the most popular skills in the area of Cloud computing. Many companies are recruiting software developers to work on cloud computing.

AWS Key Architecture Components AWS is the top cloud platform. The knowledge of this helpful to learn other cloud platforms. Below are the questions asked in interviews recently.
What are the components involved in AWS?Amazon S3.With this, one can retrieve the key information which is occupied in creating cloud structural design, and the amount of produced information also can be stored in this component that is the consequence of the key specified.Amazon EC2. Helpful to run a large distributed system on the Hadoop cluster. Automatic parallelization and job scheduling can be achieved by this component.Amazon SQS. This component acts as a mediator between different controllers. Also worn for cushioning requirements those are obt…

Kafka Stream Processing Quick Flow Chart for Dummies

The difference between Stream processing Vs Batch processing and role of Kafka I can tell you using simple example.  I have used Mainframe and Youtube for this.


In Mainframe, you remember the data processing majority of it is based on Batch processing. That means the whole collected data processed by some scheduled jobs. 



The streaming data on the other hand is different (Youtube Live). When the data comes into Queue the data will be processed. So, organizations no need to wait until the batch is completed. The organizations will get results Quickly.


Stream Data Flow Before You Move on to Kafka


Batch Vs Stream Data Processing Role of Kafka

*Here are the key points to remember: 
If you need to report on or take action immediately as data arrives, stream processing is a good approach.


*If you want to Carry detailed analysis
If you need to perform in-depth analysis or are compiling a large repository of data for later analysis, a stream-processing approach may not be a good fit.


The Real-time Uses of Stream Data


  • Credit card fraud—A credit card owner may not notice a card has been stolen, but by reviewing purchases as they happen against established patterns (location, general spending habits), you may be able to detect a stolen credit card and alert the owner.

  • Intrusion detectionAnalyzing application log files after a breach has occurred may be helpful to prevent future attacks or to improve security, but the ability to monitor aberrant behavior in real-time is critical.

  • A large race, such as the New York City Marathon—Almost all runners will have a chip on their shoe, and when runners pass sensors along the course, you can use that information to track the runners’ positions. By using the sensor data, you can determine the leaders, spot potential cheating, and detect whether a runner is potentially having problems.

  • The financial industryThe ability to track market prices and direction in real-time is essential for brokers and consumers to make effective decisions about when to sell or buy.


Kafka Architecture: Subscribe and Publishing Broker



Kafka Architecture

How Kafka Process Stream Data

  • Kafka is a publish/subscribe system, but it would be more precise to say that Kafka acts as a message broker. A broker is an intermediary that brings together two parties that don’t necessarily know each other for a mutually beneficial exchange or deal.

  • Kafka stores messages in topics and retrieves messages from topics. There’s no direct connection between the producers and the consumers of the messages. Additionally, Kafka doesn’t keep any state regarding the producers or consumers. It acts solely as a message clearinghouse.

  • The underlying technology of a Kafka topic is a log, which is a file that Kafka appends incoming records to. To help manage a load of messages coming into a topic, Kafka uses partitions. 

  • The use of partitions is to bring data located on different machines together on the same server. We’ll discuss partitions in detail shortly.

How Kafka Writes Data in the Logs

  1. Kafka Writes data in the logs
  2. Only it appends to the first entered record. It will not delete records 

Role of ZooKeeper in Kafka

  • ZooKeeper is a centralized service for maintaining configuration information, naming, providing distributed synchronization, and providing group services. 
  • All of these kinds of services are used in some form or another by distributed applications.

Comments

Popular posts from this blog

Hyperledger Fabric: 20 Real Interview Questions

7 Ideas to Improve Your Resume Before You Share

15 awesome Java Interview Questions to know