Featured Post

8 Ways to Optimize AWS Glue Jobs in a Nutshell

Image
  Improving the performance of AWS Glue jobs involves several strategies that target different aspects of the ETL (Extract, Transform, Load) process. Here are some key practices. 1. Optimize Job Scripts Partitioning : Ensure your data is properly partitioned. Partitioning divides your data into manageable chunks, allowing parallel processing and reducing the amount of data scanned. Filtering : Apply pushdown predicates to filter data early in the ETL process, reducing the amount of data processed downstream. Compression : Use compressed file formats (e.g., Parquet, ORC) for your data sources and sinks. These formats not only reduce storage costs but also improve I/O performance. Optimize Transformations : Minimize the number of transformations and actions in your script. Combine transformations where possible and use DataFrame APIs which are optimized for performance. 2. Use Appropriate Data Formats Parquet and ORC : These columnar formats are efficient for storage and querying, signif

13 Cloud Computing Terminology Quick Read

Cloud computing is a big ocean. Due to increased services, a developer must know the glossary ( keywords) involved.


1) AWS

Amazon Web services

2) Content delivery network (CDN)

It is a distributed system. Servers are located in remote locations. Customers feel that they are accessing the servers.

3) Cloud

A global network to access the resources.

4) Cloud portability

The feature to move data from one Cloud provider to another provider.

5) 
Cloudsourcing

Moving traditional IT operations to Cloud computing.

6) Cloud storage

It is a service providing to users to store data using the internet or other private networks.

7) Cloudware

It is Software that helps to run user applications in cloud computing.



13 Cloud Computing Terminology Quick Read
Terminology


8) Cluster

A group of small computers connected together to form a Single big computer. High availability and load balancing are the main benefits.

9) Consumer cloud

The cloud provider offers services to individual users. You can call it a consumer cloud.

10) Consumption-based pricing model

Cloud computing users must pay some fees to the cloud provides. This fee is based on consumption but not on time-based.

11) Content Management Interoperability Services (CMIS)

An open standard to control documents using web protocols

12) Customer self-service

This is a feature that users can manage cloud computing services using Web Services or APIs.

13) Disruptive technology

Technology with innovative methods and offers benefits to users.

Comments

Popular posts from this blog

How to Fix datetime Import Error in Python Quickly

How to Check Kafka Available Brokers

SQL Query: 3 Methods for Calculating Cumulative SUM