Featured Post

8 Ways to Optimize AWS Glue Jobs in a Nutshell

Image
  Improving the performance of AWS Glue jobs involves several strategies that target different aspects of the ETL (Extract, Transform, Load) process. Here are some key practices. 1. Optimize Job Scripts Partitioning : Ensure your data is properly partitioned. Partitioning divides your data into manageable chunks, allowing parallel processing and reducing the amount of data scanned. Filtering : Apply pushdown predicates to filter data early in the ETL process, reducing the amount of data processed downstream. Compression : Use compressed file formats (e.g., Parquet, ORC) for your data sources and sinks. These formats not only reduce storage costs but also improve I/O performance. Optimize Transformations : Minimize the number of transformations and actions in your script. Combine transformations where possible and use DataFrame APIs which are optimized for performance. 2. Use Appropriate Data Formats Parquet and ORC : These columnar formats are efficient for storage and querying, signif

The Quick and Easy Way to Analyze Numpy Arrays

The quickest and easiest way to analyze NumPy arrays is by using the numpy.array() method. This method allows you to quickly and easily analyze the values contained in a numpy array. This method can also be used to find the sum, mean, standard deviation, max, min, and other useful analysis of the value contained within a numpy array.


NumPy Python

Sum

You can find the sum of Numpy arrays using the np.sum() function. 
For example: 

import numpy as np 
a = np.array([1,2,3,4,5]) 
b = np.array([6,7,8,9,10]) 
result = np.sum([a,b]) 
print(result) 
# Output will be 55


Mean


You can find the mean of a Numpy array using the np.mean() function. This function takes in an array as an argument and returns the mean of all the values in the array. 

For example, the mean of a Numpy array of [1,2,3,4,5] would be 
result = np.mean([1,2,3,4,5]) 
print(result) 

#Output: 3.0


Standard Deviation


To find the standard deviation of a Numpy array, you can use the NumPy std() function. This function takes in an array as a parameter and returns the standard deviation of that given array. 
For example: 
import numpy as np 

arr = np.array([1, 2, 3, 4, 5]) 
std_dev = np.std(arr) 
print(std_dev) 

# Output: 1.5811388300841898

Max


To find the max Numpy Array, you can use the max() function from the Numpy library. 
For example, to find the max value in an array of numbers: 

import numpy as np 
arr = np.array([1, 3, 4, 6, 10]) 
print(np.max(arr)) 
This would output 10, which is the max value of the array.


Min


The easiest way to find the minimum value of a Numpy array is with the np.min() function. This function takes in a Numpy array and returns the minimum value in the array. 

Example: 
import numpy as np 
a = np.array([1, 5, 10, 100, 200]) 
min_val = np.min(a)
 print(min_val) 
# Output: 1

Comments

Popular posts from this blog

How to Fix datetime Import Error in Python Quickly

How to Check Kafka Available Brokers

SQL Query: 3 Methods for Calculating Cumulative SUM