3.3 KiB
Statistical Operations on Arrays
Statistics involves collecting data, analyzing it, and drawing conclusions from the gathered information.
NumPy provides powerful statistical functions to perform efficient data analysis on arrays, including minimum
, maximum
, mean
, median
, variance
, standard deviation
, and more.
Minimum
In NumPy, the minimum value of an array is the smallest element present.
The smallest element of an array is calculated using the np.min()
function.
Code
import numpy as np
array = np.array([100,20,300,400])
#Calculating the minimum
result = np.min(array)
print("Minimum :", result)
Output
Minimum : 20
Maximum
In NumPy, the maximum value of an array is the largest element present.
The largest element of an array is calculated using the np.max()
function.
Code
import numpy as np
array = np.array([100,20,300,400])
#Calculating the maximum
result = np.max(array)
print("Maximum :", result)
Output
Maximum : 400
Mean
The mean value of a NumPy array is the average of all its elements.
It is calculated by summing all the elements and then dividing by the total number of elements.
The mean of an array is calculated using the np.mean()
function.
Code
import numpy as np
array = np.array([10,20,30,40])
#Calculating the mean
result = np.mean(array)
print("Mean :", result)
Output
Mean : 25.0
Median
The median value of a NumPy array is the middle value in a sorted array.
It separates the higher half of the data from the lower half.
The median of an array is calculated using the np.median()
function.
It is important to note that:
- If the number of elements is
odd
, the median is the middle element. - If the number of elements is
even
, the median is the average of the two middle elements.
Code
import numpy as np
#The number of elements is odd
array = np.array([5,6,7,8,9])
#Calculating the median
result = np.median(array)
print("Median :", result)
Output
Median : 7.0
Code
import numpy as np
#The number of elements is even
array = np.array([1,2,3,4,5,6])
#Calculating the median
result = np.median(array)
print("Median :", result)
Output
Median : 3.5
Variance
Variance in a NumPy array measures the spread or dispersion of data points.
Calculated as the average of the squared differences from the mean.
The variance of an array is calculated using the np.var()
function.
Code
import numpy as np
array = np.array([10,70,80,50,30])
#Calculating the variance
result = np.var(array)
print("Variance :", result)
Output
Variance : 656.0
Standard Deviation
The standard deviation of a NumPy array measures the amount of variation or dispersion of the elements in the array.
It is calculated as the square root of the average of the squared differences from the mean, providing insight into how spread out the values are around the mean.
The standard deviation of an array is calculated using the np.std()
function.
Code
import numpy as np
array = np.array([25,30,40,55,75,100])
#Calculating the standard deviation
result = np.std(array)
print("Standard Deviation :", result)
Output
Standard Deviation : 26.365486699260625