Posts

Showing posts with the label remove duplicates

Featured Post

How to Check Column Nulls and Replace: Pandas

Image
Here is a post that shows how to count Nulls and replace them with the value you want in the Pandas Dataframe. We have explained the process in two steps - Counting and Replacing the Null values. Count null values (column-wise) in Pandas ## count null values column-wise null_counts = df.isnull(). sum() print(null_counts) ``` Output: ``` Column1    1 Column2    1 Column3    5 dtype: int64 ``` In the above code, we first create a sample Pandas DataFrame `df` with some null values. Then, we use the `isnull()` function to create a DataFrame of the same shape as `df`, where each element is a boolean value indicating whether that element is null or not. Finally, we use the `sum()` function to count the number of null values in each column of the resulting DataFrame. The output shows the count of null values column-wise. to count null values column-wise: ``` df.isnull().sum() ``` ##Code snippet to count null values row-wise: ``` df.isnull().sum(axis=1) ``` In the above code, `df` is the Panda

These Tips Helpful to Remove Python List and Dictionary Duplicates

Image
In this post, I have shared top ideas to remove duplicates from the list. Those are with Append and Dictionary. 1. How to Remove Duplicates with Append # Here is a list with duplicates list_with_duplicates = [1,2,3,12,1,2,3,4,5,6,1,2,3,7,8,9] It is simple if you follow the first-approach - brute force approach: list_without_duplicates = [] for pd in list_with_duplicates:   if pd not in list_without_duplicates:       list_without_duplicates.append(pd) print(list_without_duplicates) Result: [1, 2, 3, 12, 4, 5, 6, 7, 8, 9] This method has performance issues when the list is bigger in size.  Real-time. 2. How to Remove Duplicates with Dictionary # Here is you can convert a list to a dictionary dict_without_duplicates = dict(zip(list_with_duplicates, list_with_duplicates)) print(dictionary_without_duplicates) Result: {1: 1, 2: 2, 3: 3, 12: 12, 4: 4, 5: 5, 6: 6, 7: 7, 8: 8, 9: 9} Real-time. Once again, this works and has the advantage of taking less space than duplicating the entire list.