Keen Learner and Exp...ย โขย 2m
Day 13 of learning data science as a beginner. Topic: data cleaning and preprocessing In most of the real world applications we rarely get almost perfect data most of the time we get a raw data dump which needs to be cleaned and preprocessed before being made use of (funfact: data scientist put 80% of their time in cleaning and preprocessing the data) Pandas not only allows us to analyse the data but also helps us to clean and process the data some of the most commonly used pandas data preprocessing functions are .isnull: checks whether there are any missing values in the data set or not .dropna: deletes all the rows containing any missing value .fillna: fills the missing value using Nan .ffill: fills the last know value from top in place of missing value .bfill: fills the last know value from bottom in place of missing value .drop_duplicates: drop the rows with duplicate values Then there are some functions for cleaning the data (particularly strings) .str.lower: converts all the character into lowercase .str.contains: checks wheter the string contains something specific .str.split: split the string based on either a white space or a special character .astype: changes the data type .apply: applies a function or method directly to a row or column .map: applies a transformation to each value .replace: replaces something with another And also here is my code and its result




Building Bridges, No...ย โขย 5m
๐ Data Science Reality Check! Ever tracked how much time you actually spend on: โณ 60% Data cleaning & preprocessing ๐ 20% Exploratory analysis ๐ค 15% Model building ๐ก 5% Delivering insights The hard truth: We spend 12x more time preparing data t
See More
Sudheer kumarย โขย 10m
๐ Introducing CleanXo โ The Future of Automated Data Cleaning! ๐ We all know the frustration of messy, inconsistent data slowing down analysis. What if you could clean and prepare your data in under 10 minutesโwithout manual effort? Thatโs exactly
See More
Sudheer kumarย โขย 10m
"I'm building an automated data cleaning platform, CleanXo, that simplifies the entire data preprocessing pipeline. It will automatically detect and handle outliers (Z-score, IQR, Isolation Forest), perform feature engineering, generate insights with
See Moreย โขย
Self Employedย โขย 1m
Iโve been strengthening my data analysis and visualization fundamentals by building multiple end-to-end analytical projects using Python, Pandas, Matplotlib, and Seaborn. Projects worked on: Video Game Market Analysis Football Performance Analysis (
See More



Turing Data into str...ย โขย 6m
๐ฆ Weather Prediction Using Machine Learning ๐ Iโm excited to share one of my recent projects โ a Weather Prediction Model built using machine learning! ๐ ๐ Project Overview: The model was trained to predict temperature based on historical weath
See More
Download the medial app to read full posts, comements and news.