Back

Sanskar

Keen Learner and Exp... • 14h

Day 13 of learning data science as a beginner. Topic: data cleaning and preprocessing In most of the real world applications we rarely get almost perfect data most of the time we get a raw data dump which needs to be cleaned and preprocessed before being made use of (funfact: data scientist put 80% of their time in cleaning and preprocessing the data) Pandas not only allows us to analyse the data but also helps us to clean and process the data some of the most commonly used pandas data preprocessing functions are .isnull: checks whether there are any missing values in the data set or not .dropna: deletes all the rows containing any missing value .fillna: fills the missing value using Nan .ffill: fills the last know value from top in place of missing value .bfill: fills the last know value from bottom in place of missing value .drop_duplicates: drop the rows with duplicate values Then there are some functions for cleaning the data (particularly strings) .str.lower: converts all the character into lowercase .str.contains: checks wheter the string contains something specific .str.split: split the string based on either a white space or a special character .astype: changes the data type .apply: applies a function or method directly to a row or column .map: applies a transformation to each value .replace: replaces something with another And also here is my code and its result

Reply

More like this

Recommendations from Medial

Sanskar

Keen Learner and Exp... • 4d

Day 10 of learning data science as a beginner Topic: data analysis using pandas Pandas is one of the python's most famous open source library and it is used for a variety of tasks like data manipulation, data cleaning and for analysis of data. Pand

See More
Reply

Sadiq Ali

Building Bridges, No... • 3m

📊 Data Science Reality Check! Ever tracked how much time you actually spend on: ⏳ 60% Data cleaning & preprocessing 📊 20% Exploratory analysis 🤖 15% Model building 💡 5% Delivering insights The hard truth: We spend 12x more time preparing data t

See More
Reply
3

sudheer kumar

Sudheer kumar • 7m

🚀 Introducing CleanXo – The Future of Automated Data Cleaning! 🚀 We all know the frustration of messy, inconsistent data slowing down analysis. What if you could clean and prepare your data in under 10 minutes—without manual effort? That’s exactly

See More
Reply
4

Inayat Hasan

Software engineer • 1y

[This post has been deleted by the creator]

Reply
2

sudheer kumar

Sudheer kumar • 7m

"I'm building an automated data cleaning platform, CleanXo, that simplifies the entire data preprocessing pipeline. It will automatically detect and handle outliers (Z-score, IQR, Isolation Forest), perform feature engineering, generate insights with

See More
Reply
4

Anshuman Sharma

Turing Data into str... • 3m

🌦 Weather Prediction Using Machine Learning 📊 I’m excited to share one of my recent projects — a Weather Prediction Model built using machine learning! 🚀 📌 Project Overview: The model was trained to predict temperature based on historical weath

See More
Reply
2
Image Description

Comet

#freelancer • 10m

Essential Python Topics for Data Analysts 😄👇 - Data Structures: Lists, Dictionaries, Sets, and Tuples. - NumPy: Array manipulation for numerical operations. - Pandas: Data manipulation and analysis with DataFrame. - Data Cleaning: Handling missin

See More
2 Replies
2
5
Image Description

Kevin Patel

Learner 🍂🤎 • 1y

Some one please guide me for the road map of the data science: Skill I do have right now is Python, numpy , pandas (learning) After completing pandas, numpy and Excel can I do freelancing.. It will be so helpful if someone guides me

2 Replies
5
Image Description
Image Description

Sidharth s

Restarting -loop-beg... • 10m

Ever thought how hashmaps work internally in java ,, lets understand So in java , array of LinkedLists is used to implement hashmap. for example when we write HashMaphmap=new HashMap<>(); hmap.put("sss",1); when we intialize a

See More
3 Replies
2
8

Download the medial app to read full posts, comements and news.