Back

Sanskar

Keen Learner and Exp... • 15h

Day 1 of learning Data Science as a beginner. Topic: data science life cycle and reading a json file data dump. What is data science life cycle? The data science lifecycle is the structured process of extracting useful actionable insights from raw data (which we refer to as data dump). Data science life cycle has the following steps: 1. Problem Solving: understand the problem you want to solve. 2. Data Collection: gathering relevant data from multiple sources is a crucial step in data science we can collect data using APIs, web scraping or from any third party datasets. 3. Data Cleaning (Data Preprocessing): here we prepare the raw data (data dump) which we collected in step 2. 4. Data Exploration: here we understand and analyse data to find patterns and relationships. 5. Model Building: here we create and train machine learning models and use algorithms to predict outcome or classify data. 6. Model Evaluation: here we measure how our model is performing and its accuracy. 7. Deployment: integrating our model into production system. 8. Communicating and Reporting: now that we have deployed our model it is important to communicate and report it's analysis and results with relevant people. 9. Maintenance & Iteration: keeping our model upto date and accurate is crucial for better results. As a part of my data science learning journey I decided to start with trying to read a data dump (obviously a dummy one) from a .json file using pure python my goal is to understand why we need so many libraries to analyse and clean the data why can't we do it in just pure python script? the obvious answer can be to save time however I feel like I first need to feel the problem in order to understand its solution better. So first I dumped my raw data into a data.json file and then I used json's load method in a function to read my data dump from data.json file. Then I used f string and for loop to analyse each line and print the data in a more readable format. Here's my code and its result.

1 Reply
2
7
Replies (1)

More like this

Recommendations from Medial

Sudarshan Pal

Data Engineer @Quant... • 1y

Many argue that Data Engineering is a part of data science and analytics. It's different from data science, but they work together closely. Data Engineers come first in the process. They gather and organize data. This data is then used by Data Scien

See More
Reply
2
4

Mahendra Lochhab

Content creator • 12m

The data science and analytics industry is expected to reach $3.03 billion in 2024, with over 72,000 data science job openings.

Reply
4

Sadiq Ali

Building Bridges, No... • 2m

🚀 Data Science: The Career of the Future? In today’s digital world, Data Science is one of the most in-demand careers globally. From startups to Fortune 500 companies, organizations across all industries are looking for professionals who can turn ra

See More
Reply
1
Image Description
Image Description

Siddharth Boxi

Hey I am on Medial • 1y

Where to do Data Science course? Any suggestions

5 Replies
6

Dudekula Kasimvali

Hey I am on Medial • 26d

Final Year CS Student | Exploring Opportunities in Generative AI, Data Science & ML Engineering I’m currently in my final year of Computer Science and Business Systems and actively seeking internship opportunities where I can apply my skills in AI,

See More
Reply
2
Image Description
Image Description

Priyanshu Bhati

Hey I am on Medial • 10m

Idea: Developing an software to analyse data within fraction of a sec and gather insights from it Elaboration: 🚀 Transforming Data into Actionable Insights In today's data-driven world, businesses often struggle to unlock the true potential of t

See More
3 Replies
1
3

Starclouds

Build Future With St... • 9m

Hello Everyone, I am thrilled to announce that Starclouds' official global release is scheduled for 29th December! This release will include exciting features tailored for the data science community. Key Features Datasets Share and download datas

See More
Reply
Image Description

Starclouds

Build Future With St... • 10m

Guys Please UPVOTR Starclouds Data Analyst, Data Scientists And Machine Learning Engineers We Need Your Help Go UPVOTE Starclouds In December Showcase If We Win Any Price Form December Startup Showcase I Will Provide You All Free Data Science Co

See More
1 Reply
2

Sadiq Ali

Building Bridges, No... • 2m

📊 Data Science Reality Check! Ever tracked how much time you actually spend on: ⏳ 60% Data cleaning & preprocessing 📊 20% Exploratory analysis 🤖 15% Model building 💡 5% Delivering insights The hard truth: We spend 12x more time preparing data t

See More
Reply
3

Subhajit Nath

Hey I am on Medial • 2m

🟦 Part 1: Data Extraction – Starting the ETL Pipeline 🚀 Welcome to Part 1 of my Azure-based ETL project series! In this part, I walk through how to extract raw data from a GitHub link and load it into Azure Data Lake (Gen2) using Azure Data Factor

See More
Reply
3

Download the medial app to read full posts, comements and news.