Python for Data Science

A structured, notebook-first repository for learning Python and applying it to data science through hands-on practice, projects, and real datasets.

Overview
What You Will Learn
Repository Structure
Learning Sequence
Datasets
Quick Start
Tools and Technologies
Contributing
License

Overview

This repository is designed as a complete Python-for-data-science learning track. It begins with core Python concepts and gradually progresses to data analysis and visualization using NumPy, Pandas, Matplotlib, and Seaborn.

What You Will Learn

Python fundamentals and syntax
Intermediate and advanced Python concepts
Object-oriented programming (OOP)
Exception and file handling
NumPy fundamentals and advanced operations
Data analysis with Pandas
Data visualization with Matplotlib and Seaborn
Practice exercises and mini projects

Repository Structure

Folder	Focus Area
`Fundamentals of Python/`	Core Python basics, control flow, data structures, functions
`Advance Python/`	Decorators, scope, and advanced language concepts
`Exception Handling in Python /`	Error handling patterns and custom exceptions
`File Handling in Python/`	File I/O, binary files, and serialization
`OOPS in Python/`	Classes, inheritance, polymorphism, abstraction, encapsulation
`Python fundamental Questions/`	Practice questions for core and intermediate topics
`My projects/`	Mini projects for applied learning
`NumPy For Python /`	NumPy fundamentals and array operations
`NumPy For Practice/`	NumPy practice notebooks
`NumPy Advance/`	Broadcasting, advanced indexing, utilities
`Matplotlib in Python/`	Plotting and charting exercises
`Seaborn in Python/`	Statistical and relational visualizations
`Pandas in Python/`	Series, DataFrames, DateTime operations, and practice
`Database from Kaggle/`	Dataset collection used across notebooks
`Exploratory Data Analysis (EDA)/`	EDA-focused notebook work

Note: Some folder names include spaces, and a few intentionally include trailing spaces (for example, Exception Handling in Python / and NumPy For Python /) to preserve the repository's original notebook and dataset paths. Copy paths carefully when using terminal commands.

Learning Sequence

Recommended study order:

Fundamentals of Python
Advance Python
Exception Handling in Python
File Handling in Python
OOPS in Python
Python fundamental Questions
My projects
NumPy For Python
NumPy For Practice
NumPy Advance
Matplotlib in Python
Seaborn in Python
Pandas in Python
Exploratory Data Analysis (EDA)

Datasets

The Database from Kaggle/ directory contains datasets used throughout NumPy, Pandas, and visualization notebooks, including:

IPL match and delivery datasets
Movie and IMDb datasets
Student performance and expense datasets
Additional CSV/Excel files in Database from Kaggle/datasets / (note the intentional trailing space after datasets).

Quick Start

git clone https://github.com/aayushmanz/Python-For-Data-Science.git
cd Python-For-Data-Science
python -m venv .venv
source .venv/bin/activate
pip install jupyter numpy pandas matplotlib seaborn
jupyter notebook

For Windows PowerShell:

.venv\Scripts\Activate.ps1

Tools and Technologies

Python 3
Jupyter Notebook
NumPy
Pandas
Matplotlib
Seaborn
Git and GitHub

Contributing

Contributions are welcome.

Fork the repository
Create a feature branch
Make and commit your changes
Open a pull request with a clear summary

License

This project is available under the MIT License.

Maintained by Ayush Suthar.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python for Data Science

Table of Contents

Overview

What You Will Learn

Repository Structure

Learning Sequence

Datasets

Quick Start

Tools and Technologies

Contributing

License

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Python for Data Science

Table of Contents

Overview

What You Will Learn

Repository Structure

Learning Sequence

Datasets

Quick Start

Tools and Technologies

Contributing

License