This project performs comprehensive HR data analysis using Python on a dataset of 2 million employee records.
It focuses on workforce distribution, hiring trends, performance analytics, salary insights, attrition patterns, and more.
The analysis transforms raw HR data into actionable insights for HR managers, analysts, and business leaders.
- Python
- Pandas
- NumPy
- Matplotlib
- Seaborn
- Jupyter Notebook
The dataset contains 11 columns, including:
- Employee details
- Department & Job Title
- Hire Date & Country
- Salary
- Performance Rating
- Work Mode (Remote / On-site)
- Status (Active, Resigned, Retired, Terminated)
Breakdown of:
- Active
- Resigned
- Terminated
- Retired
Pie-chart visualization included.
Remote vs On-site workforce share.
Department-level analysis with bar charts.
Computed and visualized using Pandas groupby + Matplotlib.
Top salaries by job roles — insights into compensation structures.
Multi-level groupby analysis showing detailed salary comparisons.
Attrition breakdown with bar charts.
Understanding how salary grows with experience.
Department-wise average performance score.
Extracted from location → country parsing.
Correlation between:
- Salary & performance rating
- Numeric features heatmap
Shows annual hiring patterns over 16 years.
Identifying if remote workers are paid more.
Department-wise salary ranking using nlargest().
Resigned % per department (Resigned / Total * 100).
Sorted to find highest-risk departments.
- Pie charts
- Countplots
- Bar charts
- Heatmaps
- Yearly hiring trend graph
- Department vs Job salary chart
All visuals generated using Matplotlib and Seaborn.
📦 HR-Data-Analysis
┣ 📜 HR_Data_Analysis.ipynb
┣ 📜 HR_Data.csv
┣ 📜 README.md
┗ 📂 images/ (optional charts)
👤 Loganathan
loganathanvizasia@gmail.com