This project focuses on analyzing customer data from a retail store to segment customers based on their purchasing behavior. By leveraging the power of machine learning, specifically the K-Means clustering algorithm, this analysis aims to identify distinct customer segments. These insights enable the retail store to tailor its marketing strategies, optimize product offerings, and enhance overall customer satisfaction and revenue.
The dataset used for this project is the "Mall Customers" dataset, which contains demographic and behavioral information about customers, including:
Unique identifier for each customer.
Gender of the customer (Male/Female).
Age of the customer.
Annual income of the customer in thousands of dollars.
Spending score assigned by the mall based on customer behavior and spending nature.
Handling missing values and ensuring data consistency.
Understanding the distribution and relationships within the data.
Segmenting customers into distinct groups based on their purchasing behaviors.
Using Matplotlib and Power BI to create visual representations of the data and clustering results.
Pandas, NumPy, Matplotlib, Seaborn, Scikit-Learn
Power BI
By analyzing customer data and identifying distinct segments, this project helps retail stores implement more effective marketing strategies, improve operational efficiency, and achieve a competitive advantage in the market.
Mall Customers Dataset on Kaggle
Pandas, NumPy, Matplotlib, Seaborn, Scikit-Learn