A simple data analysis mini-project on the Airbnb Open Data (NYC listings). This was made as part of my data analysis coursework using Python, Pandas, Matplotlib and Seaborn.
To clean the Airbnb dataset and explore it visually to answer questions like:
- Which neighbourhood group has the most listings?
- What is the average price in each area?
- Which room types are most common?
- How are reviews and prices related?
Event_Data_Analysis_Project/
β
βββ Airbnb_Open_Data.xlsx # Original dataset
βββ airbnb_eda.py # Main analysis script
βββ airbnb_cleaned.csv # Cleaned dataset (created after running)
βββ requirements.txt # Python libraries needed
βββ index.html # Project website
βββ style.css # Website styling
βββ images/ # Charts created by the script
βββ README.md
-
Clone this repository
git clone https://github.com/ParvathyM155/Event_Data_Analysis_Project.git cd Event_Data_Analysis_Project -
Install the required libraries
pip install -r requirements.txt
-
Run the analysis script
python airbnb_eda.py
-
Open
images/to see the generated charts, or openindex.htmlin a browser to view the project website.
- Renamed columns to lowercase with underscores
- Fixed typos in
neighbourhood_group(brooklnβBrooklyn,manhatanβManhattan) - Dropped the almost-empty
licensecolumn - Removed duplicate rows
- Removed rows missing important fields (
price,room_type, etc.) - Filtered out unrealistic values (price β€ 0, minimum nights > 365, etc.)
| File | What it shows |
|---|---|
listings_by_group.png |
Number of listings per neighbourhood group |
room_type_distribution.png |
How common each room type is |
avg_price_by_group.png |
Average price in each neighbourhood group |
price_distribution.png |
Overall distribution of prices |
price_by_room_type.png |
Price comparison across room types |
top10_neighbourhoods.png |
Top 10 neighbourhoods by number of listings |
reviews_vs_price.png |
Relationship between reviews and price |
heatmap.png |
Correlation heatmap of numeric columns |
A simple HTML/CSS page (index.html) is included to present the project.
You can host it for free using GitHub Pages:
- Push the project to GitHub
- Go to Settings β Pages
- Under Source, choose
mainbranch and/root - Save β your site will be live at
https://<your-username>.github.io/Event_Data_Analysis_Project/
- Python 3
- Pandas
- Matplotlib
- Seaborn
- Jupyter / Google Colab (for early exploration)
Parvathy M
Data Analysis Mini Project