This project aims to analyze F1 drivers' performance across different circuits during the 2021-2024 seasons. Through exploratory data analysis and statistical techniques, we seek to answer two key questions:
- Does track qualities influence lap time across drivers?
- What are some circuits that each driver performs best in?
An in-depth overview of the data analysis pipeline and results is available here.
The workflow consists of several integrated components:
- Data Collection - FastF1
- FastF1 is an open-source resource providing: lap timing data, car telemetry and position, tire data, weather data.
- Data Preprocessing - Removing outliers
- Statistical Analysis - Simple statistical techniques (mean, std, var) and mixed-effects modeling
- Developing Visualizations - Scatter plots and box plots
To ensure accurate comparisons, the project focuses on a roster of recent drivers who have:
- Remained with the same team for the past 3-4 years
- Consistent performance across multiple seasons
fastf1.ipynb: Comprehensive Jupyter Notebook containing all code and analysisData.csvandNew_data.csv: Data retrieved using FastF1 APILap Performance Across*.png: Visualization for across drivers comparisonsLapTimePerKm*.csv: Statistical analysis summaries
- Clone the repository
- Install required dependencies: Copy
pip install fastf1 pandas matplotlib seaborn plotly statsmodels - Open
fastf1.ipynb - Run the notebook to reproduce the analysis and visualizations