Skip to content
View thyphan2025's full-sized avatar
πŸ’­
Learning
πŸ’­
Learning

Block or report thyphan2025

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
thyphan2025/README.md

About Me

  • πŸ‘‹ Hi, I’m @thyphan2025
  • πŸ‘€ I’m interested in AI & Machine Learning.
  • 🌱 I’m currently pursuing Master of Science in Data Analytics Engineering at George Mason University
  • πŸ˜„ Pronouns: she/her
  • ⚑ Fun fact: I love exploring different cultures, especially their amazing foods.
  • ⭐ Motivation quote : "I have no special talents. I am only passionately curious." - Albert Einstein

Currently Working On

  • Data Analytics Project (Capstone)
  • Building small passion projects to explore data workflows and new tools
  • Reading Designing Machine Learning Systems by Chip Huyen
  • Reading Machine Learning Systems by Prof. Vijay Janapa Reddi - Harvard University
  • Reading Fairness and Machine Learning by Solon Barocas, Moritz Hardt, Arvind Narayanan
  • Starting MLOps Zoomcamp course

⭐ Highlighted Projects

πŸ”Ή Bridge Material & Design Analysis β€” Feb 2026

Python, PySpark, Databricks

  • Cleaned and reshaped a multi-state bridge dataset to examine material and design patterns and applied association rule mining to identify recurring relationships.

β†’ Bridge-Material-and-Design-Analysis


πŸ”Ή Influenza Surveillance Dashboard β€” Oct 2025

Power BI

  • Explored multi-season influenza data to monitor trends, subtype distribution, and outbreak severity through an interactive dashboard.

β†’ Influenza Surveillance Dashboard Chicago


πŸ”Ή Air Quality Analysis β€” Jul 2025

R, Time-Series Analysis, Interactive Plot, Forecasting

  • Cleaned and analyzed multi-year air quality data to examine environmental risk patterns and forecast ozone trends using ARIMA model.
  • Published interactive HTML report with code, Plotly visualizations, and a few static plots.

β†’ New York Air Quality Analysis


πŸ”Ή Electric Vehicle Analysis β€” Jun 2025

Python, Data Analysis, Machine Learning

  • Analyzed electric vehicle adoption data to examine growth trends, geographic distribution, and vehicle characteristics across regions.
  • Trained a Decision Tree Model to classify between Battery Electric Vehicles (BEVs) and Plug-in Hybrid Electric Vehicles (PHEVs)
  • Utilized Synthetic Minority Oversampling Technique (SMOTE) to address class imbalance.

β†’ Electric-Vehicle-Analysis


πŸ”Ή Education in Danger Analysis β€” Dec 2024

Python, SQL, R, NLP

  • Cleaned and analyzed global incident data to identify geographic hotspots, severity patterns, and recurring risk signals affecting education infrastructure.
  • Applied natural language processing (NLP) to extract sentiment and patterns from incident descriptions.

β†’ Education-in-Danger-Incidents

πŸ“ Other Projects

Bridge Damage Prediction (Group Project) β€” PySpark ML workflow (notebook-based)

Python, PySpark, Spark MLib, Databricks

  • Contributed code to the PySpark modeling workflow in Databricks, including feature engineering and evaluation using Python, PySpark and Spark MLlib.

β†’ Bridge-Damage-Prediction

Pinned Loading

  1. Bridge-Material-and-Design-Analysis Bridge-Material-and-Design-Analysis Public

    Bridge Material and Design Analysis and Association Rules of Material and Design in Databricks

    Jupyter Notebook

  2. Education-in-Danger-Incidents Education-in-Danger-Incidents Public

    Education in Danger Incidents in War and Conflicted Areas (R, SQL, Python, NLP)

    Jupyter Notebook

  3. Electric-Vehicle-Analysis Electric-Vehicle-Analysis Public

    Electric Vehicle Analysis (Cleaning, Perform Exploratory Data Analysis and Apply Decision Tree Model in Python)

    Jupyter Notebook

  4. New-York-Air-Quality New-York-Air-Quality Public

    Air Quality Analysis of New York City in R

    HTML

  5. Power-BI---Influenza-Surveillance-Dashboard-Chicago- Power-BI---Influenza-Surveillance-Dashboard-Chicago- Public

    The Influenza Surveillance Weekly - Historical dataset - City of Chicago has been analyzed using Power BI