Skip to content
View ssilvacris's full-sized avatar

Block or report ssilvacris

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ssilvacris/README.md

Hi, I'm Cristiane da Silva

Data Scientist / Data Analyst

Passionate about transforming raw data into actionable business insights. Experienced in Machine Learning, Statistical Modeling, Spatial Analysis, and Interactive Visualizations using Python, R, and Tableau.

LinkedIn


Featured Projects

Barcelona Public Drinking Fountains Analysis

Web App Jupyter Notebook

An end-to-end Geospatial Data Science project analyzing urban resource accessibility using open data (2019–2024).

  • Business Insight: Identified spatial inequalities (Eixample Paradox) where high-population areas suffer from lower per-capita access, providing actionable data for urban planning.
  • Tech Stack: Python (Pandas, Geopandas, Scikit-learn), Folium, Leaflet.js, Chart.js.
Map Preview Temporal Analysis

NYT Headlines Analysis

Python NLTK Tableau

An exploratory data analysis (EDA) of New York Times headlines during the first 7 months of 2020, using Natural Language Processing (NLP) techniques to identify patterns and trends in news coverage during one of the most intense periods in recent history.

  • Coronavirus: Surge from 275 to 811 mentions between February and March (peak of WHO pandemic declaration)
  • George Floyd: "Police" and "Floyd" among the most cited words in June
  • US Elections: "Trump" gaining relevance in July as the presidential election approached

Tech Stack

  • Data Collection: NYT API
  • Processing: Python + NLTK (tokenization, stopwords, frequency analysis)
  • Visualization: Tableau (animated charts) + WordCloud
  • Analysis: Pandas, NumPy, Matplotlib, Seaborn

Demo

WordCloud Example Frequency Chart

Links


Commercial Location Optimization via Clustering (IBM Capstone)

Jupyter Notebook Article

Market research simulation to identify optimal locations for a new coffee shop in Montreal, mitigating investment risks through data.

  • Methodology: Web scraping and Foursquare API integration to map competitors, followed by K-Means Clustering for neighborhood segmentation.
  • Tech Stack: Python, Scikit-learn, Folium for geospatial visualization.


Financial & Statistical Modeling

Financial Time Series Forecasting (ARIMA)

RMarkdown

Statistical analysis of financial assets to model volatility and forecast future stock price movements.

  • Tech Stack: R language, ARIMA modeling, Statistical Hypothesis Testing.

Interactive Future Value Calculator

Jupyter Notebook

A dynamic, interactive tool built to simulate financial growth scenarios across shifting interest rates.

  • Tech Stack: Python, Ipywidgets for dynamic UI components.


Business Intelligence & Tableau Dashboards

A collection of interactive executive dashboards focused on operations, supply chain, and spatial insights.

Indian Coal Mine Production ACME Superstore Performance Dual Layer Mapping
Tableau Tableau Tableau

Pinned Loading

  1. Finance-Projects Finance-Projects Public

    This repository contains several exercises in Python and R, mainly in the area of finance, financial modeling, and statistics.

    HTML 54 11

  2. Coursera_Capstone Coursera_Capstone Public

    Jupyter Notebook 3 1

  3. barcelona-water-distribution barcelona-water-distribution Public

    Analysis of public water fountains in Barcelona (2019–2024), population and ML insights.

    HTML