This package is a two-session, beginner-to-intermediate introduction to AI for petroleum engineers, geoscientists, and other technical energy professionals. It moves from core machine learning concepts into practical generative AI applications that are relevant to upstream oil and gas, subsurface workflows, and carbon storage.
The material is practical and role-aware. The goal is to help you understand what modern AI can do, where it fits in technical work, how to use it responsibly, and how to start experimenting on your own.
Primary audience:
- Petroleum engineers with production, reservoir, and drilling backgrounds
- Geoscientists working in interpretation, well logs, and subsurface evaluation
- Technical professionals interested in carbon capture and sequestration workflows
Assumed starting point:
- Comfortable with engineering decision-making and domain terminology
- Limited or mixed exposure to ML and GenAI
- Some learners may code lightly in Python, but many will not
The material follows this progression:
- Session 1 introduces predictive ML and prescriptive thinking, then pivots into practical GenAI use cases.
- The hands-on notebooks and prompts let you try the ideas directly.
- Session 2 focuses on GenAI, LLMs, RAG, agents, AI coding copilots, and tool connectivity.
Each session now runs for 120 minutes.
Theme: ML foundations plus a strong GenAI bridge
Outcomes:
- Distinguish descriptive, predictive, and prescriptive analytics.
- Understand core ML workflow ideas such as targets, features, validation, and overfitting.
- Map ML techniques to reservoir, production, drilling, geoscience, and carbon storage problems.
- Understand when to use ML versus GenAI.
- See GenAI used for more than summarization, including analysis support and workflow building.
Theme: All GenAI for energy workflows
Outcomes:
- Explain what LLMs are in plain English.
- Understand RAG, chatbots, agents, tool use, and MCP-style connectivity.
- See how modern tools such as
Cursor,Claude Code, andCodexsupport technical work. - Understand how GenAI can support document-heavy and analysis-heavy workflows in energy.
- Learn practical guardrails around hallucination, governance, and human review.
The training will repeatedly anchor back to these examples:
- Production forecasting
- Underperforming well identification
- Artificial lift optimization support
- Water cut and surveillance review
- Nonproductive time screening
- ROP prediction support
- Stuck-pipe risk framing
- Lessons-learned extraction from drilling reports
- Facies or lithology classification
- Well-log interpretation support
- Cross-well note comparison
- Subsurface document search and Q&A
- Site screening
- Storage capacity estimation support
- Candidate ranking
- Documentation and screening summary assistants
The primary running example is a production forecasting copilot.
Why this works well:
- It is familiar to petroleum engineers.
- It now uses a compact official BSEE offshore completion subset with regulator-backed monthly production rather than a synthetic field example.
- It shows where
Arps hyperbolic decline, a multivariateRandomForestRegressor,Exponential Smoothing, andARIMAact as the numerical engines. - It also shows how GenAI can help frame the problem, write analysis code, inspect outputs, generate visuals, explain results, and package the workflow into something reusable.
This storyline appears in both sessions:
- Session 1: GenAI-assisted production forecasting workflow
- Session 2: how the same workflow can evolve into a chatbot, tool-using agent, or internal app
Because the sessions are now longer, the package also includes a simple optimization case study inspired by completions water management.
Why this matters:
- it gives petroleum engineers a concrete prescriptive-analytics example
- it shows how linear optimization differs from ML
- it provides a practical bridge into larger optimization products such as
WaterWise,SandWise,FracGenie, andSequestrix
training/README.mdtraining/sessions/session-1/training/sessions/session-2/training/demos/production-forecasting/training/demos/optimization-waterwise/training/setup/training/data/training/resources/reading-list.md
- Start with
training/README.mdfor the student-friendly course map. - Read the session slide files in
training/sessions/for the conceptual overview. - Use the demo folders in
training/demos/for the hands-on walkthroughs, notebooks, scripts, and results. - Use the setup guides in
training/setup/for local or Colab execution. - Review the datasets and data dictionaries in
training/data/before running the exercises. - Use the standalone
Cursorexercise repo if you want the AI-assisted coding workflow:https://github.com/davidpcg01/hands-on-ml-cursor - Use
training/resources/reading-list.mdfor optional follow-up study.
- No coding is required to read the materials.
- Optional access to
ColaborJupyteris helpful for the notebook exercises. - Optional access to the free version of
Cursoris helpful if you also want to complete the separatehands-on-ml-cursorexercise. - If you want to run everything locally, install the Python packages listed in
training/setup/requirements.txt.
After working through the package, you should be able to:
- name several high-value AI use cases in your discipline
- explain the difference between ML, optimization, and GenAI
- complete at least one small ML exercise and one GenAI exercise
- describe where chatbots or agents would fit into a technical energy workflow
- identify one workflow in your own job that is ready for experimentation