Multi-Armed Bandit Problem Simulator

This is a React-based simulator for the k-armed bandit problem, designed to test and visualize reinforcement learning strategies with an interactive UI styled using Tailwind CSS.

Built With

React - A JavaScript library for building user interfaces.
Tailwind CSS - A utility-first CSS framework for rapid UI development.

Quickstart

First, ensure you have Node.js installed to manage your project's dependencies.

Clone the project and install dependencies:

git clone https://github.com/mweglowski/bandit_demonstration.git
cd bandit_demonstration
npm install

To run the application in development mode:

npm start

This will open the simulator in your default web browser. For production builds, you can use:

npm run build

Features

Multiple bandits with unique probabilistic reward distributions.
Interactive interface for 'pulling' bandit arms, built with React.
Responsive and modern UI using Tailwind CSS.
Visualization of action counts and estimated values.

Reinforcement Learning Strategies

The simulator focuses on the ε-greedy strategy, balancing exploration and exploitation by selecting the best-known action with probability 1−ϵ and exploring a random action with probability ϵ.

Incremental Update Rule used in Reinforcement Learning

The simulator updates the estimated action value Q using the formula:

Q(n+1) = Q(n) + (1/n) * (Rn - Q(n))

Where:

Q(n+1) is the new estimate,
Q(n) is the current estimate,
Rn is the reward received,
n is the number of times the action has been chosen.

Usage

After launching the simulator, interact with the UI by selecting a bandit to 'pull'. Observe the algorithm's performance and how estimated values update based on the reward distributions.

Image Previews

Desktop Preview

Mobile Preview

Website

Explore the simulator online at https://bandit-problem-simulator.vercel.app/.

Contributing

I welcome contributions! If you have suggestions or are interested in improving the k-armed bandit simulator, please feel free to fork the repository, make changes, and submit a pull request.

Inspired by the foundational reinforcement learning work of Sutton and Barto.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
public		public
src		src
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
todo.txt		todo.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multi-Armed Bandit Problem Simulator

Built With

Quickstart

Features

Reinforcement Learning Strategies

Incremental Update Rule used in Reinforcement Learning

Usage

Image Previews

Desktop Preview

Mobile Preview

Website

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Languages

mweglowski/bandit_problem_simulator

Folders and files

Latest commit

History

Repository files navigation

Multi-Armed Bandit Problem Simulator

Built With

Quickstart

Features

Reinforcement Learning Strategies

Incremental Update Rule used in Reinforcement Learning

Usage

Image Previews

Desktop Preview

Mobile Preview

Website

Contributing

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages