Skip to content

Official code base for the paper "Randomized Confidence Bounds for Stochastic Partial Monitoring" accepted at ICML 2024.

Notifications You must be signed in to change notification settings

MaxHeuillet/partial-monitoring-algos

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Randomized Confidence Bounds for Partial Monitoring

This repository contains the implementation of algorithms described in the paper "Randomized Confidence Bounds for Partial Monitoring". This branch is a sandbox version, the developer's code is stored on the other branch of the project.

Prerequisites

Before you begin, ensure you have met the following requirements:

  • Python 3.8
  • pip

Installation

Follow these steps to set up your environment and run the experiments:

  1. Create a Virtual Environment:

    python -m venv env
    source env/bin/activate  
    
  2. Install Dependencies:

    pip install -r requirements.txt

Installation Troubleshooting:

  • Cyipopt: If you encounter issues installing cyipopt, ensure you have the latest versions of pip, setuptools, and wheel. You may also need additional system dependencies. For more details, see the Cyipopt Installation Guide.
  • Gurobi Alternative: If you prefer not to use Gurobi, you can use PULP as an alternative optimizer. To do this, install PULP using pip install pulp.

Running Experiments

  • Non-contextual Experiments: To run non-contextual experiments, use the Jupyter notebook experiment_noncontextual.ipynb.
  • Contextual Experiments: For contextual experiments, refer to the experiment_contextual.ipynb notebook.
  • Use case Experiments: For the use case, refer to the Use_case folder, approaches C-CBP, C-RandCBP and ExploreFully are in the utils.py file. Specifically, see ./use_case/benchmark_use_case2.ipynb script.

Acknowledgements

This work was funded through Mitacs with additional support from CIFAR (CCAI Chair). We thank Alliance Canada and Calcul Quebec for access to computational resources and staff expertise consultation. We would like to thank Junpei Komiyama, Taira Tsuchiya, Ian Lienert, Hastagiri P. Vanchinathan and James A. Grant for answering our technical questions and/or providing total/partial access to private code bases of their approaches. We also acknowledge the library pmlib of Tanguy Urvoy that was helpful to implement PM game environments.

About

Official code base for the paper "Randomized Confidence Bounds for Stochastic Partial Monitoring" accepted at ICML 2024.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published