Stopping Agents
Language agents for optimal stopping
Code
We provide reference implementations of stopping agents parameterized with various open-source large language models and trained using different approaches as Jupyter notebooks.
The list of available notebooks is below:
Policy Model | Training Method | Notebook |
---|---|---|
Llama 3.2 3B | Behavioral Cloning | Llama 3.2 3B + BC |
If you are interested in contributing a new stopping agent notebook, please reach out to emaadmanzoor@cornell.edu or open a Github issue.