Control of Rayleigh-Bénard Convection: Effectiveness of Reinforcement Learning in the Turbulent Regime

Examples

We make a compelling case for the inclusion of domain knowledge to efficiently obtain robust control of chaotic flows. On the left is the typical performance of a Domain-Informed agent on a random initial condition, and on the right is the typical performance of an uninformed agent. Althoug the both agents achieve a similar reduction of average convective heat transfer under initially chaotic flow conditions, note the large qualitative differences in the flows that are eventually obtained:

The domain-informed agent always achieves steady flow, and keeps this state stable.
The uninformed agent exhibits a significantly unsteady flow.

Domain-Informed	Uninformed
$Ra=10^5$	$Ra=10^5$

Abstract

Data-driven flow control has significant potential for indus- try, energy systems, and climate science. In this work, we study the ef- fectiveness of Reinforcement Learning (RL) for reducing convective fluid flows in the 2D Rayleigh-Bénard Convection (RBC) system under in- creasing turbulence. We investigate the generalizability of control across varying initial conditions and turbulence levels and introduce a reward shaping technique to accelerate the training. RL agents trained via single- agent Proximal Policy Optimization (PPO) are compared to linear pro- portional derivative (PD) controllers from conventional control theory. The RL agents reduced convection, measured by the Nusselt Number, by up to 33% in moderately turbulent systems and 10% in highly tur- bulent settings, clearly outperforming PD control in all settings. The agents showed strong generalization performance across different initial conditions and to a significant extent, generalized to higher turbulent set- tings. The reward shaping improved sample efficiency and consistently stabilized the Nusselt Number to higher turbulence levels.

Rayleigh-Bénard Convection

This work uses Fourier Neural Operators to model Rayleigh-Bénard Convection (RBC). RBC describes convection processes in a layer of fluid cooled from the top and heated from the bottom via the partial differential equations:

Rayleigh-Bénard Convection

$$\begin{aligned} & \frac{\partial u}{\partial t} + (u \cdot \nabla) u = -\nabla p + \sqrt{\frac{Pr}{Ra}} \nabla^2 u + T j \\ & \frac{\partial T}{\partial t} + u \cdot \nabla T = \frac{1}{\sqrt{Ra Pr}} \nabla^2 T\ \\ & \nabla \cdot u = 0 \\ \end{aligned}$$

The surrogate models are trained on data generated by a Direct Numerical Simulation based on Shenfun with the following parameters:

Parameter	Value	Parameter	Value
Domain	((-1, 1),(0, $2\pi$))	($T_t$, $T_b$)	(1,2)
Grid	64 x 96	$\Delta t$	0.025
Rayleigh Number	{1e5, 1e6, 2e6, 5e6}	Episode Length	300
Prandtl Number	0.7	Cook Time	200

Setup

We use uv to manage python dependencies.

uv sync

Alternatively, you can use virtual environments and install this package. Dependencies are given in pyproject.toml.

How to

Test the trained agent

This script gives a live view of the trained agent's behavior:

uv run python scripts/sbsa_test.py

Train your own agent

Adapt the configuration in config/sbsa.yaml and run the training script:

uv run python scripts/sbsa.py

Citation

If you find our work useful, please cite us via:

@article{todo,
  title={Control of Rayleigh-Bénard Convection: Effectiveness of Reinforcement Learning in the Turbulent Regime},
  author={Markmann, Thorben and Straat, Michiel and Peitz, Sebastian and Hammer, Barbara},
  journal={},
  year={}
}

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.vscode		.vscode
config		config
data/checkpoints		data/checkpoints
models/ra10000		models/ra10000
scripts		scripts
slurm		slurm
src/rbc_control_sarl		src/rbc_control_sarl
videos		videos
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
examples.md		examples.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Control of Rayleigh-Bénard Convection: Effectiveness of Reinforcement Learning in the Turbulent Regime

Examples

Abstract

Rayleigh-Bénard Convection

Setup

How to

Test the trained agent

Train your own agent

Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

HammerLabML/RBC-Control-SARL

Folders and files

Latest commit

History

Repository files navigation

Control of Rayleigh-Bénard Convection: Effectiveness of Reinforcement Learning in the Turbulent Regime

Examples

Abstract

Rayleigh-Bénard Convection

Setup

How to

Test the trained agent

Train your own agent

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages