Zermelo's problem: Optimal point-to-point navigation in 2D turbulent flows using Reinforcement Learning
July 17, 2019 Β· Declared Dead Β· π Chaos
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Luca Biferale, Fabio Bonaccorso, Michele Buzzicotti, Patricio Clark Di Leoni, Kristian Gustavsson
arXiv ID
1907.08591
Category
nlin.CD
Cross-listed
cs.AI,
cs.LG,
eess.SY,
physics.flu-dyn
Citations
105
Venue
Chaos
Last Checked
3 months ago
Abstract
To find the path that minimizes the time to navigate between two given points in a fluid flow is known as Zermelo's problem. Here, we investigate it by using a Reinforcement Learning (RL) approach for the case of a vessel which has a slip velocity with fixed intensity, Vs , but variable direction and navigating in a 2D turbulent sea. We show that an Actor-Critic RL algorithm is able to find quasi-optimal solutions for both time-independent and chaotically evolving flow configurations. For the frozen case, we also compared the results with strategies obtained analytically from continuous Optimal Navigation (ON) protocols. We show that for our application, ON solutions are unstable for the typical duration of the navigation process, and are therefore not useful in practice. On the other hand, RL solutions are much more robust with respect to small changes in the initial conditions and to external noise, even when V s is much smaller than the maximum flow velocity. Furthermore, we show how the RL approach is able to take advantage of the flow properties in order to reach the target, especially when the steering speed is small.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β nlin.CD
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
Dynamical Complexity Of Short and Noisy Time Series
R.I.P.
π»
Ghosted
Shannon Entropy Rate of Hidden Markov Processes
R.I.P.
π»
Ghosted
Theoretical design and circuit implementation of integer domain chaotic systems
R.I.P.
π»
Ghosted
Spectral Simplicity of Apparent Complexity, Part I: The Nondiagonalizable Metadynamics of Prediction
R.I.P.
π»
Ghosted
Chaotic, informational and synchronous behaviour of multiplex networks
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted