Paper 2022/1385
Deep Reinforcement Learning-based Rebalancing Policies for Profit Maximization of Relay Nodes in Payment Channel Networks
Abstract
Payment channel networks (PCNs) are a layer-2 blockchain scalability solution, with its main entity, the payment channel, enabling transactions between pairs of nodes "off-chain," thus reducing the burden on the layer-1 network. Nodes with multiple channels can serve as relays for multihop payments by providing their liquidity and withholding part of the payment amount as a fee. Relay nodes might after a while end up with one or more unbalanced channels, and thus need to trigger a rebalancing operation. In this paper, we study how a relay node can maximize its profits from fees by using the rebalancing method of submarine swaps. We introduce a stochastic model to capture the dynamics of a relay node observing random transaction arrivals and performing occasional rebalancing operations, and express the system evolution as a Markov Decision Process. We formulate the problem of the maximization of the node's fortune over time over all rebalancing policies, and approximate the optimal solution by designing a Deep Reinforcement Learning (DRL)-based rebalancing policy. We build a discrete event simulator of the system and use it to demonstrate the DRL policy's superior performance under most conditions by conducting a comparative study of different policies and parameterizations. Our work is the first to introduce DRL for liquidity management in the complex world of PCNs.
Metadata
- Available format(s)
- Category
- Applications
- Publication info
- Published elsewhere. Minor revision. Best Paper Award at the 4th International Conference on Mathematical Research for the Blockchain Economy (MARBLE 2023)
- Keywords
- Payment Channel NetworkLightning Networkrebalancingsubmarine swapDeep Reinforcement Learningoptimizationsimulation
- Contact author(s)
-
nikolaos papadis @ yale edu
leandros tassiulas @ yale edu - History
- 2023-10-08: revised
- 2022-10-13: received
- See all versions
- Short URL
- https://ia.cr/2022/1385
- License
-
CC BY
BibTeX
@misc{cryptoeprint:2022/1385, author = {Nikolaos Papadis and Leandros Tassiulas}, title = {Deep Reinforcement Learning-based Rebalancing Policies for Profit Maximization of Relay Nodes in Payment Channel Networks}, howpublished = {Cryptology {ePrint} Archive, Paper 2022/1385}, year = {2022}, url = {https://eprint.iacr.org/2022/1385} }