Deep Reinforcement Learning-based Rebalancing Policies for Profit Maximization of Relay Nodes in Payment Channel Networks

Nikolaos Papadis; Leandros Tassiulas

Paper 2022/1385

Deep Reinforcement Learning-based Rebalancing Policies for Profit Maximization of Relay Nodes in Payment Channel Networks

Nikolaos Papadis

, Yale University

Leandros Tassiulas

, Yale University

Abstract

Payment channel networks (PCNs) are a layer-2 blockchain scalability solution, with its main entity, the payment channel, enabling transactions between pairs of nodes "off-chain," thus reducing the burden on the layer-1 network. Nodes with multiple channels can serve as relays for multihop payments by providing their liquidity and withholding part of the payment amount as a fee. Relay nodes might after a while end up with one or more unbalanced channels, and thus need to trigger a rebalancing operation. In this paper, we study how a relay node can maximize its profits from fees by using the rebalancing method of submarine swaps. We introduce a stochastic model to capture the dynamics of a relay node observing random transaction arrivals and performing occasional rebalancing operations, and express the system evolution as a Markov Decision Process. We formulate the problem of the maximization of the node's fortune over time over all rebalancing policies, and approximate the optimal solution by designing a Deep Reinforcement Learning (DRL)-based rebalancing policy. We build a discrete event simulator of the system and use it to demonstrate the DRL policy's superior performance under most conditions by conducting a comparative study of different policies and parameterizations. Our work is the first to introduce DRL for liquidity management in the complex world of PCNs.

Metadata

Available format(s): PDF
Category: Applications
Publication info: Published elsewhere. Minor revision. Best Paper Award at the 4th International Conference on Mathematical Research for the Blockchain Economy (MARBLE 2023)
Keywords: Payment Channel Network Lightning Network rebalancing submarine swap Deep Reinforcement Learning optimization simulation
Contact author(s): nikolaos papadis @ yale edu
leandros tassiulas @ yale edu
History: 2023-10-08: revised; 2022-10-13: received; See all versions
Short URL: https://ia.cr/2022/1385
License: CC BY

BibTeX

@misc{cryptoeprint:2022/1385,
      author = {Nikolaos Papadis and Leandros Tassiulas},
      title = {Deep Reinforcement Learning-based Rebalancing Policies for Profit Maximization of Relay Nodes in Payment Channel Networks},
      howpublished = {Cryptology {ePrint} Archive, Paper 2022/1385},
      year = {2022},
      url = {https://eprint.iacr.org/2022/1385}
}