Table of Links
-
Preliminaries
-
Selfish Mining Attack
3.3 Formal Analysis
Goal of the analysis. We now show how to compute the optimal expected relative revenue together with an adversarial strategy achieving it in the MDP M = (π, π΄, π, π 0), up to an arbitrary precision parameter π > 0. Formally, for each strategy π in M, let
be the expected relative revenue under adversarial strategy π, i.e. the relative ratio of the number of blocks accepted on the main chain belonging to the adversary and to honest miners. Moreover, let
be the optimal expected relative revenue that an adversarial strategy can attain. Given a precision parameter π > 0, our goal is to compute
We do this by defining a class of reward functions in the MDP M and showing that, for any value of the precision parameter π > 0, we can compute the above by solving the mean-payoff MDP with respect to a reward function belonging to this class. Our analysis draws insight from that of [27], which considered selfish mining in PoW blockchains and also reduced reasoning about expected relative revenue to solving mean-payoff MDPs with respect to suitably defined reward functions. However, in contrast to [27], we consider selfish mining in efficient proof systems in which the adversary can mine on multiple blocks, meaning that our design of reward functions and the analysis require additional care.
Reward function definition. The key challenge in designing the reward function is that the main chain and the blocks on it may change whenever the adversary publishes a private fork. Hence, we design the reward function to incur positive (resp. negative) reward whenever a block owned by the adversary (resp. honest miners) is accepted at the depth strictly greater than π in the main chain. Since the adversary only mines and publishes private forks mined on blocks up to depth π in the main chain, this means that blocks beyond depth π are guaranteed to remain on the main chain.
Formally, for each π½ β [0, 1], we define ππ½ : π Γ π΄ Γ π β R to be a reward function in M which to each state-action-state triple (π , π, π β² ) assigns the reward:
β’ 1βπ½, for each block belonging to the adversary accepted at depth greater than π as a result of performing the action;
β’ βπ½, for each block belonging to honest miners accepted at depth greater than π as a result of performing the action.
Formal analysis. Our formal analysis is based on the following theorem. For clarity of exposition, we defer the proof of the theorem to Appendix C . For every π > 0, the theorem shows how to relate the optimal expected relative revenue in the MDP and π-optimal strategies to the optimal mean-payoff and π-optimal strategies under the reward function ππ½ for a suitably chosen value of π½.
Authors:
(1) Krishnendu Chatterjee, IST Austria, Austria ([email protected]);
(2) Amirali Ebrahimzadeh, Sharif University of Technology, Iran ([email protected]);
(3) Mehrdad Karrabi, IST Austria, Austria ([email protected]);
(4) Krzysztof Pietrzak, IST Austria, Austria ([email protected]);
(5) Michelle Yeo, National University of Singapore, Singapore ([email protected]);
(6) ΓorΔe Ε½ikeliΔ, Singapore Management University, Singapore ([email protected]).
This paper is