Adds temporary pallet to fellowship runtimes to fix Staking corrupted ledgers #2

gpestana · 2024-07-23T16:13:01Z

https://hackmd.io/m_h9DRutSZaUqCwM9tqZ3g?view

Temporary pallet that exposes a new extrinsic restore_ledger which:

"Forwards" the call to Staking.restore_ledger;
Does an extra check and potentially unstakes the ledger after the ledger is recovered (missing in Staking.restore_ledger);
Any signed origin can call the extrinsic
Pre-requisites for stashes to be recovered by pallet-staking:

Are whitelisted (set in the runtime config)
Are associated with a corrupted ledger (checked at runtime by pallet-staking)

test results with chopsticks
refactor the pallet out of relay/polkadot to be exposed to Kusama runtime too
weights
docs, clean up, etc
open PR against https://github.com/polkadot-fellows/runtimes after initial feedback

… call

relay/temp-pallets/pallet-staking-fixer/Cargo.toml

kianenigma · 2024-08-02T11:40:51Z

relay/temp-pallets/pallet-staking-fixer/src/lib.rs

+		/// restored, the ledger locks are higher or equal than the stash's free balance. If not, it
+		/// forces the unstake of the ledger.
+		///
+		/// Safety note: Only ledgers associated `stash` that are corrupted will be mutated. Thus it


So you are saying that the original extrinsic was needlessly restrictive, and it is generally okay to let anyone call these?

The fn restore_ledger will fail with CannotRestoreLedger if the ledger state does not fall into one of the corrupted ledger cases (basically, gated by fn inspect_bond_state. The idea of requiring staking admin origin was to make it explicit through a referendum which ledgers were being restored (and for an extra layer of safety).

To add more safety (i.e. not relying solely on fn inspect_bond_state to ensure that the ledger should be mutated and recovered), we add also a whitelisting mechanism so we can control which stashes need to be recovered in the configs based on the current on-chain state.

kianenigma · 2024-08-02T11:42:07Z

relay/temp-pallets/pallet-staking-fixer/src/lib.rs

+
+			// check if stash's free balance covers the current ledger's total amount. If not,
+			// force unstake the ledger.
+			let weight = if ledger.total > T::Currency::free_balance(&stash) {


How many staking ledgers fall into this category? if it is a handful that have to be unstaked, I would rather we hardcode their accounts, rather than 'codify' their conditions, so as to not risk this code path being accidentally accessibly to any future state.

The problem with hardcoding stashes for this check is that the condition may change with time from now until we call the extrinsic.

However, we do hardcode the stash accounts that need to be restored now. Through that, we can ensure that only the ledgers that 1. are corrupted and 2. are whitelisted can be recovered, which adds another layer of safety. The whitelisted accounts are added in the runtime configs.

kianenigma · 2024-08-22T14:23:30Z

relay/polkadot/src/lib.rs

@@ -818,6 +818,22 @@ impl pallet_staking::Config for Runtime {
 	type WeightInfo = weights::pallet_staking::WeightInfo<Runtime>;
 }

+parameter_types! {
+	pub WhitelistedStashes: Vec<AccountId> = vec![


Suggested change

pub WhitelistedStashes: Vec<AccountId> = vec![

pub CorruptStashes: Vec<AccountId> = vec![

kianenigma

LGTM, let's move this to the fellowship runtime repo, and open it against main there. We recently missed the 1.3 release. Perhaps we can still backport this into it. Else, we should target 1.4.

…Kusama (#447) Note: for more details on the corrupted ledgers issue and recovery steps check https://hackmd.io/m_h9DRutSZaUqCwM9tqZ3g?view. This PR adds a migration in Polkadot and Kusama runtimes to recover the current corrupted ledgers in Polkadot and Kusama. A migration consists of: 1. Call into `pallet_staking::Pallet::<T>::restore_ledger` for each of the "whitelisted" stashes as `Root` origin. 2. Performs a check that ensures the restored ledger's stake does not overflow the current stash's free balance. If that's the case, force unstake the ledger. This check is currently missing in polkadot-sdk/pallet-staking ([PR with patch here](paritytech/polkadot-sdk#5066)). The reason to restore the corrupted ledgers as migrations implemented in the fellowship runtimes is twofold: 1. The call to `pallet_staking::Pallet::<T>::restore_ledger` and check + `force_unstake` must be done atomically (thus a ledger can't be safely restored with a set of two distinct extrinsic calls, so it's not possible to use referenda to this fx). 2. To speed up the whole process and avoid having to wait for 1. merge and releases of paritytech/polkadot-sdk#5066 and 2. referenda to call into `Call::restore_ledger` for both Polkadot and Kusama. Alternatively, we could add a new temporary pallet directly in the fellowship runtime which would expose an extrinsic to restore the ledgers and perform the extra missing check. See this [PR as an example](gpestana#2). --- - [x] on-runtime-upgrade tests against Polkadot and Kusama - [x] staking try-state checks passing after all migrations.

init temp corrupted ledger restore

33d29e4

gpestana marked this pull request as draft July 24, 2024 07:02

gpestana changed the title ~~Adds temporary pallet runtimes to fix Staking corrupted ledgers~~ Adds temporary pallet to fellowship runtimes to fix Staking corrupted ledgers Jul 24, 2024

gpestana added 3 commits July 24, 2024 17:26

moves temp pallet under /relay/temp-pallets; adds weights to pallet's…

c7a3cb1

… call

docs

30ed47a

adds temp pallet to kusama runtime

a18da89

gpestana mentioned this pull request Jul 30, 2024

Patches Call::Staking.restore_ledger to ensure a restored ledger has enough free balance to cover staking locks paritytech/polkadot-sdk#5066

Open

kianenigma reviewed Aug 2, 2024

View reviewed changes

relay/temp-pallets/pallet-staking-fixer/Cargo.toml Show resolved Hide resolved

kianenigma reviewed Aug 2, 2024

View reviewed changes

gpestana added 2 commits August 21, 2024 20:35

whitelist

8fa7cc6

ident fix

2ecacb0

gpestana requested a review from kianenigma August 22, 2024 11:21

kianenigma reviewed Aug 22, 2024

View reviewed changes

kianenigma approved these changes Aug 22, 2024

View reviewed changes

update polkadot whitelist

207ddc2

gpestana mentioned this pull request Aug 26, 2024

Adds migrations to restore currupted staking ledgers in Polkadot and Kusama polkadot-fellows/runtimes#447

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds temporary pallet to fellowship runtimes to fix Staking corrupted ledgers #2

Adds temporary pallet to fellowship runtimes to fix Staking corrupted ledgers #2

gpestana commented Jul 23, 2024 •

edited

Loading

kianenigma Aug 2, 2024

gpestana Aug 5, 2024

gpestana Aug 22, 2024

kianenigma Aug 2, 2024

gpestana Aug 22, 2024

kianenigma Aug 22, 2024

kianenigma left a comment

	pub WhitelistedStashes: Vec<AccountId> = vec![
	pub CorruptStashes: Vec<AccountId> = vec![

Adds temporary pallet to fellowship runtimes to fix Staking corrupted ledgers #2

Are you sure you want to change the base?

Adds temporary pallet to fellowship runtimes to fix Staking corrupted ledgers #2

Conversation

gpestana commented Jul 23, 2024 • edited Loading

kianenigma Aug 2, 2024

Choose a reason for hiding this comment

gpestana Aug 5, 2024

Choose a reason for hiding this comment

gpestana Aug 22, 2024

Choose a reason for hiding this comment

kianenigma Aug 2, 2024

Choose a reason for hiding this comment

gpestana Aug 22, 2024

Choose a reason for hiding this comment

kianenigma Aug 22, 2024

Choose a reason for hiding this comment

kianenigma left a comment

Choose a reason for hiding this comment

gpestana commented Jul 23, 2024 •

edited

Loading