Skip to content

[sql-32] accounts: add migration code from kvdb to SQL #1047

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged

Conversation

ViktorTigerstrom
Copy link
Contributor

This PR introduces the migration logic for transitioning the accounts store from kvdb to SQL.

Note that as of this PR, the migration is not yet triggered by any production code, i.e. only tests execute the migration logic.

Part of #917

@ViktorTigerstrom ViktorTigerstrom added the no-changelog This PR is does not require a release notes entry label Apr 22, 2025
Copy link
Member

@ellemouton ellemouton left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Soooooo excited to see this come through! I couldnt help but take a look even though review not requested - just leaving some drive by comments in the mean time :)

I'll wait till review requested before looking again :)

Comment on lines 329 to 335

t.Run("Postgres", func(t *testing.T) {
migrationTest(t, kvStore, testClock, false)
})

t.Run("SQLite", func(t *testing.T) {
migrationTest(t, kvStore, testClock, true)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

in Lit, we've gone with the build flag approach. So can just use the existing NewTestDB methods no? and then if bbolt backend, just skip the test ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated as discussed offline :)

@@ -732,6 +733,79 @@ func (s *SQLStore) StoreLastIndexes(ctx context.Context, addIndex,
})
}

func makeInsertAccountParams(account *OffChainBalanceAccount) (
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

let's keep the migration code in the same file

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ok, yeah agree :)! Had it here previously to mimic lnd's migration implementation.

@ViktorTigerstrom ViktorTigerstrom force-pushed the 2025-04-migrate-accounts branch from 3072e48 to b4cffd8 Compare April 24, 2025 09:12
Comment on lines 327 to 336
{
"randomized accounts",
true,
randomizeAccounts,
},
{
"rapid randomized accounts",
true,
rapidRandomizeAccounts,
},
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I added 2 tests here currently, where one uses rapid for the randomisation, and one uses the normal "golang.org/x/exp/rand" testing semi-randomisation.

The drawback with the rapid test, is that we can't really generate too many invoices and and payments per account, as you can't really limit the amount of times the rapid.Check runs in golang (i.e. the number of accounts we populate the db with), without limiting the amount of runs the Check function would run for the entire litd project. I.e. if we limited it for this test, that would impact any other tests in the future that uses the rapid.Check function. There is an open issue for this in the rapid lib, see:
flyingmutant/rapid#38

When running the migration locally, the test migration execution time really goes up if you have a lot of accounts with a lot of payments and invoices, and therefore I opted to not add such a test to not make the make unit execution time take too long locally.

Therefore, I also added a normal test that doesn't use rapid for randomization, so that test adds up to 1000 invoices and payments.

In the end though , I can delete one of the tests if you prefer any test over the other :)!

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think I prefer randomizeAccounts since it's more concise and performant, great you tested out both approaches 👍

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cool, thanks for the input! Would he interested to hear Elle's opinion here as well :)

Copy link
Contributor

@bitromortac bitromortac left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Awesome work 🚀! Will certainly need another pass, since I haven't reviewed many migration PRs

AccountID: sqlID,
Hash: hash[:],
Status: int16(entry.Status),
FullAmountMsat: int64(entry.FullAmount),
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

an overflow is very unrealistic here, right?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah I think we have to make a choice here, and for migrations in general:
We can either:

  1. Assume that since this is a migration, the data we are migrating must have already been sanity checked and rule checked during the initial insertion into the kvdb, and therefore errors like that shouldn't be possible to occur during the migration.
  2. We sanity check and check the rules again for all data during the migration.

I have opted for option 1 here and for upcoming migrations, as I do think that should be the case, and we'd only let the migrations use more resources during the migration if we chose option 2 instead.

But I'd very interested to hear from both of you if you think we should instead go with option 2 here.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, I'm unsure, maybe we should be extra careful with amounts, but we can leave it as is.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we are already indirectly checking though: if we went with option 2, we'd check here and error out if it did overflow - but if we go with option 2, we will just fail later on when we do the reflect.DeapEqual check.

so i think for simplicity we dont need to add the redundant check

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I agree that there's no big difference between 1 & 2 in ths migration. But there will be a difference for future migrations like sessions.

So for example in sessions:
To go with option 2, we'd also check when migrating a session that has linked sessions, that all prior sessions has been revoked. However I don't think that really makes sense as that's the type of rule that should have been checked when the entry was added to the kvdb, and would only add extra execution load during the migration IMO.

Would be interested to hear what both of your opinions is in regards to if we should do such sanity checking or not @bitromortac & @ellemouton!

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think even if a MaxUint64 would be present here, it wouldn't be a problem, because the binary format of a unit64 and int64 doesn't change with a conversion during the marshal/unmarshal operations.

I'm not sure if sanity checks should be done, but maybe we can decide case by case?

Comment on lines 327 to 336
{
"randomized accounts",
true,
randomizeAccounts,
},
{
"rapid randomized accounts",
true,
rapidRandomizeAccounts,
},
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think I prefer randomizeAccounts since it's more concise and performant, great you tested out both approaches 👍

@ViktorTigerstrom ViktorTigerstrom force-pushed the 2025-04-migrate-accounts branch from 8cd1631 to 7ccdd7f Compare May 2, 2025 08:44
In preparation for upcoming migration tests from a kvdb to an SQL store,
this commit updates the NewTestDB function to return the Store interface
rather than a concrete store implementation.

This change ensures that migration tests can call NewTestDB under any
build tag while receiving a consistent return type.
@ViktorTigerstrom ViktorTigerstrom force-pushed the 2025-04-migrate-accounts branch from 7ccdd7f to 26b6809 Compare May 2, 2025 08:48
@ViktorTigerstrom
Copy link
Contributor Author

Thanks for the reviews! I've addressed feedback and left comments with the second last push. In the last push, I rebased on master to address a merge conflict.

Copy link
Contributor

@bitromortac bitromortac left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM 🔥🚀

},
},
{
"account with updated expiry",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I guess the intention here is to test the UpdatedAt field, right? If so that would be a better name for the test

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah your right, updated the name :).

},
},
{
"account with updated balance",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this test needed? it seems like it only tests CreditAccount/DebitAccount in addition. Also, maybe we can remove the HasExpired checks where applicable, as this adds a bit of noise?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah I wanted to show the other type of update that you can do to an account, and that this didn't cause any issues to the migration. But you're right that it didn't add any real value, so I went ahead and removed the test :).

Comment on lines 383 to 384
// randomize balance from 1,000 to 100,000,000
balance := lnwire.MilliSatoshi(
rand.Int63n(100000000-1000) + 1000,
)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is it important that we have a lower bound in these tests? Nano-nit: the upper bound is not included, just to be aware

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No not really, think I mainly wanted to just have some kind of balance, as I don't think the actual value should matter?

AccountID: sqlID,
Hash: hash[:],
Status: int16(entry.Status),
FullAmountMsat: int64(entry.FullAmount),
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, I'm unsure, maybe we should be extra careful with amounts, but we can leave it as is.

@ViktorTigerstrom
Copy link
Contributor Author

Thanks for the review @bitromortac! Will address feedback after next review round 🚀!

Copy link
Member

@ellemouton ellemouton left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looking really great!

Note that as of this PR, the migration is not yet triggered by any production code, i.e. only tests execute the migration logic.

so reviewing this PR made me realise we should perhaps flip the order of things a bit: basically, i still want to be able to test this PR properly (ie, i want to run master branch - add some accounts, then switch to this PR and see that the accounts get migrated properly).

So this means that under the dev tag, we should first just add an option to point to an existing kvdb folder where we can migrate any existing accounts from (if we spin up in sql mode). Note: this probably means adding a set-up PR before we merge this one. But yeah - i think well worth it so that we can test these individually

populateDB func(t *testing.T, kvStore *BoltStore)
}{
{
"empty",
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

let's always use named fields 🙏

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

reason being: right now, it's hard to do a quick check of things like "are there tests that cover expectedLastIndex=false" for example. if they are named, then it is very easy to check

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah good point, updated!

}

for _, test := range tests {
tc := test
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I thiiiink we dont need to do this anymore right?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hmmm the sources I found seems to indicate that this is necessary, so if you know that something has changed, I'd really be interested to hear more about that 😃?

I can see that in lnd, this pattern is preset as soon as we use t.Parallel(), however in litd it seems like it varies. So I'm not fully sure if this is needed or not :)?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yeah not needed

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks 🙏! Removed with the latest push!

require.NoError(t, kvStore.db.Close())
})

migrationTest(t, kvStore, testClock, tc.expectLastIndex)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

not loving the flow here cause some of the setup is done here and the rest is done in migrationTest.

Thinking something like this instead:

	// Create a new kvdb store to populate with test data.
			kvStore, err := NewBoltStore(
				t.TempDir(), DBFilename, clock,
			)
			require.NoError(t, err)
			t.Cleanup(func() {
				require.NoError(t, kvStore.db.Close())
			})

			// Populate the kv store.
			test.populateDB(t, kvStore)

			// Create the SQL store that we will migrate the data
			// to.
			sqlStore, txEx := makeSQLDB(t)

			// Perform the migration.
			var opts sqldb.MigrationTxOptions
			err = txEx.ExecTx(ctx, &opts, func(tx SQLQueries) error {
				return MigrateAccountStoreToSQL(
					ctx, kvStore, tx,
				)
			})
			require.NoError(t, err)
			
			// Assert migration results.
			assertMigrationResults(
				t, kvStore, sqlStore, test.expectLastIndex,
			)

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

notice how the order of operations mimics that of what would actually happen on a real node.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Agree, thanks 🙏! I updated this, and also took #1047 (comment) into consideration. So therefore I changed your draft code to also fetch all accounts and indices prior to the migration.

@lightninglabs-deploy
Copy link

@ViktorTigerstrom, remember to re-request review from reviewers when ready

In preparation for the migration from kvdb to SQL, we update the results
of the ListAllAccounts query. After the migration has been implemented,
we will test that the result off all accounts in the SQL database
is the same as the result of the fetching all accounts in the kvdb.

By ordering the SQL query's result by account id, we ensure that all
accounts are returned in the same order as they are returned by the
kvdb.
@ViktorTigerstrom
Copy link
Contributor Author

Re-requested a review from you @bitromortac as quite a bit of the testing code has changed, even though you previously ACKed. I hope that's ok

Copy link
Contributor

@bitromortac bitromortac left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, the test code looks nicer now 🎉. Tested the migration manually as well, I ran a few commands for accounts (account creation/invoices/payments).

}

for _, test := range tests {
tc := test
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

AccountID: sqlID,
Hash: hash[:],
Status: int16(entry.Status),
FullAmountMsat: int64(entry.FullAmount),
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think even if a MaxUint64 would be present here, it wouldn't be a problem, because the binary format of a unit64 and int64 doesn't change with a conversion during the marshal/unmarshal operations.

I'm not sure if sanity checks should be done, but maybe we can decide case by case?

Copy link
Member

@ellemouton ellemouton left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

tACK, LGTM!

Couple of comments that came up during testing (but not for this PR):

  1. currently we use BIGINT for account alias (8 byte) which becomes an int64 but then for session alias (4 bytes) we use a blob. should we make these the same type? iirc, we changed the account alias just for the space saving.
  2. when we do the actual migration should we delete the old kvdb files? or perhaps we should leave them for the time being. and then in a future release we can do a clean up once we are more sure things are ok

}

for _, test := range tests {
tc := test
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yeah not needed

This commit introduces the migration logic for transitioning the
accounts store from kvdb to SQL.

Note that as of this commit, the migration is not yet triggered by any
production code, i.e. only tests execute the migration logic.
@ViktorTigerstrom ViktorTigerstrom force-pushed the 2025-04-migrate-accounts branch from 81129d6 to 1e0e88b Compare May 19, 2025 12:29
@guggero guggero merged commit 0015ca2 into lightninglabs:master May 19, 2025
22 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
no-changelog This PR is does not require a release notes entry
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants