Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix downgrade stuck #1501

Merged
merged 5 commits into from
Mar 13, 2025
Merged

Conversation

antontroshin
Copy link
Contributor

Description

Timeout and completion channels in tests weren't explicitly closed.
Deferred cancel was leaking in for loop

Issue reference

We strive to have all PR being opened based on an issue, where the problem or feature have been discussed prior to implementation.

Please reference the issue this PR will close: #[issue number]

Checklist

Please make sure you've completed the relevant tasks for this PR, out of the following list:

  • Code compiles correctly
  • Created/updated tests
  • Extended the documentation

@antontroshin antontroshin changed the base branch from master to release-1.15 March 7, 2025 03:15
@antontroshin antontroshin marked this pull request as ready for review March 11, 2025 17:43
@antontroshin antontroshin requested review from a team as code owners March 11, 2025 17:43
@yaron2 yaron2 merged commit 16cc1d1 into dapr:release-1.15 Mar 13, 2025
27 checks passed
@antontroshin antontroshin deleted the fix-downgrade-stuck branch March 13, 2025 15:05
yaron2 added a commit that referenced this pull request Mar 14, 2025
* use non-deprecated flags in List operation (#1478)

Signed-off-by: yaron2 <[email protected]>

* Scheduler: set broadcast address to localhost:50006 in selfhosted (#1480)

* Scheduler: set broadcast address to localhost:50006 in selfhosted

Signed-off-by: joshvanl <[email protected]>

* Set schedulder override flag for edge and dev

Signed-off-by: joshvanl <[email protected]>

---------

Signed-off-by: joshvanl <[email protected]>

* Fix scheduler broadcast address for windows (#1481)

Signed-off-by: Anton Troshin <[email protected]>

* Remove deprecated flags (#1482)

* remove deprecated flags

Signed-off-by: yaron2 <[email protected]>

* update Dapr version in tests

Signed-off-by: yaron2 <[email protected]>

---------

Signed-off-by: yaron2 <[email protected]>

* Fix daprsystem configuration retrieval when renewing certificates (#1486)

The issue found when similar resource were installed in k8s that use the name "configurations".
In this case the knative's "configurations.serving.knative.dev/v1" was the last in the list and the command returned the error
`Error from server (NotFound): configurations.serving.knative.dev "daprsystem" not found`

Signed-off-by: Anton Troshin <[email protected]>

* fix: arguments accept units (#1490)

* fix: arguments accept units
`max-body-size` and `read-buffer-size` now accept units as defined in the docs.

Fixes #1489

Signed-off-by: Mike Nguyen <[email protected]>

* chore: gofumpt

Signed-off-by: Mike Nguyen <[email protected]>

* refactor: modify logic to comply with vetting

Signed-off-by: Mike Nguyen <[email protected]>

* chore: gofumpt -w .

Signed-off-by: Mike Nguyen <[email protected]>

* refactor: set defaults
`max-body-size` is defaulted to 4Mi
`request-buffer-size` is defaulted to 4Ki

This is inline with the runtime.

Signed-off-by: Mike Nguyen <[email protected]>

* fix: set defaults in run and annotate

Signed-off-by: Mike Nguyen <[email protected]>

* chore: gofumpt

Signed-off-by: Mike Nguyen <[email protected]>

* refactor: exit with error rather than panic

Co-authored-by: Anton Troshin <[email protected]>
Signed-off-by: Mike Nguyen <[email protected]>

---------

Signed-off-by: Mike Nguyen <[email protected]>
Co-authored-by: Anton Troshin <[email protected]>

* Fix scheduler pod count for 1.15 version when testing master and latest (#1492)

Signed-off-by: Anton Troshin <[email protected]>

* Fix podman CI (#1493)

* Fix podman CI
Update to podman 5.4.0

Signed-off-by: Anton Troshin <[email protected]>

* fix --cpus flag

Signed-off-by: Anton Troshin <[email protected]>

---------

Signed-off-by: Anton Troshin <[email protected]>

* Fix dapr upgrade command incorrectly detecting HA mode for new version 1.15 (#1494)

* Fix dapr upgrade command detecting HA mode for new version 1.15
The issue is that the scheduler by default uses 3 replicas, which incorrectly identified non-HA install as HA.

Signed-off-by: Anton Troshin <[email protected]>

* Fix e2e

Signed-off-by: Anton Troshin <[email protected]>

---------

Signed-off-by: Anton Troshin <[email protected]>

* Fix scheduler address for dapr run with file on Windows (#1497)

Signed-off-by: Anton Troshin <[email protected]>

* release: test upgrade/downgrade for 1.13/1.14/1.15 + mariner (#1491)

* release: test upgrade/downgrade for 1.13/1.14/1.15 + mariner

Signed-off-by: Mike Nguyen <[email protected]>

* fix: version skews

Co-authored-by: Anton Troshin <[email protected]>
Signed-off-by: Mike Nguyen <[email protected]>

* Update tests/e2e/upgrade/upgrade_test.go

Accepted

Co-authored-by: Anton Troshin <[email protected]>
Signed-off-by: Yaron Schneider <[email protected]>

* Update tests/e2e/upgrade/upgrade_test.go

Co-authored-by: Anton Troshin <[email protected]>
Signed-off-by: Yaron Schneider <[email protected]>

* Fix downgrade issue from 1.15 by deleting previous version scheduler pods
Update 1.15 RC to latest RC.18

Signed-off-by: Anton Troshin <[email protected]>

* Fix downgrade 1.15 to 1.13 scenario with 0 scheduler pods

Signed-off-by: Anton Troshin <[email protected]>

* increase update test timeout to 60m and update latest version to 1.15

Signed-off-by: Anton Troshin <[email protected]>

* fix httpendpoint tests cleanup and checks

Signed-off-by: Anton Troshin <[email protected]>

* make sure matrix runs appropriate tests, every matrix ran the same tests

Signed-off-by: Anton Troshin <[email protected]>

* skip TestKubernetesRunFile on HA

Signed-off-by: Anton Troshin <[email protected]>

* fix skip TestKubernetesRunFile on HA

Signed-off-by: Anton Troshin <[email protected]>

* update to latest dapr 1.15.2

Signed-off-by: Anton Troshin <[email protected]>

* add logs when waiting for pod deletion

Signed-off-by: Anton Troshin <[email protected]>

---------

Signed-off-by: Mike Nguyen <[email protected]>
Signed-off-by: Yaron Schneider <[email protected]>
Signed-off-by: Anton Troshin <[email protected]>
Co-authored-by: Anton Troshin <[email protected]>
Co-authored-by: Yaron Schneider <[email protected]>
Co-authored-by: Anton Troshin <[email protected]>

* Fix dapr init test latest version retrieval (#1500)

Lint

Signed-off-by: Anton Troshin <[email protected]>

* Fix downgrade stuck (#1501)

* Fix goroutine channel leaks and ensure proper cleanup in tests

Signed-off-by: Anton Troshin <[email protected]>

* Add artificial delay before deleting scheduler pods during downgrade

Signed-off-by: Anton Troshin <[email protected]>

* Add timeout to helm upgrade tests, they are being stuck sometime for 5+ minutes

Signed-off-by: Anton Troshin <[email protected]>

* bump helm.sh/helm/v3 to v3.17.1

Signed-off-by: Anton Troshin <[email protected]>

---------

Signed-off-by: Anton Troshin <[email protected]>

---------

Signed-off-by: yaron2 <[email protected]>
Signed-off-by: joshvanl <[email protected]>
Signed-off-by: Anton Troshin <[email protected]>
Signed-off-by: Mike Nguyen <[email protected]>
Signed-off-by: Yaron Schneider <[email protected]>
Co-authored-by: Yaron Schneider <[email protected]>
Co-authored-by: Josh van Leeuwen <[email protected]>
Co-authored-by: Mike Nguyen <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants