~maas-committers/maas/+git/temporal:release/v1.21.x

Last commit made on 2024-03-28
Get this branch:
git clone -b release/v1.21.x https://git.launchpad.net/~maas-committers/maas/+git/temporal

Branch merges

Branch information

Name:
release/v1.21.x
Repository:
lp:~maas-committers/maas/+git/temporal

Recent commits

1c57f2d... by Tim Deeb-Swihart <email address hidden>

Update server version

31b73c6... by Yimin Chen <email address hidden>

Validate normal TQ on sticky poll (#5488)

## What changed?
Validate normal TQ on sticky poll

## Why?
On sticky poll, we fetch normal TQ's UserData. So an invalid normal TQ
on a sticky queue would still trigger a load for the invalid normal
queue.

## How did you test it?
Manual test with invalid TQ.

## Potential risks
<!-- Assuming the worst case, what can be broken when deploying this
change to production? -->

## Documentation
<!-- Have you made sure this change doesn't falsify anything currently
stated in `docs/`? If significant
new behavior is added, have you described that in `docs/`? -->

## Is hotfix candidate?
<!-- Is this PR a hotfix candidate or does it require a notification to
be sent to the broader community? (Yes/No) -->

679e3dc... by Yimin Chen <email address hidden>

Add validation for a few string fields (#5487)

Add string validation for a few string fields

Since we disable utf8 string validation from proto level, we want to
enforce minimal validation for some key fields.

Unit tests

No

<!-- Have you made sure this change doesn't falsify anything currently
stated in `docs/`? If significant
new behavior is added, have you described that in `docs/`? -->

<!-- Is this PR a hotfix candidate or does it require a notification to
be sent to the broader community? (Yes/No) -->

1eaaaf8... by Haifeng He <email address hidden>

Increaes RecordHeartbeat frequency in VerifyReplicationTask (#4771)

<!-- Describe what has changed in this PR -->
**What changed?**

<!-- Tell your future self why have you made these changes -->
**Why?**

<!-- How have you verified this change? Tested locally? Added a unit
test? Checked in staging env? -->
**How did you test it?**

<!-- Assuming the worst case, what can be broken when deploying this
change to production? -->
**Potential risks**

<!-- Is this PR a hotfix candidate or require that a notification be
sent to the broader community? (Yes/No) -->
**Is hotfix candidate?**

194a893... by Haifeng He <email address hidden>

Skip verifying workflow which has already passed retention time (#4770)

<!-- Describe what has changed in this PR -->
**What changed?**

Previous logic skip workflow of which retention is within a range of
current time. But we actually just need to skip workflow already passed
retention time.

<!-- Tell your future self why have you made these changes -->
**Why?**

<!-- How have you verified this change? Tested locally? Added a unit
test? Checked in staging env? -->
**How did you test it?**

<!-- Assuming the worst case, what can be broken when deploying this
change to production? -->
**Potential risks**

<!-- Is this PR a hotfix candidate or require that a notification be
sent to the broader community? (Yes/No) -->
**Is hotfix candidate?**

8e56612... by Yichao Yang <email address hidden>

Turn ErrWorkflowClosing into a retryable error (#4765)

cadd933... by Yimin Chen <email address hidden>

Remove debug logging from Cassandra schema update tool (#4738)

<!-- Describe what has changed in this PR -->
**What changed?**
Remove debug logs from Cassandra schema upgrade tool.

<!-- Tell your future self why have you made these changes -->
**Why?**
The debug log contains cql config which includes password.

<!-- How have you verified this change? Tested locally? Added a unit
test? Checked in staging env? -->
**How did you test it?**
local test

<!-- Assuming the worst case, what can be broken when deploying this
change to production? -->
**Potential risks**
No

<!-- Is this PR a hotfix candidate or require that a notification be
sent to the broader community? (Yes/No) -->
**Is hotfix candidate?**
No

7e216c3... by rodrigozhou <email address hidden>

Bump server version to 1.21.5

524bda3... by Michael Snowden <email address hidden>

Fix bug causing duplicates when listing s3-archived workflows (#4712)

<!-- Describe what has changed in this PR -->
**What changed?**
We now only return one record per workflow execution when listing
archived workflows with the S3 storage backend.

<!-- Tell your future self why have you made these changes -->
**Why?**
To fix
https://github.com/temporalio/temporal/issues/4432#issuecomment-1661078704.
There's more details in the comments in the code as to why this bug
happened and how it is fixed.

<!-- How have you verified this change? Tested locally? Added a unit
test? Checked in staging env? -->
**How did you test it?**
I modified our unit test to verify that we don't return any duplicates.
The previous unit test was actually too permissive and allowed
duplicates. I also manually verified this by restarting the server.

Before:

<img width="947" alt="image"
src="https://github.com/temporalio/temporal/assets/5942963/3a616521-f26b-45f6-b7b4-063f6c3b9321">

After:

<img width="966" alt="image"
src="https://github.com/temporalio/temporal/assets/5942963/bd650c30-dc56-4260-8439-237c5ac75ddb">

<!-- Assuming the worst case, what can be broken when deploying this
change to production? -->
**Potential risks**
The ability to list archived workflows when using S3 is already very
new, as it was completely unsupported before, and the changes here are
just isolated to that query path, so there is little to lose even if
this just crashes.

<!-- Is this PR a hotfix candidate or require that a notification be
sent to the broader community? (Yes/No) -->
**Is hotfix candidate?**
Should update the issue when this lands.

5722c23... by Haifeng He <email address hidden>

Continue replication verification by skipping workflow which should be or soon to be deleted (#4734)

<!-- Describe what has changed in this PR -->
**What changed?**
Source and target cluster handles workflow retention separately. For a
workflow which is abort to be deleted,
it may be already deleted on target cluster but still exist on source,
which cause verification delay.
We skip workflow of which retention time is close to current time to
continue the verification.

<!-- Tell your future self why have you made these changes -->
**Why?**

<!-- How have you verified this change? Tested locally? Added a unit
test? Checked in staging env? -->
**How did you test it?**

<!-- Assuming the worst case, what can be broken when deploying this
change to production? -->
**Potential risks**

<!-- Is this PR a hotfix candidate or require that a notification be
sent to the broader community? (Yes/No) -->
**Is hotfix candidate?**