Guard notifying outdated scheduler #1559

rorbech · 2023-11-02T08:36:37Z

TODOs:

Await that core fix is merged Change the C API around custom scheduler realm-core#7102

Closes #1543
Closes #1558
Closes #1561

clementetb

Looks great.

clementetb · 2023-11-02T13:46:34Z

packages/cinterop/src/nativeDarwin/kotlin/io/realm/kotlin/internal/interop/RealmInterop.kt

            scope.launch {
                try {
                    printlntid("on dispatcher")
-                    realm_wrapper.realm_scheduler_perform_work(scheduler)
+                    if (!cancelled.value) {


~~Doesn't core wait until the jobs have been executed to release the scheduler?~~

~~Also, this guard does not cover 100% of the cases, the pointer might have been released just after accessing cancelled.~~

Because closing and job execution happens on the same thread there is no way that the Realm gets closed while executing the Job.

clementetb · 2023-11-02T14:56:35Z

Instead of guarding the SingleThreadDispatcherScheduler, we could cancel the coroutine scope just after closing the live Realm here.

Closing the coroutine there would cancel any pending core jobs that were posted.

rorbech · 2023-11-03T08:56:44Z

Instead of guarding the SingleThreadDispatcherScheduler, we could cancel the coroutine scope just after closing the live Realm here.

Closing the coroutine there would cancel any pending core jobs that were posted.

If we have a user supplied scheduler then we cannot close it. There could also potentially be multiple scheduler in the same scope so feels better to use the exact signal from the user-data-free-callback to guard this.

clementetb · 2023-11-03T09:09:34Z

Yes, we shouldn't be closing the main CoroutineContext after releasing the dispatcher.

The if-guard works, but depends on AtomicBoolean and requires to implement it on each platform. Coroutines have some structures to handle job cancelation gracefully.

For example instead of working with the main CoroutineContext we could have a specific CoroutineScope associated to the scheduler to track all the jobs that have been posted via the scheduler. Once the scheduler is deleted, we can cancel the scope, and thus all pending jobs.

rorbech · 2023-11-03T09:25:14Z

But coroutine scopes are not preemptive so you are not guaranteed that any running coroutine will be immediately aborted. But you are right the current guard is actually not good enough ... though it looks like some of the schedulers in core uses a similar flag so might be that there are some other guarantees in effect here 🤔

nhachicha

Nice fix 👍 it looks like io.realm.kotlin.test.darwin.CoroutineTests.dispatchBetweenThreads is failing for macOS on CI

rorbech · 2023-11-03T14:12:47Z

Nice fix 👍 it looks like io.realm.kotlin.test.darwin.CoroutineTests.dispatchBetweenThreads is failing for macOS on CI

The failing test is caused by trying to clean up the liveRealmContext in f3e4ce3#diff-291d3886fa0a21b98dc0ebd5ff0ce3846d78dab19a8d3ccffe6386899f0879a4. I have verified that it also happens on main if I try it out there so it has nothing to do with this PR, hence I therefore removed it from this PR to avoid stalling it.

cmelchior

Awesome 💯

cmelchior · 2023-11-09T09:13:31Z

...ages/test-sync/src/commonTest/kotlin/io/realm/kotlin/test/mongodb/common/SyncedRealmTests.kt

@@ -753,9 +753,10 @@ class SyncedRealmTests {
                        .mutableRealmIntField
                        .increment(1)
                }
+                realm.syncSession.uploadAllLocalChanges(10.seconds)


Great catch 🙈, but any reason you are adding a timeout? The first upload doesn't have one

I added it because it would highlight what action is actually not executing as expected instead of just timeout out on the recipient side. I just added if for the uploads that I inserted, but just didn't walk over the rest of the code.

rorbech added 3 commits November 1, 2023 14:13

Fix crashes when posting to released scheduler

b5a126f

Rework scheduler life cycle

f3e4ce3

Clean up

f44046a

github-actions bot assigned rorbech Nov 2, 2023

rorbech added 2 commits November 2, 2023 09:37

Naming

6d55eb6

Fix test build errors

93be8fb

rorbech mentioned this pull request Nov 2, 2023

Workaround for client reset flaky tests #1552

Closed

Bump to latest core

fc7b91e

rorbech requested review from nhachicha and clementetb November 2, 2023 09:53

rorbech changed the title ~~Fix crashes when notifying outdated scheduler~~ Guard notifying outdated scheduler Nov 2, 2023

Merge branch 'cr/fix-scheduler-crash' into releases

91e2120

clementetb reviewed Nov 2, 2023

View reviewed changes

Bump to latest BAAS

0245904

nhachicha approved these changes Nov 3, 2023

View reviewed changes

rorbech added 2 commits November 3, 2023 11:57

Proper locking around posting to freed scheduler

1363ca7

Remove usage of runTest

2f470cd

clementetb approved these changes Nov 3, 2023

View reviewed changes

Merge branch 'releases' into cr/fix-scheduler-crash

5c7b52b

rorbech changed the base branch from main to releases November 3, 2023 11:10

Fix macos tests

92c25d0

rorbech mentioned this pull request Nov 3, 2023

Flaky test: io.realm.kotlin.test.mongodb.common.SyncedRealmTests.mutableRealmInt_convergesAcrossClients #1561

Closed

Add FIXME for bumping local BaaS SHA

9a4fc03

rorbech mentioned this pull request Nov 3, 2023

[macos] Crash when trying to clean up scheduler used by multiple realms #1563

Open

rorbech requested a review from nhachicha November 3, 2023 15:48

rorbech requested a review from clementetb November 3, 2023 15:48

rorbech added 12 commits November 7, 2023 16:31

Bump core and SHA1 for local BaaS builds

8782f9e

Change http timeout for debugging

8dbcf5d

Enable debugging info

867db65

Another round for linting

e243e36

Additional debug statements

bc2df5d

Changed timeout.

43fae5d

Another round for linting

be6842c

More debug output

367bbea

Fix timeout exceptions for debugging

5623989

Clean up

6b6c8a8

Revert http timeout

cff6470

Reinsert download guard

7746087

cmelchior approved these changes Nov 9, 2023

View reviewed changes

rorbech merged commit 0dcdbfb into releases Nov 9, 2023
2 checks passed

rorbech deleted the cr/fix-scheduler-crash branch November 9, 2023 09:20

rorbech mentioned this pull request Jan 2, 2024

Fix various dispatcher issues #1611

Merged

github-actions bot locked as resolved and limited conversation to collaborators Mar 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Guard notifying outdated scheduler #1559

Guard notifying outdated scheduler #1559

rorbech commented Nov 2, 2023 •

edited

Loading

clementetb left a comment •

edited

Loading

clementetb Nov 2, 2023 •

edited

Loading

clementetb commented Nov 2, 2023 •

edited

Loading

rorbech commented Nov 3, 2023

clementetb commented Nov 3, 2023

rorbech commented Nov 3, 2023 •

edited

Loading

nhachicha left a comment

rorbech commented Nov 3, 2023

cmelchior left a comment

cmelchior Nov 9, 2023

rorbech Nov 9, 2023

cmelchior Nov 9, 2023

Guard notifying outdated scheduler #1559

Guard notifying outdated scheduler #1559

Conversation

rorbech commented Nov 2, 2023 • edited Loading

clementetb left a comment • edited Loading

Choose a reason for hiding this comment

clementetb Nov 2, 2023 • edited Loading

Choose a reason for hiding this comment

clementetb commented Nov 2, 2023 • edited Loading

rorbech commented Nov 3, 2023

clementetb commented Nov 3, 2023

rorbech commented Nov 3, 2023 • edited Loading

nhachicha left a comment

Choose a reason for hiding this comment

rorbech commented Nov 3, 2023

cmelchior left a comment

Choose a reason for hiding this comment

cmelchior Nov 9, 2023

Choose a reason for hiding this comment

rorbech Nov 9, 2023

Choose a reason for hiding this comment

cmelchior Nov 9, 2023

Choose a reason for hiding this comment

rorbech commented Nov 2, 2023 •

edited

Loading

clementetb left a comment •

edited

Loading

clementetb Nov 2, 2023 •

edited

Loading

clementetb commented Nov 2, 2023 •

edited

Loading

rorbech commented Nov 3, 2023 •

edited

Loading