-
Notifications
You must be signed in to change notification settings - Fork 2.6k
sc_network::service::tests::notifications_back_pressure is flaky #6766
Comments
I don't know how to debug this, as I can't manage to make it fail locally. I suspect that it's related to this TODO: substrate/client/network/src/service.rs Lines 1463 to 1473 in 19c1d90
It's also not certain whether the test itself is correct, because of paritytech/polkadot-sdk#552 |
Could this not also be related to the fact discussed here that notifications may start out being sent over the legacy substream due to timing? That should also imply a potential loss of ordering for the receiver, as there is none across different substreams, each having their own buffers and being polled independently. An indicator for that may also be if the message numbers in the failed assertion are always low, since such reordering for the receiver should only occur at the beginning, as later on all notifications are sent over the same substream. |
That's indeed more likely, I didn't think of this. Then hopefully #5670 should fix that. |
#6826 fixes a problem we were discarding notifications rather than sending them on the legacy substream. This might fix the test as well. |
Is this still an issue? |
#6821 fixes this issue, but introduces another test failure which I haven't investigated yet. |
This test seems to fail on CI.
It seems that notifications are lost. My guess is that the CPU is busy and the test gets put on hold for more than 30 seconds, which causes the connection to get killed for inactivity.
The text was updated successfully, but these errors were encountered: