Engine support for `jr:count` and `jr:noAddRemove` #187

eyelidlessness · 2024-08-14T00:19:20Z

As the title says, this PR adds support for jr:count and jr:noAddRemove.

Client-facing changes

In terms of client impact, this change splits the RepeatRangeNode client interface into:

RepeatRangeControlledNode (nodeType: 'repeat-range:controlled')
RepeatRangeUncontrolledNode (nodeType: 'repeat-range:uncontrolled')

From a client perspective, the main distinction between these nodes aside from their names and nodeType is that only RepeatRangeUncontrolledNode provides write methods (addInstances, removeInstances).

The client interface still exposes a RepeatRangeNode type, which is now a union of these two interfaces.

An earlier pass considered the possibility of keeping nodeType: 'repeat-range' for both types, with a separate countType property to narrow their capabilities further. And as described in the earliest commits to this branch, the approach works (i.e. type narrowing behaves as expected), but the usage felt confusing and error prone.

I'm eager to bikeshed the names of these types if there's interest, but I'm pretty confident the general approach (distinct nodeTypes as the narrowing mechanism, distinct names for each node type, clear naming corresponding to their distinct capabilities) is the right way to go.

I left the earlier approach intact in the commit history for now (which we may choose to rebase before merge), mainly to address some concern in earlier discussions about encoding more information into the nodeType property. I do understand that from a data modeling perspective this is a bit of a smell! But I really think it's the right way to go. If there are still doubts, let's consider getting on a call to discuss the tradeoffs before digging in here in the comments.

Implementation: `RepeatCountControlExpression`

A new RepeatCountControlExpression which generalizes the logic of both jr:count and jr:addRemove. This is another DependentExpression, the underlying interface used by reactive createComputedExpression, as well as all of the same dependency analysis code paths used to compute calculate, relevant... any other expression which might update based on changes to dependencies.

This is generalized by treating jr:noAddRemove as if it were a jr:count expression which evaluates to the the number of form-defined repeat instances.

Judgment call: special case for `jr:noAddRemove` with only a `jr:template`

If only a jr:template is defined, jr:noAddRemove is treated as if one instance is defined. I'm happy to reconsider this, but it seemed like the first sensible thing I could think of. Maybe we should warn instead?

Implementation: `BaseRepeatRange`, `RepeatRangeControlled`, `RepeatRangeUncontrolled`

Nearly all functionality previously in RepeatRange is moved into an abstract BaseRepeatRange class, from which the controlled and uncontrolled concrete node implementations inherit.

Only the behavior distinct to those node types in those concrete classes. Those differences are:

RepeatRangeUncontrolled creates initial repeat instances almost the same way RepeatRange did before. (More on that below!)
RepeatRangeControlled creates instances with a reactive computation of RepeatCountControlExpression, both on initialization and on any subsequent update to referenced dependencies.
As with the client interface, only RepeatRangeUncontrolled exposes public write methods to add or remove repeat instances.

Judgment call: special case when `jr:count` produces `NaN`

Currently, when a jr:count expression is evaluated to NaN, the count of repeat instances is not updated until a subsequent update produces a valid number. I added some commentary on this in 052ffbd. It matched the behavior I expect when I was directly setting a count in an input. But I started to second guess this as I was writing tests around count being computed from multiple/indirect references.

I suspect it might be better if we were to let clients provide a special case for this, but that may require doing additional form analysis so the engine can inform clients when a particular field controls a jr:count expression. Should we maybe roll this special case back and consider refinement in the future?

Change in repeat instance creation logic

There is one important change to the code moved from RepeatRange into BaseRepeatRange. Under some circumstances, synchronous addition of multiple RepeatInstance children is now performed in two (still synchronous) steps:

All of the pending RepeatInstances are constructed in memory.
Once created, they are all added to the parent repeat range's childrenState in a single write call.

This introduces a brief period where reactive state lags the state in the backing XML DOM store used for XPath evaluation. There are a couple of motivations for this change:

It's much easier to reason about! The creation code is a simple array map, and there's a single write call. This was previously implemented by recursively calling addInstances, which especially made it confusing to understand how and when that method's default parameters were used.
It addresses one of the lingering cases where clients using Solid (including scenario) will encounter InconsistentChildrenStateErrors. And I now have a much better understanding of what causes this to occur! Ultimately, this is caused when certain writes, in certain reactive scopes, are issued multiple times in the same synchronous tick. I'm not 100% confident that we've seen the last of these cases, as it could still be produced under some circumstances by repeated calls to RepeatRangeUncontrolled.addInstances.

Tests

There were no existing scenario tests from JavaRosa which passed with these changes. This is only a little surprising, based on my understanding of the history of those tests. There are certainly some tests which will be affected by the introduction of jr:count functionality, but they exercise additional functionality as well.

As such, I've added a slew of new targeted tests for these specific features and various details thereof.

Bonus bug fix: reactive update on indirect reference to non-blank + newly relevant nodes

One of the new scenario tests happened to trigger a bug that was familiar to me, and for which I've had a known fix awaiting prioritization. Specifically, this is the nature of the bug as I understand it:

Given some field a with:
- A non-blank value
- A relevant expression evaluating to false
Given some field b with a computation referencing a
Given some field c with a computation referencing b

Making a relevant will sometimes cause updates to b to lag, and will generally (if not always) prevent c from updating a consequence.

The fix for this is simple: f16a4ac. I'll call out here (as I did in a comment there) that this kind of bug will be moot if we move away from XML DOM for XPath evaluation.

Clients

Apart from scenario, I've made minimal updates to the web-forms (Vue UI) and ui-solid clients to account for the changes in the client-facing API. I did not address any presentation aspects of the change in either UI client. I expect there will be some minor adjustments incoming (probably at least around spacing), but I've deferred to downstream work on that.

Open questions

There are a few open questions raised above (name bikeshedding, a couple of judgment calls). I also wasn't sure if/how to handle non-integer values produced by jr:count. So that isn't addressed at all yet. Do we want to address it here?

changeset-bot · 2024-08-14T00:19:23Z

🦋 Changeset detected

Latest commit: 46a3d14

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 6 packages

Name	Type
@getodk/xforms-engine	Minor
@getodk/web-forms	Minor
@getodk/scenario	Minor
@getodk/ui-solid	Minor
@getodk/common	Minor
@getodk/xpath	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

eyelidlessness · 2024-08-14T00:25:38Z

I also want to call out that very little in this change is really new! The vast majority of the implementation change is moving stuff into a base class shared by both repeat range types. I did briefly consider skipping that and implementing the count aspects in a constructor branch, but I think it’s helpful to see where they split more clearly. And to hew more closely to their slight differences in client API capabilities.

eyelidlessness · 2024-08-15T20:22:46Z

I just remembered to mention another bit of functionality that we might want, but isn't handled here: preserving and restoring previous added repeat instances. I believe this is something @seadowg mentioned when we spoke about it a while back. It is certainly possible, likely even trivial, but I didn't want to be too presumptuous about it without some opportunity for discussion.

seadowg · 2024-08-19T07:24:33Z

I just remembered to mention another bit of functionality that we might want, but isn't handled here: preserving and restoring previous added repeat instances. I believe this is something @seadowg mentioned when we spoke about it a while back.

I don't recall this sadly!

This is mostly a preparation for implementation of count/fixed repeats. It also provides sufficient type information to be able to validate client usage, and in particular to validate the ability to treat `AnyRepeatRangeNode` as a tagged/discriminated union (narrowed by checking its `countType` property).

…ared usage

Includes a check for whether repeat instances can be added, which was previously missed. The tagged union type does seem to pay off here!

Similarly shows union type narrowing working as intended!

Observation: both controlled/uncontrolled repeat ranges sharing the same `nodeType` feels error prone. - Interface names and nodeType no longer have 1:1 relationship - Losing that correlation makes it likely to reference `RepeatRangeNode` when `AnyRepeatRangeNode` is intended - This adds a level of cognitive overhead that we haven’t had up to this point - While types will often guide correcting the issue, they won’t always and several mistakes were only caught due to active focus on these specific adjustments A followup commit will explore restoring that 1:1 relationship.

…t ranges

…types

… types Observation: while a bit more verbose, this approach feels both… - more consistent with the existing node type narrowing - substantially less error prone

It’s super confusing when such runtime errors are swallowed without any feedback!

I believe these exercise the bulk of the pertinent logic. Not (yet?) tested: - Non-integer values produced by count expression - Behavior of blank count value when not directly entered by user - `jr:noAddRemove` Note that at least one of these tests will fail on this commit, without a bug fix (incoming in subsequent commit).

…vance change This affects at one of the newly added `jr:count` tests. I also know there is at least one other `scenario` test affected by this bug, which I had seen passing with this exact same change while working on some other WIP functionality. Unfortunately, I’m now having trouble recalling which test that was. But I can give it a more thorough look if there’s interest. I would prefer to have landed this specific change in a more targeted PR, with the **existing affected test** passing, but I think it’s good enough that there’s a prior commit with a test that fails without the fix.

@lognaturel

These are mostly interesting to evaluate performance **of itemsets**, but they depend on `jr:count` support to work. - bench9.xml is taken directly from the set of fixtures loaded by enketo-core-performance-monitor. I confirmed with @lognaturel that this fixture doesn’t contain sensitive information, so it’s safe to include directly. - bench9-alt.xml is modified from that original fixture, replacing absolute references into repeat instances with their `current()`-prefixed relative equivalents. This should be more consistent with what we’d expect pyxform to produce for an equivalent form (assumed, not confirmed; it’d be nice to be able to reliably convert bidirectionally!).

…ty check

sadiqkhoja

This looks great 🎉

Can we add couple of tests where:

jr:count is static number, existing test suite static values is setting static values to input /data/rep-count
it shows that "jr:count is intended to be the ultimate determiner of the number of repeat instances so users should not be able to add or remove instances."

sadiqkhoja · 2024-08-26T16:03:16Z

packages/xforms-engine/src/client/repeat/RepeatInstanceNode.ts

@@ -33,12 +33,12 @@ export interface RepeatInstanceNode extends BaseNode {
 readonly root: RootNode;

 /**
- * A repeat instance may only be a child of a {@link RepeatRangeNode}.
+ * A repeat instance may only be a child of a {@link RepeatRangeUncontrolledNode}.


It can be child of RepeatRangeControlledNode as well, right?

sadiqkhoja · 2024-08-26T20:26:03Z

packages/scenario/test/repeat.test.ts

+ // Subjective! Maybe a form design mistake? But it wouldn't ever make sense to
+ // create a fixed set of zero repeat instances.


I agree with this.

I think we can consider producing a warning about this (and things like it) in the future.

These were mistakenly auto-updated as references to their uncontrolled variants by editor tooling

eyelidlessness · 2024-08-27T20:34:07Z

jr:count is static number, existing test suite static values is setting static values to input /data/rep-count

Added in 8c6b5aa.

it shows that "jr:count is intended to be the ultimate determiner of the number of repeat instances so users should not be able to add or remove instances."

Does this test cover what you're looking for? If not, can you say more about what you'd like to see?

I also want to make sure to follow up on our side discussion about naming. Now that we've both had some time to let it marinate, which of these feels right to you?

RepeatRangeControlled[Node]/RepeatRangeUncontrolled[Node]
RepeatRangeManaged[Node]/RepeatRangeUnmanaged[Node]
Something else?

sadiqkhoja

LGTM 🎉

…control of `jr:count`

eyelidlessness marked this pull request as ready for review August 14, 2024 19:46

eyelidlessness added 19 commits August 20, 2024 13:22

scenario: remove redundant assertions, use those already moved for sh…

472c1f3

…ared usage

scenario: adjustments for controlled/uncontrolled repeat range types

3ed21e7

Includes a check for whether repeat instances can be added, which was previously missed. The tagged union type does seem to pay off here!

ui-solid: adjustments for controlled/uncontrolled repeat range types

f5adbca

Similarly shows union type narrowing working as intended!

engine: refine naming and node types of controlled/uncontrolled repea…

7815fca

…t ranges

scenario: re-adjust to 1:1 controlled/uncontrolled repeat range node …

a65e938

…types

ui-solid: re-adjust to 1:1 controlled/uncontrolled repeat range node …

5644c1b

…types

web-forms: re-adjust to 1:1 controlled/uncontrolled repeat range node…

0c34709

… types Observation: while a bit more verbose, this approach feels both… - more consistent with the existing node type narrowing - substantially less error prone

engine: implement jr:count, jr:noAddRemove for RepeatRangeControlledNode

e3652f1

web-forms: propagate loading errors to console

d6b564d

It’s super confusing when such runtime errors are swallowed without any feedback!

scenario: clean up a couple more things now that jr:count is supported

30de037

Add note about the intent (and potential drawbacks) of NaN count check

7f64d2c

fix (scenario): check for controlled repeat, rename proposed capabili…

8a63cbc

…ty check

scenario: test jr:count capability, add jr:noAddRemove tests

a28b083

Changeset

7b63159

eyelidlessness force-pushed the features/repeat-count-fixed branch from 4585960 to 7b63159 Compare August 20, 2024 20:26

eyelidlessness requested review from sadiqkhoja and removed request for sadiqkhoja August 20, 2024 20:28

sadiqkhoja reviewed Aug 26, 2024

View reviewed changes

Correct JSDoc links to RepeatRange[Node]

684c2ae

These were mistakenly auto-updated as references to their uncontrolled variants by editor tooling

Add tests for static counts directly in jr:count expression

8c6b5aa

sadiqkhoja self-requested a review August 27, 2024 21:44

sadiqkhoja approved these changes Aug 27, 2024

View reviewed changes

Add test clarifying jr:noAddRemove=“false()” doesn’t negate engine …

46a3d14

…control of `jr:count`

eyelidlessness merged commit 57accc3 into main Aug 28, 2024
86 checks passed

This was referenced Aug 28, 2024

Form design pattern "Repeating as long as a condition is met" pathological infinite loop #205

Open

Fix: evaluate jr:count in context of affected repeat; fix/update child vaccination smoketest, other Scenario tests #207

Merged

Implement jr:count #87

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Engine support for `jr:count` and `jr:noAddRemove` #187

Engine support for `jr:count` and `jr:noAddRemove` #187

eyelidlessness commented Aug 14, 2024 •

edited

Loading

changeset-bot bot commented Aug 14, 2024 •

edited

Loading

eyelidlessness commented Aug 14, 2024

eyelidlessness commented Aug 15, 2024

seadowg commented Aug 19, 2024

sadiqkhoja left a comment

sadiqkhoja Aug 26, 2024

sadiqkhoja Aug 26, 2024

eyelidlessness Aug 27, 2024

eyelidlessness commented Aug 27, 2024 •

edited

Loading

sadiqkhoja left a comment

		// Subjective! Maybe a form design mistake? But it wouldn't ever make sense to
		// create a fixed set of zero repeat instances.

Engine support for jr:count and jr:noAddRemove #187

Engine support for jr:count and jr:noAddRemove #187

Conversation

eyelidlessness commented Aug 14, 2024 • edited Loading

Client-facing changes

Implementation: RepeatCountControlExpression

Judgment call: special case for jr:noAddRemove with only a jr:template

Implementation: BaseRepeatRange, RepeatRangeControlled, RepeatRangeUncontrolled

Judgment call: special case when jr:count produces NaN

Change in repeat instance creation logic

Tests

Bonus bug fix: reactive update on indirect reference to non-blank + newly relevant nodes

Clients

Open questions

changeset-bot bot commented Aug 14, 2024 • edited Loading

🦋 Changeset detected

eyelidlessness commented Aug 14, 2024

eyelidlessness commented Aug 15, 2024

seadowg commented Aug 19, 2024

sadiqkhoja left a comment

Choose a reason for hiding this comment

sadiqkhoja Aug 26, 2024

Choose a reason for hiding this comment

sadiqkhoja Aug 26, 2024

Choose a reason for hiding this comment

eyelidlessness Aug 27, 2024

Choose a reason for hiding this comment

eyelidlessness commented Aug 27, 2024 • edited Loading

sadiqkhoja left a comment

Choose a reason for hiding this comment

Engine support for `jr:count` and `jr:noAddRemove` #187

Engine support for `jr:count` and `jr:noAddRemove` #187

eyelidlessness commented Aug 14, 2024 •

edited

Loading

Implementation: `RepeatCountControlExpression`

Judgment call: special case for `jr:noAddRemove` with only a `jr:template`

Implementation: `BaseRepeatRange`, `RepeatRangeControlled`, `RepeatRangeUncontrolled`

Judgment call: special case when `jr:count` produces `NaN`

changeset-bot bot commented Aug 14, 2024 •

edited

Loading

eyelidlessness commented Aug 27, 2024 •

edited

Loading