Seed for randomization derived from non-numeric-looking string is always 0 #800

brontolosone · 2024-10-14T12:09:22Z

Related: getodk/web-forms#240
Related: getodk/web-forms#49

Here is a form that uses choice randomization.
Yes, the referenced seed node is a text input! And if you put non-numerical looking text in it, by my reading of OpenRosa, it'll evaluate as a NaN double, which then becomes 0 when converted to the Long that the Park-Miller PRNG takes as a seed. Which means that basically any text that doesn't happen to look like a number results in the same seed.

It's not a huge disaster (which is why it went unnoticed?) but on the other hand there's no indication for this behaviour; the spec (not xform, but xlsform) doesn't say "only use integer / numeric types or numberish-looking text", and neither does ODK Validate say anything about it, nor is there a runtime warning (not that we'd want one).
Thus to a user doing a superficial test of their survey it does look like the choice list gets randomized when they put in some text in the designated seed field. They might find out with more extensive testing that almost any string (any non-numeric-looking string) results in the same sort order though, and only at that point they might think about how what was promised relates to these observations, and how those oberservations relate to what they want. And that sort of ambiguity is probably not what we want.

I stumbled upon this in the context of getodk/web-forms#49.
Of the solution alternatives I could come up with ("do nothing but port the behaviour to webforms", "clarify current behaviour in the spec", and "fix things") @lognaturel is leaning towards "fix things", which I thought about as:

in the current fallthrough case (string can't be parsed into a Double) currently resulting in NaN, we take the original string value instead, hash-digest it (in utf8-encoding), and read the first 8 bytes of the resulting digest as a double. The goal is to achieve stable and reproducible (cross-platform) seeded randomization regardless of the nature of the input, while giving the same results for inputs that are currently not squashed to 0 (and thus being backwards-compatible).

The text was updated successfully, but these errors were encountered:

brontolosone mentioned this issue Oct 14, 2024

choice randomization: better approximation of JR behaviour, fixes #49 getodk/web-forms#241

Open

2 tasks

brontolosone added a commit to brontolosone/javarosa that referenced this issue Oct 14, 2024

hash un-numeric input when used as PRNG seed, fixes getodk#800

db109ec

brontolosone linked a pull request Oct 14, 2024 that will close this issue

hash un-numeric input when used as PRNG seed, fixes #800 #801

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Seed for randomization derived from non-numeric-looking string is always 0 #800

Seed for randomization derived from non-numeric-looking string is always 0 #800

brontolosone commented Oct 14, 2024

Seed for randomization derived from non-numeric-looking string is always 0 #800

Seed for randomization derived from non-numeric-looking string is always 0 #800

Comments

brontolosone commented Oct 14, 2024