[BUG] org.opensearch.search.nested.SimpleNestedIT.testExplain is flaky #11413

reta · 2023-11-30T14:44:32Z

Describe the bug
The test case org.opensearch.search.nested.SimpleNestedIT.testExplain {p0={"search.concurrent_segment_search.enabled":"true"}} is flaky

java.lang.AssertionError: 
Expected: a string starting with "0.36464313 = Score based on 2 child docs in range from 0 to 1"
     but: was "0.36464313 = Score based on 2 child docs in range from 12 to 13, using score mode Total
  0.18232156 = weight(nested1.n_field1:n_value1 in 12) [PerFieldSimilarity], result of:
    0.18232156 = score(freq=1.0), computed as boost * idf * tf from:
      2.2 = boost
      0.18232156 = idf, computed as log(1 + (N - n + 0.5) / (n + 0.5)) from:
        2 = n, number of documents containing term
        2 = N, total number of documents with field
      0.45454544 = tf, computed as freq / (freq + k1 * (1 - b + b * dl / avgdl)) from:
        1.0 = freq, occurrences of term within document
        1.2 = k1, term saturation parameter
        0.75 = b, length normalization parameter
        1.0 = dl, length of field
        1.0 = avgdl, average length of field
"

java.lang.AssertionError: 
Expected: a string starting with "0.36464313 = Score based on 2 child docs in range from 0 to 1"
     but: was "0.36464313 = Score based on 2 child docs in range from 12 to 13, using score mode Total
  0.18232156 = weight(nested1.n_field1:n_value1 in 12) [PerFieldSimilarity], result of:
    0.18232156 = score(freq=1.0), computed as boost * idf * tf from:
      2.2 = boost
      0.18232156 = idf, computed as log(1 + (N - n + 0.5) / (n + 0.5)) from:
        2 = n, number of documents containing term
        2 = N, total number of documents with field
      0.45454544 = tf, computed as freq / (freq + k1 * (1 - b + b * dl / avgdl)) from:
        1.0 = freq, occurrences of term within document
        1.2 = k1, term saturation parameter
        0.75 = b, length normalization parameter
        1.0 = dl, length of field
        1.0 = avgdl, average length of field
"
	at __randomizedtesting.SeedInfo.seed([A16D8358F150F61C:1799CEB9DDFB0C53]:0)
	at org.hamcrest.MatcherAssert.assertThat(MatcherAssert.java:18)
	at org.junit.Assert.assertThat(Assert.java:964)
	at org.junit.Assert.assertThat(Assert.java:930)
	at org.opensearch.search.nested.SimpleNestedIT.testExplain(SimpleNestedIT.java:500)
	at java.base/jdk.internal.reflect.DirectMethodHandleAccessor.invoke(DirectMethodHandleAccessor.java:103)
	at java.base/java.lang.reflect.Method.invoke(Method.java:580)
	at com.carrotsearch.randomizedtesting.RandomizedRunner.invoke(RandomizedRunner.java:1750)
	at com.carrotsearch.randomizedtesting.RandomizedRunner$8.evaluate(RandomizedRunner.java:938)
	at com.carrotsearch.randomizedtesting.RandomizedRunner$9.evaluate(RandomizedRunner.java:974)
	at com.carrotsearch.randomizedtesting.RandomizedRunner$10.evaluate(RandomizedRunner.java:988)
	at com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
	at org.junit.rules.RunRules.evaluate(RunRules.java:20)
	at org.apache.lucene.tests.util.TestRuleSetupTeardownChained$1.evaluate(TestRuleSetupTeardownChained.java:48)
	at org.apache.lucene.tests.util.AbstractBeforeAfterRule$1.evaluate(AbstractBeforeAfterRule.java:43)
	at org.apache.lucene.tests.util.TestRuleThreadAndTestName$1.evaluate(TestRuleThreadAndTestName.java:45)
	at org.apache.lucene.tests.util.TestRuleIgnoreAfterMaxFailures$1.evaluate(TestRuleIgnoreAfterMaxFailures.java:60)
	at org.apache.lucene.tests.util.TestRuleMarkFailure$1.evaluate(TestRuleMarkFailure.java:44)
	at org.junit.rules.RunRules.evaluate(RunRules.java:20)
	at com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
	at com.carrotsearch.randomizedtesting.ThreadLeakControl$StatementRunner.run(ThreadLeakControl.java:368)
	at com.carrotsearch.randomizedtesting.ThreadLeakControl.forkTimeoutingTask(ThreadLeakControl.java:817)
	at com.carrotsearch.randomizedtesting.ThreadLeakControl$3.evaluate(ThreadLeakControl.java:468)
	at com.carrotsearch.randomizedtesting.RandomizedRunner.runSingleTest(RandomizedRunner.java:947)
	at com.carrotsearch.randomizedtesting.RandomizedRunner$5.evaluate(RandomizedRunner.java:832)
	at com.carrotsearch.randomizedtesting.RandomizedRunner$6.evaluate(RandomizedRunner.java:883)
	at com.carrotsearch.randomizedtesting.RandomizedRunner$7.evaluate(RandomizedRunner.java:894)
	at org.apache.lucene.tests.util.AbstractBeforeAfterRule$1.evaluate(AbstractBeforeAfterRule.java:43)
	at com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
	at org.apache.lucene.tests.util.TestRuleStoreClassName$1.evaluate(TestRuleStoreClassName.java:38)
	at com.carrotsearch.randomizedtesting.rules.NoShadowingOrOverridesOnMethodsRule$1.evaluate(NoShadowingOrOverridesOnMethodsRule.java:40)
	at com.carrotsearch.randomizedtesting.rules.NoShadowingOrOverridesOnMethodsRule$1.evaluate(NoShadowingOrOverridesOnMethodsRule.java:40)
	at com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
	at com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
	at org.apache.lucene.tests.util.TestRuleAssertionsRequired$1.evaluate(TestRuleAssertionsRequired.java:53)
	at org.apache.lucene.tests.util.AbstractBeforeAfterRule$1.evaluate(AbstractBeforeAfterRule.java:43)
	at org.apache.lucene.tests.util.TestRuleMarkFailure$1.evaluate(TestRuleMarkFailure.java:44)
	at org.apache.lucene.tests.util.TestRuleIgnoreAfterMaxFailures$1.evaluate(TestRuleIgnoreAfterMaxFailures.java:60)
	at org.apache.lucene.tests.util.TestRuleIgnoreTestSuites$1.evaluate(TestRuleIgnoreTestSuites.java:47)
	at org.junit.rules.RunRules.evaluate(RunRules.java:20)
	at com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
	at com.carrotsearch.randomizedtesting.ThreadLeakControl$StatementRunner.run(ThreadLeakControl.java:368)
	at java.base/java.lang.Thread.run(Thread.java:1583)

To Reproduce

/gradlew ':server:internalClusterTest' --tests "org.opensearch.search.nested.SimpleNestedIT" -Dtests.method="testExplain {p0={"search.concurrent_segment_search.enabled":"true"}}" -Dtests.seed=A16D8358F150F61C

Expected behavior
The test must always pass

Plugins
Standard

Screenshots
If applicable, add screenshots to help explain your problem.

Host/Environment (please complete the following information):

CI

Additional context

https://build.ci.opensearch.org/job/gradle-check/30636/testReport/junit/org.opensearch.search.nested/SimpleNestedIT/testExplain__p0___search_concurrent_segment_search_enabled___true___/

The text was updated successfully, but these errors were encountered:

peternied · 2023-11-30T17:36:31Z

Thanks for filing this issue

jed326 · 2023-11-30T17:59:02Z

Just at a glance this looks related to ingesting dummy docs for the concurrent search path. The expected score is still correct.

neetikasinghal · 2023-11-30T18:21:56Z

I am not able to reproduce this with the above seed, also tried to run it for 100 times, but still not able to reproduce it.

neetikasinghal · 2023-12-05T11:11:41Z

I am able now to reproduce this with seed=1D20738151FB4EBD

jed326 · 2023-12-11T21:07:06Z

The explain itself is coming from Lucene: https:/apache/lucene/blob/a6f70ad2bb0b682eb65feb522358ee6d16cad766/lucene/join/src/java/org/apache/lucene/search/join/ToParentBlockJoinQuery.java#L432-L440

The difference in output seems to be coming from different start/end docs, which makes sense since we have deleted docs in our segments for the concurrent search path. It seems to me we can just fix the test itself to accommodate.

neetikasinghal · 2024-01-08T20:54:51Z

Here's some info on the docIds in lucene:

A DocId in lucene is not actually unique to the Index but is unique to a Segment. Lucene does this mainly to optimize writing and compression. Since it is only unique to a Segment, how can a Doc be uniquely identified at the Index level? The solution is simple. The segments are ordered. To take a simple example, an Index has two segments and each segment has 100 docs respectively. The DocId's in the Segment are 0-100 but when they are converted to the Index level, the range of the DocId's in the second Segment is converted to 100-200.
DocId's are unique within a Segment, numbered progressively from zero. But this does not mean that the DocId's are continuous. When a Doc is deleted, there is a gap.
The DocId corresponding to a document can change, usually when Segments are merged.

The test fails on the validation of range of the docIds in the assertion, the range changes as with indexRandomForConcurrentSearch function there are several bogus documents ingested and deleted which could trigger background merges and cause the range of the docIds matched with the search query to change.

sohami · 2024-01-09T08:55:51Z

@neetikasinghal Seems like the docId in the output of explain is the lucene internal docId and not the docId set in _id field by OpenSearch. In that case, I think it would be useful to have a tests with these bogus docs ingested once and then validating that output of explain API in concurrent and non-concurrent case is same on same index. We can extract it to a separate test class if needed.

neetikasinghal · 2024-01-09T23:18:12Z

@sohami makes sense. The range of the docs should not vary across concurrent and non-concurrent case. However, in order to do this, we need to pull the test out into a separate class and turn off the parameterization as indexRandomForMultipleSlices function could lead to having different range across different test run and will not be consistent.
I have updated the PR based on this, please review: https:/opensearch-project/OpenSearch/pull/11681/files

reta added bug Something isn't working untriaged flaky-test Random test failure that succeeds on second run and removed untriaged labels Nov 30, 2023

github-actions bot added the untriaged label Nov 30, 2023

This was referenced Nov 30, 2023

[Backport] [2.x] Bump commons-io:commons-io from 2.14.0 to 2.15.0 in /plugins/ingest-attachment (#11001) #11412

Merged

[Concurrent Segment Search] SimpleNestedIT tests using nested sort are flaky for concurrent segment search #11187

Closed

jed326 self-assigned this Nov 30, 2023

peternied removed the untriaged label Nov 30, 2023

peternied added Search:Aggregations Search Search query, autocomplete ...etc Search:Resiliency Search:Performance Search:Query Capabilities Search:Query Insights Search:Remote Search Search:Searchable Snapshots Search:Relevance labels Nov 30, 2023

andrross mentioned this issue Nov 30, 2023

[Backport 2.x] Delegating CachingWeightWrapper#count to internal weight object (#10543) #11389

Merged

8 tasks

jed326 assigned neetikasinghal and unassigned jed326 Nov 30, 2023

reta mentioned this issue Dec 4, 2023

[Backport 2.x] Add Java 11/17/21 matrix for precommit and assemble checks #11045

Merged

mch2 mentioned this issue Dec 12, 2023

Bump to Lucene99 #11421

Merged

8 tasks

neetikasinghal mentioned this issue Dec 27, 2023

Fix SimpleNestedIT.testExplain flaky test #11681

Merged

8 tasks

sohami closed this as completed in #11681 Jan 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] org.opensearch.search.nested.SimpleNestedIT.testExplain is flaky #11413

[BUG] org.opensearch.search.nested.SimpleNestedIT.testExplain is flaky #11413

reta commented Nov 30, 2023

peternied commented Nov 30, 2023

jed326 commented Nov 30, 2023

neetikasinghal commented Nov 30, 2023

neetikasinghal commented Dec 5, 2023

jed326 commented Dec 11, 2023 •

edited

Loading

neetikasinghal commented Jan 8, 2024 •

edited

Loading

sohami commented Jan 9, 2024

neetikasinghal commented Jan 9, 2024 •

edited

Loading

[BUG] org.opensearch.search.nested.SimpleNestedIT.testExplain is flaky #11413

[BUG] org.opensearch.search.nested.SimpleNestedIT.testExplain is flaky #11413

Comments

reta commented Nov 30, 2023

peternied commented Nov 30, 2023

jed326 commented Nov 30, 2023

neetikasinghal commented Nov 30, 2023

neetikasinghal commented Dec 5, 2023

jed326 commented Dec 11, 2023 • edited Loading

neetikasinghal commented Jan 8, 2024 • edited Loading

sohami commented Jan 9, 2024

neetikasinghal commented Jan 9, 2024 • edited Loading

jed326 commented Dec 11, 2023 •

edited

Loading

neetikasinghal commented Jan 8, 2024 •

edited

Loading

neetikasinghal commented Jan 9, 2024 •

edited

Loading