Add tests for remote examples #800

Etesam913 · 2022-06-18T05:34:26Z

Summary

Adds Cypress tests for the remote_procedure/template, remote_procedure/mnist, and remote_procedure/toxicity_detection examples.

Three GitHub action jobs are added. One for each of the remote_procedure examples. They just install the necessary dependencies and run the cypress tests.

Video of Tests

Toxicity Detection

toxicity-detection-tests.mov

Template

template-tests.mov

Mnist(Updated with drawing capability)

mnist-new-test.mov

…ookresearch/Mephisto into add-tests-for-remote-examples

codecov-commenter · 2022-06-18T05:37:31Z

Codecov Report

Merging #800 (93c6311) into main (20f87e1) will decrease coverage by 0.05%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##             main     #800      +/-   ##
==========================================
- Coverage   64.69%   64.63%   -0.06%     
==========================================
  Files         107      107              
  Lines        9259     9259              
==========================================
- Hits         5990     5985       -5     
- Misses       3269     3274       +5

Impacted Files	Coverage Δ
...tractions/architects/channels/websocket_channel.py	`67.96% <0.00%> (-8.60%)`	⬇️
mephisto/data_model/unit.py	`78.14% <0.00%> (+0.54%)`	⬆️
mephisto/data_model/assignment.py	`61.71% <0.00%> (+3.90%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 20f87e1...93c6311. Read the comment docs.

…ookresearch/Mephisto into add-tests-for-remote-examples

examples/remote_procedure/mnist/webapp/cypress/e2e/remote_procedure_mnist.cy.js

Etesam913 · 2022-07-14T16:10:24Z

...te_procedure/toxicity_detection/webapp/cypress/e2e/remote_procedure_toxicity_detection.cy.js

+ cy.get('[data-cy="toxicity-alert"]', { timeout: 25000 }).as(
+ "toxicityAlert"
+ );
+ cy.get("@toxicityAlert").contains(
+ 'The statement, "I hate bob!," has a toxicity of:'
+ );
+ });


A long timeout is required for this as the detoxify model takes a long time to run on the GitHub action (they have slow computers).

♻️ Removed force:true from click()

Etesam913 · 2022-07-14T19:12:10Z

examples/remote_procedure/mnist/webapp/cypress/e2e/remote_procedure_mnist.cy.js

+ cy.get('[data-cy="canvas-mouse-down-container-1"]')
+ .trigger("mouseover")
+ .trigger("mousedown", 100, 20)
+ .trigger("mousedown", 120, 200)
+ .trigger("mouseup", 120, 200);


This draws a 1, but in the GitHub action it seems to sometimes think it is a 8 leading to the test failing about 1 out of every 4 times. Might make more sense to draw a number that is clearer to the model.

It feels kinda weird though because given the exact same mouse down values I feel like the model should give me the exact same result, but I guess that is not the case. AI stuff I guess 🤷

Changed the 1 to a 3, hopefully this is more consistent

Edit:
Seems to be consistent, haven't had any github actions fail

Etesam913 · 2022-07-14T19:48:42Z

I don't really know if there is a way to alleviate this, but in some of the tests I click the submit button. This makes locally running the tests a second time fail. The task understandably isn't expected to work after it has been submitted. One workaround is to just comment out the line where you click the submit button in the test. This would allow you to run the test on the task many times without it failing because of submission.

JackUrb · 2022-07-15T16:41:44Z

I don't really know if there is a way to alleviate this, but in some of the tests I click the submit button. This makes locally running the tests a second time fail. The task understandably isn't expected to work after it has been submitted. One workaround is to just comment out the line where you click the submit button in the test. This would allow you to run the test on the task many times without it failing because of submission.

Another possible approach is to launch multiple tasks, and have a "submit" be the last step of a test, then other tests can use a different worker_id in the URL.

JackUrb

Overall these tests make sense to me!

Etesam913 · 2022-07-15T17:38:17Z

I don't really know if there is a way to alleviate this, but in some of the tests I click the submit button. This makes locally running the tests a second time fail. The task understandably isn't expected to work after it has been submitted. One workaround is to just comment out the line where you click the submit button in the test. This would allow you to run the test on the task many times without it failing because of submission.

Another possible approach is to launch multiple tasks, and have a "submit" be the last step of a test, then other tests can use a different worker_id in the URL.

It may not even be related to what I said to be honest, not exactly sure

Here's a video of the problem (occurs at the end).

Screen.Recording.2022-07-15.at.1.35.41.PM.mov

I think if you submit the task enough times it will fail or something like that.

JackUrb · 2022-07-15T18:04:57Z

Ah okay so the backend server ends up turning off as it thinks it's "finished" collecting tasks. I don't think we have a great way of dealing with this outside of just launching a "reasonable" amount to cover all of the testing, and then note that too many submissions will eventually lead to tests failing once the server shuts down.

JackUrb

Implementation looks good to me, excited to have these tests in!

Etesam913 added 4 commits June 17, 2022 15:31

🐛 Fixed bug with remoteFunction() not existing

2536c94

✅ Added tests for remote_procedure template

025db41

Merge branch 'add-cypress-to-static-task' of https:/faceb…

c41c801

…ookresearch/Mephisto into add-tests-for-remote-examples

✅ Added action for remote_procedure template

2607e70

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 18, 2022

Etesam913 changed the title ~~Add tests for remote examples~~ [WIP] Add tests for remote examples Jun 18, 2022

Etesam913 and others added 14 commits June 18, 2022 15:20

✅ Added tests for toxicity_detection example

eae5377

✏️ Added github action for toxicity_detection

67e53c0

✏️ Fixed typo in name of workflow

6ece9f5

🔥 Removed wait() statements

d996e6a

🔥 Removed spinner assertion test

adb023d

⏳ Added timeout

9895ca5

✅ Added tests for remote_procedure/mnist

82c7f31

✏️ Added mnist job to github action

10ae5f8

➕ Added dependencies to github action

2ac5d62

🔥 Removed linking from github action

1d1c855

🔀 Merged with add-cypress-to-static-task

21a1c4f

Merge branch 'add-cypress-to-static-task' of https:/faceb…

5fa8caf

…ookresearch/Mephisto into add-tests-for-remote-examples

✨ Added link script to all remote_examples

2799908

⏱ Increased wait time for toxicity alert

7fbe70a

Etesam913 changed the title ~~[WIP] Add tests for remote examples~~ Add tests for remote examples Jun 24, 2022

Etesam913 added 2 commits June 26, 2022 14:24

🥅 Added exception handling for post_build script

cd7a9fa

✅ Reformat to pass code-style

169ef90

pringshia mentioned this pull request Jun 28, 2022

Review outstanding PRs for Cypress integration and the Feedback/Tips component #807

Closed

🔀 Merged with main

3c5ed24

Etesam913 mentioned this pull request Jul 12, 2022

[Ongoing] Add an e2e integration test #726

Closed

Etesam913 added 2 commits July 14, 2022 11:58

🔀 Merged with main

494b384

✏️ Updated post_build_script to post_install_script

b550ec3

Etesam913 commented Jul 14, 2022

View reviewed changes

examples/remote_procedure/mnist/webapp/cypress/e2e/remote_procedure_mnist.cy.js Outdated Show resolved Hide resolved

Etesam913 commented Jul 14, 2022

View reviewed changes

Etesam913 added 2 commits July 14, 2022 12:33

✅ Added an alert check for template

719f812

♻️ Removed force:true from click()

✨ Added drawing to mnist test

277003c

Etesam913 requested a review from JackUrb July 14, 2022 18:47

Etesam913 commented Jul 14, 2022

View reviewed changes

Etesam913 added 2 commits July 14, 2022 17:44

✅ Changed 1 to 3 in mnist test for more consistency

2f82350

💡 Uncommented out submit button click

93c6311

JackUrb reviewed Jul 15, 2022

View reviewed changes

JackUrb approved these changes Jul 15, 2022

View reviewed changes

Etesam913 merged commit 1c14cf3 into main Jul 15, 2022

JackUrb deleted the add-tests-for-remote-examples branch July 15, 2022 18:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tests for remote examples #800

Add tests for remote examples #800

Etesam913 commented Jun 18, 2022 •

edited

Loading

codecov-commenter commented Jun 18, 2022 •

edited

Loading

Etesam913 Jul 14, 2022

Etesam913 Jul 14, 2022

Etesam913 Jul 14, 2022

Etesam913 Jul 14, 2022 •

edited

Loading

Etesam913 commented Jul 14, 2022

JackUrb commented Jul 15, 2022

JackUrb left a comment

Etesam913 commented Jul 15, 2022 •

edited

Loading

JackUrb commented Jul 15, 2022

JackUrb left a comment

Add tests for remote examples #800

Add tests for remote examples #800

Conversation

Etesam913 commented Jun 18, 2022 • edited Loading

Summary

Video of Tests

Toxicity Detection

Template

Mnist(Updated with drawing capability)

codecov-commenter commented Jun 18, 2022 • edited Loading

Codecov Report

Etesam913 Jul 14, 2022

Choose a reason for hiding this comment

Etesam913 Jul 14, 2022

Choose a reason for hiding this comment

Etesam913 Jul 14, 2022

Choose a reason for hiding this comment

Etesam913 Jul 14, 2022 • edited Loading

Choose a reason for hiding this comment

Etesam913 commented Jul 14, 2022

JackUrb commented Jul 15, 2022

JackUrb left a comment

Choose a reason for hiding this comment

Etesam913 commented Jul 15, 2022 • edited Loading

JackUrb commented Jul 15, 2022

JackUrb left a comment

Choose a reason for hiding this comment

Etesam913 commented Jun 18, 2022 •

edited

Loading

codecov-commenter commented Jun 18, 2022 •

edited

Loading

Etesam913 Jul 14, 2022 •

edited

Loading

Etesam913 commented Jul 15, 2022 •

edited

Loading