Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Replication for MS MARCO passage on Colab #76

Merged
merged 7 commits into from
Sep 9, 2020
Merged

Replication for MS MARCO passage on Colab #76

merged 7 commits into from
Sep 9, 2020

Conversation

qguo96
Copy link
Contributor

@qguo96 qguo96 commented Sep 8, 2020

Replication on Colab is successful.

Colab Environment:
OS: Ubuntu-18.04.3 LTS
Java: 11.0.8
Python: 3.6.9

results are shown below:
Re-Ranking with monoBERT:
Screenshot (1157)

Re-Ranking with monoT5:
Screenshot (1162)

q35guo and others added 2 commits September 8, 2020 18:45
replication on Colab demo (Notebook)
Add instructions to replicate on Colab
@qguo96
Copy link
Contributor Author

qguo96 commented Sep 8, 2020

I add a Colab notebook under pygaggle/notebooks. Otherwise, it takes too much space. I'm not sure if this is a good idea.

@lintool
Copy link
Member

lintool commented Sep 8, 2020

I was just thinking this - add a section of prose to here: https:/castorini/pygaggle/blob/master/docs/experiments-msmarco-passage.md

describing any tips/tricks to getting the replication colab, as opposed to including the entire notebook... we want to URAs that come after to actually do some work... ;) and not just click "run all cells!"

thoughts @ronakice ?

@qguo96
Copy link
Contributor Author

qguo96 commented Sep 9, 2020

Is the guide concise enough? I followed the guide and redid it again. Newcomers may need a little time to understand.

@ronakice
Copy link
Member

ronakice commented Sep 9, 2020

Yep, I think that's why I was hesitant to share the notebooks (although I have some for MS MARCO Passage and CovidQA since that's how I initially ran the experiments).

@ronakice
Copy link
Member

ronakice commented Sep 9, 2020

@lintool @qguo96 I also think we should move these instructions to convert Colab into a GPU instance with full SSH access, elsewhere. While it is very useful, I think it needs to be in some other place since it is useful for all instructions?

@lintool
Copy link
Member

lintool commented Sep 9, 2020

Created new repo for holding this information: https:/castorini/onboarding

@qguo96 sorry about changing our mind. Can send PR against this new repo for your instructions?

And I thought the replication log "Results replicated by ... (Tesla T4 on Colab)" should make it clear that this was on Colab?

@qguo96
Copy link
Contributor Author

qguo96 commented Sep 9, 2020

Get it. I would make a new PR for this new repo.

@lintool lintool merged commit cc85405 into castorini:master Sep 9, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants