Skip to content
This repository has been archived by the owner on Dec 16, 2022. It is now read-only.

added AdversarialBiasMitigator tests and model #281

Merged
merged 11 commits into from
Jun 18, 2021

Conversation

ArjunSubramonian
Copy link
Contributor

@ArjunSubramonian ArjunSubramonian commented Jun 16, 2021

  • Added AdversarialBiasMitigator tests.
  • Added adversarial-binary-gender-bias-mitigated-roberta-snli model.

Depends on: allenai/allennlp#5269.

@dirkgr
Copy link
Member

dirkgr commented Jun 17, 2021

I'm not reviewing this until allenai/allennlp#5269 is merged, right?

"display_name": "Adversarial Binary Gender Bias-Mitigated RoBERTa SNLI",
"task_id": "textual_entailment",
"model_details": {
"description": "This `Model` implements a basic text classifier and feedforward regression adversary with an adversarial bias mitigator wrapper. The text is embedded into a text field using a RoBERTa-large model. The resulting sequence is pooled using a cls_pooler `Seq2VecEncoder` and then passed to a linear classification layer, which projects into the label space. Subsequently, a `FeedForwardRegressionAdversary` attempts to recover the coefficient of the static text embedding in the binary gender bias subspace. While the adversary's gradients are computed normally, the classifier's gradients are computed such that updates to the classifier's parameters will not aid the adversary and will make it more difficult for the adversary to recover binary gender bias.",
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If this method really doesn't work that well, as you said in standup, then I think we should have a description of that somewhere.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The method actually ended up working quite well.

@ArjunSubramonian ArjunSubramonian changed the title added AdversarialBiasMitigator tests added AdversarialBiasMitigator tests and model Jun 18, 2021
"preprocessing": null
},
"quantitative_analyses": {
"unitary_results": null,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm curious; what numbers did we get?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added!

@ArjunSubramonian ArjunSubramonian merged commit bdf82a1 into main Jun 18, 2021
@ArjunSubramonian ArjunSubramonian deleted the arjuns/adversarial-bias-mitigation branch June 18, 2021 17:26
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants