Adversarial bias mitigation #5269

ArjunSubramonian · 2021-06-16T20:06:46Z

Changes/additions proposed in this pull request:

Added AdversarialBiasMitigator, a Model wrapper to adversarially mitigate biases in predictions produced by a pretrained model for a downstream task. Tests are in: added AdversarialBiasMitigator tests and model allennlp-models#281.
Fixed IndexOutOfBoundsException in MultiOptimizer when checking if optimizer received any parameters.
Changed behavior of MultiOptimizer so that while a default optimizer is still required, an error is not thrown if the default optimizer receives no parameters.

Depends on allenai/allennlp-models#281.

…s/adversarial-bias-mitigation

dirkgr · 2021-06-16T20:16:55Z

I have misspelled "adversarial" all my life 🤦🏻 . Seems like autocorrect bailed me out a lot.

dirkgr · 2021-06-16T22:58:00Z

allennlp/fairness/adversarial_bias_mitigator.py

+    !!! Note:
+        Intended to be used with `AdversarialBiasMitigator`.
+        trainer.model is expected to have `predictor` and `adversary` data members.


You could, if you wanted to, put in a check for this condition and throw an exception if it isn't met.

dirkgr · 2021-06-16T22:59:36Z

allennlp/fairness/adversarial_bias_mitigator.py

+        }
+
+        trainer.model.predictor.zero_grad()
+        batch_outputs["loss"].backward()


If "loss" and "adversary_loss" don't use exactly the same computation graph, does that mean that parts of the computation graph of "adversary_loss" could stick around when we don't want them to?

That's a really good point about part of the computation graph not getting erased! Upon further reading, it looks like the computation graph will stay around until adversary_loss goes out of scope. So I added this in the callback to remove all references to the adversary_loss in the graph and instead keep a view of the loss that's not in the graph:

# remove adversary_loss from computation graph batch_outputs["adversary_loss"] = batch_outputs["adversary_loss"].detach()

dirkgr · 2021-06-16T23:02:44Z

allennlp/training/optimizers.py

@@ -278,6 +278,11 @@ def __init__(
                    " Alternatively, you can remove this optimizer from the provided `optimizers`"
                    " if it is not relevant to a particular parameter group."
                )
+        # default optimizer is required, but may not be used


"must not be used" or "using it is optional"?

I know what you mean from context, but it would be easier to read if worded unambiguously.

AkshitaB · 2021-06-17T17:43:08Z

allennlp/fairness/adversarial_bias_mitigator.py

+of some protected variable Z. Informally, "knowing Y would not help
+you predict Z any better than chance" (Zaldivar et al., 2018). This


Should this be the other way round? "knowing Z would not help you predict Y"? Or is it stating that knowing the outcome shouldn't give you the information about the protected variable?

The latter, it's stating that knowing the outcome shouldn't give you information about the protected variable.

AkshitaB · 2021-06-17T17:55:16Z

allennlp/fairness/adversarial_bias_mitigator.py

+
+    vocab : `Vocabulary`
+        Vocabulary of predictor.
+    predictor : `Model`


This is not strictly an issue. We use the term predictor differently in the library elsewhere; should we change the name here? If this is adhering to the paper's terminology, it's probably okay to keep it as is.

Yes, I'm adhering to the paper's terminology.

Arjun Subramonian and others added 8 commits June 9, 2021 10:46

started adversarial bias mitigator wrapper

aba3003

Merge branch 'main' of https://github.com/allenai/allennlp into arjun…

c328c67

…s/adversarial-bias-mitigation

initial commit

7612926

finished adversarial bias mitigation; need to write tests

8d32012

manually checked for bugs

99f3b5f

debugged through testing

f911d1f

Merge branch 'main' of https://github.com/allenai/allennlp into arjun…

102b9bf

…s/adversarial-bias-mitigation

updated CHANGELOG

5e4f0eb

ArjunSubramonian requested review from dirkgr, AkshitaB and epwalsh June 16, 2021 20:06

ArjunSubramonian self-assigned this Jun 16, 2021

ArjunSubramonian mentioned this pull request Jun 16, 2021

added AdversarialBiasMitigator tests and model allenai/allennlp-models#281

Merged

Update CHANGELOG.md

3f19e99

dirkgr approved these changes Jun 16, 2021

View reviewed changes

minor fixes to docstrings and addressed Dirk's feedback

543ac38

dirkgr approved these changes Jun 17, 2021

View reviewed changes

AkshitaB and others added 2 commits June 17, 2021 09:53

Merge branch 'main' into arjuns/adversarial-bias-mitigation

ac09f42

Merge branch 'main' into arjuns/adversarial-bias-mitigation

2c84c27

ArjunSubramonian enabled auto-merge (squash) June 17, 2021 17:50

ArjunSubramonian merged commit f1f51fc into main Jun 17, 2021

ArjunSubramonian deleted the arjuns/adversarial-bias-mitigation branch June 17, 2021 18:03

AkshitaB reviewed Jun 17, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adversarial bias mitigation #5269

Adversarial bias mitigation #5269

ArjunSubramonian commented Jun 16, 2021

dirkgr commented Jun 16, 2021

dirkgr Jun 16, 2021

dirkgr Jun 16, 2021

ArjunSubramonian Jun 16, 2021

dirkgr Jun 16, 2021

dirkgr Jun 16, 2021

AkshitaB Jun 17, 2021

ArjunSubramonian Jun 17, 2021

AkshitaB Jun 17, 2021

ArjunSubramonian Jun 17, 2021

		of some protected variable Z. Informally, "knowing Y would not help
		you predict Z any better than chance" (Zaldivar et al., 2018). This

Adversarial bias mitigation #5269

Adversarial bias mitigation #5269

Conversation

ArjunSubramonian commented Jun 16, 2021

dirkgr commented Jun 16, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment