avoid recency bias in prompt construction #104

AndreasKarasenko · 2024-06-18T08:59:33Z

Context
According to this paper ChatGPT (and likely other LLMs) suffer from a recency bias. Whatever class comes last has a higher propability of being selected.
Issue
Currently scikit-llm constructs prompts based on the order of the training data.
Since we are recommended to restrict the training data I would usually do something like this:

df = df.groupby(label_col).apply(lambda x: x.sample(n_samples))
df = df.reset_index(drop=True)

Which returns a sorted dataframe by label_col. Even if sort=False is passed to groupby the instances are still clustered by label.

Question/Solution
Should a method be implemented that randomizes the order of samples in the prompt / training data, or should users take care of that themselves?
The most straightforward way would be to simply add this to sampling:

df = df.sample(frac=1)

Which leaves it up to chance to balance it reasonably.

The text was updated successfully, but these errors were encountered:

OKUA1 · 2024-06-18T16:50:52Z

Hi @AndreasKarasenko,

Yes, the order of the samples introduces some bias. For the regular FewShot this can be easily solved by permuting the training data. It is not that straight-forward in the DynamicFewShot and would require some refactoring.

On the other hand, I am not sure whether it poses such a big problem. The study you provided is from 2021 and hence relatively outdated.

Also, from my personal observations, sometimes even in the ZeroShot setting, the order of the candidate labels is relevant. Therefore, the bias would probably always introduce some bias which can hardly be completely avoided.

AndreasKarasenko · 2024-06-19T05:41:28Z

A forward search yields this paper from 2024 which supports your last point and also points to this paper from 2021/2022. You're probably right, that accounting for all biases might be out of scope. Maybe a best practices section would be appropriate then?

OKUA1 · 2024-06-19T18:24:10Z

Yes, I agree that it is a good idea to at least mention it somewhere and in the future think about refactoring the code a bit to minimize this bias.

I will keep the issue open for now.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

avoid recency bias in prompt construction #104

avoid recency bias in prompt construction #104

AndreasKarasenko commented Jun 18, 2024

OKUA1 commented Jun 18, 2024

AndreasKarasenko commented Jun 19, 2024

OKUA1 commented Jun 19, 2024

avoid recency bias in prompt construction #104

avoid recency bias in prompt construction #104

Comments

AndreasKarasenko commented Jun 18, 2024

OKUA1 commented Jun 18, 2024

AndreasKarasenko commented Jun 19, 2024

OKUA1 commented Jun 19, 2024