Force use_cache=True #496

borzunov · 2023-09-02T17:33:04Z

Petals supports only use_cache=True for inference.

However, we should not reject use_cache=False since it returns identical results (just forces the slower O(n^3) inference algorithm instead of the O(n^2) one).

I allow use_cache=False since some models use this setting for reasons unclear to me (see https://huggingface.co/garage-bAInd/Platypus2-70B-instruct/discussions/8), and this led to AssertionError before this PR.

borzunov · 2023-09-02T17:37:12Z

setup.cfg

@@ -40,7 +40,7 @@ install_requires =
    transformers>=4.32.0,<5.0.0  # if you change this, please also change version assert in petals/__init__.py
    speedtest-cli==2.1.3
    pydantic>=1.10,<2.0  # 2.0 is incompatible with hivemind yet
-    hivemind @ git+https://github.com/learning-at-home/hivemind
+    hivemind==1.1.10.post2


They are currently identical.

This reverts a part of #496 and instead overrides `use_cache` in `LlamaConfig`s only (so the correct value is visible by HF `.generate()` as well).

This reverts a part of bigscience-workshop#496 and instead overrides `use_cache` in `LlamaConfig`s only (so the correct value is visible by HF `.generate()` as well).

borzunov and others added 2 commits September 2, 2023 17:32

Force use_cache=True

13f66c4

Update setup.cfg

4d5a21c

borzunov commented Sep 2, 2023

View reviewed changes

borzunov merged commit abd5477 into main Sep 2, 2023
9 checks passed

borzunov deleted the force-use-cache-true branch September 2, 2023 18:57

borzunov mentioned this pull request Sep 2, 2023

Force use_cache=True in config only #497

Merged

borzunov added a commit that referenced this pull request Sep 2, 2023

Force use_cache=True in config only (#497)

b4d822a

This reverts a part of #496 and instead overrides `use_cache` in `LlamaConfig`s only (so the correct value is visible by HF `.generate()` as well).

d-popov pushed a commit to d-popov/petals-ai that referenced this pull request Sep 6, 2023

Force use_cache=True (bigscience-workshop#496)

1606bbe

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Force use_cache=True #496

Force use_cache=True #496

borzunov commented Sep 2, 2023 •

edited

Loading

borzunov Sep 2, 2023

Force use_cache=True #496

Force use_cache=True #496

Conversation

borzunov commented Sep 2, 2023 • edited Loading

borzunov Sep 2, 2023

Choose a reason for hiding this comment

borzunov commented Sep 2, 2023 •

edited

Loading