Bump transformers from 4.37.2 to 4.38.1 in /image_generation/stable_diffusion_1_5/cpp/scripts #250

dependabot · 2024-02-26T15:51:32Z

Bumps transformers from 4.37.2 to 4.38.1.

Release notes

v4.38.1

Fix eager attention in Gemma!

[Gemma] Fix eager attention #29187 by @sanchit-gandhi

TLDR:
-        attn_output = attn_output.reshape(bsz, q_len, self.hidden_size)
+        attn_output = attn_output.view(bsz, q_len, -1)
v4.38: Gemma, Depth Anything, Stable LM; Static Cache, HF Quantizer, AQLM

New model additions

💎 Gemma 💎

Gemma is a new opensource Language Model series from Google AI that comes with a 2B and 7B variant. The release comes with the pre-trained and instruction fine-tuned versions and you can use them via AutoModelForCausalLM, GemmaForCausalLM or pipeline interface!

Read more about it in the Gemma release blogpost: https://hf.co/blog/gemma
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("google/gemma-2b")
model = AutoModelForCausalLM.from_pretrained("google/gemma-2b", device_map="auto", torch_dtype=torch.float16)
input_text = "Write me a poem about Machine Learning."
input_ids = tokenizer(input_text, return_tensors="pt").to("cuda")
outputs = model.generate(**input_ids)
You can use the model with Flash Attention, SDPA, Static cache and quantization API for further optimizations !

Flash Attention 2
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("google/gemma-2b")
model = AutoModelForCausalLM.from_pretrained(
"google/gemma-2b", device_map="auto", torch_dtype=torch.float16, attn_implementation="flash_attention_2"
)
input_text = "Write me a poem about Machine Learning."
input_ids = tokenizer(input_text, return_tensors="pt").to("cuda")
outputs = model.generate(**input_ids)

... (truncated)

Commits

a085774 Release: v4.38.1
2f54e0b [Gemma] Fix eager attention (#29187)
08ab54a [ gemma] Adds support for Gemma 💎 (#29167)
2de9314 [Maskformer] safely get backbone config (#29166)
476957b 🚨 Llama: update rope scaling to match static cache changes (#29143)
7a4bec6 Release: 4.38.0
ee3af60 Add support for fine-tuning CLIP-like models using contrastive-image-text exa...
0996a10 Revert low cpu mem tie weights (#29135)
15cfe38 [Core tokenization] add_dummy_prefix_space option to help with latest is...
efdd436 FIX [PEFT / Trainer ] Handle better peft + quantized compiled models (#29...
Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot will merge this PR once CI passes on it, as requested by @Wovchena.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR
@dependabot recreate will recreate this PR, overwriting any edits that have been made to it
@dependabot merge will merge this PR after your CI passes on it
@dependabot squash and merge will squash and merge this PR after your CI passes on it
@dependabot cancel merge will cancel a previously requested merge and block automerging
@dependabot reopen will reopen this PR if it is closed
@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
@dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Bumps [transformers](https://github.com/huggingface/transformers) from 4.37.2 to 4.38.1. - [Release notes](https://github.com/huggingface/transformers/releases) - [Commits](huggingface/transformers@v4.37.2...v4.38.1) --- updated-dependencies: - dependency-name: transformers dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>

Wovchena

@dependabot squash and merge

dependabot bot added the dependencies Pull requests that update a dependency file label Feb 26, 2024

Wovchena approved these changes Feb 26, 2024

View reviewed changes

dependabot bot merged commit 8470250 into master Feb 26, 2024
2 checks passed

dependabot bot deleted the dependabot/pip/image_generation/stable_diffusion_1_5/cpp/scripts/transformers-4.38.1 branch February 26, 2024 16:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump transformers from 4.37.2 to 4.38.1 in /image_generation/stable_diffusion_1_5/cpp/scripts #250

Bump transformers from 4.37.2 to 4.38.1 in /image_generation/stable_diffusion_1_5/cpp/scripts #250

dependabot bot commented on behalf of github Feb 26, 2024 •

edited

Loading

Wovchena left a comment

Bump transformers from 4.37.2 to 4.38.1 in /image_generation/stable_diffusion_1_5/cpp/scripts #250

Bump transformers from 4.37.2 to 4.38.1 in /image_generation/stable_diffusion_1_5/cpp/scripts #250

Conversation

dependabot bot commented on behalf of github Feb 26, 2024 • edited Loading

v4.38.1

Fix eager attention in Gemma!

v4.38: Gemma, Depth Anything, Stable LM; Static Cache, HF Quantizer, AQLM

New model additions

💎 Gemma 💎

Wovchena left a comment

Choose a reason for hiding this comment

dependabot bot commented on behalf of github Feb 26, 2024 •

edited

Loading