Add TPU service and task example #1416

Bihan · 2024-07-17T12:41:33Z

No description provided.

peterschmidt85 · 2024-07-18T07:20:58Z

What about a README file? Do you have an idea of what could be written there?

Bihan · 2024-07-18T07:43:19Z

What about a README file? Do you have an idea of what could be written there?

@peterschmidt85 Do you mean this README file?

peterschmidt85 · 2024-07-18T08:22:57Z

@Bihan I mean typically every example we have has a README.md like https://github.com/dstackai/dstack/blob/master/examples/fine-tuning/axolotl/README.md (a detailed one)or https://github.com/dstackai/dstack/blob/master/examples/fine-tuning/qlora/README.md (a short one - didn't have time to write a good one).

Bihan · 2024-07-18T08:32:05Z

@Bihan I mean typically every example we have has a README.md like https://github.com/dstackai/dstack/blob/master/examples/fine-tuning/axolotl/README.md (a detailed one)or https://github.com/dstackai/dstack/blob/master/examples/fine-tuning/qlora/README.md (a short one - didn't have time to write a good one).

@peterschmidt85 I had added a README.md here for the TPU/TGI example. I will add more details in this README file.

peterschmidt85 · 2024-07-18T08:49:07Z

@Bihan I mean typically every example we have has a README.md like https://github.com/dstackai/dstack/blob/master/examples/fine-tuning/axolotl/README.md (a detailed one)or https://github.com/dstackai/dstack/blob/master/examples/fine-tuning/qlora/README.md (a short one - didn't have time to write a good one).

@peterschmidt85 I had added a README.md here for the TPU/TGI example. I will add more details in this README file.

Oh, I missed it when reviewed it. If you want to add more details what can we add that could be very useful?

Bihan · 2024-07-18T09:01:06Z

@Bihan I mean typically every example we have has a README.md like https://github.com/dstackai/dstack/blob/master/examples/fine-tuning/axolotl/README.md (a detailed one)or https://github.com/dstackai/dstack/blob/master/examples/fine-tuning/qlora/README.md (a short one - didn't have time to write a good one).

@peterschmidt85 I had added a README.md here for the TPU/TGI example. I will add more details in this README file.

Oh, I missed it when reviewed it. If you want to add more details what can we add that could be very useful?

@peterschmidt85 We can add about the image huggingface/optimum-tpu:latest. Also point to optimum-tpu repo for more details. Mention the models (Gemma (2b, 7b), Llama2 (7b) and Llama3 (8b)) supported by it. Finally, we can mention that all TPU Cores are utilized in Inference.

peterschmidt85 · 2024-07-18T09:18:34Z

Sounds good it me Best regards, Andrey

…

On Thu 18. Jul 2024 at 11:01, Bihan Rana ***@***.***> wrote: @Bihan <https://github.com/Bihan> I mean typically every example we have has a README.md like https://github.com/dstackai/dstack/blob/master/examples/fine-tuning/axolotl/README.md (a detailed one)or https://github.com/dstackai/dstack/blob/master/examples/fine-tuning/qlora/README.md (a short one - didn't have time to write a good one). @peterschmidt85 <https://github.com/peterschmidt85> I had added a README.md here <https://github.com/Bihan/dstack/blob/tpu_examples/examples/deployment/tpu/tgi/README.md> for the TPU/TGI example. I will add more details in this README file. Oh, I missed it when reviewed it. If you want to add more details what can we add that could be very useful? @peterschmidt85 <https://github.com/peterschmidt85> We can add about the image huggingface/optimum-tpu:latest. Also point to optimum-tpu repo <http://huggingface/optimum-tpu:latest> for more details. Mention the models (Gemma (2b, 7b), Llama2 (7b) and Llama3 (8b)) supported by it. Finally, we can mention that all TPU Cores are utilized in Inference. — Reply to this email directly, view it on GitHub <#1416 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AM5DXRUXCJSH4EIDJ5OVKCDZM576PAVCNFSM6AAAAABLARWBASVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEMZVHE4TIOJYHE> . You are receiving this because you were mentioned.Message ID: ***@***.***>

… example temporarily; minor edits; WIP

Change optimum-tpu fork from bihan to dstack repo

…pu_examples

Bihan · 2024-09-05T10:05:52Z

examples/accelerators/tpu/README.md

-[Optimum TPU :material-arrow-top-right-thin:{ .external }](https://github.com/huggingface/optimum-tpu){:target="_blank"}.
+Llama 3.1 8B using 
+[Optimum TPU :material-arrow-top-right-thin:{ .external }](https://github.com/huggingface/optimum-tpu){:target="_blank"}
+and [vLLM :material-arrow-top-right-thin:{ .external }](https://github.com/huggingface/optimum-tpu){:target="_blank"}.


Noticed vLLM link is wrong. I will update it.

Bihan · 2024-09-05T15:54:01Z

Closing PR Due to Squashing Complexity and for Repo Cleanliness

This PR is being closed to address issues with squashing commits and maintaining a clean commit history. New PR is opened with the appropriate changes consolidated into a single, clean commit.

Add TPU service and task example

543ba5b

Bihan Rana and others added 22 commits July 18, 2024 18:01

Add few details in TPU examples README.md

80705db

Add TPU fine tune with optimum-tpu

b201fc1

Update tpu TGI deployment README.md

656fc1b

Merge branch 'master' into tpu_examples

0d573f4

Update tpu Finetune README.md

bc848a4

Add Llama-3.1-8B tgi example

e415ac9

Add Llama-3.1-8B tgi example

7445d84

Merge branch 'master' into tpu_examples

8eabea5

Remove tpu prefix

ffc4b42

Minor change in docker image name

db0a630

Change folder structure and add primary TPU readme

f870578

Add TPU in mkdocs

a875126

[Docs] Added TPU basics; replaced the task with a service; hidden TRL…

0f78c44

… example temporarily; minor edits; WIP

Add parameter constraint info in TPU Readme

61331f6

Add TPU service and task example

b1a7da2

Add few details in TPU examples README.md

1c7d75a

Add TPU fine tune with optimum-tpu

9ecc4d3

Update tpu TGI deployment README.md

e726fd6

Update tpu Finetune README.md

0474e13

Add Llama-3.1-8B tgi example

6bdd4df

Add Llama-3.1-8B tgi example

1b3d50d

Change folder structure and add primary TPU readme

9941d33

Bihan Rana and others added 5 commits August 27, 2024 13:29

Add TPU in mkdocs

bdb4792

[Docs] Added TPU basics; replaced the task with a service; hidden TRL…

33a9c9e

… example temporarily; minor edits; WIP

Add parameter constraint info in TPU Readme

d2c0f01

Update tpu examples for new folder structure

44c0da0

Add vLLM TPU example and update TPU readme

96ad111

Change optimum-tpu fork from bihan to dstack repo

Bihan force-pushed the tpu_examples branch from 2bce2cc to 96ad111 Compare September 1, 2024 07:31

peterschmidt85 and others added 4 commits September 2, 2024 19:59

Merge branch 'master' into tpu_examples

1eb6e3c

Update TPU Quantization and Memory docs

931d7e1

Merge branch 'tpu_examples' of https://github.com/Bihan/dstack into t…

fd1e803

…pu_examples

[Docs] Review

f7234a8

Bihan commented Sep 5, 2024

View reviewed changes

Bihan mentioned this pull request Sep 5, 2024

Add TPU examples with optimum-tpu and vLLM #1663

Merged

Bihan closed this Sep 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add TPU service and task example #1416

Add TPU service and task example #1416

Bihan commented Jul 17, 2024

peterschmidt85 commented Jul 18, 2024

Bihan commented Jul 18, 2024

peterschmidt85 commented Jul 18, 2024

Bihan commented Jul 18, 2024

peterschmidt85 commented Jul 18, 2024

Bihan commented Jul 18, 2024

peterschmidt85 commented Jul 18, 2024 via email

Bihan Sep 5, 2024

Bihan commented Sep 5, 2024

Add TPU service and task example #1416

Add TPU service and task example #1416

Conversation

Bihan commented Jul 17, 2024

peterschmidt85 commented Jul 18, 2024

Bihan commented Jul 18, 2024

peterschmidt85 commented Jul 18, 2024

Bihan commented Jul 18, 2024

peterschmidt85 commented Jul 18, 2024

Bihan commented Jul 18, 2024

peterschmidt85 commented Jul 18, 2024 via email

Bihan Sep 5, 2024

Choose a reason for hiding this comment

Bihan commented Sep 5, 2024