Tips & tricks to speed up inference #179

danielpatrickhug · 2021-03-29T16:02:57Z

Hi Everyone,

Using the code written in the example notebook for GPT 3 2.7B I was wondering I anyone had any tips to speed up the inference of the model. Also, I was wondering how I could change the code to stop the decoder after a set number of characters. Does anyone have any advice or can point me in the right direction. Thank you.

sdtblck · 2021-03-29T18:19:01Z

Hi, the main problem is the fact that our library uses tf.estimator. This isn't really designed for inference, as it has to reload the graph every time it's called.

If you want fast inference, I'd recommend using the HuggingFace port when it's ready. Another option would be to do something like https://github.com/marcsto/rl/blob/master/src/fast_predict2.py

re: your second point. I'm working on a PR to make sampling a bit more user friendly. This will include things like stopping generation after a set number of characters, or a specific token.

In the meantime, you can change the values that get passed into this function https://github.com/EleutherAI/gpt-neo/blob/master/model_fns.py#L99

danielpatrickhug · 2021-03-29T19:43:20Z

Thank you for responding, I appreciate the help. Is there anything I can do to help with the port over to hugging face? or on this repository?

sdtblck · 2021-03-29T21:55:12Z

Thank you for responding, I appreciate the help. Is there anything I can do to help with the port over to hugging face? or on this repository?

You'd have to ask HuggingFace about that, but it seems like they mostly have everything under control.

This repository is open to PRs. If you could figure out a way to monkey patch tf-estimator like in the link i posted above so the graph doesn't have to be reloaded every time it's called, that's definitely something we'd be interested in using.

StellaAthena closed this as completed Mar 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tips & tricks to speed up inference #179

Tips & tricks to speed up inference #179

danielpatrickhug commented Mar 29, 2021

sdtblck commented Mar 29, 2021

danielpatrickhug commented Mar 29, 2021

sdtblck commented Mar 29, 2021

Tips & tricks to speed up inference #179

Tips & tricks to speed up inference #179

Comments

danielpatrickhug commented Mar 29, 2021

sdtblck commented Mar 29, 2021

danielpatrickhug commented Mar 29, 2021

sdtblck commented Mar 29, 2021