inference time #9

ak9250 · 2021-01-12T00:56:51Z

are there ways to reduce inference time? Currently takes about 13 minutes on a k80 for the norway example at 432x288

rromb · 2021-02-12T09:54:36Z

Hi. Yes, one way is to store already calculated attention weights when creating a sequence. See for example https://huggingface.co/transformers/quickstart.html#using-the-past. Note that this is not currently implemented for our models as we wanted to stick to the very hackable minGPT implementation, but it would definitely be nice to look at.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

inference time #9

inference time #9

ak9250 commented Jan 12, 2021

rromb commented Feb 12, 2021

inference time #9

inference time #9

Comments

ak9250 commented Jan 12, 2021

rromb commented Feb 12, 2021