hellokan.ipynb returns NaN instead of formula. #179

Stealeristaken · 2024-05-13T12:08:50Z

Hi.

I was trying hellokan.ipynb file. I do some scaleup at training steps like 50 -> 150. In the end train_loss started to return NaN instead of any value.

I thought maybe it's a kernel error. So I re-downloaded the baseline hellokan.ipynb and rerun without any editing. It returned NaN once again. I will drop screenshot about problem.

KindXiaoming · 2024-05-13T21:37:49Z

Hi, the problem was caused by the appearance of the log function (which is unexpected behavior). This means that the pruning step is not good. Could you show the plot you have after pruning? From feedback from others, you may try model.prune(threshold=5e-2) instead of just model.prune().

Stealeristaken · 2024-05-16T17:48:42Z

Sorry for late response. I tried both prune options (no-specified threshold and specified threshold) result was the same i am adding pictures
specified

no-specified

ShuleiCao · 2024-05-21T02:27:04Z

I encountered a similar issue, but I found that increasing the step size helped.

KindXiaoming · 2024-05-21T13:34:44Z

@Stealeristaken, in block [8], it should again specify the threshold model = model.prune(threshold=5e-2)

KindXiaoming · 2024-05-21T13:35:40Z

@ShuleiCao Thanks, yes, the pruning results can depend on quite many factors. Training longer will usually end up a sparser network.

Stealeristaken mentioned this issue May 18, 2024

When running on Apple GPU (MPS), the loss is always nan. #199

Open

KindXiaoming closed this as completed Jul 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hellokan.ipynb returns NaN instead of formula. #179

hellokan.ipynb returns NaN instead of formula. #179

Stealeristaken commented May 13, 2024

KindXiaoming commented May 13, 2024 •

edited

Loading

Stealeristaken commented May 16, 2024

ShuleiCao commented May 21, 2024

KindXiaoming commented May 21, 2024

KindXiaoming commented May 21, 2024

hellokan.ipynb returns NaN instead of formula. #179

hellokan.ipynb returns NaN instead of formula. #179

Comments

Stealeristaken commented May 13, 2024

KindXiaoming commented May 13, 2024 • edited Loading

Stealeristaken commented May 16, 2024

ShuleiCao commented May 21, 2024

KindXiaoming commented May 21, 2024

KindXiaoming commented May 21, 2024

KindXiaoming commented May 13, 2024 •

edited

Loading