KAN.initialize_from_another_model() error:Expected input and other to have the same dtype, but got input's dtype Float and other's dtype Double #146

wwz1126 · 2024-05-10T03:07:57Z

input is tensor([[ 0.0000, -0.2120],
[ 0.0000, -0.0247],
[ 0.0000, 0.2150],
...,
[ 0.0000, 0.7221],
[ 0.0000, -0.6781],
[ 0.0000, 0.3832]], dtype=torch.float64);

model = KAN(width=[2,3,1], grid=3, k=3)
train loss: 5.90e-01 | test loss: 6.14e-01 | reg: 1.03e+01 : 100%|██| 20/20 [00:08<00:00, 2.37it/s]
model.train(dataset, opt="LBFGS", steps=20,)

initialize a more fine-grained KAN with G=10
model2 = KAN(width=[2,3,1], grid=10, k=3)

initialize model2 from model
model2.initialize_from_another_model(model, dataset['train_input']);

RuntimeError Traceback (most recent call last)
Cell In[7], line 4
2 model2 = KAN(width=[2,3,1], grid=10, k=3)
3 # initialize model2 from model
----> 4 model2.initialize_from_another_model(model, dataset['train_input'])

File E:\jupyter\KAN\pykan-master\pykan-master\kan\KAN.py:196, in KAN.initialize_from_another_model(self, another_model, x)
193 another_model(x.to(another_model.device)) # get activations
194 batch = x.shape[0]
--> 196 self.initialize_grid_from_another_model(another_model, x.to(another_model.device))
198 for l in range(self.depth):
199 spb = self.act_fun[l]

File E:\jupyter\KAN\pykan-master\pykan-master\kan\KAN.py:275, in KAN.initialize_grid_from_another_model(self, model, x)
273 model(x)
274 for l in range(self.depth):
--> 275 self.act_fun[l].initialize_grid_from_parent(model.act_fun[l], model.acts[l])

File E:\jupyter\KAN\pykan-master\pykan-master\kan\KANLayer.py:253, in KANLayer.initialize_grid_from_parent(self, parent, x)
251 x_pos = parent.grid
252 sp2 = KANLayer(in_dim=1, out_dim=self.size, k=1, num=x_pos.shape[1] - 1, scale_base=0., device=self.device)
--> 253 sp2.coef.data = curve2coef(sp2.grid, x_pos, sp2.grid, k=1, device=self.device)
254 y_eval = coef2curve(x_eval, parent.grid, parent.coef, parent.k, device=self.device)
255 percentile = torch.linspace(-1, 1, self.num + 1).to(self.device)

File E:\jupyter\KAN\pykan-master\pykan-master\kan\spline.py:137, in curve2coef(x_eval, y_eval, grid, k, device)
135 # x_eval: (size, batch); y_eval: (size, batch); grid: (size, grid); k: scalar
136 mat = B_batch(x_eval, grid, k, device=device).permute(0, 2, 1)
--> 137 coef = torch.linalg.lstsq(mat.to('cpu'), y_eval.unsqueeze(dim=2).to('cpu')).solution[:, :, 0] # sometimes 'cuda' version may diverge
138 return coef.to(device)

RuntimeError: torch.linalg.lstsq: Expected input and other to have the same dtype, but got input's dtype Float and other's dtype Double.

In example_1_function fitting, input is tensor([[-0.0075, 0.5547], [[-0.0075, 0.5547],

[ -0.8230, 0.1526], ...
... ,

[ 0.0036, -0.3966], ...
[-0.1923, -0.8376]]), which was successful.  I want to know what I should do about it.

The text was updated successfully, but these errors were encountered:

KindXiaoming · 2024-05-10T03:36:04Z

Hi it seems like in your first input tensor, the first inputs are zero for all samples. This can lead to a singular problem. You could remove the first column and create a KAN which takes in only the second dimension.

Jim137 · 2024-05-10T04:43:39Z

Hi,
It seems like you're facing a similar issue to #129. A quick workaround is to ensure that your input dtype is float32.
I'll create a PR to solve that right away.

wwz1126 · 2024-05-10T07:45:20Z

嗨，您似乎面临着与#129类似的问题。一个快速的解决方法是确保输入 dtype 为 float32。我将创建一个 PR 来立即解决这个问题。

I've made changes based on #129 similar issue and the problem persists. But I made sure that entering the dtype as float32 solved the problem. Thanks to

wwz1126 · 2024-05-10T07:59:16Z

嗨，似乎在您的第一个输入张量中，所有样本的第一个输入都为零。这可能会导致一个单一的问题。您可以删除第一列并创建一个仅包含第二个维度的 KAN。
I found that using one column of data to train didn't give as good a result as adding the first column of 2 columns, even though the newly added columns were all 0.

Jim137 · 2024-05-10T10:19:42Z

I've made changes based on #129 similar issue and the problem persists.

Please give #148 a try; it's designed to address the issue you're facing. Note that #129 only resolved the problem with coef2curve, but here the problem occurs in curve2coef.
Let me know if the problem persists after applying the fix from #148.

Jim137 mentioned this issue May 10, 2024

Fix dtype error in curve2coef #148

Merged

wkqian06 mentioned this issue May 12, 2024

Runtime Error in hellokan.ipynb #173

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KAN.initialize_from_another_model() error:Expected input and other to have the same dtype, but got input's dtype Float and other's dtype Double #146

KAN.initialize_from_another_model() error:Expected input and other to have the same dtype, but got input's dtype Float and other's dtype Double #146

wwz1126 commented May 10, 2024 •

edited

Loading

KindXiaoming commented May 10, 2024 •

edited

Loading

Jim137 commented May 10, 2024

wwz1126 commented May 10, 2024

wwz1126 commented May 10, 2024

Jim137 commented May 10, 2024

KAN.initialize_from_another_model() error:Expected input and other to have the same dtype, but got input's dtype Float and other's dtype Double #146

KAN.initialize_from_another_model() error:Expected input and other to have the same dtype, but got input's dtype Float and other's dtype Double #146

Comments

wwz1126 commented May 10, 2024 • edited Loading

KindXiaoming commented May 10, 2024 • edited Loading

Jim137 commented May 10, 2024

wwz1126 commented May 10, 2024

wwz1126 commented May 10, 2024

Jim137 commented May 10, 2024

wwz1126 commented May 10, 2024 •

edited

Loading

KindXiaoming commented May 10, 2024 •

edited

Loading