code details #61

Cancerce1l · 2019-03-22T11:15:02Z

So as I read in paper the offset of input image is sum of grid and offset map. but the code below is confusing for me. Can you kindly explain it a bit, thanks.
offsets_x = torch.cat([grid_x, grid_y + offsets_grid], 3)

and also, Does it require a particular initial weigth for morn, as I think that it might be the best to initialize identity transform matrix.

The text was updated successfully, but these errors were encountered:

Cancerce1l · 2019-03-22T11:32:44Z

actually following the first question I asked. I noticed that the last convolution output map's activate func isn't tanh, but instead,
offsets_posi = nn.functional.relu(offsets, inplace=False)
offsets_nega = nn.functional.relu(-offsets, inplace=False)
offsets_pool = self.pool(offsets_posi) - self.pool(offsets_nega)
Is their any particular reason?
Thank u.

Canjie-Luo · 2019-03-23T16:09:58Z

Sorry for late reply.

In MORAN v1, we found that the x-offset map didn’t result to significant improvement. Thus, we disabled it and only used y-offset map in v2. As for the question about the usage of the sampling function, please ref to the PyTorch document.
We removed the Tanh() in v2 for more stable convergence. The pooling operation was also updated. The new operation extras the maximum absolute values on the offset map.

Hopefully this will help you.

Cancerce1l · 2019-03-25T10:04:18Z

it helps a lot, thank u. I've trained a model based on x and y offset map, and it's doing poorly on the eval set. I think it's because of the x offset that affects the ctc decoding. Utill I figure how to fix this, I'll change to y-offset only too.
Thanks again.

Canjie-Luo · 2019-03-27T07:13:31Z

You're welcome!

PkuDavidGuan · 2021-09-28T03:16:01Z

@Canjie-Luo Hi, I still could not understand the code in

MORAN_v2/models/morn.py

Line 64 in 2cd40c4

offsets_pool = self.pool(offsets_posi) - self.pool(offsets_nega)

. Why do you add the positive and negative offsets with maximum absolute values?

I guess that you assume the offsets within the 2*2 block are in a similar range (both positive or negative), but the assumption might not be proper in the very beginning of the training.

Canjie-Luo closed this as completed Apr 20, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

code details #61

code details #61

Cancerce1l commented Mar 22, 2019 •

edited

Loading

Cancerce1l commented Mar 22, 2019 •

edited

Loading

Canjie-Luo commented Mar 23, 2019

Cancerce1l commented Mar 25, 2019

Canjie-Luo commented Mar 27, 2019

PkuDavidGuan commented Sep 28, 2021

code details #61

code details #61

Comments

Cancerce1l commented Mar 22, 2019 • edited Loading

Cancerce1l commented Mar 22, 2019 • edited Loading

Canjie-Luo commented Mar 23, 2019

Cancerce1l commented Mar 25, 2019

Canjie-Luo commented Mar 27, 2019

PkuDavidGuan commented Sep 28, 2021

Cancerce1l commented Mar 22, 2019 •

edited

Loading

Cancerce1l commented Mar 22, 2019 •

edited

Loading