curriculum training #5

Jiakui · 2019-01-14T12:34:33Z

In the paper, it is said that the model should be trained with three stages. However, in the code, I didn't see how you implement the three stages. Therefore, if we run the code, could we reproduce the accuracy mentioned in the paper?

Thanks!

Canjie-Luo · 2019-01-14T14:02:48Z

Yes, the MORAN v1 in our paper was trained using curriculum learning. But the review took a long time. And we developed and released MORAN v2 for more stable convergence and higher accuracy. The released version was trained end-to-end with only one stage. But it outperformed the MORAN v1. The results were reported in the README.md. Run the code and manually decrease the learning rate~You can reproduce the v2 results. By the way, if the MORAN v2 is trained using curriculum learning, it may yield better performance.

BlakeXiaochu · 2019-01-19T06:56:30Z

Hi~ We can reproduce the v2 results by running the code and manually decreasing the learning rate. What's the strategy of decreasing learning rate? For example, divide the learning rate by 10 every 100000 iter? Thx~

Canjie-Luo · 2019-01-19T07:12:04Z

Please divide the lr by 10 after 3 epoch. Totally 6 epoch maybe enough for training. Enjoy yourself~

Canjie-Luo closed this as completed Jan 23, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

curriculum training #5

curriculum training #5

Jiakui commented Jan 14, 2019

Canjie-Luo commented Jan 14, 2019

BlakeXiaochu commented Jan 19, 2019 •

edited

Loading

Canjie-Luo commented Jan 19, 2019

curriculum training #5

curriculum training #5

Comments

Jiakui commented Jan 14, 2019

Canjie-Luo commented Jan 14, 2019

BlakeXiaochu commented Jan 19, 2019 • edited Loading

Canjie-Luo commented Jan 19, 2019

BlakeXiaochu commented Jan 19, 2019 •

edited

Loading