-
Notifications
You must be signed in to change notification settings - Fork 14
Open
Description
Hi, thanks for your great work!
I’m currently trying to replicate the EM Routing experiments. I noticed that in the original paper, the authors used shallower convolutional layers, so I switched from ResNet20 (which you implemented) to SmallNet.
However, I found something strange:
On RTX 4090, training with ResNet20 takes only about 30 seconds per epoch, but training with SmallNet takes about 2 minutes per epoch—which is much slower, even though SmallNet is supposed to have less computation.
Do you have any idea why this might be happening? Is there any optimization I should consider when using SmallNet, or is there something specific in the implementation that could cause this slowdown?
Thanks for your help!
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels