-
Notifications
You must be signed in to change notification settings - Fork 907
The drop connect rate (aka survival rate) is incorrect #200
Comments
I check the drop rate per block and it looks fine: block1a_ 1.0 |
@darcula1993 I'm confused. Shouldn't the block rate be at ~0.8 for the final block since the drop_connect_rate is 0.2 by default? |
So it turns out I pasted different values. However the problem remains as indicated. |
I check the code and seems that b can be greater than num of blocks,not sure why. |
I've observed the same thing as well. |
Qubvel's implementation does not calculate the total number of blocks correctly for configurations larger than B0. |
Practically it does perform better than official. -_- . |
I've only observed better performance in one case so I'm not sure it generalizes. In that case, the improved performance does indicate that extreme low survival rates (<0.3) might be a good regularization approach. |
Well, I'm not sure, maybe I need to look again properly. In fact, I spent almost a week assuming that there probably some problem with my data loader using the official efficient-net. But when I use non-official implementation, it was just fine. |
I originally posted this as an issue here: qubvel/efficientnet#135
However I noticed the two implementations were the same and the error exists here as well, so I decided to post it here.
I just verified with the reference tf.keras implementation, and here are the results. Below is the output for B5
This implementation's drop connect rate
(index, name, rate)
Tensorflow's drop connect rate
At index 18 it's off by a significant amount
The text was updated successfully, but these errors were encountered: