You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Firstly thanks for developing chronumental, it's been really helpful!
I've noticed that the loss and error metrics spike quite noticeably during runs:
I don't know enough about the optimizer that is being used to know whether this is expected behaviour, but it does seem a little odd.
I guess you could get unlucky if the run finished during a spike. In this run it looks like the final parameters are actually just at the end of a spike and not properly back on the plateau, which I assume would be preferable.
Anyway, I wondered whether this is something to be concerned about, whether its pointing to me doing something wrong, and/or whether it might point you to something that could be improved.
Happy to share more details and the data.
Many thanks
David
The text was updated successfully, but these errors were encountered:
In this run it looks like the final parameters are actually just at the end of a spike and not properly back on the plateau, which I assume would be preferable.
My belief is that this should not be an issue.
Unless this is set we should use the params that gave the lowest loss. (Unless there's a bug of which I'm not aware).
More generically though, I have a fair bit of paranoia about whether I may have broken some bits of Chronumental with some changes in the last couple of years and I have some overdue benchmarking to perform. It is certainly good to be cautious about its behaviour.
Hi @theosanderson
Firstly thanks for developing chronumental, it's been really helpful!
I've noticed that the loss and error metrics spike quite noticeably during runs:
I don't know enough about the optimizer that is being used to know whether this is expected behaviour, but it does seem a little odd.
I guess you could get unlucky if the run finished during a spike. In this run it looks like the final parameters are actually just at the end of a spike and not properly back on the plateau, which I assume would be preferable.
Anyway, I wondered whether this is something to be concerned about, whether its pointing to me doing something wrong, and/or whether it might point you to something that could be improved.
Happy to share more details and the data.
Many thanks
David
The text was updated successfully, but these errors were encountered: