You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I think this issue refer to using EncT5 approach on T5-Pile (except for the large size) as a baseline.
Actually go ahead and eval Pile-T5 with the EncT5 treatment, as that's likely to be the best Encdec around. (except for the large size, where we'd use T5-1.1, since there seems to have been an issue with PileT5 for that size).
Hey @raphaelsty ! Yes, @NohTow is correct, this is about setting up baselines with only the Encoder + the EncT5 recipe from the paper above (GitHub repo) for:
PileT5-base
T51.1-large (there's an unknown issue with PileT5-large where it performs surprisingly weakly)
PileT5-XL
To serve as a very strong comparison point for our own base/large/XL variants.
No description provided.
The text was updated successfully, but these errors were encountered: