Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question: how to adapt this for CTC loss #9

Open
pfeatherstone opened this issue Aug 8, 2023 · 2 comments
Open

Question: how to adapt this for CTC loss #9

pfeatherstone opened this issue Aug 8, 2023 · 2 comments

Comments

@pfeatherstone
Copy link

@lucidrains Do you have any advice on how to adapt RecurrentMemoryTransformerWrapper such that it works with CTC ?

@pfeatherstone
Copy link
Author

In the memory replay backpropagation algorithm, the labels are partitioned in the same way as the logits. The loss is evaluated per block. For CTC that doesn't make sense since labels are not necessarily aligned.... So does memory replay in its current form even apply to CTC?? Any help is gratefully received.

@pfeatherstone
Copy link
Author

@lucidrains Or if we forget CTC, can you think of a way to make this work with unaligned targets ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant