-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Sentences being joined that shouldn't be #3
Comments
Tagging @kirefu who isn't able to be assigned to do initial investigation. |
The problem lies with bleualign_cpp, although not sure where, as I don't speak C++. I'm trying to find the bug, but could be quicker if @lpla also took a look To run an example of the problem on valhalla: cd /fs/meili0/faheem/postprocess/sv-en |
In ec8f4f7 I've updated bleualign to put a space between consecutive sentences found by the gap filler. However, we still have a policy question: should consecutive sentences found by the gap filler go on to one line‽ |
Faheem reports this was in se-en: [Name of person]
Umu.se makes use of cookies to improve the user experience.By continuing to use the website you agree to the usage of cookies.
There's no space
experience.By
but a space appears in the source document, including the WARC we crawled from umu.seThe text was updated successfully, but these errors were encountered: