Steps to replicate the issue (include links if applicable):
- try to translate a page that has an american courtcase into dutch (e.g. critical race theory)
- notice that a link like Brown v. Board of Education splits into two parts. "Brown v." and "Board of education"
- correcting this error is possible, but a hassle
What happens?:
a single link is split into two links
What should have happened instead?:
a single link remains a single link and it links to it's equivelant in the target language.
Software version (skip for WMF-hosted wikis like Wikipedia):
Other information (browser name/version, screenshots, etc.):
Firefox 113.0.1 (64-bit)
"Hudson v.Leake County School Board -zaak" shows it too
DetailsSubject Repo Branch Lines +/- Use sentencex library for sentence segmentation Pginer-WMF moved this task from Quarter Backlog to Priority: Translation on the Language-Team (Language-2023-July-September) board.Jun 30 2023, 11:38 AMComment ActionsI can confirm the bug in sentence splitting for highlighting
And when translated the link for Brown v. Board of Education becomes two links, in two sentences.
This is a bug in cxserver's sentence splitting algorithm. We are working on a mode advanced sentence splitting algorithm for MinT here
Comment ActionsChange 961080 had a related patch set uploaded (by Santhosh; author: Santhosh):
[mediawiki/services/cxserver@master] Use sentencex library for sentence segmentation
Comment ActionsChange 961080 merged by jenkins-bot:
[mediawiki/services/cxserver@master] Use sentencex library for sentence segmentation
Comment ActionsChange 961979 had a related patch set uploaded (by KartikMistry; author: KartikMistry):
[operations/deployment-charts@master] Update cxserver to 2023-09-28-043003-production
Comment ActionsChange 961979 merged by jenkins-bot:
[operations/deployment-charts@master] Update cxserver to 2023-09-28-043003-production
quick link). The previous issues are no longer happening:
The whole sentence is highlighted:
The link to "Brown v. Board of Education" is kept as a single link (not split into two):
The above example is using Google Translate. With MinT, the link is not added to the translation, which is an issue that we may be part of a separate ticket: T348612: References moved to the end of the sentence and links disappear when translated with MinT
Comment Actions@BartTerpstra, the specific issues reported are fixed. I'm marking this as resolved, but feel free to reopen if new issues appear. Thanks for your feedback!
Subject | Repo | Branch | Lines +/- | |
---|---|---|---|---|
Use sentencex library for sentence segmentation | Pginer-WMF moved this task from Quarter Backlog to Priority: Translation on the Language-Team (Language-2023-July-September) board.Jun 30 2023, 11:38 AM Comment Actions I can confirm the bug in sentence splitting for highlighting And when translated the link for Brown v. Board of Education becomes two links, in two sentences. This is a bug in cxserver's sentence splitting algorithm. We are working on a mode advanced sentence splitting algorithm for MinT here Comment Actions Change 961080 had a related patch set uploaded (by Santhosh; author: Santhosh): [mediawiki/services/cxserver@master] Use sentencex library for sentence segmentation Comment Actions Change 961080 merged by jenkins-bot: [mediawiki/services/cxserver@master] Use sentencex library for sentence segmentation Comment Actions Change 961979 had a related patch set uploaded (by KartikMistry; author: KartikMistry): [operations/deployment-charts@master] Update cxserver to 2023-09-28-043003-production Comment Actions Change 961979 merged by jenkins-bot: [operations/deployment-charts@master] Update cxserver to 2023-09-28-043003-production quick link). The previous issues are no longer happening:
The whole sentence is highlighted: The link to "Brown v. Board of Education" is kept as a single link (not split into two): The above example is using Google Translate. With MinT, the link is not added to the translation, which is an issue that we may be part of a separate ticket: T348612: References moved to the end of the sentence and links disappear when translated with MinT Comment Actions@BartTerpstra, the specific issues reported are fixed. I'm marking this as resolved, but feel free to reopen if new issues appear. Thanks for your feedback! |