Page MenuHomeBartTerpstraJun 10 2023, 7:08 PM
Tags
Referenced Files
F38220139: Screenshot 2023-10-13 at 10.32.45 2.png
Oct 13 2023, 8:39 AM
File Not Attached
F38220133: Screenshot 2023-10-13 at 10.28.58 2.png
Oct 13 2023, 8:37 AM
F38220130: Screenshot 2023-10-13 at 10.26.37 2.png
Oct 13 2023, 8:37 AM
F37690115: image.png
Sep 11 2023, 11:30 AM
F37690110: image.png
Sep 11 2023, 11:30 AM
Subscribers

Description

Steps to replicate the issue (include links if applicable):

  • try to translate a page that has an american courtcase into dutch (e.g. critical race theory)
  • notice that a link like Brown v. Board of Education splits into two parts. "Brown v." and "Board of education"
  • correcting this error is possible, but a hassle

What happens?:
a single link is split into two links

What should have happened instead?:
a single link remains a single link and it links to it's equivelant in the target language.

Software version (skip for WMF-hosted wikis like Wikipedia):

Other information (browser name/version, screenshots, etc.):
Firefox 113.0.1 (64-bit)
"Hudson v.Leake County School Board -zaak" shows it too

Details

SubjectRepoBranchLines +/-

I can confirm the bug in sentence splitting for highlighting

And when translated the link for Brown v. Board of Education becomes two links, in two sentences.

This is a bug in cxserver's sentence splitting algorithm. We are working on a mode advanced sentence splitting algorithm for MinT here

Change 961080 had a related patch set uploaded (by Santhosh; author: Santhosh):

[mediawiki/services/cxserver@master] Use sentencex library for sentence segmentation

Change 961080 merged by jenkins-bot:

[mediawiki/services/cxserver@master] Use sentencex library for sentence segmentation

Change 961979 had a related patch set uploaded (by KartikMistry; author: KartikMistry):

[operations/deployment-charts@master] Update cxserver to 2023-09-28-043003-production

Change 961979 merged by jenkins-bot:

[operations/deployment-charts@master] Update cxserver to 2023-09-28-043003-production

quick link). The previous issues are no longer happening:

The whole sentence is highlighted:

The link to "Brown v. Board of Education" is kept as a single link (not split into two):

The above example is using Google Translate. With MinT, the link is not added to the translation, which is an issue that we may be part of a separate ticket: T348612: References moved to the end of the sentence and links disappear when translated with MinT

@BartTerpstra, the specific issues reported are fixed. I'm marking this as resolved, but feel free to reopen if new issues appear. Thanks for your feedback!