Modify

Ticket #518 (closed defect: fixed)

Opened 4 years ago

Last modified 3 years ago

[PATCH] Unicode normalization

Reported by: moyogo@… Owned by: framm
Priority: trivial Component: Core
Version: latest Keywords:
Cc:

Description

JOSM should normalize strings at input. They should also be normalized when searching.

For example inputing name="Rue de l'École" should end up the same as name="Rue de l'École". The first has "É" as U+0045 LATIN CAPITAL LETTER E + U+0301 COMBINING ACUTE ACCENT while the second has "É" U+00C9 LATIN CAPITAL LETTER E WITH ACUTE. Searching for one should match the other.

See  http://unicode.org/faq/normalization.html for more info.

java.text.Normalizer.normalize(string, java.text.Normalizer.Form.NFC) can be used when required. NFC is probably better because it's better supported than NFD due to legacy.

Attachments

josm-normalization.patch Download (2.1 KB) - added by moyogo@… 4 years ago.
normalizing strings before comparison in SearchCompiler, and value in PropertiesDialog

Change History

Changed 4 years ago by moyogo@…

normalizing strings before comparison in SearchCompiler, and value in PropertiesDialog

comment:1 Changed 3 years ago by stoecker

  • Summary changed from Unicode normalizatin to [PATCH] Unicode normalization

comment:2 Changed 3 years ago by stoecker

  • Status changed from new to closed
  • Resolution set to fixed

Fixed in r1155.

View

Add a comment

Modify Ticket

Change Properties
<Author field>
Action
as closed
as The resolution will be set. Next status will be 'closed'
The resolution will be deleted. Next status will be 'reopened'
Author


E-mail address and user name can be saved in the Preferences.

 
Note: See TracTickets for help on using tickets.