Opened 18 years ago

Last modified 12 years ago

#518 closed defect

Unicode normalizatin — at Initial Version

Reported by: moyogo@… Owned by: framm
Priority: minor Milestone: 14.01
Component: Core Version: latest
Keywords: Cc:

Description

JOSM should normalize strings at input. They should also be normalized when searching.

For example inputing name="Rue de l'École" should end up the same as name="Rue de l'École". The first has "É" as U+0045 LATIN CAPITAL LETTER E + U+0301 COMBINING ACUTE ACCENT while the second has "É" U+00C9 LATIN CAPITAL LETTER E WITH ACUTE.
Searching for one should match the other.

See http://unicode.org/faq/normalization.html for more info.

java.text.Normalizer.normalize(string, java.text.Normalizer.Form.NFC) can be used when required.
NFC is probably better because it's better supported than NFD due to legacy.

Change History (1)

by moyogo@…, 18 years ago

Attachment: josm-normalization.patch added

normalizing strings before comparison in SearchCompiler, and value in PropertiesDialog

Note: See TracTickets for help on using tickets.