Opened 18 years ago
Last modified 12 years ago
#518 closed defect
Unicode normalizatin — at Initial Version
| Reported by: | Owned by: | framm | |
|---|---|---|---|
| Priority: | minor | Milestone: | 14.01 |
| Component: | Core | Version: | latest |
| Keywords: | Cc: |
Description
JOSM should normalize strings at input. They should also be normalized when searching.
For example inputing name="Rue de l'École" should end up the same as name="Rue de l'École". The first has "É" as U+0045 LATIN CAPITAL LETTER E + U+0301 COMBINING ACUTE ACCENT while the second has "É" U+00C9 LATIN CAPITAL LETTER E WITH ACUTE.
Searching for one should match the other.
See http://unicode.org/faq/normalization.html for more info.
java.text.Normalizer.normalize(string, java.text.Normalizer.Form.NFC) can be used when required.
NFC is probably better because it's better supported than NFD due to legacy.



normalizing strings before comparison in SearchCompiler, and value in PropertiesDialog