Opened 18 years ago
Closed 12 years ago
#518 closed defect (fixed)
[PATCH] Unicode normalization
| Reported by: | Owned by: | framm | |
|---|---|---|---|
| Priority: | minor | Milestone: | 14.01 |
| Component: | Core | Version: | latest |
| Keywords: | Cc: |
Description (last modified by )
JOSM should normalize strings at input. They should also be normalized when searching.
For example inputing name="Rue de l'École" should end up the same as name="Rue de l'École". The first has "É" as U+0045 LATIN CAPITAL LETTER E + U+0301 COMBINING ACUTE ACCENT while the second has "É" U+00C9 LATIN CAPITAL LETTER E WITH ACUTE.
Searching for one should match the other.
See http://unicode.org/faq/normalization.html for more info.
java.text.Normalizer.normalize(string, java.text.Normalizer.Form.NFC) can be used when required.
NFC is probably better because it's better supported than NFD due to legacy.
Attachments (1)
Change History (5)
by , 18 years ago
| Attachment: | josm-normalization.patch added |
|---|
comment:1 by , 17 years ago
| Summary: | Unicode normalizatin → [PATCH] Unicode normalization |
|---|
comment:3 by , 12 years ago
| Description: | modified (diff) |
|---|---|
| Milestone: | → 14.01 |
| Priority: | trivial → minor |
| Resolution: | fixed |
| Status: | closed → reopened |



normalizing strings before comparison in SearchCompiler, and value in PropertiesDialog