Opened 12 years ago

Last modified 12 years ago

#10393 closed enhancement

LRM character on URLS — at Version 2

Reported by: jgpacker Owned by: team
Priority: normal Milestone: 14.09
Component: Core validator Version:
Keywords: unicode lrm character url email Cc:

Description (last modified by jgpacker)

If possible, JOSM should automatically remove the LRM character (an invisible unicode character used in some languages) from certain keys.

This character can be accidentally introduced by mappers that copy and paste information from other sources.

Usually this character is no big deal, but it can be nasty in tags like website=* , contact:website=* and similar ones, because it can corrupt their address.
Examples of such tags can be seen here.
Note that recently I removed this character from URLs across the globe (around 1200), and yet it is growing again (around 60 right now).
Fortunately it seems this character only appears in the end (and maybe at the beginning) of an URL.

I opened a similar issue for the iD editor: https://github.com/openstreetmap/iD/issues/2323

Alternatively, this job could be delegated to JOSM Validator

Change History (2)

comment:2 by jgpacker, 12 years ago

Description: modified (diff)
Note: See TracTickets for help on using tickets.