Opened 16 years ago

Last modified 11 years ago

#3733 closed defect

SimilarNamedWays naïvely uses Levenshtein distance and marks a lot of false positives — at Initial Version

Reported by: avarab@… Owned by: team
Priority: normal Milestone: 14.12
Component: Core validator Version: latest
Keywords: similar name Cc: AM909, mdk

Description

The SimilarNamedWays test just uses Levenshtein distance to determine if ways have a similar name. This is turning up a lot of false positives for the Iceland data (and presumably other locations). In Iceland it's common to have ways in the same suburb that share the same suffix or prefix. For example:

  • Fagraholt
  • Hafraholt
  • Hlíðarberg
  • Hlíðartorg
  • Hjallabraut
  • Hjallahraun
  • Nóatún
  • Sóltún
  • Austurvegur
  • Vesturvegur

Change History (0)

Note: See TracTickets for help on using tickets.