Investigating the quality of reverse geocoding services using text similarity techniques and logistic regression analysis


Kılıç B. , Gülgen F.

Cartography and Geographic Information Science, cilt.47, ss.336-349, 2020 (SSCI İndekslerine Giren Dergi) identifier identifier

  • Cilt numarası: 47 Konu: 4
  • Basım Tarihi: 2020
  • Doi Numarası: 10.1080/15230406.2020.1746198
  • Dergi Adı: Cartography and Geographic Information Science
  • Sayfa Sayıları: ss.336-349

Özet

Location, usually defined by postal address information or geographic coordinate values, is one of the leading themes in geography. Famous global mapping services such as ArcGIS Online, Bing Maps, Google Maps, or Yandex Maps can provide users with address information of any geographic coordinates using reverse geocoding. The accuracy of retrieved addresses is quite essential for a service user. Several researchers have evaluated the accuracy of the process based on the positional errors between the retrieved and actual addresses. This article proposes a different assessment based on text similarity algorithms. In this study, the authors examine the outcomes of 15 different text similarity algorithms by comparing them with the reference data. They benefit from the binary logistic regression to evaluate the results. At the end of the case study, they conclude that the soft-term frequency/inverse document frequency algorithm is the most appropriate to measure the quality of postal addresses of all tested services. The Jaccard algorithm also produces successful results only for Google and Bing Maps services. Moreover, the study allows the reader to assess the results of reverse geocoding derived from the global map platforms that serve in the test region.