Geographic Information Retrieval Using Wikipedia Articles

Amir Krause, Sara Cohen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Assigning semantically relevant, real-world locations to documents opens new possibilities to perform geographic information retrieval. We propose a novel approach to automatically determine the latitude-longitude coordinates of appropriate Wikipedia articles with high accuracy, leveraging both text and metadata in the corpus. By examining articles whose base-truth coordinates are known, we show that our method attains a substantial improvement over state of the art works. We subsequently demonstrate how our approach could yield two benefits: (1) detecting significant geolocation errors in Wikipedia; and (2) proposing approximated coordinates for hundreds of thousands of articles which are not traditionally considered to be locations (such as events, ideas or people), opening new possibilities for conceptual geographic retrievals over Wikipedia.

Original languageAmerican English
Title of host publicationACM Web Conference 2023 - Proceedings of the World Wide Web Conference, WWW 2023
PublisherAssociation for Computing Machinery, Inc
Pages3331-3341
Number of pages11
ISBN (Electronic)9781450394161
DOIs
StatePublished - 30 Apr 2023
Event2023 World Wide Web Conference, WWW 2023 - Austin, United States
Duration: 30 Apr 20234 May 2023

Publication series

NameACM Web Conference 2023 - Proceedings of the World Wide Web Conference, WWW 2023

Conference

Conference2023 World Wide Web Conference, WWW 2023
Country/TerritoryUnited States
CityAustin
Period30/04/234/05/23

Bibliographical note

Publisher Copyright:
© 2023 Owner/Author.

Keywords

  • Geographic Information Retrieval
  • Geolocation
  • Wikipedia

Fingerprint

Dive into the research topics of 'Geographic Information Retrieval Using Wikipedia Articles'. Together they form a unique fingerprint.

Cite this