The famous o3 "GeoGuessr" prompt did not work
A benchmark of 200 images found that OpenAI's elaborate "GeoGuessr" prompt did not improve o3's geolocation accuracy over a basic prompt—it performed slightly worse. The author warns against overestimating prompt engineering based on anecdotal success, and notes o3's geolocation skill has not carried over to newer GPT models.