-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathargleton.html
226 lines (193 loc) · 9.82 KB
/
argleton.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
<!doctype html>
<html lang="en">
<head>
<meta charset="utf-8">
<title>Kevin Boone: The Argleton affair, and the Wikification of the World-Wide Web</title>
<link rel="shortcut icon" href="https://kevinboone.me/img/favicon.ico">
<meta name="msvalidate.01" content="894212EEB3A89CC8B4E92780079B68E9"/>
<meta name="google-site-verification" content="DXS4cMAJ8VKUgK84_-dl0J1hJK9HQdYU4HtimSr_zLE" />
<meta name="description" content="%%DESC%%">
<meta name="author" content="Kevin Boone">
<meta name="viewport" content="width=device-width; initial-scale=1; maximum-scale=1">
<link rel="stylesheet" href="css/main.css">
</head>
<body>
<div id="myname">
Kevin Boone
</div>
<div id="menu">
<a class="menu_entry" href="index.html">Home</a>
<a class="menu_entry" href="contact.html">Contact</a>
<a class="menu_entry" href="cv.html">CV</a>
<a class="menu_entry" href="software.html">Software</a>
<a class="menu_entry" href="articles.html">Articles</a>
<form id="search_form" method="get" action="https://duckduckgo.com/" target="_blank"><input type="text" name="q" placeholder="Search" size="5" id="search_input" /><button type="submit" id="search_submit">🔍</button><input type="hidden" name="sites" value="kevinboone.me" /><input type="hidden" name="kn" value="1" /></form>
</div>
<div id="content">
<h1>The Argleton affair, and the Wikification of the World-Wide Web</h1>
This article discusses the implications that the 'Argleton affair' --
the widespread dissemination of information about a wholly fictitious
English village --
might have for how we view the World-Web Web as a source of information.
<p></p>
Comments are welcome, as always.
<br clear="all"/>
<h2>The prime of Argleton</h2>
The image to the left shows how and where Argleton appeared
on Google Maps in its heyday, back in 2009.
<img src="img/argleton0.png" class="article-image"/>
The 'village' of Argleton enjoyed a brief notoriety, mostly on the Web,
although some people did make the trek to the outskirts of Ormskirk in
Lancashire, to see what was, and is, a field. Local businesses stepped
up to meet the new interest from tourists, offering 'Argleton Ale' and 'Argleton Pie', and so on. It bears repeating that the village Argleton
does not exist, and never has existed.
<p></p>
The origins of Argleton remain hazy. At the time, Google blamed the
map provider Tele-Atlas for the error. There were rumours that Argleton
appeared on in-car sat-nav systems that used data from Tele-Atlas.
If that were the case, then that would get Google off the hook, although
I've not seen any hard evidence that anything but Google Maps ever showed
Argleton. It's possible -- but unlikely -- that Argleton was introduced
for the purposes of detecting unlicensed copying of the Tele-Atlas data.
Mapping companies do engage in this sort of subterfuge from time to
time, but usually only at the level of streets, not whole settlements.
More likely, the map provider used scans of paper maps, and Argleton
was a mis-reading of the nearby town of Aughton. Its easy to see how
an optical character recognition system could make such a mistake.
<p></p>
Whatever its origins, Argleton soon disappeared from Google Maps, and
faded back into obscurity.
<p></p>
Or did it?
<br clear="all"/>
<h2>Argleton's continued existence</h2>
<img src="img/argleton1.png" class="article-image" width="400"/>
Nearly five years after it disappeared from
Google Maps, Argleton continued to exert a ghostly presence
on the World-Wide Web. You could, for example, look for accommodation in
Argleton.
Here's what Travel Republic had to offer: 198 hotels and apartments in
the area. Sadly, none is actually in Argleton itself.
<p></p>
<br clear="all"/>
<p></p>
<img src="img/argleton2.png" class="article-image" width="400"/>
If you were feeling energetic, you could even have gone for
a walk or a run around
Argleton. In fact, there is a public footpath that approaches within
a hundred yards or so of the putative location of the village, so
this might be the best way to see it.
<p></p>
<br clear="all"/>
<p></p>
<img src="img/argleton3.png" class="article-image" width="400"/>
The list of Argleton sightings goes on on; but what knocked me back slightly is that the British Ordnance Survey Web site
listed Argleton as a place for nearly ten years. It was not in
precisely the same place as Google
Maps had it, but it's probably close enough to be accounted for by the
conversion between map datums.
<p></p>
Just to be clear -- that's the Ordnance Survey, right?
Picture generations of
bedraggled individuals in long socks stomping, red-face and glassy-eyed,
across our green and pleasant land, with a sandwich and a Thermos, and a
map in a transparent plastic case. The organization that millions of
hardy treckers have trusted their lives, or at least their holidays,
to. <i>That</i> Ordnance Survey.
<br clear="all"/>
<h2>What's going on here?</h2>
To be fair, no Web site that I could find is still displaying Argleton
as a place name on its map in 2020, but Argleton was a distinct
presence on the web for at least ten years.
<p></p>
Given the scale of Google's vast data-mining operation, nobody is very
surprised when there are occasional mistakes in Google Maps.
For example, not very far from
my village Google displays the location of a swimming school. It is in the
middle of a suburban street; there might be room for a kids' paddling pool
in the back yard, but I doubt you could learn to swim there -- not even
after all the rain we've had this year. What's happened, I think, is that
Google has scraped the address, either from another Web site, or from
another business directory of some sort. Then it's located the address
on its map, along with the school name. The address is the correspondence
address of the school's administrator; the swimming school itself
is located, unsurprisingly, in a sports centre a couple of miles away.
<p></p>
This is an understandable mistake -- it's not Google's fault that the
swimming school's administrator gave a personal address for correspondence.
However, it does demonstrate how error-prone is the strategy of building a
map by large-scale data mining. There is no shortage of similar examples,
although it's been a while since we've seen one on the scale of Google Maps'
locating Gloucester as a suburb of Tewksbury back in 2010.
<p></p>
None of this explains the continued existence of Argleton. Whatever we
may think of Google's data gathering strategy, dubious mapping information
from the Ordnance Survey is a problem of a different order. In the UK, OS maps are
regarded with almost religious reverence. To be clear, it's only the
on-line OS site that showed Argleton; there's no evidence that the printed
maps ever did. Still, this dubious data is coming from <i>somewhere</i>.
Tele-Atlas and Google may have expunged Argleton, and yet its
geographic location clearly continued to pollute the place-name databases
that other Web sites use.
<p></p>
It's reasonably clear that Argleton does not exist in the OS's own place-names
database. Recently the OS made this data available to download
free-of-charge, and it goes down to street level. Among the 800,000-or
so entries in the OS place-name database, there are 90 for the
modest village I
live in, and none whatsoever for Argleton. So where does the OS get
the coordinates of a non-existent place from?
<br clear="all"/>
<h2>Why is this a problem?</h2>
The Argleton affair shows two things, I believe.
<p></p>
First, it shows how thoroughly authoritative data is now polluted by
non-authoritative
data, to the extent that it's difficult to know what information is reasonably
trustworthy. Of course, we always had to be careful about how much
trust we put in Web sites (including this one, naturally). However,
on-line information is now so diffuse that it's difficult
to trace the origins of any specific piece of information.
Consequently, it's difficult even for
legitimate, reliable site operators to offer any guarantee of the
authenticity of their information.
<p></p>
At
one time we could decide whether to trust on-line content on the same basis
that we decided whether to trust print content: the reputation of the
provider. However, the
OS Web site presents the location of Argleton with exactly the same
conviction, no more and no less, than it does of Newport Pagnell, say,
or Southampton. There's no warning that the information comes from a less
reliable source than the Survey's own information-gathering; but clearly
it does.
<p></p>
Second, we see just how difficult it is to
remove known-bad data from the World-Wide Web once it has become
entrenched. I don't know if Argleton still turns up in web searches
for hotels and walking routes, but it do so for at least ten years.
<h2>Closing remarks</h2>
The World-Wide Web is now almost certainly the World's most significant
research tool. Children learn at school -- not from their teachers, I'm
sure -- that Wikipedia is a good source of information for researching
school projects. Rather than resisting this trend -- which is probably
irresistible -- we should be teaching our children how to assess the
reliability of the various information sources they encounter, both
in print and on-line. To be able to do this is a crucial life skill.
<p></p>
And yet, and yet... What Argleton shows -- and the hundreds of similar
Argleton-like episodes show -- is that the whole Web is in danger of
becoming one vast Wikipedia. When this happens, every piece of information
we encounter will have to be assessed on its own merits, with no context.
<p></p>
Does any of us have the skill, or the time, to do this?
<p><span class="footer-clearance-para"/></p>
</div>
<div id="footer">
<a href="rss.html"><img src="img/rss.png" width="24px" height="24px"/></a>
Categories: <a href="general_computing-groupindex.html">general computing</a>
<span class="last-updated">Last update Jul 07 2020
</span>
</div>
</body>
</html>