SHMOSM056 - Annotation Process in OSM
SHMOSM056 - Annotation Process in OSM
Research Article
Abstract
In this article we describe the analysis of 25,000 objects from the OpenStreetMap
(OSM) databases of Ireland, United Kingdom, Germany, and Austria. The objects
are selected as exhibiting the characteristics of “heavily edited” objects. We consider
“heavily edited” objects as having 15 or more versions over the object’s lifetime. Our
results indicate that there are some serious issues arising from the way contributors
tag or annotate objects in OSM. Values assigned to the “name” and “highway”
attributes are often subject to frequent and unexpected change. However, this “tag
flip-flopping” is not found to be strongly correlated with increasing numbers of
contributors. We also show problems with usage of the OSM ontology/controlled
vocabularly. The majority of errors occurring were caused by contributors choosing
values from the ontology “by hand” and spelling these values incorrectly. These
issues could have a potentially detrimental effect on the quality of OSM data while
at the same time damaging the perception of OSM in the GIS community. The
current state of tagging and annotation in OSM is not perfect. We feel that the
problems identified are a combination of the flexibility of the tagging process in
OSM and the lack of a strict mechanism for checking adherence to the OSM
ontology for specific core attributes. More studies related to comparing the names of
features in OSM to recognized ground-truth datasets are required.
1 Introduction
Address for correspondence: Peter Mooney, Department of Computer Science, National University
of Ireland Maynooth, Maynooth,Co. Kildare, Ireland. E-mail: [email protected]
analysis of the history of the evolution of objects one can see how objects have changed
(in response to edits) over time. OSM is a very large (and growing) spatial dataset and
probably the most well known example of VGI. We analyse the annotation of 25,000
spatial objects or features in the OSM databases of Ireland (653), United Kingdom
(10,040), Germany (10,604), and Austria (3,367). Extracting the history of objects in
OSM is currently a difficult and time-consuming process (Mooney and Corcoran 2011a,
b). We specifically selected objects which are “heavily edited”, that is objects with 15 or
more versions of edits. We set V as the lower threshold of versions at V ⱖ 15. These
objects are more likely to exhibit collaborative editing where multiple OSM contributors
edit and annotate these objects. We feel these objects are particularly interesting and
could eventually assist us in understanding the nature of contributions in the OSM
collaborative project model. Antin (2011) remarks that for collaborative projects (in
their case Wikipedia) focus often quickly turns to the “practical challenges of informa-
tion quality, coordination, and contributor bias” related to these open models.
The remainder of this article is organized as follows. Section 2 provides a discussion
of the literature related to the topic of contributors to VGI projects and the quality of
their contributions. In Section 3 we discuss the experimental setup for this research. It is
necessary to describe how the 25,000 objects were selected from OSM and how their
histories were compiled. The core of the article is Section 4 which outlines the results of
our analysis on the selected 25,000 OSM objects. Section 5 summarizes the key outcomes
from the article and in this section we present some issues for future work and research
on this topic.
In this section we provide an overview of related work. To better organize this overview,
we have divided related work into literature-based related work (section 2.1) and then
web-based related work (section 2.2). Web-based related work is for research which may
not have peer-reviewed literature related to it.
and maintained, containing information on: types of changes and edits, methods of survey
and collection, and finally a fitness for purpose statement.
Many papers in the literature report very positive experiences and results for OSM.
Haklay (2010) describes a comparison of the road network in OSM for England with the
road network in the Ordnance Survey UK Meridian dataset. The conclusion of Haklay’s
study is that OSM is “as good if not better than the Meridian dataset in terms of
positional accuracy”. Haklay remarks that “completeness is very good for major urban
centres and draws the conclusion that if mapping applications want to use OSM data for
these locations, it is as good a choice as any other source of spatial data. A similar study
by Zielstra and Zipf (2010 a, b) of OSM and TeleAtlas for Germany shows that for larger
cities (Berlin, Frankfurt, Munich, etc.) the OSM spatial data “is so rich that OSM is now
replacing proprietary data for many projects’ ”. Ludwig et al. (2011) compare Navtec
and OSM street and road networks for Germany and conclude that between the two
datasets there “are considerable qualitative differences between regions, towns, and
street categories” but at a national level the “relative completeness of OSM objects is
high enough for maps and cartographic production.” Zielstra and Hochmair (2011)
compared OSM, TIGER, NAVTEQ NAVSTREETS and Tele Atlas Multinet street data
for the state of Florida, USA in a related study to Zielstra and Zipf (2010a, b) and found
“OSM strong heterogeneity of OpenStreetMap data for the US in terms of its complete-
ness’”. Ather (2009) comments that as OSM grows, most regions (the UK in their study)
will eventually fulfil the levels of map quality required for other GIS applications. He
goes on to comment that “it would be useful if long-term measures were in place to
provide continued assessments of OSM map quality and then communicate these results
back to users as they browse through the map.” In Over et al. (2010) the authors
comment that the quality control of OSM differs fundamentally from professionally
edited maps. The community-based approach allows anyone to upload and alter the
spatial data. But due to the huge number of editors, errors and conflicts are usually
quickly resolved. They state that “OSM has probably the most up-to-date map data.” In
urban areas, changes in the road network appear in the OSM dataset long before
appearing in other map data providers. Haklay et al. (2010) investigates the relationship
between the number of contributors to OSM and data accuracy against a ground-truth
dataset. Haklay et al. (2010) conclude that, beyond 15 contributors per square kilome-
tre, the positional accuracy of OSM becomes very good below 6 m. At the other end of
the scale, the first five contributors to an area seem to provide the biggest contribution in
terms of positional accuracy improvement. Girres and Touya (2010), in their quality
assessment of OSM dataset for France, show that the number of OSM objects in an area
clearly grows in relation to the number of contributors in the area but under a non-linear
relationship. As is clearly shown here, OSM is a multiple representation database
containing Points-of-Interest, land cover, transportation networks, buildings, waterways
and waterbodies. There is also some literature on the nature of contributors to VGI and
OSM. Coleman et al. (2009) show the participants in the production process of VGI are
both users and producers or “prosumers”. Assessing the credibility of contributors is
important for evaluating the overall reliability of their contribution. They find many
reasons why contributors take part in VGI, including: for social rewards, to take part in
an outlet for creative and independent self-expression, pride in one’s home place,
and intellectual stimulation. Budhathoki et al. (2008) argue that motivations of the
contributors from VGI are very strong and can assist in the “distribution of the produc-
tion of GI for Spatial Data Infrastructures (SDI) among organizations, individuals, and
groups of individuals”. Then a ”hybrid SDI model that draws on the synergy between
the conceptual foundation of SDI and an extensive produser base of VGI can be
developed.”
However, there are also several negative outcomes for OSM from some of these
research studies. Welser et al. (2011) remark that there is the perception amongst many
in the scientific domain that the quality of open collaborative projects, such as Wikipedia,
“will never been sufficient as long as it relies on non-expert volunteers of unknown
identity” and this appears to be an issue for some in the GIS community regarding OSM.
Qian et al. (2009) conclude that a serious negative aspect of the VGI model is that the
underlying data is acquired by non-professionals with non-professional equipment,
meaning that there cannot be any guarantee of quality about the VGI (or OSM) data
unless it can be compared to some other source. Ballatore and Bertolotto (2011) call
OSM “spatially-rich but semantically-poor”. Smart et al. (2011) show how freely avail-
able sources of georeferenced data can be used for automated enrichment of 3D city
models. OSM is included as a key data source. They concluded that “matching the
georeferenced point locations from sources, such as OSM, to the geometry of the
buildings in the registered 3D model” posed significant problems due to accuracy and
sparse attribute problems. While Haklay’s (2010) comparison of OSM and Ordnance
Survey UK data reflects very positively on OSM, the author concludes by warning that
“there are serious issues about completeness and coverage” in the UK. Coverage is also
commented upon in the work of Zielstra and Zipf (2010a,b) who state that “while
professional data is not without its faults, the coverage of OSM in rural areas is too small
to be seriously considered a sophisticated alternative for any applications.” When one
moves away from large urban centres, the major issue for quality becomes one of
coverage – in many rural areas there is little or no OSM coverage at all. While Ludwig
et al.’s (2010) comparison of Navtec and OSM street/roads is positive from the OSM
perspective the authors conclude that “other attributes of OSM, which are needed for
other advanced GIS problems, are still relatively incomplete.” Mooney and Corcoran
(2011c) investigate the potential role VGI can play in eEnvironment and various Spatial
Data Infrastructures (SDI) on a local, regional, and national level. Specifically for OSM,
the authors conclude that while currently problems such as inconsistency of metadata
and unpredictable changes to geometries are a barrier to inclusion in SDI, the quantity of
spatial data in OSM means it has a role to play in SDI. Mooney et al. (2010a) investigate
the spatial representation of natural features in OSM. They report that there are differ-
ences in polygon structure for natural and landuse polygons based on: sampling point
density, simplification and generalisation of imported data, and inconsistency in manual
tracing from aerial imagery. Overall this highlights inconsistencies in representation of
natural features in OSM databases for different countries, regions, and contributors.
Mooney et al. (2010b) apply shape-matching techniques from pattern recognition to
compare OSM polygons (lakes) with Ordnance Survey Ireland NMA data. Their results
reveal that the shapes of these polygons in OSM compare poorly with authoritative
NMA data. The authors conclude that the quality of OSM data is not necessarily solely
restricted to a geometric comparison to some other dataset but should include other
aspects such as metadata and tagging. Girres and Touya (2010) also suggest that tagging
and annotation of objects within OSM deserves immediate attention.
The production of cartographic output from the OSM database is the most popular
use of the raw spatial data with some authors (Kessler et al. 2009, Over et al. 2010)
remarking that OSM is not considered for “serious Geomatics applications”. GIS-based
research using OSM as input data for models and testing is beginning to appear in the
literature. Wallgrün et al. (2010) use OSM for matching a sketch map to a geo-referenced
dataset. Corcoran et al. (2011) use OSM data for testing map simplification models. The
same authors use OSM data for model testing in their work on progressive transmission
of vector data (Corcoran and Mooney 2011). Jacob et al. (2010) use OSM as the source
of spatial data for routing algorithms used in the development of haptic-feedback
applications for mobile pedestrian navigation systems.
3 Experimental Setup
In this section we provide an overview of the experimental setup for the analysis presented
in Section 4. In section 3.1 we discuss how the OSM data is obtained, processed, and
prepared for analysis. In section 3.2 we discuss the process of selecting OSM objects for our
analysis. Section 3.3 discusses the characteristics of the selected objects and regions from
which they are drawn. As the OSM global database contains several million objects we feel
that it is necessary to carefully select a subset of these objects for analysis.
quality in Wikipedia articles in order to estimate the fraction of time during which articles
retain high-quality status. They select only “featured articles”. As outlined by Anderka
et al. (2011), featured articles in Wikipedia are “made” after significant editing and
revision work by many contributors and moderators. Stein and Hess (2007) argue that
“instead of just looking at the formal characteristics of featured articles one should look
at what contributors do on these pages” in order to understand the effects of multiple
contributors over an extended period of time.
nodes and tags on the feature, and contributors returning after a number of edits have
been made by other contributors. As evident from Table 1 there are very few (relatively
speaking) “heavily edited objects” in the OSM database. Yet they offer, in our opinion,
the most interesting opportunities for analysis of the collaborative aspect of OSM
editing and contribution. The equivalent object in Wikipedia, the “featured articles” is
similar in terms of occurrence. In September 2011 there were almost four million
articles in the English language Wikipedia but only 3,377 featured articles – which
roughly translates to one featured article for every 1,000 articles. In our case study
databases there are approximately 16 million objects. Just over 12 million of these
(about 75%) have only one or two versions. Consequently we felt that the choice of
V ⱖ 15 was appropriate because of the small number of objects available.
In the Ireland dataset there are 776 features with V or more versions, in the UK dataset
there are 12,804, in the Germany dataset there are 76,355, and in the Austrian dataset
there are 7,950. The total number of features with versions V ⱖ 15 is 97,885. We randomly
selected 25,000 of these features and finally 653 features were selected from the Ireland
dataset, 10,040 from the United Kingdom dataset, 10,604 from the German dataset, and
3,367 from the Austrian dataset. Each dataset is a mix of landuse, highway, amenity,
waterways, and natural features. In total the OSM-XML processing and OSM data
download from the OSM-API took 920 hours. The scripts to automatically download the
OSM-XML using the OSM-API were carefully monitored, as connection breaks occurred
frequently. We were also mindful of bandwidth limiting on the OSM servers. The scripts
were usually only run during normal working hours. The datasets were downloaded and
processed during May and June 2011. The data was stored in a PostGIS database.
For each object P the history of edits is downloaded as an OSM-XML file from the
OSM API. Suppose that the object P has n versions (n ⱖ V) where i = 0 is the first version
and i = n-1 is the final or current version. Then each version Pi of P is stored as the tuple
represented in Equation 1.
Pi = (ui, vi, Ni, τ i, ci, NSRi, G(i), A(i), L(i), Di, T (i)) (1)
where the elements of the tuple Pi are as defined as follows: ui is the user ID of the OSM
contributor who edited version vi, vi is the version of the OSM object, Ni is the number
of nodes in object Pi, ti is the timestamp for the edit, ci is the changeset that the edit was
saved in, NSRi is the number of nodes which “survived” from the previous version Pi-1
of polygon Pi, G(i) is the geometry of Pi, A(i) is the area of G(i) in hectares (only
calculated for polygons), L(i) is the length or perimeter of G(i) in meters, Di is the mean
spacing in meters between the adjacent nodes of Pi, and T(i) is the set of tags (keys,values)
assigned to this version of Pi which are stored as a comma-separated list. Finally, if
specified for each version Pi of the object P a vector data file representation is written out
to disk. There are a number of possible output formats: Esri Shapefile, KML file, or GPX
format. This allows for quick visualization within most desktop GIS software and some
web-based GIS.
4 Experimental Analysis
In this section we outline results from our experimental analysis of the 25,000 OSM
features. The analysis focuses on three aspects of annotation of these features in OSM
namely: assignment of values to attributes (or in OSM terminology values to tags or tag
Table 2 The percentage of ways in the four OpenStreetMap datasets with the specified
number of unique tags over each object’s history
keys) (section 4.1), types of contribution by the OSM volunteers on these features
(section 4.2), and use of the OSM Map Features page as a controlled vocabulary (section
4.3).
Table 3 Number of unique values assigned to the “name” tag of objects in the four OSM
databases
Table 4 Example of changes to the value of the name attribute of the road polyline
24276789 (England)
Table 5 Example of changes to the value of the name attribute of the road polyline
9779683 (Germany)
sible for the assignment of three or more values to the “name” tag of any object. For the
purpose of illustration we provide Tables 4, 5, and 6, outlining the edit history of three
road polylines in OSM where there are changes to either the road “name” (Tables 4 and
6) or the highway designation attribute (Table 5). The aesterick symbol indicates the
Table 6 Example of changes to the value of the name attribute of the road polyline
4755815 (Scotland)
Table 7 Number of unique values assigned to the “highway” tag of objects in the four
OSM databases. The column ‘highway’ indicates the number of unique values assigned
current version of the objects in the OSM database. Each table shows the date of edit, the
version number, the tag value, and the ID of the contributor who made the edit. In all
three cases multiple contributors are involved. Some edits made on the same day
probably, in our opinion, correspond to self-corrections by the contributors who made
them.
Table 7 is similar to the results presented in Table 3. In this table we summarize the
number of unique values that the “highway” attribute is assigned for all objects in the four
databases with the “highway” tag. There are some interesting observations. A very small,
but not negligible, number of objects have a high number of changes of highway
designation. For example object 9779683 in Germany has four different values assigned to
its highway attribute over its 29 version history. These values are “primary”, “tertiary”,
“residential”, “secondary”. There are seven unique contributors to this object. A signifi-
cant percentage of objects in all four databases have three or more unique values assigned
to their highway attribute. We believe that it is unlikely that real-world physical highways
(motorways, roads, paths, etc.) could change their designation this frequently. For
example: UK (9.3%), Ireland (11.8%), Austria (16.7%), and Germany (15.1%). With
comparison to a ground-truth dataset it is difficult to understand precisely the reason for
the tag “flip-flopping” with the highway attribute. We believe this could demonstrate
uncertainty amongst contributors regarding the designation for a given highway object.
This could also reveal a deeper issue of semantics within the OSM Map Features. Different
Table 8 Overall usage of values from the OSM controlled vocabulary ‘Map Features’
from all unique values assigned to “landuse” and “highway” tags. The compliance
column indicates the number of unique values found with the number of these not in the
controlled vocabulary in brackets
577 objects which have the “landuse” attribute at some stage of their history. In total 39
values were assigned to “landuse” attribute tags (so 39–29 = 10 free text tags not defined
in OSM Map Features). From a visual inspection, spelling errors accounted for the
majority of these variants. In Table 9 a summary of the number of objects with landuse
or highway attributes (at first and current version) is provided. The number of objects
with these attributes is shown in the Objects column. The number of these objects where
the values for either landuse or highway attributes are drawn from the Map Features
controlled vocabulary is shown in the Compliance column. In all cases the number of
objects with these tags increases from the first to the current version. Compliance with
the Map Features controlled vocabulary is very good overall. Being compliant with the
map features controlled vocabulary does not in any way indicate that this attribute
assignment is currently correct and would need to be confirmed by comparison to
ground-truth datasets.
Table 9 Compliance of tagging with the map features controlled vocabulary: first and
current versions
In this article we have investigated how spatial objects are tagged in OSM databases. We
selected four OSM databases and from these 25,000 heavily edited objects were chosen
for analysis. The article begun with an introduction to “tagging” in OSM. This was
followed by a comprehensive overview of the literature on OSM. We then described the
process of choosing heavily edited objects and working with OSM-XML data. The
locations of Ireland, United Kingdom, Germany, and Austria were chosen because of the
home location of the authors and the activity of the OSM communities in the other three
regions. The analysis could be easily extended to other regions. Table 1 shows that over
90% of objects in the four OSM databases have ⱕ3 versions. This makes it difficult to
undertake analysis to investigate collaborative editing on these objects. Subsequently our
analysis chose to investigate “heavily edited” objects. These offer a similar concept to the
Wikipedia Featured Article. Heavily edited articles in Wikipedia are usually those that
gain the status of “featured article”. Featured articles are recognized as articles of high
quality, with a long history of collaborative editing, and have become relatively stable (no
major recent edits) (Anderka et al. 2011). Welser et al. (2011) explain that heavily edited
articles in Wikipedia usually gain the status of “featured article” and are subsequently
recognized as articles of high quality. Korfiatis et al. (2006) based their analysis of quality
of Wikipedia articles on successive edits and therefore focused on articles with a long edit
history. Hecht and Gergle (2010) focus on articles that have been edited frequently,
particularly those by the same contributor. Nemoto et al. (2011) indicate that quality
increases and stabilizes, the more contributors work on a given article.
The tagging and annotation of these heavily edited objects in OSM is restricted to the
use of a small number of tags. In all four datasets at least 50% of objects use six tags or
less over their history. We found the use of tags such as “source” and “description” (to
indicate how the data was captured, etc.) was sparse. Only 3.5% of the 25,000 objects
used one or both of these tags. Tag “flip-flopping” occurs where the values assigned to
tags such as “name” and “highway” change multiple times. Tables 3 and 7 show that a
small, but not negligible, percentage of objects have their “name” or “highway” tags
assigned different values over the object lifetime. The OSM Map Features page offers a
controlled vocabulary from which contributors can choose values for tags such as
“landuse” and “highway’ ”. Table 9 shows the number of objects which draw the values
for their “landuse’ ” and “highway” tags from the Map Features controlled vocabulary.
The rate of compliance is very high (>98% for the current versions of all objects).
However this compliance does not imply that the current values assigned to these
attributes are correct. Table 8 shows wide variations on the controlled vocabulary used
and Tables 7 and Table 3 show significant “flip-flopping” of values assigned to the
“highway” and “name” tags, respectively. Finally, no relationship was found to exist in
our four datasets between the number of contributors to an object and the number of
tags or tag “flip-flopping” on that object. Overall, this work shows that there are issues
in how contributors tag and annotate spatial features in OSM. These issues need to be
addressed before OSM can be considered for use in “serious geomatics applications”
(Mooney et al. 2010b, Over et al. 2010).
As described in section 3.2 our database of history for the selected 25,000 objects is
a very detailed record of contributor activity to OSM over a period of approximately
four years. This provides us with a very rich dataset from which future research work can
be developed. Haklay et al. (2010) and Girres and Touya (2010) show that increases in
the number of OSM contributors in an area are strongly related to an increase in
geometric data quality and spatial data volume. An immediate issue for future work
would be comparison of the tags of these 25,000 with ground-truth data to investigate
if a relationship exists between the number of contributors and accuracy of tagging.
There are no moderators for content in OSM. Contributors can take a ‘moderator’
responsibility for a particular OSM region or a set of objects. It would be interesting to
conduct a survey of OSM contributors to understand the causes of tag value changes for
example. This would allow us to formulate some indication of the methods of contri-
butions of different communities of OSM from different countries, etc. This work could
also include an analysis of the geometric and positional accuracy of heavily edited objects
over time measured against some ground truth dataset. As outlined in section 3.3. there
are just over 97,000 objects in the four databases with V ⱖ 15. From the complete set of
objects in all four databases this represents less than 1% of all objects. For future work
we will also consider reducing the threshold value V to investigate the effects it has on our
analysis and results. Finally, in the field of visualization we feel there is scope for work
on the visualization of the historical evolution of features in OSM. Gilbert and Kara-
halios (2009) remark that they “see vast potential for social visualization to make large
impacts on social production projects” because added value can be gained for both those
involved in the project and outside it from being able to “to observe the long-term
impacts on motivation and production in real, working social production communities.”
Several authors (Suh et al. 2008, Pirolli et al. 2009, Riche et al. 2010) claim that the
increased exposure to the editing/authoring histories, using visualization software
applications, for collaborative knowledge projects like Wikipedia, increases credibility
judgements and offers transparency. This can eventually lead to “improvements in the
interpretation, communication, and trustworthiness” (Suh et al. 2008) of collaboratively
generated knowledge. We feel that this could extend to include OpenStreetMap and other
VGI projects.
References
Anderka M, Stein B, and Lipka N 2011 Towards automatic quality assurance in Wikipedia. In
Proceedings of the Twentieth International Conference Companion on World Wide Web
(WWW’ 11), Hyderabad, India: 5–6
Antin J 2011 My kind of people?: Perceptions about Wikipedia contributors and their motivations.
In Proceedings of the 2011 Annual Conference on Human Factors in Computing Systems
(CHI ’11), Vancouver, British Columbia: 3411–20
Ather A 2009 A Quality Analysis of OpenStreetMap Data. Unpublished M.Eng., Department of
Civil, Environmental and Geomatic Engineering, University College London
Ballatore A and Bertolotto M 2011 Semantically enriching VGI in support of implicit feedback
analysis. In Tanaka K, Fröhlich P, and Kim K-S (eds) Web and Wireless Geographical
Information Systems. Berlin, Springer Lecture Notes in Computer Science Vol. 6574: 78–93
Brando C and Bucher B 2010 Quality in user generated spatial content: A matter of specifications.
In Proceedings of the Thirteenth AGILE International Conference on Geographic Information
Science, Guimarães, Portugal
Budhathoki N, Bruce B, and Nedovic-Budic Z 2008 Reconceptualizing the role of the user of spatial
data infrastructure. GeoJournal 72: 149–60
Bulterman D C A 2004 Is it time for a moratorium on metadata? IEEE MultiMedia 11: 10–17
Cantador I, Konstas I, and Jose J M 2011 Categorising social tags to improve folksonomy-based
recommendations. Web Semantics: Science, Services and Agents on the World Wide Web 9:
1–15
Ciepluch B, Mooney P, Jacob R, and Winstanley A C 2009 Using OpenStreetMap to deliver
location-based environmental information in Ireland. In Proceedings of the Seventeenth ACM
SIGSPATIAL International Conference on Advances in Geographic Information Systems,
Seattle, Washington: 17–22
Coleman D J, Georgiadou Y, and Labonte J 2009 Volunteered geographic information: The nature
and motivation of producers. International Journal of Spatial Data Infrastructures Research 4:
332–58
Corcoran P and Mooney P 2011 Topologically consistent selective progressive transmission. In
Geertman S, Reinhardt W, and Toppen F (eds) Advancing Geoinformation Science for a
Changing World. Berlin, Springer Lecture Notes in Geoinformation and Cartography: 519–38
Corcoran P, Mooney P, and Winstanley A C 2011 Planar and non-planar topologically consistent
vector map simplification. International Journal of Geographical Information Science 25: in
press
Flanagin A J and Metzger M J 2008 The credibility of volunteered geographic information.
GeoJournal 72: 137–48
Geofabrik 2011a Data Downloads for OpenStreetMap Data. WWW document, http://
www.geofabrik.de/data/download.html
Geofabrik 2011b Mapcompare: Visual Comparison of Google Maps and OpenStreetMap. WWW
document, http://tools.geofabrik.de/mc/
Gilbert E and Karahalios K 2009 Using social visualization to motivate social production. IEEE
Transactions on Multimedia 11: 413–21
Girres J-F and Touya G 2010 Quality assessment of the French OpenStreetMap dataset. Transac-
tions in GIS 14: 435–59
Goodchild M F 2007 Citizens as sensors: The world of volunteered geography. GeoJournal 69:
211–21
Goodchild M F 2008 Commentary: Whither VGI? GeoJournal 72: 239–44
Haklay M 2010 How good is volunteered geographical information? A comparative study of
OpenStreetMap and Ordnance Survey datasets. Environment and Planning B 37: 682–703
Haklay M, Basiouka S, Antoniou V, and Ather A 2010 How many volunteers does it take to map
an area well? The validity of linus’ law to volunteered geographic information. Cartographic
Journal 47: 315–22
Hecht B J and Gergle D 2010 On the “localness” of user-generated content. In Proceedings of the
2010 ACM Conference on Computer Supported Cooperative Work (CSCW ’10), Savannah,
Georgia: 229–32
Jacob R, Mooney P, Corcoran P, and Winstanley A C 2010 Haptic-GIS: Exploring the possibilities.
In Proceedings of the Eighteenth ACM SIGSPATIAL Conference on Advances in Geographic
Information Systems (Volume 2), San Jose, California: 13–18
Javanmardi S and Lopes C 2010 Statistical measure of quality in Wikipedia. In Proceedings of the
First Workshop on Social Media Analytics (SOMA ‘10), Washington, D.C.: 132–38
Kessler C, Janowicz K, and Bishr M 2009 An agenda for the next generation gazetteer: Geographic
information contribution and retrieval. In Proceedings of the Seventeenth ACM SIGSPATIAL
International Conference on Advances in Geographic Information Systems, Seattle, Washing-
ton: 91–100
Kessler C, Trame J, and Kauppinen T 2011 Tracking editing processes in volunteered geographic
information: The case of OpenStreetMap. In Proceedings of Workshop on Identifying Objects,
Processes and Events in Spatio-Temporally Distributed Data (IOPE), Belfast, Maine
Korfiatis N, Poulos M, and Bokos G 2006 Evaluating authoritative sources using social networks:
An insight from Wikipedia. Online Information Review 30: 252–62
Langläufer 2011 The OSM History Browser Web Application. WWW document, http://
osm.virtuelle-loipe.de/history/
Liu D, Wang M, Hua X-S, and Zhang H-J (2011 Semi-automatic tagging of photo albums via
exemplar selection and tag inference. IEEE Transactions on Multimedia 13: 82–91
Ludwig I, Voss A, and Krause-Traudes M 2011 A comparison of the street networks of Navteq and
OSM in Germany. In Geertman S, Reinhardt W, and Toppen F (eds) Advancing Geoinforma-
tion Science for a Changing World. Berlin, Springer Lecture Notes in Geoinformation and
Cartography: 65–84
Mooney P and Corcoran P 2011a Accessing the history of objects in OpenStreetMap. In Proceed-
ings of the Fourteenth AGILE International Conference on Geographic Information Science,
Utrecht, The Netherlands
Mooney P and Corcoran P 2011b Annotating spatial features in OpenStreetMap. In Proceedings
of the Nineteenth Annual GIS Research UK Conference (GISRUK 2011), Portsmouth,
England
Mooney P and Corcoran P 2011c Can volunteered geographic information be a participant in
eEnvironment and SDI? In Hrebícek J, Schimak G, and Denzer R (eds) Environmental
Software Systems: Frameworks of eEnvironment. Boston, MA, Springer Advances in Infor-
mation and Communication Technology Vol. 359: 115–22
Mooney P, Corcoran P, and Winstanley A C 2010a A study of data representation of natural
features in OpenStreetMap. In Proceedings of the Sixth International Conference on Geo-
graphic Information Science (GIScience 2010), Zurich, Switzerland
Mooney P, Corcoran P, and Winstanley A C 2010b Towards quality metrics for OpenStreetMap.
In Proceedings of the Eighteenth ACM SIGSPATIAL International Conference on Advances in
Geographic Information Systems, Seattle, Washington: 514–17
Nemoto K, Gloor P, and Laubacher R 2011 Social capital increases efficiency of collaboration
among Wikipedia editors. In Proceedings of the Twenty-second ACM Conference on Hyper-
text and Hypermedia (HT ‘11), Eindhoven, The Netherlands: 231–40
OpenStreetMap 2011a The Map-features Page on OpenStreetMap: A Guide to the Recommended
Tagging of OSM Features. WWW document, http://wiki.openstreetmap.org/wiki/
Map_Features
OpenStreetMap 2011b OpenStreetMap Automated Import and Editing Code of Conduct. WWW
document, http://wiki.openstreetmap.org/wiki/Automated_Edits/Code_of_Conduct
OSM API 2011 OpenStreetMap Editing API for Fetching and Saving Raw Geodata From/to the
Global OpenStreetMap Database. WWW document, http://wiki.openstreetmap.org/wiki/API
(Online Wiki)
OSM History Viewer 2011 The OSM Changeset Visualiser Web Application. WWW document,
http://osmhv.openstreetmap.de/index.jsp
OSM Map Features 2011 The Map Features Page Guide to Tagging Features on OpenStreetMap.
WWW document, http://wiki.openstreetmap.org/wiki/Map_Features
OSM-Stats 2011 The OpenStreetMap Statistics Page. WWW document, http://www.
openstreetmap.org/stats/data_stats.html
Over M, Schilling A, Neubauer S, and Zipf A 2010 Generating web-based 3D city models from
OpenStreetMap: The current situation in Germany. Computers Environment and Urban
Systems 34: 496–507
Pirolli P,Wollny E, and Suh B 2009 So you know you’re getting the best possible information: A tool
that increases Wikipedia credibility. In Proceedings of the Twenty-seventh International Con-
ference on Human Factors in Computing Systems (CHI ‘09), Boston, Massachusetts: 1505–08
Prasarnphanich P and Wagner C 2011 Explaining the sustainability of digital ecosystems based on
the Wiki model through critical-mass theory. IEEE Transactions on Industrial Electronics 58:
2065–72
Qian X, Di L, Li D, Li P, Shi L, and Cai L 2009 Data cleaning approaches in Web2.0 VGI
application. In Proceedings of the Seventeenth International Conference on Geomatics,
Fairfax, Virginia: 1–4
Ramm F 2011 Displaying historic OpenStreetMap coverage. WWW document, http://
labs.geofabrik.de/history/
Riche N H, Lee B, and Chevalier F 2010 iChase: Supporting exploration and awareness of editing
activities on Wikipedia. In Proceedings of the International Conference on Advanced Visual
Interfaces (AVI ‘10), Rome, Italy: 59–66
Roick O 2011 OSMatrix: Visualisation of contributions to OSM. WWW document, http://
koenigstuhl.geog.uni-heidelberg.de/osmatrix/
Smart P D, Quinn J A, and Jones C B 2011 City model enrichment. ISPRS Journal of Photogram-
metry and Remote Sensing 66: 223–34
Stein K and Hess C 2007 Does it matter who contributes: A study on featured articles in the
German Wikipedia. In Proceedings of the Eighteenth International ACM Conference on
Hypertext and Hypermedia (HT ‘07), Manchester, United Kingdom: 171–74
Suh B, Chi E H, Kittur A, and Pendleton B A 2008 Lifting the veil: Improving accountability and
social transparency in Wikipedia with Wikidashboard. In Proceeding of the Twenty-sixth
International Conference on Human Factors in Computing Systems (CHI ‘08), Florence, Italy:
1037–40
Trame J and Kessler C 2011 Exploring the lineage of volunteered geographic information with heat
maps. In Proceedings of the GeoViz 2011: Linking Geovisualization with Spatial Analysis and
Modeling Workshop, Hamburg, Germany
van Exel M, Dias E, and Fruijtier S 2010 The impact of crowdsourcing on spatial data quality
indicators. In Proceedings of the Sixth International Conference on Geographic Information
Science (GIScience 2010), Zurich, Switzerland
Wallgrün J O, Wolter D, and Richter K-F 2010 Qualitative matching of spatial information. In
Proceedings of the Eighteenth ACM SIGSPATIAL International Conference on Advances in
Geographic Information Systems, San Jose, California: 300–09
Welser H T, Cosley D, Kossinets G, Lin A, Dokshin F, Gay G, and Smith M 2011 Finding social
roles in Wikipedia. In Proceedings of the 2011 iConference, Seattle, Washington: 122–29
Zielstra D and Hochmair H H 2011 Digital street data: Free versus proprietary. GIM International
25(7): 29–33
Zielstra D and Zipf A 2010a A comparative study of proprietary geodata and volunteered
geographic information for Germany. In Proceedings of the Thirteenth AGILE International
Conference on Geographic Information Science, Guimarães, Portugal
Zielstra D and Zipf A 2010b Quantiative studies on the data quality of OpenStreetMap in
Germany. In Fabrikant S I, Reichenbacher T, van Kreveld M, and Schlieder C (eds) Geo-
graphic Information Science: Proceedings of GIScience 2010. Berlin, Springer Lecture Notes in
Computer Science Vol. 6292: 20–26