GeoKnow addresses a bold challenge in the area of intelligent information management: the exploitation of the Web as a platform for geospatial knowledge integration as well as for exploration of geographic information. This group will bring together scientists, GIS users, linked Data users, data consumers and providers, interested in the exploitation of linked geospatial data.
This group will not produce specifications.
Note: Community Groups are proposed and run by the community. Although W3C hosts these conversations, the groups do not necessarily represent the views of the W3C Membership or staff.
GeoKnow has recently introduced OSMRec, a JOSM plugin for automatic annotation of spatial features (entities) into OpenStreetMap. OSMRec trains on existing OSM data and is able to recommend to users OSM categories, in order to annotate newly inserted spatial entities. This is important for two reasons. First, users may not be familiar with the OSM categories; thus searching and browsing the OSM category hierarchy to find appropriate categories for the entity they wish to insert may often be a time consuming and frustrating process, to the point of users neglecting to add this information. Second, if an already existing category that matches the new entity cannot be found quickly and easily (although it exists), the user may resort instead to using his/her own term, resulting in synonyms that later need to be identified and dealt with.
The category recommendation process takes into account the similarity of the new spatial entities to already existing (and annotated with categories) ones in several levels: spatial similarity, e.g. the number of nodes of the feature’s geometry, textual similarity, e.g. common important keywords in the names of the features and semantic similarity (similarities on the categories that characterize already annotated entities). So, for each level (spatial, textual, semantic) we define and implement a series of training features that represent spatial entities into a multidimensional space. This way, by training the aforementioned models, we are able to correlate the values of the training features with the categories of the spatial entities, and consequently, recommend categories for new features. To this end, we apply multiclass SVM classification, using LIBLINEAR.
The following figure represents a screen of OSMRec within JOSM. The user can select an entity or draw a new entity on the map and ask for recommendations by clicking the “Add Recommendation” button. The recommendation panel opens and the plugin automatically loads the appropriate recommendation model that has previously been trained offline.
The recommendation panel provides a list with the top-10 recommended categories and the user can select from this list and click “Add and continue”. As a result the selected category is added to the OSM tags. By the time the user adds a new tag at the selected object, a new vector is computed for that OSM instance in order to recalculate the predictions and display an updated list of recommendations (taking into account the previously selected categories/tags, as extra training information). Further, OSMRec provides functionality for allowing the user to combine several recommendation models, based on (a) a selected geographic area, (b) user’s past editing history on OSM and (c) combination of (a) and (b). This way, personalized category recommendations can be provided that take into account the user’s editing history and/or the specific characteristics of a geographic area of OSM.
OSMRec plugin can be downloaded and installed in JOSM following the standard procedure. Detailed implementation information can be found in the following documents:
A typical tourist scenario is hard to picture without a map. Yet, such a scenario implies you are not familiar with your surroundings and, therefore, often not sure how to find the things that are of interest to you. Typical geospatial browsers will provide you with common exploration tools that will most often include a slippy map combined with keyword search, categorized points of interest (POIs) and a fixed set of filters. But, all of these imply either that you know what it is you’re looking for, or that the preset collection of POIs and criteria will be enough to satisfy your needs. In real life, however, those needs will often be affected by the given context, which is, in turn, dependent on multiple, dynamic factors, such as the place you’re visiting, your mood, interests, background etc. Imagine using your favorite geospatial browser to answer the following question:
“Where are the nearest buildings designed by Frank Lloyd Wright, typical of the Prairie School movement?”
GEM (Geospatial-semantic Exploration on the Move) is the very first geospatial exploration tool that offers a rich mobile experience and overcomes the abovementioned limitations of conventional solutions by exploiting all strengths of the Linked Open Data paradigm, such as built-in semantics in open, crowd-sourced knowledge found in publicly available sources, such as DBpedia, loaded and filtered on-demand, according to user’s needs, in order to prevent maps from overpopulating.
The Linked Map team is willing receive feedback on the results of the project. This project is a short term project (less than 1 year) and it is part of the larger FP7 project PlanetData. Linked Map address the development of a standard WMS that, at the same time, is a LD node offering read/write access to geographic knowledge. This vision is applied in a challenging scenario: the use of crowdsourcing techniques to improve the quality of the automatic integration of a National Map with existing VGI data.
Up to date we have developed:
A transparent semantic proxy for WMS 1.3.0 (deliverable D17.1)
Transformation into RDF of a National Map (BCN/BTN25 provided by IGN.es) and a VGI dataset (a portion of OSM) (deliverable D16.3)
Large scale alignment of both datasets using only name, type and location properties (deliverable D16.3)
Annotation of the resulting RDF datasets at feature level with W3C PROV ontology (deliverable D16.1 and implementation D16.3)
The GeoKnow project has been running over one year and is proud to show the first results of the research project during the W3C Switzerland event on May 22, 2014. Ontos, a W3C member and a partner of the GeoKnow project will demonstrate the GeoKnow Generator during the talk “Linked Open Data”. The event takes place in 1700 Fribourg, Switzerland and is free of charge. More information about the event and registration is available at the following link:
The GeoKnow team has created the first tutorials showing how to work with the GeoKnow Generator and the tools. The team plans to extend the list of tutorials during 2014 helping everybody to get a better understanding on how to work with geospatial data using the GeoKnow LD stack.
The GeoKnow consortium also welcomes everybody to work with the prototype available at http://generator.geoknow.eu:8080/generator/#/home. Just keep in mind it is the demo server and as with many software projects some minor problems can occur.
Any feedback is welcome via our Twitter channel https://twitter.com/geoknow.
One of the most important event in Europe about Open Data is the European data forum. During the event that will take place in March in Athens several workshop about tools and applications are organised. The GeoKnow consortium will present the LinkedDataEurope workshop the current version of the research project. During the talk we will show the GeoKnow research objectives related to geospatial linked open data, the GeoKnow Generator and various tools that help to fulfil the Linked Data Lifecycle with geospatial data.
Following is the link to the slides that will be presented at the workshop:
In the past month (April 2013), we invited geospatial data consumers and providers, GIS experts and Semantic Web specialists to participate in our Geospatial Data Users Survey. The goal of this survey was to collect general use cases and user requirements from people outside the GeoKnow consortium. We publicised the survey using mailing lists and social networks, and it was available for 25 days. During this period we received 122 responses, of these we had 51 full responses and 71 incomplete ones. Since we were interested in having good quality surveys, so we performed a manual control, which resulted in 39 useful responses – not too bad. In this blog post, we aim to show some interesting results from our survey. If you are interested to learn more about the results of this survey, you can check the public derivable available here.
One of the goals of this survey was to learn more use cases different from those we already consider in the project. Thus, we asked participants how they use geospatial data in their work. To analyse this question, we grouped answers in different types which is shown in the graph at the right. Most of the scenarios were about visualisation and analysis, followed by geospatial data creation scenarios.
We asked users for the most popular tools they use at their work. Responses to this question were OSM and Google Maps/Earth, as well as other GSI. After we asked about the features they like the most about these tools, participants reflected preference by easy to use and free tools for their work, referring to their popular choices of Google Maps or OSM. Also having an API to interact with the application is important. The fact that applications provided data that can be integrated was also appreciated. GIS applications were considered as difficult. Integration and interoperability were mentioned as goals. Besides the previous question, we were also interested in knowing the missing functionalities that may improve their work. A list of these functionalities grouped by the related work package within GeoKnow is presented in the image below.
This survey allow us to learn from different use cases, main features used, and desired functionalities, that are to be considered in the creation of the GeoKnow Generator. Some important high level findings from the survey were the emphasis in interoperability and reusability through open APIs and approachable visualisation components, support for common geospatial data formats and geodbs, and the necessity of simple tools to support data integration/reuse from geospatial LOD sources. We also found that some of the ideas of the GeoKnow project are further supported by user requirements like the integration of private and public data and the importance of using the web as an integration platform.
Many different applications we deal with on a daily basis have some kind of geographic dimension. This geospatial information is normally required for decision making at different levels. However, this information is dispersed among a multiplicity of sources. At GeoKnow we aim to make information seeking easier by allowing exploration, editing and interlinking of heterogeneous information sources with a spatial dimension.
Now we are interested in getting to know the people that face these kinds of issues in their everyday work. We have created a survey to help us to understand and to hear more about their experience with geospatial data. This survey targets geospatial data consumers and providers, and GIS users interested in having an integrated web of geospatial data.
If you use geospatial data in your work, your contribution in this survey will be highly appreciated. The outcome of this survey will impact the use cases and requirements for the GeoKnow project, which aims to create a versatile software framework to rapidly generate spatial semantic web applications.
We are offering a 20 euro Amazon voucher to the first 50 completed surveys. Willing to participate? Please go right away to:
We would like to announce the Descartes Core 2013 geospatial semantics specialist meeting that will take place 03/20-22/2013 at Santa Barbara, CA, USA. The meeting aims at taking the current series of GeoVoCamps, workshops, and specialists meetings to the next level by establishing a common core of (geo-)ontology design patterns, vocabularies, best practice guides, examples, software, and services, that aim to foster semantic interoperability between different (Linked Data) sources without restricting semantic heterogeneity at the same time.
The meeting will be organized as a so-called Geo-Vocabulary Camp (GeoVoCamp). This is a series of free and informal but highly productive meetings in which domain experts and ontology engineers work together to discuss problems and projects and to develop ontologies and tools. Previous events have resulted in a number of geo-ontologies, new collaborations, and research papers. If you are interested in geospatial semantics, linked spatiotemporal data, big geo-data, semantic interoperability and heterogeneity, semantic annotations and metadata, or geographic information retrieval, we hope that you will be able to join us this March. We especially welcome participants from different domains such as the digital humanities or chemistry that are interested in space and place.
Please register as soon as possible to help us in organizing the event. We will announce the schedule, activities, speakers, and potential topics for breakout groups during the next weeks using the wiki at http://descartes-core.org/. The page also contains links to some previous events.
Mark Gahegan, Pascal Hitzler, Werner Kuhn, and Krzysztof Janowicz