During the May 2010 local elections, Socitm joined with the Local Government Association to support the Open Election Data Project set up by Chris Taggart, developer of OpenlyLocal.com and a member of the Department of Communities’ Local Public Data Panel, on which Socitm also sits.
Local authorities were encouraged to publish election results on their websites as ‘Linked Open Data’ – data that is published under an open licence that allows unrestricted reuse, and that is marked up to identify the structure and meaning, making possible its automated collection for re-publishing and mashing up with other data.
The purpose of the project was threefold:
- to start building an open database of local election results (none currently exists);
- to help local authorities develop skills and knowledge in publishing open data
- to identify issues, pain-points and blockers to publishing open data in general on a relatively contained body of information.
Given the Coalition Government’s commitment to publishing open data, and stated aim of having Local Authorities publish their information as machine-readable open data, the project has been useful as an unofficial pilot for many other data sets that authorities may wish to publish in the future.This document details the key lessons learned from the project, most of which are applicable to other local government datasets, and indeed other projects undertaken by a wide range of bodies interested in publishing linked open data to the same standard. We believe this is probably the first attempt to have a group of public bodies publish linked data to the same consistent standard, and as such there are important implications for similar future exercises.
The project and its outcomes
As indicated above, no freely available database of local election results exists. This is because the data required to compile it is located in different sections of hundreds of different council websites. The information is presented in many different formats and many different ways, and the only way the information can be compiled for re-use is manually – a laborious affair involving finding, cutting and pasting lots of individual pieces of information published by 433 different councils.
The Open Election Data project asked that instead of publishing their results using arbitrary and often inaccessible formats, councils should publish the results as HTML (the language used to write web pages) enriched with ‘RDFa‘. Though invisible to normal users, publishing the information in this way gives it structure and meaning, and means it’s possible for machines to read the information as data.
….
- There is a lack of ‘corporate’ awareness/understanding of open data issues, and this will inhibit take up of open, linked data publishing unless it is addressed
- There is a lack of even basic web skills at some councils
Worryingly there were councils where no-one had even basic web-authoring skills (i.e. a good understanding of HTML), being merely relegated to fill in forms in an (outsourced) content management system. In a world where the web is becoming the main method of communication with citizens and between bodies, this is not unlike having a finance department with no-one who understands the core rules of accountancy. Without those basic skills as a foundation there is no way a body can hope to produce linked data.- Many councils lack web publishing resources, never mind the resources to implement open, linked data publishing
Many council webteams are significantly under-resourced, e.g. consisting of one person, possibly part-time, sometimes with £0 annual budget, meaning any expenditure has to be bid for and a formal business case made. This stifles innovation (the driving force of the web), leads to websites that become increasingly out-of-date, and makes participation in such a project problematic (although there was no direct cost involved in participation, it did require an investment in time).
The open, linked data issue may not, therefore, come high on web managers’
priority list, unless publishing in open data format becomes a requirement on councils.- The understanding of even the basics of linked data and the steps to publishing public data in this way is very, very limited
Publishing information as machine-readable data is not new, but the standards and methods for publishing it on the web so that anyone can consume it have only recently moved out from academic circles.
Because of this, even those experience with core web technologies have little familiarity with it, and the core principles behind it (using web addresses, or resource URIs, as identifiers for objects and relationships).
In addition, there are few tools for the publishing of linked data, meaning much has to be done by hand and checked manually, which is time-consuming and error-prone. (This is similar to the early days of the web, before the creation of content management systems and web-authoring tools.)
The Open Election Data Project has increased knowledge in local government of this by an order of magnitude, as well as providing concrete examples of publishing of open data by councils and establishing an informal network of people who have the knowledge and are willing to pass it on.- The tools for and knowledge of consuming linked data are also limited
Although linked data exposes data in a richer form, consuming it is not as well-known or straightforward as other formats (e.g. XML, JSON) requiring a greater investment in time and knowledge by data users. While it is appropriate for election data (where it is critical that electoral areas and parties are properly identified), it may not be as essential for all data sets, and should not necessarily be the first format supported, as it may delay publication and consumption.- Publishing RDFa conflicted with some existing setups
A common query councils had was validation errors caused by a conflict between the RDFa and HTML Doctype (which defines the version of HTML), either because councils were unable to change the Doctype, or because their existing code would break if they did. Although the validation errors were insignificant (it would not affect either browsers or screenreaders), some councils used their strict policies about validation as reasons for not taking part (even though there were invariably validation errors on other pages).
Another problem (for data users, rather than publishers) was where the HTML included the base url, and there were also problems when consuming data that had line breaks within the RDFa.- Getting councils to publish data in open, linked formats will require dedication of a range of resources and application of a mix of skills
The success of the Open Election Data Project, through which some 20 councils successfully published results as open, linked data at minimal cost to them required the following resources:
- A project leader, in this case, OpenlyLocal (which also funded the project), and endorsement and in kind support from ‘recognised’ authorities, in this case, LGA and Socitm.
- Access to ‘official’ information resources (e.g. The Electoral Commission, lined data experts at the Cabinet Office) to ensure the subject was properly understood and represented in the data.
- communications skills to explain the issues and benefits and identify appropriate channels to persuade the necessary people that their organisation should participate in the project
- communications resources including a ‘campaign banner’ and an interactive communications ‘hub’ for the project (see below)
- research and publication of resources required for implementation by webmasters, eg resource URIs for political parties
- technical expertise around web coding and mark-up and standards and resource URIs and the resource to deploy this as advice to councils
- Councils need a dedicated, interactive, information and advice source
L’intero articolo, raggiungibile all’indirizzo http://data.gov.uk/blog/publishing-local-open-data-important-lessons-open-election-data-project prosegue nelle indicazioni dirette a sviluppatori e addetti ai lavori che vogliano contribuire alla pubblicazione dei dati con codice RDfa , puntualmente riportato nel post, semplicemente cliccando sul link How to mark up your election data del sito dedicato al progetto http://openelectiondata.org
La lezione prosegue
- Publishing each type of data will require appropriate resource URIs if data is to be published as open, linked data, and furthermore require councils to be able to use those URIs
Every data set to be published as linked data – be it expenditure data or information about recycling facilities or numbers of free school meals – will need to have a appropriate resource URIs agreed, published and communicated across local authorities. It is worth noting that one of the barriers the Open Election Data website solved was that councils in general had no idea of the Office of National Statistics SNAC ids used to identify the councils and the wards, and therefore needed to be told them by the website. It is to be expected that similar knowledge gaps will exist in other areas.- Many content management systems are inflexible and made the adding of RDFa to HTML at best problematic at best, and sometimes completely impossible
One of the biggest problems local authorities faced with publishing their election data as linked data was that their content management systems did not support this, and at times made this impossible. One council which had recently bought an expensive new proprietary content management system spent many hours trying to add RDFa markup to the HTML, and eventually gave up.On the other hand, other content management systems used this as an opportunity to show the flexibility of their systems, as the ‘plug-in’ created for the Jadu CMS by one of its customers (Lichfield District Council) in connection with the Open Election Data project illustrates.
Local authorities that have or acquire content management products that do not allow use of RDFa and XHTML will find it very difficult to meet the Government’s goal of publishing data in open, linked formats wherever possible.
- Upgrading council software systems to generate ‘open linked data’ automatically may be possible but will not always be straightforward
During the Open Elections Data Project, the supplier of the Modern.gov committee/democratic data publishing system agreed to modify the system so that it would publish linked data (it published this as a web service using the XML version of RDF rather than RDFa). As a result, data published on the website directly from this system would not need further modification for it to be published as open, linked data.However, even this brought its own issues, as the existing data in the Modern.gov systems didn’t include standard ward identifiers and some councils were unable to expose the web service sue to the setup of their firewalls.
- Outsourced web publishing contracts often make it difficult and expensive to publish open, linked data
A common comment from councils wanting to participate was “We’d like to do this, but we’ve outsourced our web systems and it will cost us £XXX just to pick up the phone to our supplier.” Even aside from the specific issues of the Open Election Data project, with the web developing so quickly this sort of outsourcing agreement doesn’t seem capable of keep pace with those changes.
In addition, it also became clear that many outsourcing suppliers had no more knowledge about linked data than the average council. This is clearly a problem that goes beyond the requirement for open data and is about the wider fitness for purpose of websites commissioned from external suppliers, and the need for appropriate website commissioning skills within councils.
Problemi e difficoltà, quelli elencati sopra, non hanno precluso e meno che mai frenato lo slancio di coloro che vedono nei Linked Open Data una rappresentazione più evoluta di quella cultura democrazia che deve essere diffusa tra i cittadini e senza la quale nessuna democrazia politica sarebbe tale.
Approfittando dello spunto sempre esemplare giunta da oltremanica, ho cercato sul sito istituzionale del Ministero dell’Interno – Dipartimento per gli Affari Interni e Territoriali – archivio storico, dati relativi alle elezioni in Italia.
Sono disponibili dati relativi alle recenti elezioni in Italia, rigorosamente rappresentati da numeri , manco elencati in un elementare formato csv, consultabili, ergo, solo on line; dati ancora relativi alle recenti elezioni e non aggiornati; dati relativi SOLO al referendum del 1946 e se la memoria non mi inganna, per esempio, si è votato nel giugno del 2006 il referendum sulla Riforma Costituzionale.
E’ tempo di studiare !
La Manica
This precious stone set in the silver sea,
Which serves it in the office of a wall
Or as a moat defensive to a house,
Against the envy of less happier lands .
William Shakespeare, Riccardo II Atto II, Scena I

Politecnico di Torino leads the LAPSI Network: make PSI in Europe more accessible, re-usable and exploitable.
Information generated and collected by public sector entities represents a veritable minefield; it might make a much greater contribution to EU economies and societies, if current legal barriers to access and re-use were removed.
The LAPSI (Legal Aspects of Public Sector Information) project intends to build a network apt to become the main European point of reference for high-level policy discussions and strategic action on all legal issues related to the access and the re-use of the PSI namely in the digital environment.
The debate is to be organized around four focal points: (1) implementation and deployment issues; (2) design of the incentives for public bodies and private players, both in the for-profit and non-profit sectors, to make available and, respectively, to re-use public data; (3) special consideration of infra- and supra-national levels of access and re-use policies and practices, intended to enlist the dynamic forces of regulatory competition and to bring out the full potential of cross-border, EU-wide services; and crucially (4) strategic vision and occasions for out-of-the box thinking for the next steps ahead in policy making.
LAPSI will take part in the GSDI 12 World Conference (Global Spatial Data Infrastructures) that will take place in Singapore from 19 to 22 October 2010.
( via LAPSI public wiki page – an abstract submitted by Katleen Janssen, Roger Longhorn and Gabor Remetey-Fullop to provide the GSDI-conference with a brief introduction to LAPSI project)
The GLOBAL SPATIAL DATA INFRASTRUCTURE ASSOCIATION, is an inclusive organization of organizations, agencies, firms, and individuals from around the world. The purpose of the organization is to promote international cooperation and collaboration in support of local, national and international spatial data infrastructure developments that will allow nations to better address social, economic, and environmental issues of pressing importance.