It’s been a few months since we first announced our Democratic Commons project under the banner of “shared code, data and resources where anyone can contribute, and anyone can benefit” — but if we’ve been silent since then, it’s certainly not for a lack of activity.
Quite the reverse, in fact: we’ve been busy bringing new team members on board and getting stuck in with the time-consuming and often fiddly process of data gathering and sharing.
When we’re in the midst of all this hard work, it’s sometimes hard to remember to talk about how everything’s going; but it’s always interesting, so here’s a snapshot of where we are now.
Those of us working on Democratic Commons are only a small team within the smallish organisation mySociety. Gathering in-depth data on politicians all around the world takes more time and more local knowledge than we have ourselves, so we’re working with partners located within our target countries.
Distintas Latitudes have been handling Latin America – they’ve been great at gathering data and explaining the various differences between the political systems in each country we’ve worked together on.
In India, Factly and Gender And Politics have done the most amazing job in gathering a full national and state level dataset for politicians right across the country. We were astounded, as that is a LOT of data (over 3,500 records and counting so far).
And in South East Asia we’re working with OCF, with whom we’ve had a long association (you may remember TICTeC Taiwan, for example). OCF have helped us with data for Taiwan and South Korea so far, and are set to work with us on seven more countries before December 2018.
Finally, a special mention goes to OpenLeb of Lebanon, who are working hard to start finding data in a country where data is not usually open. We genuinely could not do this work without our partners and we are eternally grateful for their help.
As is probably clear from the above, we often select which countries to work on by our ability to find a community or organisation that will extend help. A nice side effect of this is that we’re strengthening the connections and bonds between mySociety and organisations with similar missions in many different places.
Growing the community of such organisations across the world is going to be the primary focus of our new Community Manager Georgie, whom you will no doubt hear a lot from over the next few months.
She’s going to be finding out who’s already using data like this, who’s maintaining it, who’s interested in running projects with it or doing research — and seeing if there’s also an appetite there to keep the data up to date. This is because the data will really only be useful to people if it’s well maintained and current!
Working with Wikidata
Early on, we recognised that improving the political data available in Wikidata, rather than ringfencing it all within EveryPolitician, was going to be an efficient way to maximise the benefits of the Democratic Commons project.
What does this mean in practice? Well, in our first phase we’ve targeted 13 places in which to locate the data and load it into Wikidata: Mexico, Brazil, Colombia, Paraguay, Chile, Canada, Italy, Estonia, Lebanon, India, S Korea, Taiwan and Hong Kong.
Ultimately, we want to help make any and all information about politicians at every level freely and openly available via the Commons; but for now, our initial scope looks at representatives at national, regional, and city legislatures.
Now that a lot of this work is being done on or in Wikidata, we’re creating tools to make processes smoother and faster. The main ones of these are around verifying data and creating statements in Wikidata; we hope that when we’ve completed these they’ll be valuable to the whole Wikidata community beyond just the Democratic Commons project.
Step by step
We’re focusing on getting what we’re terming ‘Outline Data’ for each place loaded into Wikidata first. This type of data helps us model the political system, as it tells us what the legislature looks like — for example, whether it is unicameral, bicameral or different to those; what it calls members of the legislature; what term the legislature is on and how long that lasts; and often how many seats that legislature contains.
Once we have that outline data, we then need some information about people holding seats in those legislatures. We try and start with five examples of each type of role at each level, then we can send this ‘Seed Data’ off to hopefully crowdsource the rest of the data: more on that in a bit!
Meanwhile, our GIS expert Will is working on boundaries. Boundary data is hard — like, really hard! This is one of the most challenging areas of the project but it’s also one of the most important. Without electoral boundaries we don’t know what area a politician is representing and a lot of the tools we think this data would be useful for just won’t work.
However, boundaries aren’t always released openly or completely, especially when it comes to local level constituencies, and even when we do find them, understanding whether we have all the data we need to represent politicians correctly can be really tricky.
Because we like to keep really busy, we’ve also been starting to collaborate with other organisations such as Open Knowledge and CLEA on how to raise the visibility of the availability (or lack, more likely) of open sources of official boundary data.
Working with Facebook
You may remember our ongoing work to connect Facebook users with their politicians after an election, in countries around the world.
We’re also working with Facebook to run some crowdsourcing experiments that will gather more data on politicians. I mentioned ‘Seed Data’ above. For each country, this gets fed to Facebook, and allows them to create questions which they can send to users to ask them who their representatives are at different levels of government.
We then get this data back and our partners help us verify it and put it into Wikidata so it becomes open and available for anyone to use. Facebook has a reach we would never be able to manage on our own.
So that’s where we are
As I’ve hopefully demonstrated in this post, the work is extremely challenging. That’s why we’re sometimes a little slow in updating where we’ve got to — but we genuinely believe that that having this data out there in the open will pave the way for so many exciting new political data-based projects and research. And so, onwards!
Image: Ben White (Unsplash)