What we’ve done — and what we want to do
Wikidata now has up-to-date and consistent data on political position holders in current national legislatures for at least 39 countries (and work in progress for over 60 countries), thanks to work by volunteer community members on the Wikiproject every politician. mySociety worked as part of this project with a Wikimedia Foundation grant in 2017-18.
There is now a real possibility for Wikidata to become the definitive source of data about democracies worldwide — but only if that data can be maintained sustainably. A significant risk is that elections and other major political changes quickly render data on political position holders and legislatures in Wikidata out-of-date.
We’re proposing a Wikidata post-election updating toolkit project, which aims to ensure that data on elected representatives is substantially correct and complete within a month following an election, leading to improved quality and consistency of data in Wikidata over time. We’ll work as part of the Wikidata community to create and signpost tools and pathways that help contributors to quickly, easily and consistently update data following an election or other political change.
How community members can get involved in the project
If you’re already active around data relevant to political position holders, legislatures, or elections in Wikidata, we’d like your feedback and help to test the new tools and guidance and ensure that they are consistent with the emerging consensus around modelling these types of data.
In particular, if you live in a country or major region that has an upcoming election, please talk to us about piloting the tools! We’d like for you to test the project tools and guidance to update data following your country’s election, and to give us feedback on the value and appropriateness of the approach in your context and political system.
In general, we’re keen to encourage discussion and evaluation of Wikidata as a source of current position holder data.
Please review our proposal
If you’re interested in this, and are active on Wiki projects, please have a look and review our proposal here.
Image: Mike Alonzo
Earlier this year, we were fortunate enough to be contacted by Brian Keegan, Assistant Professor in Information Science at the University of Colorado Boulder, who specialises in the field of network analysis.
Brian and his team were planning to mine the official biographies of every legislator published by the Library of Congress – going back to the first Congress in 1789 – and add the information as structured data to Wikidata. Having heard of our involvement with WikiProject Every Politician, they wanted to understand more about contributing.
The research team, which included professors from the Libraries, Political Science and Information Science departments, planned to combine this biographical data with more common data in political science about voting and co-sponsorship, so that interesting questions could be asked, such as “Do Ivy League graduates form cliques?” or “Are medical doctors more likely to break with their party on votes concerning public health?”. Their hypothesis was that the biographical backgrounds of legislators could play an important role in legislative behaviours.
However, the first big step before questions could be asked (or SPARQL queries made) was supporting undergraduate students to enter biographical data for every member of Congress (going right back to the first) on Wikidata. This has not generally made it into the datasets that political scientists use to study legislative behaviour, and as students began to enter data about these historical figures, it quickly became apparent why: non-existent nations, renamed cities, and archaic professions all needed to be resolved and mapped to Wikidata’s contemporary names and standardised formats.
Nine months on, the team and ten undergraduates have revised over 1,500 Wikidata items about members of Congress, from the 104th to the 115th Congresses (1995-2018) and the 80th– 81st Congresses (1947-1951), which is 15% of the way through all members dating back to the first Congress in 1789!
They started running SPARQL queries this summer.
Joe Zamadics, a political science PhD student who worked on the project explained the potential of combining these data: “One example we tried was looking at House member ideology by occupation. The graph below shows the ideology of three occupations: athletes, farmers, and teachers (in all, roughly 130 members). The x-axis shows common ideology (liberal to conservative) and the y-axis shows member’s ideology on non-left/right issues such as civil rights and foreign policy. The graph shows that teachers split the ideological divide while farmers and athletes are more likely to be conservative.”
The team are keen to highlight the potential that semantic web technology such as Wikidata offers to social scientists.
For the full Q + A with Brian and Joe see the mySociety Medium post.
You may remember that thanks to a grant from the Wikimedia Foundation, mySociety has been working to support increasingly authoritative data on the world’s politicians, to exist on Wikidata as a key part of developing the concept of the Democratic Commons.
And, this summer mySociety welcomed two members of staff to support with the community work around both Wikidata and the Democratic Commons. In May, I (Georgie) joined in the role of ‘Democratic Commons Community Liaison’ and in late June I was joined by Kelly, mySociety’s first ever ‘Wikimedia Community Liaison’… and it’s about time you started to hear more from us!
I’ve been climbing the learning curve: exploring the potential moving parts of a global political data infrastructure, finding out how the communities of Wikidata and Wikipedia operate, attempting to take meaningful notes at our daily meetings for the tool the team developed to improve political data on Wikidata and making sense of the complexity in creating interface tools to interpret the political data already in Wikidata. Oh, and supporting a “side-project” with Open Knowledge International to try and find every electoral boundary in the world (can you help?).
And if you are in any of the relevant open Slack channels (what is Slack?), you may have seen my name on the general introduction pages, as I have been shuffling around the online community centres of the world — off Wikidata Talk that is — trying to find the people interested in, or with a need for, consistently and simply formatted data on politicians, but who aren’t already part of the Wikidata community.
That’s because, the issue the Democratic Commons seeks to address is the time-consuming business of finding and maintaining data on politicians, work that we suspect is duplicated by multiple organisations in each country (often all of them having a similar aim), that is slowing down delivering the stuff that matters. This has certainly been mySociety’s experience when sharing our tools internationally.
And the solution we propose — the Democratic Commons — is that if people and communities worked together to find and maintain this data, it would be better for everyone… ah the paradox of simplicity.
Update on efforts to support the Democratic Commons concept
With each interaction and conversation that we’ve had about the Democratic Commons with partners, we’ve continued to learn about the best role for us to play. Here are some initial actions and thoughts that are shaping the work; please feel free to comment, or even better, get involved 🙂
Making sure the concept is a good fit through user research
We have set a goal to carry out user research on the concept of the Democratic Commons. So far, we have lined up calls with campaign staff (who are interested in using and supporting open political data through their UK campaigning work) and journalists in Nigeria (who have expressed a need for the data) and I am lining up more calls — if you have a need for or can contribute political data, let’s talk.
Bringing the Open Data/Civic Tech and Wiki communities together?
From my experience to date, the Civic Tech and Wiki communities appear to operate quite separately (I am very open to being proved wrong on this point!).
I am just getting started within the Wikidata/ Wikimedia communities (that’s more for Kelly) but on the Open Data/ Civic tech side, there are questions about data vandalism and the potential to trust the data from Wikidata, arguments on the benefit of using Wikidata (especially where you already have a lot of useful data) and on whether there is a need to invest time in learning SPARQL, the query language that allows faster retrieval and manipulation of data from databases.
Misconceptions are not unusual in communities online or offline, but it is a gap that our work focus, communications and tools hope to help close. If you have ideas on blogs, video tutorials or articles to share to read around these concepts, please get in touch.
Working openly in existing global communities (off Wiki)
We are aware that, off-Wiki mySociety is leading the work to develop the Democratic Commons, however, we know that we need to be delivering this work in the open for it to be owned by other people outside of mySociety, and finding the right homes to talk about it (off Wiki) has been important. In order to work openly, we have a shared #DemocraticCommons Slack channel with mySociety and Code for All; see ‘Get involved’ below to find out how to join the conversation.
We also plan to document the learning involved in the process through blog posts and documentation, to be uploaded publicly.
And, supporting local communities to develop, where possible
A global network such as Code for All is very useful in supporting a concept like the Democratic Commons, however, the bulk of need for the data will likely be country-specific. Together with our partners and collaborators, we are exploring what is needed and how to support local communities:
- Through the remainder of our Wikimedia Foundation Grant, we are supporting community events and editathons: in Lebanon with SMEX, in France with newly formed organisation F0rk, and in Spain with Wikimedia España.
- Some groups we are working with, such as Code for Pakistan, plan to set up a channel on their Slack instance and use their Whatsapp community to discuss the data use and maintenance.
- In my own country, the UK, we are talking to mySociety’s community and collaborators to understand how the Democratic Commons could benefit organisations and work in practice here. If you want to be involved in this work, please contact me.
- We are listening to understand what support is needed with collaborators in the global South, as we’re well aware that it is a lot to ask people to work on a voluntary basis and that adequate support is needed. I hope we can share the learning and use it to shape any future projects that may emerge.
How to get involved in the Democratic Commons?
- Contribute to the Wikidata community: If you are Wikidata user, or keen to learn, visit the Wikidata project page on political data. If you need guidance on tasks, do feel free to add to the Talk page to ask the community, or get in touch with Kelly, our Wikimedia Community Liaison: email@example.com.
- Join the conversation on Code for All Slack: If you would like to join the Slack conversation, join here: https://codeforall.org/ (scroll down and find the ‘Chat with us’ button).
- Look for electoral boundary data: We are working with Open Knowledge to find electoral boundary data for the whole world. See more about that here.
- Keep up to date and subscribe to our Medium blog: Sometimes these Democratic Commons posts are a bit too in-depth for the general mySociety readership, so for those who are really interested, we plan to share all we are learning here.
- Share the concept with contacts: Please share the message on your platforms and encourage potential users to take part in research and get involved. We recognise that our view — and reach — can only be anglo-centric, and we’d so appreciate any translations you might be able to contribute.
- Tell us (and others) how you think you would use the data: This can’t just be about collecting data; it’s about it being used in a way that benefits us all. How would the Democratic Commons help your community? We would love people to share any ideas, data visualisations, or theories, ideally in an open medium such as blog posts. Please connect with Georgie to share.
- Something missing from this list? Tell us! We’re @mySociety on Twitter or you can email firstname.lastname@example.org or email@example.com .
Image: Toa Heftiba
March 3 is Open Data Day, and groups all around the world will be using Open Data in their communities, to show its benefits and to encourage the sharing of more data from government, business and civil society.
Obviously, that depends on their having some good-quality data to work with — and we’d like to help make that happen. Or, more accurately, help you to help make that happen.
Just as with Global Legislative Open Week last October, we’ll support groups who would like to run a workshop, getting together with other like-minded people to improve the open political data available for your country in Wikidata.
Funding and support available
Thanks to the Wikimedia Foundation, we’re able to offer some support to individuals/groups who are interested in running Wikidata workshops during February. If you’d like to hold an event like this, it’s pretty simple: all you need is a space, and someone with some existing Wikidata skills who can show others how to add or improve data. Then you just have to pick a date, and put out the word for people to join you.
We can help with a few things, so let us know once you’ve decided to take part, and we’ll chat with you about what might be useful. Here’s what we can offer:
- A small amount of funding to help cover event costs
- In-person support during your event – we may be able to send one of our EveryPolitician/Wikidata team to your event to present, participate and advise
- A review of your country’s existing political information in Wikidata and some pointers about possible next steps
- Ideas for how you and your attendees can:
a) Use the data for interesting research and projects, and
b) Improve the data for future research queries/projects
Workshops can take place at any time until the end of February.
So, if you’d like to be part of this push to improve and use political information in Wikidata in order to contribute to the Democratic Commons, we’d be thrilled to hear from you. Please do get in touch: firstname.lastname@example.org
If you know your way around Wikidata, we’d love you to join in with the global string of events taking place for GLOW next week.
We’re very keen to get as many people as possible helping to improve the quality of Wikidata’s information on politicians. Why? Well, let’s take a quick look at a recent story that hit the news.
A new Bundestag
With Germany’s new parliament gathering for the first time on October 24, der Spiegel took the opportunity to examine their male-to-female balance, in the context of legislatures across the world. At around 31% female, they noted, the Bundestag now sits at the better end of the scale: parliaments almost everywhere are male-dominated.
How were they able to make such an assessment? As they note at the foot of their article, they used data on politicians’ gender from our EveryPolitician project.
A further exploration looked at age — they discovered that on average their parliamentarians were very slightly younger than in previous years — and they note as an aside that here in the UK, we have in Dennis Skinner the oldest MP in Europe, while Mhairi Black is the second-youngest by a whisker.
These are the kind of insights we seek to increase through our work with Wikidata as we help to boost the quality of their politician data: we consider such analysis not only interesting, but important. Whether or not countries wish to encourage fair representation across age groups and gender — not to mention many other categories — their decisions should at least be based on facts.
As things stand, there are only a handful of countries where data is good enough to be able to make such comparisons: in our vision, journalists, researchers — and anyone else — will be able to turn to Wikidata to find what they need. The forthcoming Global Legislative Openness Week (GLOW) gives us all an opportunity to put a rocket under the quality and quantity of data that’s available to people making analyses like these, that stand to benefit us all.
How to get involved
GLOW runs from next Monday until the 30th November, and we’re encouraging people — wherever you live in the world — to get together and improve the data on national-level politicians for your country.
We’re already expecting a good number of groups to run events. Get-togethers are confirmed in Slovenia, Bulgaria, Italy, Greece, Spain and more — once final details are firmed up, we think there’ll be action in other countries across the globe. Now how about you? As we said in our post last month, a concentrated effort from a small group of people can really make a difference.
We’re especially keen to encourage folk who have some experience of contributing to Wikidata: we reckon that, for this particular drive, you need to already know your way around a bit. So if that’s you, do come forward!
Start by having a look at this page, which outlines what we hope to achieve; we’ll be adding more detail this week too. You can add your country to the list if you’d like to, or explore what’s missing in the data of those countries already listed.
Or, if Wikidata’s all new to you, why not put out some feelers and see if there’s anyone who can show you the ropes while you work together? One good way is to see if there’s a Wikimedia User Group local to you.
What exactly will you be doing?
Here’s a bit more detail on what a workshop will look like.
The idea is to improve information in Wikidata about members of your country’s legislature. The ‘Progress Indicators’ on this page will give you guidance: typically you’ll be working through tasks like adding any missing “position held” statements and biographical data. We’re asking folk to prioritise current politicians, with information for historic members an added bonus if time permits.
Once sufficient data is available in Wikidata, the real fun begins! Your workshop attendees will be able to query the data to answer questions such as:
1) Can the gender breakdown and average age of members of the current legislature be calculated?
2) Can that be broken down per political party/group, or (where appropriate) by region?
3) Can you compare those figures for the legislature vs. the cabinet?
4) How far back can you generate those for?
And if the ideas start to flow, building queries and visualisations to answer other questions will also be very useful.
Let us know if you have any questions before the week begins — we’re going to be very busy during GLOW, but we’ll do our absolute best to help.
Image: Alex Iby (Unsplash)
Global Legislative Openness Week (GLOW) celebrates open, participatory legislative processes across the globe.
Back in 2015 we marked GLOW by setting up a challenge: could we get politician data for 200 countries up on EveryPolitician within the week, with the help of the global community? The answer was a resounding yes, and the challenge was a massive success. We ended up with data for 201 countries in the end, thanks to help from awesome people from all over the world.
This year, we’re running another challenge: to get as many Wikidata workshops focusing on political data to happen during GLOW as possible.
Fancy helping with this challenge? Read on…
This is all part of our Wikidata/EveryPolitician project.
The project aims to improve political data in Wikidata, so that it can be used more easily for projects, research or investigations that hold politicians to account. Examples of where good political data is vital include in parliament-tracking websites (like in Zimbabwe, Nigeria and Ukraine) and in cross-border journalistic investigations (like the Panama Papers).
Providing this data in consistent and structured formats across countries means the people running these accountability projects spend less of their time gathering the data and more on actually using and interpreting it, to keep tabs on those in power. This project is all part of our mission to contribute to the Democratic Commons.
One of the best ways to improve and use political data on Wikidata is to get people together in person to work on their country’s data. So, that’s the aim with this latest GLOW challenge, and we’d love for as many groups around the world to host Wikidata workshops as possible!
The aim of these Wikidata workshops is to:
- Improve political information in Wikidata so that developers, researchers and journalists (or anyone!) can use the data in their investigations and accountability projects.
- Use and query existing political data in Wikidata to see what interesting questions can be answered when data is available in consistent and structured formats.
Workshop attendees will go away with:
- Increased knowledge of how Wikidata works and how to contribute to it
- A better understanding of why good political data is so vital and how it can be used
- New connections to the global community of people who care about accountability issues
- A warm, fuzzy feeling of satisfaction that they’ve helped with the global accountability movement 😉 (We hope so anyway!)
Not sure what such an event might look like? Read up on our recent Wikifying Westminster workshop: it really showed us how much can be done when a few people get together in a room.
Funding and support available
Thanks to the Wikimedia Foundation, we’re able to offer some support to individuals/groups who are interested in running Wikidata workshops like these during GLOW. This will differ on a case by case basis but includes:
- A small amount of funding to help cover event costs
- A review of your country’s existing political information in Wikidata and some pointers about possible next steps
- Ideas for how you and your attendees can:
a) Use the data for interesting research and projects, and
b) Improve the data for future research queries/projects
- In-person support during your event – if you’d like one of our EveryPolicitian/Wikidata team to come to your event to present and participate, we can do this (if our budget allows!)
- Access to a dedicated Slack channel which connects you with other groups around the world who are also running events during GLOW.
Workshops can take place at any time within GLOW week, which is from 20th-30th November 2017 (yes, that is a long week!).
So, if you’d like to be part of this global challenge to improve and use political information in Wikidata, we’d be thrilled to hear from you. Please do get in touch: email@example.com
Last Saturday (August 19th) at Newspeak House in London, mySociety and Wikimedia UK held the “Wikifying Westminster” workshop, a day-long event to encourage people to get involved with Wikidata, but also to give a taste of what people can build with the data that is already there.
The vision: one day, complex investigations which currently take researchers a lot of time, such as “how many MPs are descended from people who were also MPs” or “how many people named X were MPs in year Y”, will be answerable with data from Wikidata using a single SPARQL query…
…but we’re not quite there yet. Currently, some data is scattered all over separate databases (which sometimes get shut down or disappear); some is just plain missing; and most frustrating of all, some is in place but there’s no apparent way to get it out of the database.
In order to make this vision a reality, we need to experiment with the data, find ways to check how complete it is, and explore what questions we can currently answer with it. Events like Wikifying Westminster are the perfect opportunity to do just that.
After a brief introduction to Wikidata and the EveryPolitician project, we split into two groups: one focused on learning how to use Wikidata, while the other focused on working on mini-projects.
Here’s a taste of what happened…
The learning track began by introducing new users to the basic Wikidata editing principles (or “getting data into Wikidata”). Participants were able to put their new skills into action immediately, by adding missing data on British MPs, who were mostly lacking dates and places of birth.
By the end of the first session, good progress had been made, particularly on obtaining dates of birth for current British parliamentarians. For some reason, though, it proved much harder to find these for women than for men: we can only speculate as to why that might be (do some still adhere to the idea that a woman shouldn’t reveal her age?!).
We were also given an introduction to SPARQL, a language used to query information on databases (or “getting data out of Wikidata”). Lucas Werkmeister introduced the Wikidata query service and explained a few tricks to help with using it. Participants were later able to put this to the test by running progressively difficult test queries such as “All current UK MPs” or “Who is the youngest current MP?”
Also, Navino Evans showed us the potential of reusing data, talking about Histropedia, which he co-created with Sean McBirnie. Histropedia is an awesome tool that lets you visualise thousands of topics on interactive timelines: you can browse through existing ones or create a new one from scratch.
This group both worked on improving data and looked at how well we could answer some simple “stepping stone” queries (i.e. small questions to which we already knew some of the answers) as a heuristic of how good the data in Wikidata already is. You can see and contribute questions to the list of test queries here.
Some more details:
Improving data. The focus here was on the Northern Ireland Assembly, for which Wikidata now has full membership history back to the foundation of the Assembly, and on adding academic degrees of cabinet ministers. Starting from an excellent spreadsheet of the undergraduate universities and subjects of UK politicians and ministers (going back to John Major’s cabinets), we tried to upload that data on the relevant items, adding the qualifier “academic major” (P812) to the property “educated at” (P69). In this case, the key problem we found was that we weren’t sure how to model when people did joint subjects, like “Maths and Politics”, convincing us to concentrate on the more obvious subjects first.
Answering some unusual and/or intriguing questions. Inspired by a prior finding that there are more FTSE 100 CEOs named John than there are female ones, and that John is historically the most common name of UK parliamentarians, we thought we’d find out when exactly the John-to-female balance was toppled amongst the UK’s MPs (hint: not until 1992).
Going back further in history, we queried the first time each given name was recorded in Parliament, this was inspired by a recent news article about an MP who claimed he was the first “Darren” in the Commons.
Some ideas were also born that we weren’t able to see through, for various reasons. For example, could we discover which, if any, MPs are descended from people listed in the UCL’s ‘Legacy of British Slave-owners’ database? An interesting question, but at the moment, the answer is ‘no’, partly because child-parent relationships are currently inconsistently modelled in Wikidata, and partly because of the nature of Wikidata and ancestry: if there is someone who doesn’t exist in Wikidata (e.g. Grandad Bob, the painter) in the family chain, Wikidata can’t bridge the gap between a present day MP and the slave owner who might be their ancestor.
This is just the beginning
Work, of course, is still ongoing: all pre-1997 UK data is still to be inserted or improved on Wikidata, and so much more is missing – family connections, academic degrees, links to other databases, and all sorts of “unusual stuff” that can be used for interesting queries.
This data is crucial if we want to be able to answer the really big questions which Wikidata should one day be capable of helping us explore, about what politicians do.
We can do that together!
We hope that events like this give people an easy way in to Wikidata and also show them what’s already possible to achieve with the data. Over the coming months, we are hoping to support more events of this type around the world. If you are interested in getting involved, here’s how:
- Want to improve your country’s data? Events like this can be a great way to help kickstart activities and find other people who share your goals. We are happy to help out and support people in other countries to do so.
- Are you already organising or planning to organise a similar workshop around Wikidata? Make sure it is listed on the Wikidata Event page!
- Do you want to attend future workshops? Follow us on Twitter to stay updated about events that we are running, and ones that other people are too!
We’re also always looking for feedback and suggestions on workshop and event formats that might also work. Have you already run similar workshops? Let us know your impressions and suggestions on firstname.lastname@example.org!
Feature image credits: Mark Longair
This post is by Tony Bowden and Lucy Chambers from the EveryPolitician team. Today we officially launch our collaboration with Wikidata – here’s what to expect…
The story so far
You might have been following the progress: since 2015, through our project EveryPolitician, we’ve been gathering data for every national legislature in the world, from thousands of sources, and sharing it.
Now, two years on, we’ve started to see some great results. For example:
- It’s much easier to build simple Parliament Tracking sites (as mySociety partner organisations have in Zimbabwe and Nigeria). Those running the sites can work on providing information and context to hold politicians accountable — and don’t have to worry about wrangling data and software.
- Tools that allow citizens to write to their representatives (like Majlis Nameh in Iran, based on the WriteInPublic software, or Oxfam’s UK and Australia Campaigning Tool) can now be deployed in days rather than weeks, allowing groups to focus on local customisation.
- Vote-tracking sites like TheyVoteForYou in Australia can be more easily adapted to other countries — such as Ukraine, again without worrying about the major task of sourcing the politician data.
- Projects that highlight politicians’ activity can be augmented to show extra information against those politicians — so for example, Politwoops can show party affiliation next to politicians’ tweets.
- Investigative journalists can cross-match our lists of politicians to other sources (e.g. investigating shell companies for a recent Private Eye article or with the Panama Papers).
- mySociety’s collaboration with Facebook makes it easy for people to connect with their newly elected representatives.
All of this is possible because of two main tenets of EveryPolitician: having gathered the data, we structure it consistently, and we share it freely.
Not just current data
We’ve also discovered that there is a huge value for everyone in retaining historic information, too:
- Old data tends to disappear from many official sites, for example, when a new government comes into power and / or when parliaments decide to remodel their websites.
- Sometimes even the official sources no longer exist: for example, in Burkina Faso the Parliament building was burned down by protesters in 2014 and many crucial documents were lost.
- Research projects and sites run by civil society organisations sometimes run out of funding and have to shut down — meaning that the data people were relying on can vanish overnight.
The future: going one step further
So that’s all great, but we think EveryPolitician can do still more to help the worldwide community of Civic Tech coders and activists.
In particular, we want to go beyond data showing who the politicians are, and also provide information on what they do. That’s because we can see real value in the ability to answer questions like:
- When we look at politicians who vote on issues such as gay marriage, smoking bans, tax on sugary drinks, etc, anywhere in the world — are there any broad correlations like age or gender?
- As countries elect more women MPs, do those women also gain equivalent representation in committees? Does this affect attendance and participation rates?
- Are there standard political career paths that can be observed anywhere in the world? So for example, do certain Cabinet positions limit future progression; and if so, would it be feasible to spot politicians who are on the way up, or on their way out?
- Do politicians who move from the lower to upper house act, and vote, differently?
- Do politicians change their voting activity after certain types of intervention, for example after receiving funding from oil companies?
… and there are undoubtedly millions more questions, each one just as interesting and with answers that could enrich our understanding of the world. We are aiming for a future in which each should be answerable within minutes, rather than form the basis of a multi-year post-graduate research project.
How we plan to get there
You may remember our recent proposal to integrate more deeply with Wikidata. We’re delighted to say that our proposal was accepted, and that makes EveryPolitician’s path very clear.
Wikipedia is fast becoming one of the best sources of political information in many countries: it’s often updated more quickly than major news outlets or official parliamentary sites are.
Which is great, but there’s an issue when it comes to using that information for projects like the ones we’ve mentioned above: Wikipedia contains largely unstructured information (that is to say, information that comes in a wide variety of different formats) — as you’d expect from any project with multiple contributors and, often, a free-text input.
There are also a lot of differences between Wikipedias in different countries. Some countries’ data (particularly countries where the Wikipedia community is larger) gets updated very quickly. Smaller language Wikipedias can’t rely on such a large pool of editors, and it tends to be longer before they are updated.
Additionally, and as you might expect, Wikipedia content will appear fastest in the countries most directly affected by the change being documented, so for example when there are elections in Estonia, the Estonian Wikipedia may show the results almost immediately, but it can take a while for those changes to trickle into all languages.
But of course, nothing’s ever quite that simple. While Wikipedia has a wealth of unstructured political information, on Wikidata there’s still an awful lot of data missing. You may recall that we recently ran a drive to ensure that every country at least had its head of government entered, but that’s just the beginning: in order to answer the kinds of questions we mention above, we really need to ensure there’s consistently-structured information for all legislators, all elections… and much more.
What you’ll see in the coming months
While we plan for EveryPolitician to retain its own identity and keep its own front end, we’re excited to say that over the next few months, we’ll be teaming up with Wikidata communities across the world.
Our first objective is see if we can bring Wikidata up to the same level as EveryPolitician for as many countries as possible.
And once we hit that target, we plan to go much further. The beauty of Wikidata is that you can add pretty much any information you want to add to politicians (indeed, to anything!), so local communities can decide for themselves which information is most pertinent and make sure that it’s included.
To help us make all of this happen, we’re expanding. If you’ve read this far and you’re still finding the project interesting, the chances are you’d be a great addition to the team! We’re looking for a new community co-ordinator, working from anywhere in the world (compatible timezones allowing): you can see all the details here.
How you can get involved
There are plenty of ways that you can help with this drive to improve the structured information on Wikidata. Here are the obvious ones:
- There are active missions always ongoing on the Wikiproject Heads of State and Government. There’s one there now – check them out!
- Would you be interested in organising a push to Improve Wikidata for your country? Get in touch on email@example.com and we’ll do what we can to help.
- If this all sounds interesting, but you’re not sure where to start, or you’re unfamiliar with Wikidata – drop us an email and tell us what you’re interested in. We’re more than happy to help you get started.
Last you heard from from the EveryPolitician team we were talking about integrating EveryPolitician more deeply with Wikidata.
Here’s a quick update on what to expect in the coming months and a note on how you can get involved – we are going to need a lot of help!
Before we can make headway on a deeper integration, there are some pretty foundational pieces of the puzzle we need to put in place.
Some key data is missing, and you may be surprised to discover that it is pretty basic stuff. Take a look at this for example:
This is from a report we have generated to highlight gaps in the links between items. Notice how not even all of the offices have been filled in on the country page, let alone who occupies those offices?
This is the type of foundational data we are hoping to get in place over the next little while, and you can help…
How you can get involved
Over the coming weeks we’re going to be conducting a few experiments in public and trying to get data into Wikidata. We’ll need as much help as we can get!
The experiments will focus on using Wikidata to attempt to answer some questions we find interesting and see how we can expose gaps and inconsistencies in the data. In doing this, we’ll be pointing to specific reports we have generated and asking you to help us fill in the gaps.
Our first question will be:
“What is the gender breakdown of heads of government across the world?”
We’ll be blogging about the investigation over on Medium. The first post is already there:
Keen to dive straight in? Help us fill in the blanks you see in this report! If you are familiar with Wikidata, you will probably be able to get started straight away, but there are also tips and pointers in the Medium post if you want a bit more guidance.
You can get notified of the challenges to complete the data as soon as they are published by following the EveryPolitician bot on Twitter.
Want even more details on our plans?
We should also mention that we’ve updated our proposal to the Wikimedia Foundation to make a couple of things clearer about the problem we are trying to solve and our proposed solution (the proposal is still being reviewed). If you are interested in the nitty-gritty, that’s the best place to get the full overview of what we are planning.