Some rather random thoughts about open data, open source, open access in archaeology from a recent interview for OpenScience …
There’s a lot of debate in the wider world about digital data – issues of access and privacy, the case of Aaron Swartz and open access to knowledge, the Ed Snowden revelations, and, at the personal level, the way that we all leave data trails behind as we traverse the Internet. Surrendering our personal data is difficult to avoid, even if we forswear Facebook, Google, and their like who build their business models on their ability to capture data about us.
In a recent paper by Richard Mortier et al, (2015), they argue that this new world of data requires a new kind of study of human-data interaction, looking at the implications of the data we generate in all kinds of different ways, knowingly or unknowingly.
There’s an interesting project run out of Durham and Newcastle Universities by Bob Simpson and Robin Humphrey, Writing Across Boundaries, which started off as a series of workshops to look at challenges faced by researchers writing a thesis employing qualitative data but has broadened out somewhat thereafter. Simpson and Humphrey suggest that in recent years
“there has been acceleration in the way researchers progress from one kind of writing to another – doctoral thesis to articles to monograph to more accessible forms of dissemination. The pressure to do in couple of years what an earlier generation might have done in a couple of decades has a variety of external drivers. These mostly come down to funding and the competition for scarce resources on the one hand, and the demonstration of public accountability on the other.”
One of the features of the availability of increasing amounts of archaeological data online is that it frequently arrives without an accompanying awareness of context. Far from being a problem, this is often seen as an advantage in relation to ‘big data’ – indeed, Chris Anderson has claimed that context can be established later once statistical algorithms have found correlations in large datasets that might not otherwise be revealed.
In relation to the Portable Antiquities Scheme (PAS) database, David Gill on his ‘Looting Matters’ blog has pondered “How far can we trust the information supplied with the reported objects? Are these largely reported or ‘said to be’ findspots?”.
Spatial information is frequently cited as a problem in relation to open archaeological data – but the focus tends to be on the risks it poses for looting (for example, Bevan 2012, 7-8; Kansa 2012, 508-9).