Radiant Textuality: Literature After the World Wide Web

to be placed within an . Nevertheless, contrasting the structure of witness 1 with the structure of witness 2 already alerts the reader to structural revisions and invites a closer inspection. 11 The edges in a hypergraph are called hyperedges. In contrast to edges in a DAG, hyperedges can connect a set of nodes. From graveyard to graph 155 https://collatex.net/ be visualised in any meaningful way. At the same time, the various types of information contained by the collation hypergraph are of instrumental value to a deeper study of the textual objects. For that reason, HyperCollate offers not one specific type but rather lets the user select from a wide variety of visualisations, ranging from alignment tables to variant graphs. In selecting the output visualisation, the user decides which information she prefers to see and which information can be ignored. She may consider an alignment table if she’s primarily interested in the relationships between witnesses on a microlevel, or a variant graph if an insightful overview of the various token orders is more relevant to her research. Furthermore, she may decide what markup layers she want to see: arguably knowing that every token is part of the root element ‘text’ is of less concern than detecting changes in the structure of sentences. Making such decisions does require the user to have a basic knowledge of the underlying dataset and a clear idea of what she’s looking for. 6 Requirements for visualising textual variance This overview allows us to draw a number of conclusions regarding the visualisation of textual variation and to what extent each visualisation considers the various dimensions of the textual object. We have seen that intradocumentary variation is as of yet not represented by default; the editor is required to make certain adjustments to the visualisation. Alignment tables and parallel segmentation can be extended to some extent, for instance by using colours and visualising deletions and additions. Regular variant graphs may include intradocumentary variation if the different paths through the texts are collated as separate witnesses12; only HyperCollate’s variant graph output includes both intra- and interdocumentary variation. Structural variation, is currently only taken into account by HyperCollate and consequently only visualised in HyperCollate’s variant graph. While the added value of studying this type of variation may be clear, it remains a challenge to visualise both linguistic/semantic and structural variation in an informative and clear manner. Fig. 11 may clearly convey the structural difference between witness 1 and witness 2 (i.e., the element), but the raw collation output contains much more information which, if included, would probably overburden the user. A promising feature of visualisations intended to further explorations of textual variation is interactivity. One can imagine, for instance, the added value of discovering promising sites of revision through a graph representation, zooming in, and annotating the relationships between the witness nodes. Acknowledging the various strengths and shortcomings of existing visualisations, we propose that there is not one, all-encompassing visualisation that pays head to all properties 12 This practice leads to some problematic issues in case of complex revisions, see De Bruijn et al. 2007; Bleeker 2017, 111–114. Fig. 11 Alternative, black-and-white visualisation of HyperCollate output, with the markup repre- sented as hyperedge on the nodes. Other markup is not visualised 156 E. Bleeker et al Fig. 12 Alternative visualisation of HyperCollate output, with each node containing the Xpath-like informa- tion about the place of the text in the XML tree (e.g. the path /TEI/text/div/p/s/ indicates that the ancestors of a text node are, bottom up, an element, a
element, a
element, the element and the element) From graveyard to graph 157 of text. Instead, each visualisation highlights a different aspect of textual variance or provides another perspective on text. Each perspective puts another textual characteristic before the footlights, while (ideally) making users aware of the fact that there is much more happing behind the familiar scenes. As Tanya Clement argues, focusing on one aspect can be instrumental in our understanding of text, helping the user ‘get a better look at a small part of the text to learn something about the workings of the whole’ (Clement 2013, §3). Indeed it seems that multiple and interactive representations (cf. Andrews and Van Zundert 2013; Jänicke et al. 2014; Sinclair et al. 2013) are a promising direction. 7 Visual literacy and code criticism The process of visualising data is a scholarly activity in line with the process of modelling, hence the resulting visualisation influences the ways in which a text can be studied Collation output can be visualised in different ways, which raises essential questions regarding the assessment and evaluation of visualisations. The function of a digital visualisation is two-fold: on the one hand, it serves as a means of communication and on the other hand it provides an instrument of research. The communicative aspect implies that visualisation is first and foremost an affair of the scholar(s) who creating visualisations. The diversity of visualisations, each of which highlights different aspects of the text, reflects the hermeneutic aspect inherent to humanist textual research. Thus, by using visualisation to foreground textual variation, editors are able to better represent the multifocal nature of text. In order to choose an appropriate representation of collation output, then, scholars need to know what argument they want to make about their data set, and how the visualisation can support that argument by presenting and omitting certain information. Accordingly, they can estimate the value of a visualisation for a specific scholarly task and expose the inevitable bias embedded in technology. When a visualisation is used as an instrument of study and exploration, it is vital to be critical about its workings and its (implicit) bias. This includes an awareness of which elements the visualisation highlights and, just as important, which elements are ignored. As Martyn Jessop has pointed out, humanist education often overlooks training in ‘visual literacy’, which can be defined as the effective use of images to explore and communicate ideas (Jessop 2008, 282). Visual literacy, then, denotes an understanding of the fact that a visualisation represents a scholarly argument. Jessop identifies four principles that facilitate the understanding of a visualisation: aims and methods, sources, transparency requirements, and documentation (Jessop 2008 290). The documentation of a visualisation of collation output then, could describe what research objective(s) it aims to achieve, on what witnesses it is based, and how these witnesses have been transcribed, tokenized, and aligned.13 Another suitable rationale for critically evaluating the visualisation process is offered by the domains of ‘tool criticism’ or ‘code criticism’ (Traub and van Ossenbruggen 2015; Van Zundert and Dekker 2017, 125). Tool criticism assumes that the code base of scholarly tools reflects certain scholarly decisions and assumptions, and it raises critical questions in order to further awareness of the 13 Although the value of documenting a tool’s operations is uncontested, making use of documentation is not yet part of digital humanities’ best practice. In that respect, it is worthwhile to keep in mind the RTFM-mantra of software development (‘Read the F-ing Manual’). 158 E. Bleeker et al relationships between code and scholarly intentions. Questions include (but are not limited to) ‘is documentation on the precision, recall, biases and pitfalls of the tool available’, or ‘is provenance data available on the way the tool manipulates the data set?’ (Traub and van Ossenbruggen 2015). Indeed, when it comes to evaluating the visualisation of automated collation results, one may well ask to what extent these witnesses and the ways in which they have been processed by the collation tool are subject to bias and interpretation. Like transcription (and any operation on text for that matter), collation is not a neutral process: it is subject to the influence of the editor. This becomes clear if we look at the different steps in the collation workflow as identified by the Gothenburg model (GM; 2009). The GM consists of five steps: tokenisation, normalisation, alignment, analysis, and visualisa- tion. For each step, the editor is required to make decisions, e.g. ‘what constitutes a token’, ‘do I normalise the tokens and, if so, do I present the original and the normalised tokens’, or ‘what is my definition of a match and how do I want to align the tokens?’ As Joris Van Zundert and Ronald Haentjens Dekker emphasise, not all decisions made by collation software are easily accessible to the user, simply because they are the result of ‘incredibly complex heuristics and algorithms’ (Van Zundert and Dekker 2017, 123). To illustrate this, we can look at the decision tree used by HyperCollate to calculate the alignment of two simple sentences. The graph in Figs. 13 and 14 are complementary and show all possible decisions the alignment algorithm of Hypercollate can take in order to align the tokens of witness A and witness B and the likely outcomes of each decision. An evident downside of such trees is that they become very large very quickly. For that reason, we see them as primarily useful for editors keen to find out more about the alignment of their complex text. The GM pipeline is not strictly chronological or linear. Although automated collation does start with tokenization, not every user insists on normalising the tokens, and a step can be revisited if the outcome is considered unsatisfactory or not in line with the user’s expectations. Though visualisation comes last in the GM model, this article has argued that it is surely not an afterthought to collation. In fact, the visual representation of textual variance entails an additional form of information modelling: editors are compelled to give physical form to an abstract idea of textual variation which exists at that point only in the transcription and (partly) in the collation result. Using the markup to obtain a more optimal alignment, as HyperCollate does, only emphasises this point: marking up texts Fig. 13 The collation of witness B against witness A, with potential matches indicated in red From graveyard to graph 159 entails making explicit the knowledge and assumptions that would otherwise have been left implicit. Visualising the markup elements, then, implies that these assumptions and thus a particular scholarly orientation to text is foregrounded. 8 Conclusions The present article investigated several methods of representing textual variation: alignment tables, synoptic viewers, and graphs. Two small textual fragments containing in-text variation and structural variation formed the example input for the alignment table and the variant graph visualisation. The fragments were transcribed in TEI/XML and subsequently collated with CollateX and Fig. 14 The decision tree for collating witness B against witness A. Chosen matches indicated in bold, discarded matches rendered as strike-through; others are potential matches. Arrow numbers indicate the number of matches discarded since the root node (this number should be as low as possible). Red leaf nodes indicate a dead end, orange leaf nodes a ‘sub-optimal’ match, and green leaf nodes indicate an optimal set of matches 160 E. Bleeker et al HyperCollate respectively. In addition, we looked at existing visualisations of the Versioning Machine and the Diachronic Slider. These visualisations were judged on their potential to represent different types of variance in addition to the regular interdocumentary variation: intradocumentary, linguistic, and structural. Visualising these aspects of text paves the way for a deeper, more thorough, and more inclusive study of the text’s dimensions. We concluded that there is currently no ideal visualisation, and that the focus should not be on creating an ideal visualisation. Instead, we propose appreciating the multitude of possible visualisations which, individually, amplify a different textual property. This re- quires us to appreciate what a visualisation can do for our research goals and, furthermore, to evaluate its effectiveness. To this end, methods from code criticism and visual literacy can be of aid in furthering an understanding of the digital representations of collation output as rhetorical devices. We propose evaluating the usefulness of a visualisation on the basis of the following principles: 1) Interactivity. This may range from annotating the edges of a graph, adjusting the alignment by (re)moving nodes, to alternating between macro- and micro level explorations of variance. 2) Readability and scalability. Especially in a case of many and/or long witnesses, alignment tables and variant graphs become too intricate to read: their function becomes primarily to indicate complex revision sites. 3) Transparency of the textual model. The visualisation not only represents textual variance, but simultaneously makes clear what scholarly model is intrinsic to the collation. It needs to be clear which scholarly perspective serves as a model for transcription and representation. 4) Transparency of the code. Visualisations represent the outcome of an internal collation process which is usually not available to the general user audience. A clear, step-by-step documentation of the algorithmic process helps users under- stand what scholarly assumptions are present in the code, what decisions have been made, what parameters have been used, and how these assumptions, decisions, and parameters may have influenced the outcome. Decision trees may be of additional use. This applies particularly to interactive visualisations: if it’s possible to adjust parameters or filters, these adjustments need to be made explicit. Digital visualisation is sometimes regarded as an afterthought in humanities research, or even considered with a certain degree of suspicion. Some consider it a mere technical undertaking, an irksome habit of some digital humanists who recently learned to work with a flashy tool. Yet if used correctly, these flashy tools may also function as instruments of study and research, which means they should be evaluated accordingly. Within the framework of visualising collation output, visual literacy is key. Having a critical understanding of the research potential of visualisations facilitates our research into textual variance. After all, these representational systems produce an object which we use for research purposes; we need to take seriously the ways in which they do this. In addition to communicating a scholarly argument, digital visualisations of collation output foreground textual variation. The collation tool HyperCollate facilitates the examination of a text from multiple perspectives (some unfamiliar, some inspiring, some contrasting, but all of them highlighting a particular element of interest). This From graveyard to graph 161 freedom of choice invites scholars to reappraise prevalent notions and continue explor- ing the dynamic nature of text in dialogue with other disciplines. Digital visualisations, then, give us a means to take variants out of the graveyard and into an environment in which they can be fully appreciated. Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and repro- duction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. References Andrews, T., & Mace, C. (2012). Trees of Texts: Models and Methods for an Updated Theory of Medieval Text Stemmatology. Paper presented at the digital humanities conference, 2012, July 16–20, University of Hamburg. Abstract available at http://www.dh2012.uni-hamburg.de/conference/programme/abstracts/trees-of-texts- models-and-methods-for-an-updated-theory-of-medieval-text-stemmatology.1.html. Accessed 23 Dec 2018. Andrews, T., & Van Zundert, J. (2013). An Interactive Interface for Text Variant Graph Models. Paper presented at the Digital Humanities Conference, 2013, July 16–19, University of Lincoln, Nebraska. Abstract available at http://dh2013.unl.edu/abstracts/ab-379.html. Accessed 23 Dec 2018. Bleeker, E. (2017). Mapping invention in writing: Digital infrastructure and the role of the genetic editor. Ph.D. Dissertation, University of Antwerp. Bleeker, E., Buitendijk, B., Dekker, R. H., Neyt, V., & van Hulle D. (2017). The challenges of automated collation of manuscripts. In Advanced in digital scholarly editing, Leiden: Sidestone Press, pp. 241–249. Bleeker, E., Buitendijk, B., Dekker, R. H., & Kulsdom, A. (2018). Including XML Markup in the Automated Collation of Literary Texts. Proceedings of the XML Prague conference 2018, February 9–11, pp. 77–95. Burnard, L., Jannidis, F., Middell, G., Pierazzo, E., & Rehbein, M. (2010). An encoding model for genetic editions, accessible at http://www.tei-c.org/Activities/Council/Working/tcw19.html (last accessed 2018, March 30). Clement, T. (2013). Text analysis, data mining, and visualizations in literary scholarship. In Literary studies in the digital age: An evolving anthology. https://doi.org/10.1632/lsda.2013.0. De Bruijn, P. (2002). Dancing around the grave. A history of historical-critical editing in the Netherlands. In Plachta, B. & Van Vliet, H.T.M. (red.), Perspectives of scholarly editing/perspektiven der textedition (pp. 113–124). Berlin: Weidler Buchverlag. Dillen, W. (2015). Digital scholarly editing for the genetic orientation: The making of a genetic edition of Samuel Beckett’s works. Ph.D. thesis, University of Antwerp. Drucker, J. (2012). Humanistic theory and digital scholarship. In M. Gold (Ed.), Debates in the digital humanities (pp. 85–96). Minneapolis: University of Minnesota Press. Haentjens Dekker, R., & Birnbaum, D. J. (2017). It’s more than just overlap: Text as graph. Presented at Balisage: The Markup Conference 2017, Washington, DC, August 1 - 4, 2017. In Proceedings of Balisage: The Markup Conference 2017. Balisage Series on Markup Technologies, vol. 19. https://doi. org/10.4242/BalisageVol19.Dekker01. Jänicke, Stefan, Gessner, Annette, Büchler, Marco, & Scheuermann Gerik (2014). Design rules for visualizing text variant graphs. In Proceedings of the digital humanities 2014, edited by Clare Mills, Michael Pidd and Jessica Williams. Joyce, J. (1984-1986). Ulysses: A critical and synoptic edition, prepared by Hans Walter Gabler with Wolfhard Steppe and Claus Melchior, 3 vols. New York & London: Garland Publishing Inc. Jessop, M. (2008). Digital visualization as a scholarly activity. Literary and Linguistic Computing, 23(3), 281– 293. Roos, T., & Heikkilä, T. (2009). Evaluating methods for computer-assisted stemmatology using artificial benchmark data sets. Literary and Linguistic Computing, 24(4), 417–433. Schacht, P. (2016). ‘Introduction’ in: Thoreau, Henry David. Walden: A fluid-text edition. Digital Thoreau. http://digitalthoreau.org/fluid-text-toc. Accessed 27 May 2019. 162 E. Bleeker et al http://www.dh2012.uni-hamburg.de/conference/programme/abstracts/trees-of-texts-models-and-methods-for-an-updated-theory-of-medieval-text-stemmatology.1.html http://www.dh2012.uni-hamburg.de/conference/programme/abstracts/trees-of-texts-models-and-methods-for-an-updated-theory-of-medieval-text-stemmatology.1.html http://dh2013.unl.edu/abstracts/ab-379.html http://www.tei-c.org/Activities/Council/Working/tcw19.html https://doi.org/10.1632/lsda.2013.0 https://doi.org/10.4242/BalisageVol19.Dekker01 https://doi.org/10.4242/BalisageVol19.Dekker01 http://digitalthoreau.org/fluid-text-toc Schäuble, J., & Gabler, H. W. (2016). Visualising processes of text composition and revision across document Borders. Paper presented at the symposium Digital Scholarly Editions as Interfaces, Graz, Austria, September 22–23. Schmidt, D., & Colomb, R. (2009). A data structure for representing multi-version texts online. International Journal of Human-Computer Studies, 67(6), 497–514. Sinclair, S., Ruecker, S., & Radzikowska, M. (2013). Information visualization for humanities scholars. In Literary studies in the digital age, an evolving anthology, edited by Kenneth Price and Ray Siemens. Available at https://dlsanthology.mla.hcommons.org/information-visualization-for-humanities-scholars. Accessed 23 Dec 2018 Traub, M., & van Ossenbruggen, J. (2015). Workshop on tool criticism in the digital humanities. CWI Techreport July 1, 2015. Available at https://pdfs.semanticscholar.org/d337/ce558c2fd1d8be793786c9 cfc3fab6512dea.pdf. Accessed 27 May 2019. Vanhoutte, E. (1999). Where is the editor? Human IT, 3.1, 197–214. Van Zundert, J., & Dekker, R. H. (2017). Code, scholarship, and criticism: When is code scholarship and when is it not? Digital Scholarship in the Humanities, 32, 121–133. From graveyard to graph 163 https://dlsanthology.mla.hcommons.org/information-visualization-for-humanities-scholars https://pdfs.semanticscholar.org/d337/ce558c2fd1d8be793786c9cfc3fab6512dea.pdf https://pdfs.semanticscholar.org/d337/ce558c2fd1d8be793786c9cfc3fab6512dea.pdf From graveyard to graph Abstract Introduction Automated collation Properties of text Existing Visualisations of collation results Alignment table Synoptic viewers Parallel segmentation Critical or inline apparatus Variant graph Phylogenetic trees or stemmata HyperCollate Requirements for visualising textual variance Visual literacy and code criticism Conclusions References work_exnpbfjwuzdrnogl2pkv3q6n7e ---- TMWG_posterFweb THE BACKBONE THESAURUS USER STORIES BBT ID The Backbone Thesaurus (BBT) is the research outcome of work undertaken by the Thesaurus Maintenance WG in an effort to design and establish a coherent overarching meta-thesaurus for the Humanities [1]. It is a faceted classification scheme that favors a loose integration of multiple thesauri, by offering a small set of top-level concepts (facets and hierarchies) for specialist thesauri terms to map to. Curation The BBT [2] is systematically curated by a cross disciplinary team of editors coming from organisations participating in the TMWG (AA, FORTH, DAI, FRANTIQ/CNRS), through BBTalk, an online editing and communication tool designed to support collaborative, interdisciplinary development and extension of thesauri. Partner vocabularies and thesauri The controlled vocabularies/thesauri that have been mapped to the BBT to this day are: the iDAI.welt Thesaurus [3], the DYAS Humanities Thesaurus [4], the Parthenos Vocabularies [5] and the Language of Binding Thesaurus [6]. Members of the working group are working towards integrating the PACTOLS [7], the Taxonomy of Digital Research Activities in the Humanities (TADiRAH) [8] and the Arts and Architecture Thesaurus [9] with the BBT. Why adopt the BBT - BBT has a logical and easily accessible structure - It makes use of a small number of top-level concepts - It allows the subsumption of any local thesaurus: scholars are not required to quit using their terms of preference. - It promotes objectivity and interdisciplinarity. It allows integration of terms from different scientific fields and enables cross-disciplinary resource discovery - It is a community driven initiative that offers peer scientific support - It can also serve as a basis for thesaurus building (and restructuring) Benefits from joining the Thesaurus Federation H. Goulis, Academy of Athens, E. Tsouloucha, ICS-FORTH Current Main Editors Martin Doerr, Helen Katsiadakis, Helen Goulis, Eleni Tsouloucha, Chrysoula Bekiari, Lida Charami, Gerasimos Chrysovitsanos, Camilla Colombi, Patricia Kalafata, Annika Kirscheneder, Blandine Nouvel, Evelyne Sinigaglia, Yorgos Tzedopoulos References: Thesaurus Maintenance Working Group (2019). DARIAH Backbone Thesaurus (BBT): Definition of a model for sustainable interoperable thesauri maintenance, Version 1.2.2. Greece: May 2019 https://www.backbonethesaurus.eu/sites/default/files/DARIAH_BBT%20v%201.2.2%20draft%20v1.pdf; Georgis Ch., Bruseker G., Tsouloucha E. (2019). BBTalk: An Online Service for Collaborative and Transparent Thesaurus Curation, ERCIM News 116, Special theme: Transparency in Algorithmic Decision Making https://ercim-news.ercim.eu/en116/r-i/bbtalk-an-online-service-for-collaborative-and-transparent-thesaurus-curation; Daskalaki M., Charami L. (2017). A Back Bone Thesaurus for Digital Humanities, ERCIM News 111, October 2017, Special theme: Digital Humanities https://ercim-news.ercim.eu/en111/special/a-back-bone-thesaurus-for-digital-humanities; M. Daskalaki, M. Doerr, 2017. Philosophical background assumptions in digitized knowledge representation systems, in Dia-noesis: A Journal of Philosophy, 2017, Issue 3, p 17-28; https://www.backbonethesaurus.eu/sites/default/files/Philosophical%20background%20assumptions.pdf; Thesaurus Maintenance Working Group (2015). Thesaurus Maintenance Methodological Outline. Greece 2015 https://www.backbonethesaurus.eu/sites/default/files/workingpaperonthesaurusmaintenance29_05_2015.pdf. This work is licensed under a Creative Commons Attribution 4.0 International Licence Scan or go to https://youtu.be/QdDEGN-jiRY to watch BBT User stories “In the reality of the data BBT is one of our major goals and will bring together 190 years of cataloguing and tagging of knowledge and creating knowledge systems”. Reinhard Förtsch (DAI) “We had an old fashioned thesaurus, we realised we have to change its structure and make a step toward a conceptual framework” “The BBT improved the quality of our work” “We flagged up a number of redundancies in our thesaurus which could not be resolved before adopting BBT” Blandine Nouvel (FRANTIQ-CNRS) “When the DAI is doing world archaeology nowadays we try to become interoperable. With the BBT we had a target. And that was the great thing. The greatest learning curve for people entering the digital world is that you need clean schemes”. “In the future the BBT will have a very central role in the way DAI presents its data on the web with the FAIR implementation. BBT is one of the hallmarks of this kind of thinking”. [1] https://www.backbonethesaurus.eu/ [2] https://vocabs.dariah.eu/backbone_thesaurus/en/ [3] http://thesauri.dainst.org/_fe65f286 [4] https://humanitiesthesaurus.academyofathens.gr [5] https://isl.ics.forth.gr/parthenos_vocabularies/ [6] https://www.ligatus.org.uk/lob/ [7] https://pactols.frantiq.fr/opentheso/ [8] http://tadirah.dariah.eu/vocab/ [9] https://www.getty.edu/research/tools/vocabularies/aat/ DARIAH-EU Virtual Annual Event 2020: Scholarly Primitives, November 10-13, 2020 Evelyne Sinigaglia (FRANTIQ-CNRS) Links work_ez62sq6rm5bixbwx6qakugt6f4 ---- Core or Periphery? Digital Humanities from an Archaeological Perspective Jeremy Huggett (Archaeology, School of Humanities, University of Glasgow) Abstract The relationship between Digital Humanities and individual humanities disciplines is difficult to define given the uncertainties surrounding the definition of Digital Humanities itself. An examination of coverage within Digital Humanities journals narrows the range but at the same time emphasises that, while the focus of Digital Humanities might be textual, not all textually-oriented disciplines are equally represented. Trending terms also seem to suggest that Digital Humanities is more of a label of convenience, even for those disciplines most closely associated with Digital Humanities. From an archaeological perspective, a relationship between Digital Archaeology and Digital Humanities is largely absent and the evidence suggests that each is peripheral with respect to the other. Reasons for this situation are discussed, and the spatial expertise of Digital Archaeology is reviewed in relation to Digital Humanities concerns regarding the use of GIS. The conclusion is that a closer relationship is possible, and indeed desirable, but that a direct conversation between Digital Humanities, Digital Archaeology and humanities geographers needs to be established. Determining scope From a traditional humanities perspective, it can often seem as if Digital Humanities (DH) is not only the new kid on the block but also the monster that is garnering all the attention and sucking up available research funding. DH is seen as being better-placed to respond to the kind of large-scale collaborative research programmes increasingly favoured by funding bodies (for example, Barker et al 2012, 189). So, from an archaeological perspective, what is the scope of DH? and what is the nature of its relationship with the individual humanities disciplines served by DH? Determining the scope of DH is immediately made difficult because of the lack of a clear-cut definition of what DH actually is. The annual Day of Digital Humanities with its now traditional request for definitions of the digital humanities rather underlines this situation, as does the equally traditional range of responses producing almost as many different definitions as there are scholars who responded. With perhaps one exception, none of the definitions offered in 2012 identified which fields or humanities disciplines came under the DH banner: the majority are content to leave the 'humanities' part of DH undefined, with plenty of references to broad interdisciplinarity, big tents, and traditional humanities. One contributor - Lisa McAulay - suggests that DH relates to a cluster of subject areas - literature, languages, linguistics, history, classics, anthropology, and archaeology. None in the list are surprising, although the absence of philosophy and the performing arts might be noted. Evaluating coverage An evaluation of the relative importance of humanities discipline within the Digital Humanities can be estimated by looking at the appearance of each term within a range of DH journals. This is admittedly a crude analysis, based on the number of papers within which a term occurs rather than the disciplinary focus of each paper, but it serves to provide an impression of the coverage of each journal. Figure 1: Distribution of papers within Digital Humanities journals, expressed as a percentage of total hits. (IJHAC does not include occurrences in its predecessor History and Computing; PMLA only considers papers published since 2002). Some of the results in Figure 1 are surprising: for instance, 87% of the hits in Computers and the Humanities (published 1966-2004) related to literature and linguistics, almost exactly mirrored in its successor publication, Language Resources and Evaluation, whereas Literary and Linguistic Computing displays a rather more balanced set of results. The International Journal of Humanities and Arts Computing, perhaps reflecting its origins in the journal History and Computing, leans towards history and literature, but also had the highest proportion of references to archaeology (7%) - double that of the next highest ranked for archaeology (Digital Humanities Quarterly). Digital Humanities Quarterly probably displays the strongest representation across the subjects, but still retains a significant leaning towards literature and history. This underlines the close association of DH with literature, linguistics, and history, and suggests a rather different relationship with other humanities subjects, if there is one at all. So what lies behind this apparent focus on literature, linguistics and history? Does the lack of reference to other humanities disciplines represent a lack of interest in or relevance of digital methods in those areas? Is the disciplinary scope of DH much smaller than might have been expected? External perceptions of DH tend to view it as a text-based subject, and various DH scholars have pointed to the privileged position of text within the field of DH. For example, Pilsch suggests that "Digital humanities is, ultimately, a way of doing textual criticism. In fact ... we can suggest that digital humanities is a specialized set of assumptions about how texts work and what makes them interesting" (2012, 5). Liu defines DH broadly as combining 'humanities computing' or 'text-based' digital humanities and new media studies (2012, 10). Barker, Hardwick and Ridge suggest that "The means by which many humanists first, or only, experience the digital humanities are the tools that are being developed to assist in philological research." (Barker et al 2012, 187). Particularly relevant in this context, Hockey notes that "applications involving textual sources have taken center stage within the development of humanities computing as defined by its major publications" (Hockey 2004, 1). While the definitions from the Day of Digital Humanities 2012 may not emphasise disciplinary areas, several reference a focus on text, ranging from seeking patterns within texts and representing and interacting with texts. This textual emphasis would seem to support the literature, linguistics, and history focus identified in DH journals; however, other text-heavy disciplines such as classics and philosophy are not strongly represented. A strong emphasis on text, perceived or real, makes it difficult for humanities subjects which do not share that same emphasis to see the DH agenda as relevant to their own disciplines. Consequently Svensson's proposition that the strong textual focus within DH affects the scope and penetration of humanities computing (2009, 51) would appear to find support here. However, it does not explain the apparent under-representation of subjects such as philosophy and classics. Although philosophy is closely related to computing (for example, Ess 2004), there seems to be a much more limited relationship with DH. For example, Bradley notes that while there are philosophers developing digital content or using information technology to further philosophical research, and there are a number of notable philosophers thinking about the interface between technology and ourselves, there are not numerous examples of philosophers using DH techniques in the pursuit of philosophy (Bradley 2012, 104). The multidisciplinary nature of classics means that digital aspects may be subsumed under the headings of history, archaeology, or linguistics - or, from a classics point of view, classicists including archaeologists, ancient historians and philologists may employ digital methods and technologies (Mahony and Bodard 2010, 1). There is some dispute about the status of digital classics: for example, Crane (2004) talks of classicists aggressively integrating computerised tools into the discipline but at the same time argues that the needs of classicists are not so distinctive as to warrant a separate "classical informatics". Both Terras (2010, 187) and Rabinowitz (2011) see digital classics as more of an emergent field still in its early stages, while Cayless (2011) describes it as an underground movement, with some very high-profile projects and practitioners operating within a more generally hostile attitude towards digital ways of knowing. Trending disciplines Trending terms may also be revealing. For example, Google's nGram viewer can display the frequency of phrases within a sample of over 5.2 million books scanned by Google up to 2009, normalising the results by the number of books published each year. Since the ngram term must occur in at least 40 books, several phrases which might have been expected (for example, digital philosophy, digital classics) returned null results, which could in itself be seen as significant. Figure 2: Google nGram results: 'traditional' labels (top) and 'digital' labels (below) Some interesting patterns are apparent in Figure 2. References to literary computing peaks either side of 1980, while linguistic computing peaks as literary computing declines in the mid 1990s. Historical computing and archaeological computing peak in the late 1980s-early 1990s before declining. Classical computing underlines the limitations of this tool, as its steady growth is associated with an increasing profile of publications on classical computing devices rather than computing in the classics. Humanities computing peaks latest and rises highest, but like all the terms, it now appears to be in decline. Not unexpectedly, the decline of the more traditional terms for computing in the humanities is matched by the rise in use their 'digital' equivalents (the very early showing for digital history in the 1960s relates to publications on digital signalling rather than history). Perhaps unexpectedly, DH is last on the scene: digital literature references appear from 1975, digital history from 1980, and digital archaeology from 1988, while DH first appears around 1993. Furthermore, DH has not overtaken the other terms and remains the least common of those shown. Leaving aside the vagaries of context-free text searching, these results seem to demonstrate a shift in emphasis towards the 'digital', with most of the traditional terms being overtaken by their digital equivalents by 2005. However, the results also suggest that individual disciplines maintained their disciplinary identity in the move to 'digital', with DH essentially acting as an umbrella term of convenience, or, alternatively representing the gradual development of a new disciplinary focus. In the end, the disciplinary scope of DH remains unclear. On one hand, it might be expected to represent the broad church of the humanities, but in reality it seems to consist of a much smaller and more restricted group of humanities fields with some of its major constituents drifting in and out as it suits them. In that light, it would be worth examining the extent to which digital literature, digital linguistics, and digital history publications appear in more mainstream disciplinary journals, or whether their predominance in DH journals represents a choice or need to publish outside their disciplinary journals. The same question could apply to other humanities subjects - do their digital publications appear in DH journals rather than in their disciplinary outlets? Does this account for the poor showing of digital classics and digital philosophy? In archaeology, for example, there is only one computing-based journal (Archeologia e Calcolatori), and archaeology has a low profile within DH journals; instead, archaeological computing papers tend to appear in mainstream archaeology journals and, to a lesser extent, in disciplinary journals outside the field (such as geography). This highlights the way in which digital archaeologists participate in the discipline of archaeology more generally, whereas it has been suggested that DH scholarship is often not highly regarded, in citation terns at least, within their broader fields (Juola 2008, 73-75). Digital Archaeology and Digital Humanities So where does this leave archaeology and its relationship with DH? It evidently does not figure strongly in DH journals, and DH barely figures within archaeological publications. The impression from the disciplinary discussion above is that archaeology remains largely distinct - some might say aloof - from DH. Dunn has recently commented that the relationship between archaeology and DH is curiously lacking (Dunn 2012) and suggests that the reasons for this are nuanced and complex. There are certainly strong parallels between both DH and Digital Archaeology (DA) - both share similar concerns with interdisciplinarity, technology and digital methods. Indeed, the characterisation of DA and DH is not so different. For example, Dunn characterised archaeology as "a disciplinary mash-up, needing support from a range of technological infrastructures, at all levels of scale and complexity" (Dunn 2011, 98), and Daly and Evans (2006, 3) defined digital archaeology as "not so much a specialism, nor a theoretical school, but an approach - a way of better utilizing computers based on an understanding of the strengths and limits of computers and information technology as a whole". Both definitions might equally be applied to DH. It is perhaps this very similarity that, paradoxically, separates the two disciplines. As a field, DA is well-established. Probably the earliest use of electronic data processing in European archaeology was by Peter Ihm and Jean-Claude Gardin in 1958/1959 and in the USA by James Deetz in 1960 (Cowgill 1967, 17). Since then, activity in archaeological computing has grown substantially, especially since the first personal computer revolution in the 1980s, and the annual Computer Applications and Quantitative Methods in Archaeology (CAA) conference has been meeting since 1973, with 500 delegates meeting in Southampton in March 2012. Like DH, DA has spawned a number of different centres (for example, Digitale Archäologie, based in Freiburg, the Center for Digital Archaeology (CoDA) at the University of California, Berkeley, the Laboratorio di Archaeologia Digitale at the University of Foggia, the Digital Archaeology Research Lab (DigAR) at the University of Washington, Seattle) and a range of undergraduate modules and specialised postgraduate degrees. There are also a number of tenured positions and support posts in University archaeology departments as well as a larger number of computing posts in commercial archaeology organisations (43 in the UK at the last count (Jeffrey and Aitchison 2008)). Given this existing infrastructure, it is not unreasonable to propose that DA does not 'need' DH for legitimacy or support, although it is evident that archaeologists are happy to capitalise on digital humanities programmes if they can see the benefits for archaeology. Equally, digital humanities scholars not infrequently draw on archaeological examples in their publications (for instance, Bodenhamer 2007, 2010; Anderson et al 2010), often in the context of demonstrating technologies such as Geographical Information Systems (GIS). Methodological commons? Like archaeology, DH is frequently defined in terms of practice rather than a particular category of data (text) or a historical period (for example, Scholes and Wulfman 2008, 65, Anderson et al 2010, 3782). Indeed, McCarty and Short's classic diagram mapping DH emphasises this, with its central zone highlighting the methodological commons shared by the various disciplines (McCarty and Short 2002). While its authors make it clear that the map is a work in progress, it notably omits archaeology from either the set of disciplines (although 'material culture' is included) or from the 'clouds of knowing' which represent areas of learning which bear upon the field. Later updates (for example, McCarty 2005, 119) add anthropology to the cloud, which could include archaeology if its American definition is adopted. The absence of archaeological contributions to recent collaborative volumes on DH (for example, Berry 2012, Gold 2012) is matched by corresponding recent collections of DA which make only passing reference to DH (for example, Kansa et al 2011, Chrysanthi et al 2012). This serves to underline the lack of relationship between the two disciplines in either direction - digital humanists are not queuing up to access DA and digital archaeologists are not knocking on the door of DH. This apparent peripheral status of DA and DH with respect to each other could support the contention that while both disciplines are concerned with methods, their focus is rather different, with archaeology focused on the study of past material culture whereas DH has a broader, primarily textual outlook (for example, Dunn 2012). Two propositions arise from this situation; that:  the image of archaeology as dealing with primarily long-past pre-literate societies means it fits poorly within a logo-centric DH, and  the practices that underpin the methodologies of both DH and DA are drawn from elsewhere, not from each other, or have developed independently. One of the problems here is that the characterisation of archaeology, at least in DH terms, is frequently flawed. While there is no doubt that archaeology deals with prehistoric societies, to define it in these terms alone is to ignore the several millennia of literate societies which are equally the subject of archaeological study. Ultimately texts are forms of material culture just as much as potsherds and flint flakes, and hence grist to archaeology's mill. Indeed, David Clarke's famous definition of archaeology as "the discipline with the theory and practice for the recovery of unobservable hominid behaviour patterns from indirect traces in bad samples" (Clarke 1973, 17) challenges rather than places limits on the subject. Furthermore, the scope and reach of archaeology - and DA - is wider than is often appreciated. As part of the archaeology of modernity (see Harrison and Schofield 2010, Schofield 2009), new areas of study such as digital forensic data recovery (for example, Ross and Gow 1999) and the investigation of digital media (for example, Huhtamo and Parikka 2011b), as well as the disciplinary implications of new information technologies (for example, Huggett 2012a, 2012b), the study of 'non-places' (transit areas and travel spaces) and virtual worlds (Harrison and Schofield 2010, 249ff), together with contemporary conflict, human rights and disaster archaeology, are all part of archaeology as practised in the twenty-first century. Some would argue that archaeology is over-reaching itself in some of these areas - for example, Huhtamo and Parrika make it clear that they see media archaeology as quite distinct from the more typical understanding of archaeology (2011a, 3), although Liu's characterisation of media archaeology as the study of old media (2012, 16) leaves the door open. Others might argue that archaeology's moves into such areas is a response to tactical and political rather than disciplinary demands. However, the fact remains that archaeology has extended its interest and involvement into these fields, and several are also of interest to - and, in the case of digital media studies, considered to be a part of (Lui 2012, 10) - DH. At the very least, therefore, this re-presentation of archaeology offers the potential for greater interactions in future between DH and DA than there has been to date, and in the process may help to address the foreshortened, presentist focus of DH identified by Liu (2012, 15) by combining contemporary and historical objects of study. If the character of archaeology should not present an obstacle to establishing a greater relationship with DH, the question of shared practice is perhaps more problematic. At one level, neither discipline has need of the other when it comes to the basic analysis of their data. On the other hand, both DA and DH are moving into areas in which the other already has expertise, so one might expect a productive relationship to be established at least in these contexts. In terms of DA there is a dramatic increase in interest in handling text, largely associated with the Semantic Web or Web 3.0: for instance, text mining grey literature reports and journals to extract temporal and spatial data together with associated contextual attributes (for example, Richards et al. 2011, Byrne and Klein 2010). However, the relationships established by DA in relation to projects such as these are primarily with computing science, not DH, despite the long history of text processing in DH. If DA seems to be bypassing DH in relation to text, DH appears to be looking beyond DA in relation to GIS. For example, although a recent volume on Spatial Humanities includes a contribution from an archaeologist (Lock 2010), the 'Suggestions for Further Reading' section contains no reference to archaeological work in GIS (Bodenhamer et al 2010, 177-189). Reference to archaeology appears only in relation to theoretical work on space despite archaeology being recognised elsewhere in the same volume as the first amongst the humanities to adopt GIS (Bodenhamer 2010, 21). Instead the main focus of recommended works is geography and, to a lesser extent, historical GIS. In some respects, this situation is not surprising - rather than pursue a set of complex technological methodologies mediated through another humanities discipline, is it not sensible to go straight to the discipline which is most closely associated with the development of those techniques? However, mediation through an allied humanities discipline may offer considerable benefits in terms of complementarity of theory and method, time saved through lessons learned, and so on. That said, it might appear that historical GIS performs this mediating role within DH, but if so, it is less well developed than in DA and the kinds of issues raised by, for example, Bodenhamer (2010), Boonstra (2009), Jessop (2008), and Suri (2011), are the same as those raised within DA more than fifteen years ago (for example, Gaffney et al 1995), which have been addressed to a varying extent since then. Spatial differences Perhaps as a consequence of this lack of relationship with DA, DH applications of GIS can seem very limited, even simplistic, to archaeological eyes in that they often seem to focus on interactive hypermedia visualisation with little use of GIS analytical tools (for example, Hypercities (Presner 2010), Litmap (Hui 2010) and GapVis (Barker et al 2012) although the user interfaces of projects such as these can disguise very complex data manipulation involved in the generation of the underlying spatial data in the first place. Examples of the successful use of humanities GIS cited by Bodenhamer (2007, 2010) are, from an archaeological perspective, a combination of 3D virtual worlds and multimedia databases rather than GIS as such. As if to emphasise this, as a way of bringing together GIS and the humanities Bodenhamer describes 'deep maps of memory', in which each artefact from a place (a letter, memoir, photograph, painting, oral account, video etc.) constitutes a separate layer that can be arranged sequentially through time (Bodenhamer 2007, 105; 2010, 27-28). This concept has been taken up by Fishken (2011) among others, who proposes the creation of 'Digital Palimpsest Mapping Projects'. However, there is no sense in which the 'knowledge' of the layers is being utilised beyond the spatial and temporal layering inherent in the GIS, and these models are operating on what is essentially a multimedia methodology. In part, of course, this represents a difference between data exploration and data analysis - the analysis, such as it is, remains in the eye of the beholder. This underlines the need within the DH for the kind of spatial literacy and spatial thinking identified by Suri (2011, 182) and the specialist training referred to by Boonstra (2009, 5). A range of specific problems with applying GIS within a DH context have been identified, and lie behind a perceived reluctance to use these tools. For example, Bodenhamer (2010, 23-24) identifies several issues:  The complexity of the technology and the level of time and effort required to learn the techniques  GIS favour structured data  Ambiguity, uncertainty, nuance, and uniqueness are not readily routinised  Managing time is problematic - GIS typically represent time as an attribute of space  GIS rely heavily on visualisation, which is difficult for a logo-centric scholarship which does not generally think in terms of geographical space or framing spatial queries  GIS require collaboration between technical and domain experts, putting the lone humanities scholar at a disadvantage  GIS appear reductionist in the way data are categorised, space is defined, and complexity is handled. These strongly reflect the conflict between positivist technology and humanist traditions also highlighted by, amongst others, Boonstra (2009, 6), Gregory and Hardie (2011, 299), Harris et al (2010, 168), Jessop (2008, 44), and Suri (2011, 163). The contrasts between the accuracy, precision, structure, and reductionism inherent in GIS and the humanistic emphases on uncertainty, imprecision and ambiguity are often presented as part of a critical assessment of the application and use of GIS. In a trenchant response to the archaeological critics of GIS who have raised much the same issues in the past, Cripps et al point to the advent of fuzzy approaches which mean that certainty is no longer required; they argue that GIS do not foster generalisation and standardisation (or at least, no more so than the book, article or presentation, and we are well-accustomed to problematise these); and that far from being reductionist, GIS facilitate complex analyses of time, human agency and perception, and the semantics and linguistics of space (Cripps et al 2006, 27-28). In other words, methods to deal with these issues have been investigated and continue to be developed and, far from representing a purely pragmatic response, they are embedded in critical theory. The danger is that preconceptions concerning GIS applications remain unchallenged through a lack of engagement with the tools and a reluctance to develop them in the search for answers to what are perceived to be the more humanistic questions. For instance, space within GIS is frequently conceived as rectilinear, isotropic (independent of direction), gridded, and framed, and consequently it establishes the conditions for distanced and dispassionate observation – the so-called 'scientific gaze' (Thomas 2004, 199) which is problematic for the humanities. However, this characterisation is not uncontested and GIS are capable of modelling alternative conceptions of space at a human scale which are not predicated on Western, post-Enlightenment perceptions. For example, during the debates surrounding the Indian Land Claims Commission (established in 1946) Western 'common- sense' notions of homogenous, bounded, stable territorial units had to be set aside for aboriginal forms of territoriality in which the spatial unit consisted of aggregates of 'tenures' held at different times (Zedeño 1997). To the Hopi, these could be places, landmarks, natural resources (herds, stands of trees, mineral outcrops), and the material record of human use of the land and its resources (burial grounds, villages, encampments, trails, shrines etc.) (Zedeño 1997, 71). Crucially, as Zedeño emphasises, this concept of space and territoriality is in stark contrast to the kind of landscape in which space is contiguous and can be comprehended at a glance (Zedeño 1997, 73). Nevertheless, it is possible to represent the richness of such a landscape within a GIS along with the human encounters, movement, perceptions, interrelationships and memories that constitute it (for example, Llobera 2007). Such a representation is never anything more than a model of reality, just as the text describing it is no more than an attempt to abstract an impression of the Hopi conceptual world. The visual emphasis of GIS "with its reductionist allure and wondrous images" (Harris et al 2010, 170) is undoubtedly a highly seductive aspect of the tools. The power of the visual image is not unfamiliar to humanists -what perhaps makes GIS so powerful is that, while traditional maps can be a potent means of capturing large amounts of information, that information remains locked within the image, whereas GIS maps are generated on the fly from underlying spatial information and its associated attributes. Consequently GIS facilitate a much higher degree of flexibility: new information can be added, new data can be created through manipulating information within the existing map, and data can be removed. Of greater significance, however, is the seduction of the tool itself - the ease with which images can be generated at the push of a button and the way in which the software can be seen as protecting the user from, and hence disguises, the underlying complexities through inserting layers of opacity (Huggett 2004, 83-84), while the very use of the tool can heighten perceived authority - but all these issues emphasise the need for a properly critical approach. It may be true that the dependence of archaeologists and geographers on maps and plans make the application of GIS easier (Bodenhamer 2010, 21), but visualising DH data need not be a barrier despite its textual focus. As several DH scholars have shown, the extraction of spatial information from texts makes visualisation possible (for example, Gregory and Hardie 2011, Gregory and Cooper 2009), while archaeologists and geographers have demonstrated the potential of more qualitative approaches (see the contributions in Daniels et al 2011 and Dear et al 2011 for example). The need to represent ambiguity and uncertainty are well-established and arguably inherent to some extent in GIS if a raster rather than vector representation is used thoughtfully. For example, vector polygons present clear unambiguous boundaries to regions when what is required is imprecisely delimited, indeterminate boundaries. Boundaries might be malleable (in the sense that the boundary shifts, expands, and contracts depending on circumstances) and permeable (recognising that things may cross from one domain to the other to varying extents, again depending on circumstances) (Kooyman 2006, 425). This is nevertheless capable of being modelled using rasters to represent the degrees of uncertainty or ambiguity. Similarly, uncertainty of location is poorly represented as vector point data. For example, archaeological sites may be recorded using a mixture of resolutions from 1m to 100m or more for a variety of reasons but are frequently represented in absolute locations, although they may be coloured according to their resolution of location. However, within the approximate area within which such a site falls, it is possible to know where the site is not going to be (in a river, on a cliff, for instance), enabling an estimation of the probability that a site is located in some areas rather than others, which can again be represented using graduated rasters. At a more human level, many conceive of the world in terms of their immediate surroundings, with a great deal of knowledge of space and relationships. Beyond that familiar world, things become more hazy and indistinct – scale becomes less precise, and proximity and distance become more a case of 'near', 'further away', 'a long way away', for example. Again, these can be generalisable to a series of rasters to enable this ambiguity to be incorporated within the model. Time is undoubtedly problematic, but this is essentially in terms of its visualisation, rather than its underlying representation. For the most part, presentations of time within GIS are essentially static: snapshots representing single moments in time which can then be stitched together into sequences sampling what is a dynamic phenomenon (for example, Johnson (2002) and Gregory (2008)). An advantage of this approach is that it is recognisable and interpretable, whereas more complex three- dimensional representations of time as space-time paths, space-time prisms and potential path areas result in unfamiliar images which are difficult to assimilate (for example, Shaw et al 2008, Neutens et al 2011) as well as being very much more complex to generate. Nevertheless, the representation of time intervals (using Allen relations (Allen 1991) for instance) within the underlying GIS database can model complex temporal relationships with appropriately fuzzy components ('during', 'before', 'overlaps' and so on) which can then be retrieved as a sequence of contemporary snapshots. It would be misplaced to assume that GIS practitioners are unaware and uncritical of the tools they use and the ways those tools affect the representation of information, but it does underline the requirement for knowledgeable users (as emphasised by Boonstra 2009, 5). This might indeed be achieved through collaboration between technical and domain experts, as Bodenhamer (2011, 24) suggests, which fits with a multiple-member interdisciplinary team model for DH research, but it is not a requirement. Alternatively the lone DH scholar may be trained in the techniques: a model essentially adopted within archaeology where archaeological GIS projects are largely undertaken by archaeologists practised in the use of GIS. The archaeological experience would suggest the need for suitable humanities-focused courses to be created in order to communicate the complexities of spatial concepts within an appropriate and meaningful context . Building relationships? In many respects, the adoption of GIS within DH is caught up in a series of anxiety or identity discourses within DH, DA, and also geography, which may account for many of the doubts, uncertainties, and criticisms which are voiced. Anxiety discourses tend to be associated with fields which meet their disciplinary challenges by drawing down concepts and methodologies from external subjects, and which have an intellectual centre primarily focused on praxis, with theory being derived from outside (for example, Lyytinen and King 2004, 222). This seems equally appropriate as a description of DH and DA with each seeking justification, validation, and status as part of a process of discipline-building, rather than being perceived as providing little more than low- prestige technical support for their broader communities. In the process, however, it would seem sensible and strategically appropriate to ensure that the respective discourses contribute to, rather than are at the expense of, each other. For example, DH scholars frequently appear suspicious of what has been labelled as 'common denominator' systems (Hunt et al 2011, 218). These are categories of digital tools which, despite being broad-based, have been developed to accommodate scientists and engineers, with humanists being seen very much as an afterthought: "academics in the HASS [humanities, arts, and social sciences] have learned to content themselves with the few beneficial bits (or bytes) that fall their way from the technological table; nonetheless, common denominator systems are insufficient by themselves to meet the specialised needs of HASS scholars." (Hunt et al 2011, 218). This has also been a feature of the DA discourse in the past, where it has long been recognised that few of the digital tools used by archaeologists have been created by archaeologists specifically for archaeological use. However, this is essentially reductio ad absurdum: there are many tools, digital or otherwise, that have not been specifically created for DH, or DA, and yet are fundamental to each. In fact, one of the advantages of GIS is that, despite being essentially very simple, they are capable of extension, adaptation, and modification in order to better represent the complexities of the application area. The issue is therefore not the rejection of these broad-based digital tools, but the question of their development and application into new areas. Of course, this may be precisely the kind of pragmatism that Meeks (2012) is concerned about. While he points to archaeologists as having more experience with adapting digital tools to their work than digital humanists (Meeks 2012, 95), he sees archaeology's pragmatic approach as not offering solutions to the perception that humanities needs software tools embedded with humanities rather than engineering principles. By this argument, GIS, as broad-based digital tools, and archaeologists, who are pragmatic - and by inference, uncritical - enough to turn them to use, are equally problematic in terms of DH applications. While the kinds of approaches outlined above to handling uncertainty, time, and so on may be open to the accusation of pragmatism, this would assume that the results they generate represent reality or truth in some way rather than being what they are: abstract conceptual models of virtual spaces built out of theory. In many respects, this argument is closely related to the discussions within DH about the place of building things as a scholarly activity (for example, Ramsay and Rockwell 2012; Ramsay 2011a, 2011b). Digital archaeologists, whatever the digital tools they adopt and use, are well-accustomed to the idea of creating, coding, and modifying these tools in order to facilitate research - indeed, the ability to do so can be seen as a significant factor in the consideration of a suitable tool. However, the process of construction or modification is an integral component of research and arises out of theory, rather than being seen an end in itself. At the same time as DH and DA are, to some extent at least, manoeuvring around each other with respect to textual and spatial issues, geography has also been positioning itself in relation to the humanities more generally. In the same way as part of archaeology's discourse has been to question whether it is a science, social science, or humanities subject, geography has situated itself in recent years on the boundaries of the social sciences and humanities (for example, Cosgrove 2011, xxiv, Dear 2011, 311-312). Indeed, Cosgrove argues that connections between geography and humanities have been strongest during periods of cultural inquisitiveness, "when imagination encounters the resistance of material reality" (Cosgrove 2011, xxiii), a characterisation that seems especially pertinent in the context of the 'digital' worlds each is seeking to create. Furthermore, both archaeology and geography with their science/social science profiles have experience of Byerley's recent warning concerning DH: if DH is seen as a response to a scenario of broader humanities budget cuts, it may end up with a series of eggs in a more expensive basket, which will be especially problematic if the humanities are seen as 'irrelevant' as ever (Byerley 2012, 3). The humanist turn? In such circumstances of budgetary crisis, disciplinary anxiety, and the search for relevance, it would seem that DH, DA and humanities geographers would be stronger together and weaker apart, to employ a hackneyed phrase. However, in order to define and build such a relationship between the three fields, a direct conversation is required. Dear points to an absence of such a conversation between geography and the humanities, recognising that "textual propinquity is not sufficient to produce a community of enquiry" (Dear 2011, 304) and there has likewise been no equivalent conversation between DA and DH to date. Over recent years our disciplines have experienced, to varying extents and at varying times, a 'computational turn', a 'digital turn', and a 'spatial turn': as Lock has observed, the time may have arrived for spatial technologies to develop the 'humanist turn' (Lock 2010, 103), presenting at once an opportunity and a challenge for DH in its relationship with the spatial disciplines. References Allen, James F. 1991. Time and Time Again: The Many Ways to Represent Time. International Journal of Intelligent Systems 6 (4): 341-355. Anderson, Sheila, Tobias Blanke, and Stuart Dunn. 2010. Methodological commons: arts and humanities e-Science fundamentals. Philosophical Transactions of the Royal Society A 368: 3779- 3796. Barker, Elton, Chris Bissell, Lorna Hardwick, Allan Jones, Mia Ridge, and John Wolffe. 2012. Colloquium: Digital technologies. Help or hindrance for the humanities? Arts and Humanities in Higher Education 11 (1-2): 185-200. Barker, Elton, Kate Byrne, Leif Isaksen, Eric Kansa, and Nick Rabinowitz. 2012. Google Ancient Places. http://googleancientplaces.wordpress.com/2012/02/25/the-story-continues/ (accessed April 13, 2012). Berry, David M. (ed.). 2012. Understanding Digital Humanities. Basingstoke: Palgrave Macmillan. Bodenhamer, David J. 2007. Creating a landscape of memory: the potential of humanities GIS. International Journal of Humanities and Arts Computing 1 (2): 97-110. Bodenhamer, David J. 2010. The Potential of Spatial Humanities. In The Spatial Humanities. GIS and the Future of Humanities Scholarship, ed. David J. Bodenhamer, John Corrigan and Trevor M. Harris, 14-30. Bloomington: Indiana University Press. Bodenhamer, David J., John Corrigan, and Trevor M. Harris (eds.). 2010. The Spatial Humanities. GIS and the Future of Humanities Scholarship. Bloomington: Indiana University Press. Boonstra, Onno. 2009. Barriers between historical GIS and historical scholarship. International Journal of Humanities and Arts Computing 3 (1-2): 3-7. Bradley, Peter. 2012. Where are the Philosophers? Thoughts from THATCamp Pedagogy. Journal of Digital Humanities 1: 104-106. http://journalofdigitalhumanities.org/ Byerley, Alison. 2012. Everything Old is New Again: The Digital Past and the Humanistic Future. Paper presented at the Modern Language Association (MLA) Conference, Seattle, January 2012. http://www.duke.edu/~ves4/mla2012/ (accessed April 13, 2012). Byrne, Kate and Ewan Klein. 2010. Automatic extraction of archaeological events from text. In Making History Interactive: Computer Applications and Quantitative Methods in Archaeology 2000, ed. Bernard Frischer, Jane Crawford and David Koller, 48-56. Oxford: Archaeopress. Cayless, Hugh. 2011. Building Digital Classics. #alt-academy ('Making Room' cluster). http://mediacommons.futureofthebook.org/alt-ac/pieces/building-digital-classics (accessed April 13, 2012). Chyrsanthi, Angeliki, Patricia Murrieta Flores, and Constantinos Papadopoulos (eds.). 2012. Thinking Beyond the Tool. Archaeological Computing and the Interpretative Process. Oxford: Archaeopress. Clarke, David L. 1973. Archaeology: The Loss of Innocence. Antiquity 47 (185): 6–18. Cosgrove, Denis. 2011. Prologue: Geography within the humanities. In Envisioning Landscapes, Making Worlds. Geography and the Humanities, ed. Stephen Daniels, Dydia DeLyser, J. Nicholas Entrikin, and Douglas Richardson, xxii-xxv. Abingdon: Routledge. Cowgill, George. 1967. Computer Applications in Archaeology. Computers and the Humanities 2 (1): 17-23. Crane, Greg. 2004. Classics and the Computer: An End of the History. In A Companion to Digital Humanities, ed. Susan Schreibman, Ray Siemens and John Unsworth, 46-55. Oxford: Blackwell. http://www.digitalhumanities.org/companion/ Cripps, Paul, Graeme Earl, and David Wheatley. 2006. A Dwelling Place in Bits. Journal of Iberian Archaeology 8: 25-39. Daly, Patrick. and Thomas L. Evans. 2006. Introduction: archaeological theory and digital pasts. In Digital Archaeology: bridging method and theory, ed. Thomas L. Evans and Patrick Daly, 3-9. Abingdon: Routledge. Daniels, Stephen, Dydia DeLyser, J. Nicholas Entrikin, and Douglas Richardson (eds.). 2011. Envisioning Landscapes, Making Worlds. Geography and the Humanities. Abingdon: Routledge. Dear, Michael. 2011. Historical moments in the rise of the geohumanities. In GeoHumanities. Art, history, text at the edge of place, ed. Michael Dear, Jim Ketchum, Sarah Luria, and Douglas Richardson, 309-314. Abingdon: Routledge. Dear, Michael, Jim Ketchum, Sarah Luria, and Douglas Richardson (eds.). 2011. GeoHumanities. Art, history, text at the edge of place. Abingdon: Routledge. Dunn, Stuart. 2012 CAA1 - The Digital Humanities and Archaeology Venn Diagram. http://stuartdunn.wordpress.com/2012/04/01/caa1-the-digital-humanities-and-archaeology-venn- diagram/ (accessed April 13, 2012). Dunn, Stuart. 2011. Poor relatives or favourite uncles? Cyberinfrastructure and Web 2.0: a critical comparison for archaeological research. In Archaeology 2.0: New Approaches to Communication and Collaboration, ed. Eric C. Kansa, Sarah Witcher Kansa, and Ethan Watrall, 95-118. Los Angeles: UCLA Cotsen Institute of Archaeology Press. http://escholarship.org/uc/item/1r6137tb Ess, Charles. 2004. "Revolution? What Revolution?" Successes and Limits of Computing Technologies in Philosophy and Religion. In A Companion to Digital Humanities, ed. Susan Schreibman, Ray Siemens and John Unsworth, 132-142. Oxford: Blackwell. http://www.digitalhumanities.org/companion/ Fishken, Shelley Fisher. 2011 "Deep Maps": A Brief for Digital Palimpsest Mapping Projects. Journal of Transnational American Studies 3 (2). http://escholarship.org/uc/item/92v100t0 Gaffney, Vincent, Zoran Stančič, and H. Watson. 1995. The Impact of GIS on Archaeology: a Personal Perspective. In Archaeology and Geographical Information Systems: a European Perspective, ed. Gary Lock and Zoran Stančič, 211-229. London: Taylor and Francis. Gregory, Ian N. 2008. Different Places, Different Stories: Infant Mortality Decline in England and Wales 1851-1911. Annals of the Association of American Geographers 98 (4): 773-794. Gregory, Ian N. and Andrew Hardie. 2011. Visual GISting: bringing together corpus linguistics and Geographical Information Systems. Literary and Linguistic Computing 26 (3): 297-314. Gregory, Ian N. and David Cooper. 2009. Thomas Gray, Samuel Taylor Coleridge and Geographical Information Systems: A Literary GIS of Two Lake District Tours. International Journal of Humanities and Arts Computing 3 (1-2): 61-84. Gold, Matthew K. (ed.). 2012. Debates in the Digital Humanities. Minneapolis: University of Minnesota Press. Harris, Trevor M., John Corrigan, and David J. Bodenhamer. 2010. Challenges for the Spatial Humanities: towards a research agenda. In The Spatial Humanities. GIS and the Future of Humanities Scholarship, ed. David J. Bodenhamer, John Corrigan and Trevor M. Harris, 167-176. Bloomington: Indiana University Press. Harrison, Rodney and John Schofield. 2010. After Modernity. Archaeological Approaches to the Contemporary Past. Oxford: Oxford University Press. Hockey, Susan. 2004. The History of Humanities Computing. In A Companion to Digital Humanities, ed. Susan Schreibman, Ray Siemens and John Unsworth, 1-19. Oxford: Blackwell. http://www.digitalhumanities.org/companion/ Huggett, Jeremy. 2012a. What Lies Beneath: Lifting the Lid on Archaeological Computing. In Thinking Beyond the Tool. Archaeological Computing and the Interpretative Process, ed. Angeliki Chyrsanthi, Patricia Murrieta Flores, and Constantinos Papadopoulos, 204-214. Oxford: Archaeopress. Huggett, Jeremy. 2012b. Disciplinary Issues: the research and practice of computer applications in archaeology. Keynote presentation: Computer Applications and Quantitative Methods in Archaeology, Southampton March 2012. Huggett, Jeremy. 2004. Archaeology and the New Technological Fetishism. Archeologia e Calcolatori 15: 81-92. Huhtamo, Erkki and Jussi Parikka. 2011a. Introduction: an archaeology of media archaeology. In Media Archaeology: approaches, applications, and implications, ed. Erkki Huhtamo and Jussi Parikka, 1-21. Berkeley: University of California Press. Huhtamo, Erkki and Jussi Parikka (eds.). 2011b. Media Archaeology: approaches, applications, and implications. Berkeley: University of California Press. Hui, Barbara. 2010 Litmap. http://barbarahui.net/the-litmap-project/ (accessed April 13, 2012). Hunt, Leta, Marilyn Lundberg, and Bruce Zuckerman. 2011. Getting beyond the common denominator. Literary and Linguistic Computing 26 (2): 217-231. Jeffrey, Stuart. and Kenny Aitchison, K. 2008. 'Who works in digital archaeology?, ADS Online 22 (Winter). http://ads.ahds.ac.uk/newsletter/issue22/jeffery.html (accessed April 13, 2012). Jessop, Martyn. 2008. The Inhibition of Geographical Information in Digital Humanities Scholarship. Literary and Linguistic Computing 23 (1): 39-50. Johnson, Ian. 2002. Contextualising Archaeological Information through Interactive Maps. Internet Archaeology 12. http://intarch.ac.uk/journal/issue12/johnson_index.html Juola, Patrick. 2008. Killer applications in Digital Humanities. Literary and Linguistic Computing 23 (1): 73-83. Kansa, Eric C., Sarah Witcher Kansa, and Ethan Watrall (eds.). 2011. Archaeology 2.0: New Approaches to Communication and Collaboration. Los Angeles: UCLA Cotsen Institute of Archaeology Press. http://escholarship.org/uc/item/1r6137tb Kooyman, Brian. 2006. Boundary Theory as a Means to Understanding Social Space in Archaeological Sites. Journal of Anthropological Archaeology 25: 424-435. Liu, Alan. 2012. The state of the digital humanities. A report and critique. Arts and Humanities in Higher Education 11 (1-2): 8-41. LLobera, Marcos. 2007. Reconstructing visual landscapes. World Archaeology 39 (1): 51-69. Lock, Gary. 2010. Representations of Space and Place in the Humanities. In The Spatial Humanities. GIS and the Future of Humanities Scholarship, ed. David J. Bodenhamer, John Corrigan and Trevor M. Harris, 89-108. Bloomington: Indiana University Press. Lyytinen, Kalle. and John Leslie King. 2004. Nothing at the Center? Academic legitimacy in the Information Systems field. Journal of the Association for Information Systems 5 (6): 220-245. Mahony, Simon. and Gabriel Bodard. 2010. Introduction. In Digital Research in the Study of Classical Antiquity, ed. Gabriel Bodard and Simon Mahony, 1-11. Farnham: Ashgate. McCarty, Willard. 2005. Humanities Computing. Basingstoke: Palgrave Macmillan. McCarty, Willard and Harold Short. 2002. Mapping the field. Report of ALLC meeting held in Pisa, April 2002. http://www.allc.org/node/188 (accessed April 13, 2012). Meeks, Elijah. 2012 Digital Humanities as Thunderdome. Journal of Digital Humanities 1: 94-96. http://journalofdigitalhumanities.org/ Neutens, Tijs., Tim Schwanen, and Frank Witlox. 2011. The Prism of Everyday Life: Towards a New Research Agenda for Time Geography. Transport Reviews 31 (1): 25-47. Pilsch, Andrew. 2012. As Study or as Paradigm? Humanities and the Uptake of Emerging Technologies. Paper presented at the Modern Language Association (MLA) Conference, Seattle, January 2012. http://www.duke.edu/~ves4/mla2012/ (accessed April 13, 2012). Presner, Todd. 2010. HyperCities: A Case Study for the Future of Scholarly Publishing. in The Shape of Things to Come, ed. Jerome McGann, 251-271. Houston: Rice University Press. Rabinowitz, Adam. 2011 Review of 'Digital Research in the Study of Classical Antiquity'. Internet Archaeology 30. http://intarch.ac.uk/journal/issue30/rabinowitz.html Ramsay, Stephen 2011a. Who's In and Who's Out. Paper presented at the Modern Language Association (MLA) Conference, Los Angeles, January 2011. http://lenz.unl.edu/papers/2011/01/08/whos-in-and-whos-out.html (accessed April 13, 2012). Ramsay, Stephen 2011b. On Building. http://lenz.unl.edu/papers/2011/01/11/on-building.html (accessed April 13, 2012). Ramsay, Stephen and Geoffrey Rockwell 2012. Developing Things: Notes toward an Epistemology of Building in the Digital Humanities. In Debates in the Digital Humanities, (ed.) Matthew Gold, 75-84. Minneapolis: University of Minnesota Press. Richards, Julian D., Stuart Jeffrey, Stewart Waller, Fabio Ciravegna, Sam Chapman, and Ziqi Zhang. 2011. The Archaeology Data Service and the Archaeotools Project: Faceted Classification and Natural Language Processing. In Archaeology 2.0: New Approaches to Communication and Collaboration, ed. Eric C. Kansa, Sarah Witcher Kansa and Ethan Watrall, 31-56. Los Angeles: UCLA Cotsen Institute of Archaeology Press. http://escholarship.org/uc/item/1r6137tb. Ross, Seamus. and Ann Gow. 1999. Digital Archaeology: Rescuing Neglected and Damaged Data Resources. JISC/NPO Study within the eLib Programme on the Preservation of Electronic Materials. London: Library and Information Technology Centre. http://eprints.erpanet.org/47/01/rosgowrt.pdf Scoles, Robert and Clifford Wulfman. 2008. Humanities Computing and Digital Humanities. South Atlantic Review 73 (4): 50-66. Schofield, John (ed.). 2009. Defining Moments: Dramatic Archaeologies of the Twentieth-Century. Oxford: Archaeopress. Shaw, Shih-Lung, Hongbo Yu, and Leonard S. Bombom. 2008. A Space-Time GIS Approach to Exploring Large Individual-based Spatiotemporal Datasets. Transactions in GIS 12 (4): 425-441. Suri , Venkata Ratnadeep. 2011. The assimilation and use of GIS by historians: a sociotechnical interaction networks (STIN) analysis. International Journal of Humanities and Arts Computing 5 (2): 159-88. Svensson, Patrik. 2009. Humanities computing as digital humanities. Digital Humanities Quarterly 3 (3). http://www.digitalhumanities.org/dhq/vol/3/3/000065/000065.html. Terras, Melissa. 2010. The Digital Classicist: disciplinary focus and interdisciplinary vision, In Digital Research in the Study of Classical Antiquity, ed. Gabriel Bodard and Simon Mahony, 171-189. Farnham: Ashgate. Thomas, Julian. 2004. Archaeology and Modernity. Abingdon: Routledge. Thomas, William G. 2012. What We Think We Will Build and What We Build in Digital Humanities. Journal of Digital Humanities 1: 77-81. http://journalofdigitalhumanities.org/ Zedeño, María Nieves. 1997. Landscapes, Land Use and the History of Territory Formation: an example from the Puebloan Southwest. Journal of Archaeological Method and Theory 4(1): 67-103 work_f5jg5mje5neyjo5h3iki2jkclm ---- Crowd simulation: A video observation and agent-based modelling approach Browse Explore more content Repository IJDH Shahrol 2016.pdf (889.45 kB) Crowd simulation: A video observation and agent-based modelling approach CiteDownload (889.45 kB)ShareEmbed journal contribution posted on 03.11.2016, 10:00 by Shahrol Mohamaddan, Keith Case Human movement in a crowd can be considered as complex and unpredictable, and accordingly large scale video observation studies based on a conceptual behaviour framework were used to characterise individual movements and behaviours. The conceptual behaviours were Free Movement (Moving Through and Move-Stop-Move), Same Direction Movement (Queuing and Competitive) and Opposite Direction Movement (Avoiding and Passing Through). Movement in crowds was modelled and simulated using an agent-based method using the gaming software Dark BASIC Professional. The agents (individuals) were given parameters of personal objective, visual perception, speed of movement, personal space and avoidance angle or distance within different crowd densities. Two case studies including a multi-mode transportation system layout and a bottleneck / non-bottleneck evacuation are presented. Categories Mechanical Engineering not elsewhere classified Keywords Agent-based modellingCrowd simulationObservational study History School Mechanical, Electrical and Manufacturing Engineering Published in International Journal of the Digital Human Volume 1 Issue 3 Pages 229 - 247 (19) Citation MOHAMADDAN, S. and CASE, K., 2016. Crowd simulation: A video observation and agent-based modelling approach. International Journal of the Digital Human, 1(3), pp. 229-247. Publisher © Inderscience Version AM (Accepted Manuscript) Publisher statement This work is made available according to the conditions of the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) licence. Full details of this licence are available at: https://creativecommons.org/licenses/by-nc-nd/4.0/ Acceptance date 03/03/2016 Publication date 2016-10-18 Notes This paper was accepted for publication in the journal International Journal of the Digital Human and the definitive published version is available at http://dx.doi.org/10.1504/IJDH.2016.10000735 DOI https://doi.org/10.1504/IJDH.2016.10000735 ISSN 2046-3375 Publisher version http://dx.doi.org/10.1504/IJDH.2016.10000735 Language en Administrator link https://repository.lboro.ac.uk/account/articles/9565757 Licence CC BY-NC-ND 4.0 Exports Select an optionRefWorksBibTeXRef. managerEndnoteDataCiteNLMDC Categories Mechanical Engineering not elsewhere classified Keywords Agent-based modellingCrowd simulationObservational study Licence CC BY-NC-ND 4.0 Exports Select an optionRefWorksBibTeXRef. managerEndnoteDataCiteNLMDC Hide footerAboutFeaturesToolsBlogAmbassadorsContactFAQPrivacy PolicyCookie PolicyT&CsAccessibility StatementDisclaimerSitemap figshare. credit for all your research. work_fanwgwngu5d65lyl2rdbpkfbze ---- Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Research How to Cite: Dalamu, Taofeek. 2019. “Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities.” Digital Studies/ Le champ numérique 9(1): 8, pp. 1–50. DOI: https://doi.org/10.16995/ dscn.287 Published: 23 April 2019 Peer Review: This is a peer-reviewed article in Digital Studies/Le champ numérique, a journal published by the Open Library of Humanities. Copyright: © 2019 The Author(s). This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. See http://creativecommons.org/licenses/by/4.0/. Open Access: Digital Studies/Le champ numérique is a peer-reviewed open access journal. Digital Preservation: The Open Library of Humanities and all its journals are digitally preserved in the CLOCKSS scholarly archive service. Dalamu, Taofeek. 2019. “Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities.” Digital Studies/Le champ numérique 9(1): 8, pp. 1–50. DOI: https://doi.org/10.16995/dscn.287 RESEARCH Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Taofeek Dalamu Anchor University, Lagos, NG lifegaters@yahoo.com The focus of the study is the application of Systemic Functional Grammatics (SFG) to text as a facility of meaning-making. Having provided a wide room for technological devices to read and account for elements of a text, it portrays the exercise within the scope of Digital Humanities (DH). The theory, championed by Halliday, describes a text from its systemic configurations to chain structures and social relationship frameworks. To explain the weight of SFG as an interface between text and technology, the author chose a poem, ‘Area Boy’, in which three perspectives of the mood system, thematic system, and transitivity system are instrumental to expose its nuances. The approach was followed by correlating the three systems together as a comparative analysis. The study reveals that ‘Area Boy’ operates in declarative clauses with heavy utilization of Subject and Finite. These are organized in marked themes. The contents of the text are represented in material processes (e.g. spent) with supports from both mental (e.g. remember) and verbal (e.g. said) processes. Some of the processes along with circumstances (e.g. Of washing …, Now that …) recur as repetitions for emphatic and enhancement purposes. On the one hand, the article concludes that SFG can assist in interpreting textual elements to generate meaning potential. On the other hand, through the SFG’s metafunctional applications to ‘Area Boy’, one can suggest that the society should give a helping hand to the less privileged. Such a behavior can eradicate vices experienced through the ‘Area Boys’ from the society. Keywords: ‘Area Boy’; Digital Humanities; Mood System; Systemic Functional Grammatics; Thematic System; Transitivity System Cette étude se focalise sur l’application de la grammaire fonctionnelle systémique (GFS) à des textes comme moyen de facilitation de la création de sens. Ayant pourvu une large marge pour des dispositifs technologiques pour lire et justifier des éléments d’un texte, cette étude présente la mise en pratique dans le cadre des Humanités numériques (HN). Cette théorie, Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 2 of 50 promue par Halliday, décrit un texte, de ses configurations systémiques à ses structures de la chaîne et structures des relations sociales. Pour expliquer la signification de GFS comme interface entre texte et technologie, cet auteur se sert du poème « Area Boy », où trois perspectives du système de mode, du système thématique et du système de transitivité ont des rôles déterminants dans l’exposition des nuances de GFS. Nous avons ensuite fait une analyse comparative en corrélant les trois systèmes ensemble. Cette étude révèle que « Area Boy » fonctionne en des propositions déclaratives avec une utilisation intensive du Sujet et du Fini. Elles sont organisées en thèmes indiqués. Les contextes du texte sont représentés par des processus matériaux (par exemple, spent) avec des soutiens des processus mentaux (par exemple, remember) et verbaux (par exemple, said). Certains des processus, ainsi que des circonstances (par exemple, Of washing…, Now that…), se reproduisent en tant que répétitions pour des raisons emphatiques et appuyées. D’un côté, cet article affirme que GFS peut aider à l’interprétation des éléments textuels pour produire du potentiel de signification. De l’autre côté, à travers des applications métafonctionnelles de GFS à « Area Boy », on peut suggérer que la société doit donner un coup de main aux moins privilégiés. Un tel comportement peut éradiquer les vices venant de la société et ceux vécus par les « Area Boys ». Mots-clés: « Area Boy »; Humanités numériques; système de mode; Grammaire fonctionnelle systémique; système thématique; système de transit Introduction In the light of development, digitization has become an inevitable phenomenon in human social affairs. As one feels its preoccupation in the physical and social sciences; digitization has also dynamically glided into the humanities, especially, the literary world. As a result, scholars have resolved to employ technological and scientific devices to explore literary items to benefit readers (Burrows 2004; Craig 2004). For instance, Robinson and Saklofske (2017) interconnect computer software, mobile applications, etc. with narratives. The utilization of mobile apps, computer software, as modular systems, assists in synchronizing networks on the perception of narratives. Such effort encourages distance readings and algorithmic appreciations. It is on that critical plane that this study suggests the application of Systemic Functional Linguistics (SFL) as a reliable tool that can facilitate the meaning potential of literary texts. This is because SFL possesses capabilities to provide a type of analysis that can Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 3 of 50 be utilized for close structural readings with semantic implications. The digitization of literature attracts theoretical terms as Warwick (2016) particularly emphasizes. The deployment of conceptual terminologies to promote digital humanities (DH) situates SFL very viably in this arena. Consequently, the appropriate applications of theoretical techniques, as interpretive strategies, forge lucid and direct partnerships, and cement strong relationships between research components and meanings derived from the materials (Schreibman, Siemens, and Unsworth 2004, xxv). The abilities of the theory to process texts into linguistic devices makes scientific instruments like tables and graphs effective in accounting for grammatical and semantic frequencies in the form of Jockers and Underwood’s (2016) quantitative methods. Also, SFL works well, by supporting the computation of clause elements in their complex forms within the framework and methodology as discussed by Bradley (2004), and Drucker (2016). As stated earlier, the need for a critical inquiry (Warwick 2016) stimulates the introduction of SFL as a reliable lens to manifest the nitty-gritty of a literary item (e.g. poem). This grammatics (Halliday 2013, 29) addresses this operation by characterizing the clauses of a specific poem into both structural labels and contextual situations. Apart from that, SFL considers language in the form of structures within the purview of socio-cultural manifestations (Kress 2010; Bartlett 2013; Dalamu 2017e, 2017h). This could be a reason for drawing a text into two separate planes. In Halliday and Hasan’s (1985, 5) sense, “there is text and there is other text that accompanies it: text thus is with, namely the con-text.” This notion of elements, associated with the text, pinpoints the production environment of the text. The socio-cultural norms, as Halliday and Hasan (1985) underscore, offer the text much meaningful detail. This is on the grounds that the context meshes with the text and its immediate indices as the unified element of communication. In every language production, Halliday and Hasan assert, there are two texts. The first text is the internal chains that bind the product of a text together as an indivisible entity of meaning (1985, 24–26). The menus of the structural elements are connected through cohesive ties (Eggins 2004; Dalamu 2018). The second, as characterized, is the context of the language of interaction (Halliday and Hasan, 1985). Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 4 of 50 This is the totality of the elements in the setting in which the language is applied. One can argue that there is nothing fascinating in analyzing a text for the purpose of its structural components. It is rather captivating when an analyst considers the constituents of a text within the profile of its socio-cultural plane (Ravelli 2000, 29). That suggestion is a probable projector of the text in the domains of cohesion and coherence. Cohesion describes the structure of the text while coherence realizes its context (Thompson 2004). Figure 1, below, adds flavor to the text and context abstractions of a piece of language in use. The convention of coherence and cohesion, “merry-go-rounding” the text, ends up at the table of three metafunctions as shown in Figure 1, above. This explains the idea that the three metafunctions dominate and remain the focus of SFL. Both the meeting and melting point of coherence and cohesion are the three metafunctions (Halliday 1985; Matthiessen 1993). Through that synthesis, meaning is generated in text. Having said that, there are numerous conceptual frameworks in the theory that can assist in explaining texts. These are very possible without recourse to the celebrated three metafunctions. For instance, part of the grammatical metaphor has capabilities to explore a text independently (Thompson 2004, 220–224). Furthermore, within the domains of SFL, analysts can consider a text from the spheres of contextual, socio- semiotic, and multimodal perspectives (Hodge and Kress 1988; O’Toole 1994; Kress and Van Leeuwen 2003; Kress 2010; Dalamu 2017g). These are some of the incentives that propel the writer to suggest that SFL contains reliable resources useful in DH. One can feel the waves of DH in Riguet and Mpouli’s (2017) characterization of dialogism of French discourse on literary criticism. As a result, Riguet and Mpouli Figure 1: Text and context expressed through coherence and cohesion. Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 5 of 50 discuss how scientific terminologies are “loaned” and adopted, giving those terms new but literary meanings. While Muzny, Algee-Hewitt, and Jurafsky (2017) throw some light on the dialogue, in terms of conversation, as the basis for communicative interactions; Binotti and Azcorra (2017) explain DH values from their great influence on the general public, describing the effectiveness of Entiéndelo (a textual explication tool) for all humanity. The authors depict the benefits of Entiéndelo as augmenting people’s quality of life. As modern literary communications cannot be totally jettisoned or retired from ancient events, Bogna (2017) and Ciula (2017) elucidate some old and new disciplines in both multi- and interdisciplinary manners (also in Erlin 2016). The correlations draw readers’ attentions to their great significance as well as their interrelationships. Other scholars model language from the perspective of discourse studies in relation to DH (Rodilla and Gonzalez-Perez 2017); expound the concept of “big data” (Castro 2017); reveal unusual non-count nominals in Modern English (Svensson 1998); and elucidate historical discourses of race in literary elements (Lee et al 2018). Of importance is Kreniske and Kipp’s (2014) insight on the influence of DH on the documentation of social values of South African San nationality. This study, as a contribution to earlier analyses, explicates the application of technology that relies on the result of the application of SFL concepts to the text. As a practice, the analyst has applied SFL to Adesanmi’s “Area Boy” (Adesanmi 2010, 308). This exercise displays the influence that SFL can have on a text in terms of the writer’s style and corpus development. In other words, the paper discusses how technology has assisted the analyst to do a reading of a specific poem (“Area Boy”) within a framework of functional grammar. It is the hope of the author that this will trigger further research efforts, channeled in a similar direction of this course. As a fundamental textually- focused theory, SFL exercises its vitality on the grammar of a language, considering the clause, as the center of analysis. Grammar refers to the structural system of wordings of a language (Yule 1985, 69; Dalamu 2017a, 268) that can be viewed from below, from around, and from above. The functions and analysis of such a language are also carried out in the way that a language communicates (Burke 1969; Quirk and Greenbaum 1973; McGregor 1992; Halliday and Matthiessen 2004). Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 6 of 50 It is pertinent to argue that the analysis of a quantum of grammatical elements cannot be done haphazardly because grammar itself is an organized event. By implication, a consideration for making meaning from the grammar of a language must not be operationalized chaotically. Rather, its organization must be in sequences. Thus, it is also demanded that the theoretical application on grammatical structures must begin from somewhere, that is, its constituted ordering. The concern drives SFL to start the analysis of text from the clause, its nerve, as applied later. This shows that the examination of every grammatical unit and function(s) has a connection with the clause. Thus, it is obligatory for every user of SFL to get acquainted with the clause and varieties of building blocks attached to it in either simple or complex forms. In corollary, Ravelli (2000, 29) points out that the key to beginning a systemic analysis is to identify a clause, which is the hub of grammar. Following Ravelli (2000), the clause is similar in concept to a sentence, except that a sentence pertains to written language, whereas a clause applies also to spoken language. In a specific sense, a clause represents a state of affairs. X-raying systemic functional linguistics Unlike so many ideas within schools of linguistics, SFL comes along with many linguistic tickets, as means of constructing and illuminating the thoughts of the exponents. The major exponential ancestors are fundamentally Saussure on Syntagmatic and Paradigmatic (De Beaugrande 1991), Bühler on the three functional models of language (Innis 1987), and Malinowski on Context of Situation (Malinowski 1935; Bailey 1985). The link of Hejelmslev to the theory is on Theme that taps its currency from the Prague School (Halliday 1994). Firth is always remembered for the concept of System – a system of systems or being polysystemic (Firth 1957; Butler 1985), while Hasan is notably the propagator of Context of Culture (Halliday and Hasan 1985), and Halliday on the Three Metafunctions (Halliday 1973). However, the configuration, harmonization, and development (to an extent) of the contributions of the intellectual progenitors of SFL’s identities rest on Michael Alexander Kirkwood Halliday. Actually, Halliday conceives the bright idea of SFL from the confluence of the thoughts of earlier scholars (e.g. Firth). The convergence occurs through the scrutiny and careful selection of useful materials as platforms for constructing SFL. Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 7 of 50 That insight elevates Halliday’s pedigree as synonymous with SFL (Dalamu 2017c). This is because Halliday does not only make choices from scholarly resourceful materials; the sage also champions the compatibility of the raw materials; and moreover, injects invaluable terms to the subjects that SFL accommodates. This study, in that regard, considers Halliday as the architect as well as the mason of the theory. The centrality of the clause to grammar, as mentioned earlier, cannot be undermined. The fragmentation of the clause produces phrases and words; the elaboration leads to the formation of clause complexes. The writer points out that every statement deployed by an interactant either in the form of the spoken or written language has its origin negotiated in the clause. Such place of occupancy encourages systemicists to make the clause kernel in analyses rather than the sentence. The significance of the veins of the clause operations on the text can be demonstrated, as in Table 1, below. In one way or another, SFL is functionally-cyclical, most especially, in the dominance of the clause in all operations. Besides, point (v), (vi), and (vii), in Table 1, link the clause again to the three metafunctions. Beginning the construction of meaning of a text (e.g. poem) from below the clause (e.g. word) to a full-fledged clause and ending the exercise around the clause (e.g. discourse) is, perhaps, a sign Table 1: Domains of the clause in SFL. Numbers Systemic Terms Grammatical Elements Examples I Below the clause Words, phrases Search, a system code Ii Around the clause Cohesion, texture, discourse Engineered differently Iii Above the clause Clause complexes I sing yet dance in church Iv Beyond the clause Metaphorical interactions Statistical comparisons elevate research V Clause as exchange Mood system As in Figure 8, below Vi Clause as message Thematic system As in Figure 9, below Vii Clause as representation Transitivity system As in Figure 10, below Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 8 of 50 of building up meaning from the scratch to a broad meaning derivative. It is on that ground that SFL serves as an interface between a poem (e.g. “Area Boy”) and technological devices (e.g. graphs) in order to position “Area Boy”, as an entity of DH. Digital Humanities: Historical developments In the historical development of DH, the name of Roberto Busa is estimable. Roberto Busa was a Jesuit priest who picked interest in building a concordance for the works of Thomas Aquinas in 1949. That assiduous effort charted a pioneering course for what is known as DH today (Ess 2004, 133). Though, a very tedious journey, the objective was to realize the word of Aquinas’ writings in what Busa referred to as index verborum (Busa 1980). The effect of that singular act seems referential up to this period. In Crane’s observation, Busa’s attempts transcended any other struggles in the hemisphere of lexemic accountability (Crane 2004, 47). The difficulty experienced, doing manual operations influenced Busa to seek help from IBM to accelerate the counting and ensure accuracy (Burton 1981a, 1981b). Hockey’s perspectives on DH history Hockey’s approach to the history of DH from the effort of Father Busa serves as the point of departure (2004, 4). In this classification, 1949 to early 1970s mark the beginnings where index verborum and cum hypertextibus are reiterated. The years of consolidation fall within 1970 to the middle of 1980s, witnessing the journal Computers and the Humanities, conferences, the writing of computer programs, along with the establishment of computer centers. Personal computers that foster innovation, as Hockey (2004) reports, become necessary for scholarship to snowball the development of DH in 1980s and early 1990s. Of significance in the era is the long-standing impact that reminds the writer of the publication of the Human Computing Yearbook for the storage or archiving and campaign of scholarly projects, software, and publications (ibid.). The herald of the World Wide Web (WWW), as the “culmination” in 1990s, becomes the irresistible boost that welcomes researchers to first-class information, perhaps, in any subject. Moreover, WWW gives license to anyone to publish. This promotes scholarly works as much as the elimination of the constraint of printing pages of books. There is no page limitation, and Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 9 of 50 every publication can be reviewed from time to time. Another great merit of the WWW/URL is that any publication can be accessed from any part of the globe as long as it is not passworded. Hockey (2004, 17) submits that, “now that the Internet is such a dominant feature of everyday life, the opportunity exists for humanities computing to reach out further than has hitherto been possible” (also in Svensson 2010; Jørgensen 2016). Digital Humanities: Definitions and domains Perhaps, scholars have been wisely and systematically softening academic pedals from defining the term, Digital Humanities (e.g. Burdick et al 2012). This is because DH is a probable subject to expand beyond human imaginations among the sub- contending disciplines (Kirschenbaum 2010, 58). The discipline also addresses many of the research challenges on methodological paradigms (Schreibman, Siemens, and Unsworth 2004, xxx). However, attempts have been made to describe the contents of the fast growing and developing DH. Thus, Busa (2004) claims that DH is precisely the automation of every possible analysis of human expression (therefore, it is exquisitely a “humanistic” activity), in the widest sense of the word, from music to the theater, from design and painting to phonetics, but whose nucleus remains the discourse of written texts (Busa 2004, xvi). This perspective is very broad. It is coherent, Busa explains, to all possible human social endeavors. The pointer in the description is the text. Again, at this point, the relevance of SFL to text can be referred. As the nucleus of DH is the text, the same text is the hub of SFL as well as the wheel of language. SFL seems the chair of DH and language because of its theoretical underpinning in both the textual claims and social connections (Wodak and Meyer 2001, 3–9). As such, domains of SFL are contextually-expressed, as publicized earlier in Figure 1, in the terminology of cohesion and coherence. The study sees a joint venture between DH and SFL. DH seems to embrace two customary but academic lifestyles by creating a robust and intertwining relationship between the digits (1, 2, 3, etc.) and the alphabet (a, b, c, etc.). The partnership extends to signs and figures of various Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 10 of 50 pluralistic annotations such as scientific representations of symbols in different forms, capacities, and functions. This is where SFL can create an interconnection between the two entities by positioning every element of a clause in the appropriate place. This advances the examination of the events of the “humanities” to produce meanings. This opportunity can be a compelling reason for Busa (2004, xxx) to elucidate the evolving but permanent association of humanities and the computer utilization as signaling “The finger of God.” Burdick et al (2012, vii) recognize the communicative divine signature by validating that unlike in the past, researchers in the humanities today live and function in “rare moments of opportunity” with the potential to play a vastly expanded creative role in public life. Computerization influences seem to have aided such transformation. The testimony of Burdick et al. (2012) places a wide gulf between the knowledge of precursors of humanities and the present humanists. The current information age (or golden age) negotiates workable and lasting relationships between human expressions/lifestyles and computer applications unlike past generations. However, as the arts construct enduring relationships with computerization, the disciplines are not in any way retreating from the long-standing tenets of founding fathers. DH is a foremost development of “the purview of the humanities, precisely because it brings the values, representational and interpretive practices, meaning- making strategies, complexities, and ambiguities of being human into every realm of experience and knowledge of the world” (ibid.). This suggests that a major contribution of DH is the creation of additional values into the arts through the applications of computerized interpretative equipment. As such, technological tools are capable of advancing and enhancing the meaning-making of human activities where SFL serves an intermediary function. Furthermore, the Digital Humanities Manifesto articulates DH as having observed in the discipline an array of convergent practices in two senses. One, it explains that the “print is no longer the exclusive or the normative medium in which knowledge is produced and/or disseminated; instead, print finds itself absorbed into new, multimedia configurations” (Digital Humanities Manifesto n.d.). The usual manner of exercises in the print has changed to an advanced level. Two, “digital Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 11 of 50 tools, techniques, and the media have altered the production and dissemination of the knowledge of arts, human, and social sciences” (ibid.). It is not contentious that DH has taken the arts from their deep-seated artistry to a sustainable “scientific” level of functionality. Perhaps, sooner or later, all disciplines in the Ivory Towers rather than awarding B.A. degree titles in the humanities, will transit to awarding a B.Sc. to every bachelor of a university. This projection depends on applications of computational infrastructures to the humanities, which have the capacity to realize the dream proposed (Edmond 2016; Montfort 2016; O’Donnell, Walter, Gil, and Fraistat 2016). Responsibilities of DH dominate all fields, where human beings operate (Butler-Kisber 2013). This is because the applications of computer facilities are limitless most especially when one correlates every action with the renowned slogans of IBM, Everything you need to build anything you want and THINK (IBM 2017; Creative Block Inc. 2017). This is made possible and effective because writers of computer programs receive instructions that assist them to produce a program that is parallel to a particular operational need and demand (Peirson, Damerow, and Laubichler 2016). The more the scope of human beings widens the better the areas of DH’s occupancy. Among others, DH is applied to linguistics, literary studies, music, graphic arts, and archaeology. (Schreibman, Siemens, and Unsworth 2004). Remarkable suggestion of John Unsworth Father Busa, perhaps the most distinguished pioneer of the well-known DH echelons (Schreibman, Siemens, and Unsworth 2004, ix), did not label the subject as DH. Before 2001/2002, Busa and his contemporaneous scholars had been tagging the remarkable activities on the new idea of textual accountability as Index Thomisticus, Lessico Tomistico Biculturale, concordance, humanities computing, etc. The construction of a universally-acceptable title given to what Busa started in 1949 rests on John Unsworth (Unsworth 2002), the same way that the construct of Discourse Analysis resides in Zellig Harris, and Context of Situation rests on Bronislaw Malinowski (Malmkjaer 2004). The big idea, according to Unsworth, came to him while negotiating the title of the book, A Companion to Digital Humanities, with the representative of Blackwell publishing company (Unsworth 2010; also in Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 12 of 50 Kirschenbaum 2010, 56–57). Although, the labeling rests on Unsworth, DH is a child of circumstance borne per chance. However, it is pertinent to think back to Busa’s assertion on “Digitus Dei est hic! i.e. The finger of God is here!” Busa perceives the phenomenon as an outstanding activity involving human beings, yet, charged and influenced by God. That is the rationale for Busa to add that “it is just like a satellite map of the points to which the wind of the ingenuity of the sons of God moves and develops the contents of computational linguistics, i.e., the computer in the humanities” (Busa 2004, xvi). Very salient in the Unsworth’s (2010) construct is the adjective “digital.” The coinage, in Unsworth’s standpoint, appears in order to move away from simple digitization of lexemes. “Digital” as a modifier signals a form of “sporadic” shift from the counting of words into all manifestations of humanistic operations. The “randomization” of the affiliation of computer technological applications to various humanistic domains is a probable factor that has prevented the discipline of DH from one-face value on definition. SFL: The interface between “Area Boy” and technology Although, the three metafunctions of SFL – interpersonal, textual, and experiential – are the theoretical concepts of the study; it is significant to demonstrate the function of SFL in the study as manifested in Figure 2, below. The portion in the blue color (identified as A) is the poem, “Area Boy”, while the portion in the green color (identified as C) is the technology. On the one hand, “Area Boy” is a piece of literature that contains textual elements with embedded meaning potential. On the other hand, the green color is the facility useful for calibration. Because, it is seemingly difficult for the technological device to approach “Area Boy” in order to generate systemic meaning, SFL (identified as B) bridges the lacuna. Its application processes “Area Boy” structures into countable Figure 2: Relationship between SFL and technology. Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 13 of 50 values that the technology can accommodate. The systemic operations permit the technological facility to act on “Area Boy.” In a simple term, the outcomes of the application turn the whole exercises on “Area Boy” to semiotic slots of SFL, and SFL to computerization devices in order to operate as DH. The theoretical application of SFL is the wheel that turns “Area Boy” into an entity of DH. Besides the current application, as mentioned earlier, SFL with the use of any of its concepts (substitution, ellipsis, grammatical metaphor, coherence, context of situation, etc.) can be applied to texts for meaning-making. Theoretical breadth Significantly, a demonstration of SFL as a very resourceful tool of DH inspires the author to adopt the three metafunctions as the relevant conceptual entities. That being said, the three metafunctions, as mentioned earlier, are the core concept of SFL. The applications of the triadic terms to a text provide the target audience structural, paradigmatic, and contextual meanings (Eggins 2004). Table 2, below, shows the operational slots of the three metafunctions. The grammatical spheres of the metafunctions shown in the analyses of Figures 9, 10, and 11, below, make it very possible to earmark semantic slots to the structural organs of the clauses of “Area Boy.” The system networks in Figures 3, 4, and 5, below, are indicators of the metafunctions, operating from below, from around, and from above. However, some of these functions are basically-intrinsic. Besides, the system network represents the choice that a language user makes out of numerous ones available to the individual. Contextual implications of interpersonal, textual, and experiential metafunctions are accommodated discursively. Table 2: Three Metafunctions’ operational slots. Terminology Grammatical Sphere Paradigmatic Context Interpersonal Metafunction Mood System Network Tenor of Discourse Textual Metafunction Theme System Network Mode of Discourse Experiential Metafunction Transitivity System Network Field of Discourse Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 14 of 50 Mood system Thompson (2004) probes the interpersonal metafunction as a device that fulfils the “performative” roles of every addresser to the addressee. The concept reveals either constitutive functions or ancillary functions. The speech roles, Thompson (2004, 46–47) emphasizes, permit questions (interrogatives), commands (imperatives), statements (declaratives), and offers (modulated interrogatives) to be realizable in discussions (also in Dalamu, 2017b, 190–193). However, Halliday and Matthiessen (2004, 111–132) characterize the main structural organs as the mood disposed in Subject and Finite respectively. Predicator, Complement, and Adjunct, in Bloor and Bloor’s (2013) conceptualization, are components of the residue. Figure 3, below, explains further the system network of the mood choices. Apart from exclamation marks and “sets” in English, the choices of the indicative and imperative are clearly open in communicative activities. Thematic system Theme and rheme fall into the organizational ideas of a text. A user of a language determines the componential arrangements of the communication as desired (Halliday 1994, 34–67). Apart from that, the function that the language is deployed to achieve Figure 3: Mood System Network (Thompson 2014). Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 15 of 50 has a great implication on the background details of a discourse. Rashidi (1992, 192) illuminates the theme as the starting point of the message. That is, the constituent that begins moving the encoder towards the essence of the communication. There is the essential ideational jumping-off point directing the decoder’s attention to the ultimate goal of the communication. The theme, in Rashidi’s approach, begins a clause irrespective of the linguistic device experienced at the start up. In other words, it gives a track to text productions. It is that operational condition that further influences Rashidi (1992, 197) to describe the rheme as the nub of the message of a clause despite the obligatory appearance of the theme in any construct (e.g. NG, Prep G, VG, Adj G or Adv G). This manifests the essential position of the theme in structures except in a situation of elliptical lexical amenities. Themes of the text operate in different ways. Unmarked theme occurs when the topical theme functions as a subject of the clause. Marked theme occurs when the theme of the clause is not the subject. Topical theme operates whenever participant, process, and circumstance realize the theme. Thematic theme arises before the topical theme. Exclusive discussions of the theme are in Halliday and Matthiessen (2004, 64–87), where the point of departure, orientation, and location connecting the social reality realize the theme (Ellis 1987, 113–121; Dalamu 2017f). Figure 4, below, reveals the system network of the thematic system of the clause, exhibiting textual, interpersonal, and experiential/ideational elements as the configuration of the multiple themes. The system in Figure 4, below, shows the theme and rheme as two separate tools of interpretations. Transitivity system Bloor and Bloor (2004) argue that the experiential metafunction encodes the speaker’s experience by allowing language to play a critical role that accommodates the goings-on and the participants involved in the activities. In consonance with that perspective, Halliday and Matthiessen (2004, 168–259) describe the content and participants, sometimes encompassed with circumstantials, as crucial in disseminating information through the experiential (McGregor 1997). Valuable materials are in (Martin 1992; Halliday and Matthiessen 2014). Figure 5, below, elucidates the experiential metafunction, showing the processes and participants. Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 16 of 50 Figure 4: Thematic system network (Halliday and Matthiessen 2014). Figure 5: Transitivity system network (Dalamu, 2017i). Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 17 of 50 Figure 5, above, reveals six processes functioning in the English language. Material processes, mental processes, and relational processes are major while behavioral, verbal, and existential processes are minor. These systemic facilities are minor because behavioral, verbal, and existential occur at the peripheries of the major processes (Halliday and Matthiessen 2014; Dalamu, 2017d). The occurrence of the processes in a disc-like format is another operational figure that throws more lights on the cyclical nature of SFL (Halliday 1994, 108). The linear sequence of Figure 5, above, informs the introduction of the second material label as being a caricature. Figure 6, below, illustrates the compatibility of the interpersonal metafunction, textual metafunction, and experiential metafunction on a clause, indicating their unbroken relationships. The partnership pinpoints the way that meaning potential of a text is realizable in three different systemic forms in order to generate meanings (Bloor and Bloor 2013; Fontaine 2013; Dalamu 2017c). Figure 6: Three metafunctions composite system network (Dalamu 2017i). Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 18 of 50 This study demonstrates the treaty in the three metafunctions in Figure 6, above, in order to serve a good purpose of understanding their usefulness in DH through technological appreciations. Methodology The author has chosen the poem of “Area Boy” written by Pius Adesanmi out of the 231 poems in Lagos of the Poet, which addresses concerned issues of Lagos State, Nigeria (Adesanmi 2010). Consequently, the book has huge implications concerning Lagos and the Nigerian society at large. It is also a means of creating a global awareness of the meaning of “Area Boy” in this part of the world. Above all that, the choice of the poem allows the study to exhibit resultant effects of SFL on a literary text. Design “Area Boy” has been divided into clauses with the systemic traditional style of the slash demarcations, that is, “///” and “//”. “///” signifies the beginning and the end of a stanza while “//” serves as a simple clause separator. These are the reasons for observing slashes in the data presentation, below. The analysis of the “Area Boy” has undergone three different spheres of the mood, thematic, and transitivity systems in order to reveal the application of each of the three metafunctional instruments in clear functional terms. In the mood system analysis, S = Subject, F = Finite, P = Predicator, C = Complement, A = Adjunct, and Mod Adj = Modal Adjunct. The study also uses Circ as Circumstance, Pro as Process, Loc as Location, and Ident as Identifying. Measures After the systemic analysis, the researcher exploited AntConc, a text-computing technology (Laurence Anthony’s Software), to account for the processes in “Area Boy.” The first step was to identify and write down the processes in a piece of paper after which the entire “Area Boy” file was inputted into AntConc by selecting the “Open Files” icon in the “Navigation Menu” and the “Concordance” in the “Tool Tabs”. As the “Area Boy” file has appeared in the “Corpus File” window, each process term (e.g. remember) was later entered into the AntConc window’s dialogue box in the left-hand side of the “Control Panel.” The AntConc displayed the frequency in two forms after clicking “Start”. The computing instrument showed the word recurrence in the ‘KWIC Results Window’ and the digit in “Concordance Hits” as shown in Figure 7, below. Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 19 of 50 Details about AntConc are in Anthony (2018). That exercise was conducted to ensure the recurrence accuracy of the texts. Besides, AntConc supported the investigation by harvesting the frequency of other linguistic elements such as at times, you, your, and nobody. Thereafter, the researcher utilized the Microsoft Excel Worksheet (e.g. Figure 8, as publicized latter) to further support SFL to process the clauses in “Area Boy.” The use of the Excel Worksheet became a fundamental tool in order to achieve accurate classifications of the structures that the metafunctional components have realized. As AntConc does not have the capacity for systemic appreciations, manual counting of the grammatical constituents in the semiotic slots has become inevitable. To this end, quantitative operations, following Jockers and Underwood (2016), Drucker (2016), and Dalamu (2017i), allow tables to compute the grammatical elements in the semiotic slots into appropriate values. Each table further schematizes into a simple graph for prompt examination of the operational facilities of the systemic elements. The scientific interpretation can Figure 7: A sample screen of AntConc. Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 20 of 50 assist the reader for easy accessibility of the functional domains of the poem. The graphs of the mood, thematic, and transitivity systems expressed in Figure 14, later below, are cumulated into a single piece to reveal the relationships of the three metafunctions. Procedure The analytical as well as reading processes in Figures 9 to 14, as later illustrated below, inform the patterns of the discussion. However, the discussion gives preferences to the transitivity system because the grammatical term provides expressions for the contents of the clause. Besides, as the transitivity shows concern for the narrator’s experience (inner and outer), the terminology also communicates universal relations of subcomponents of logical items (Butler 1985; Olivares 2013). Data presentation The items, below, are the data of “Area Boy”, written in paragraphs and poetic lines. Area Boy ///At times you still remember Those agonizing years// you spent As a cheap labourer in the General’s farm Tilling, toiling and sweating in the sun For the pittance//they flung at you once in a month// Yet nobody said anything then.// ///At times you still remember The painful years//you spent As a reluctant houseboy in Ikoyi// Oga’s callouseness still haunts your steps// Madam’s overbearing attitude you cannot forget// Yet nobody said anything then./// Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 21 of 50 ///At times you still remember The psychological oppression Of watching their scions spray dollars in parties Of their limousines splashing water on you in the streets Of your wondering//where you went wrong// //Yet nobody said anything then.// Now that something in you has snapped// Now that you can no longer stomach it// Now that you’re fighting back in the streets// //Lashing out at the system/// ///Stinging the molochs//who operate it// And cowards who tolerate it// It is time for them to call you names: Tout! Vagrant! Vandal! Area boy!/// ///Brother, your being an Area Boy is now the issue// Nobody will ever bother to excavate The fossils of disenchantment Buried deep down in your soul.// Data analysis Figures 8, 9, and 10, below, display the application of mood, thematic, and transitivity systems to the poem, “Area Boy.” The investigation further exhibits the frequencies of the grammatical constituents of “Area Boy”, based on SFL’s applications in Figures 8, 9, and 10, below, in tables and graphs as expressed in the result section. Results Mood system of the “Area Boy” analysis Table 3, below, displays the values of the semiotic slots in the “Area Boy” mood analysis in Figure 8, below. Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 22 of 50 Ta b le 3 : “ A re a B oy ” m oo d s ys te m r ec u rr in g va lu e. Se m io ti c Sl o t C la u se To ta l C L1 C L2 C L3 C L4 C L5 C L6 C L7 C L8 C L9 C L1 0 C L1 1 C L1 2 C L1 3 C L1 4 C L1 5 C L1 6 C L1 7 C L1 8 C L1 9 C L2 0 C L2 1 S 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 0 1 1 1 1 1 19 F 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 0 1 1 1 1 1 19 P 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 0 1 19 C 1 0 0 1 1 0 1 0 1 1 1 0 1 1 1 1 1 1 1 1 1 16 A 2 2 2 1 1 2 2 0 1 5 1 1 2 1 1 0 0 0 1 1 1 2 7 Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 23 of 50 Figure 8: “Area Boy” mood analysis. Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 24 of 50 Figure 9: “Area Boy” thematic analysis. Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 25 of 50 Figure 10: “Area Boy” transitivity analysis. Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 26 of 50 Figure 11, below, is the cumulative of the values computed in the mood system in Table 3, above. Figure 11, above, indicates Adjunct as the priority because it is more functional in the “Area Boy” text. Subject, Finite, and Predicator are next with Complement being the less functional device. The figure shows that the text is constructed in declarative clauses, issuing statements to the target audience in order to show the feelings of the speaker. Thematic system of the “Area Boy” analysis Table 4, below, reveals the values of the semiotic slots in the “Area Boy” thematic analysis in Figure 9, as shown earlier, above. Figure 12, below, is the cumulative of the values specified in the thematic system in Table 4, below. Figure 11: “Area Boy” mood system calibration. Figure 12: “Area Boy” thematic system calibration. Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 27 of 50 Ta b le 4 : “ A re a B oy ” th em at ic s ys te m r ec u rr in g va lu e. Se m io ti c Sl o t C la u se To ta l C L1 C L2 C L3 C L4 C L5 C L6 C L7 C L8 C L9 C L1 0 C L1 1 C L1 2 C L1 3 C L1 4 C L1 5 C L1 6 C L1 7 C L1 8 C L1 9 C L2 0 C L2 1 Th em e1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 0 1 1 1 1 1 19 Th em e2 1 0 0 1 1 0 0 1 1 1 1 1 1 1 0 0 0 1 0 1 0 12 Th em e3 0 0 0 0 0 0 0 0 0 0 0 1 1 1 0 0 0 1 0 0 0 4 R h em e 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 21 Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 28 of 50 Rheme is the most prominent in Figure 12, above. This is the core of the message of the “Area Boy.” Besides, Theme 1 recurs in almost all the clauses. This signals that the organizations of the clauses are hardly elliptical. The structures are complete statements that sometimes have Theme 2 as a support for the clauses points of departure. Theme 3 is available only in clauses 12, 13, 14, and 18. That points out the rarity of Theme 3 in the textual operations. Transitivity system of the “Area Boy” analysis Table 5, below, shows the values of the semiotic slots in the “Area Boy” transitivity analysis in Figure 10, above. Figure 13, below, is the cumulative of the values manifested in the thematic system in Table 5, below. Material processes record the highest value in Figure 13, above. This is in alignment with the claim of Halliday and Matthiessen (2004) that material processes are the most deployed in language usages. Apart from mental processes that operate at the frequency of five, other processes such as relational and behavioral operate at the minimal levels of two points each. It is surprising that verbal processes function in a relatively similar category with other processes with three points. This act seems to happen because the narrator makes a sort of reported speeches from time to time. Figure 13: “Area Boy” transitivity calibration. Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 29 of 50 Ta b le 5 : “ A re a B oy ” tr an si ti vi ty s ys te m c al ib ra ti on . Se m io ti c Sl o t C la u se To ta l C L1 C L2 C L3 C L4 C L5 C L6 C L7 C L8 C L9 C L1 0 C L1 1 C L1 2 C L1 3 C L1 4 C L1 5 C L1 6 C L1 7 C L1 8 C L1 9 C L2 0 C L2 1 M at er ia l 0 1 1 0 0 1 1 0 0 0 0 1 0 1 0 1 1 0 0 0 1 9 M en ta l 1 0 0 0 1 0 0 1 0 1 0 0 1 0 0 0 0 0 0 0 0 5 R el at io n al 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 2 B eh av io ra l 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 1 0 0 0 0 2 V er b al 0 0 0 1 0 0 0 0 1 0 1 0 0 0 0 0 0 0 0 0 0 3 Ex is te n ti al 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 C ir cu m st an ce 2 2 1 0 2 2 0 0 0 5 0 1 1 2 0 0 0 0 1 1 0 2 0 Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 30 of 50 Three metafunctions of the “Area Boy” analysis Figure 14, below, is the cumulative of the values, exhibited in the mood, thematic, and transitivity systems, as displayed earlier in Figures 8, 9, and 10, above. Figure 14, above, demonstrates the Adjunct of the mood system, rheme of the thematic system, and material processes of the transitivity system as the highest in functional values, as followed by mental processes. By implication, SFL illustrates Adjunct, rheme, and material processes, as the strongest areas of domination of the “Area Boy” text. These are followed by Subject, Finite, and Predicator of the mood system, and Theme 1 of the thematic system. The computing outcomes of SFL of “Area Boy” indicate analytical skills that can augment cross-fertilization of ideas in disciplines. The graphical appearances of textual elements create a sort of communicative interaction for the audience in an easy way. Discussion There are five stanzas in the poem of “Area Boy.” The segments explain the concern of the narrator about an “Area Boy” named George in the epigraph not actually integral to the stanzas. It is striking to read from the epitaph that the poem is for George, the “Area Boy”, who opened up a bitter heart to me at Ojuelegba, Lagos. This Figure 14: “Area Boy” metafunctions’ relationship calibration. Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 31 of 50 revelation specifies the source of the poet’s influence as well as the focus. Ojuelegba is an important part of Yaba, Lagos (not far away from the renowned University of Lagos), where influential and highly respected people live. As a heartbeat of Yaba, the mentioning of Ojuelegba anywhere in Nigeria signifies something remarkably-different. It is a signpost to a very small portion of land with a flyover. Underneath the flyover are motor parks, petty trading activities, and prostitutes. On top of these, Ojuelegba is a domain for miscreants for twenty-four hours a day. In all these, Ojuelegba points to a place where prostitutes transact businesses. However, the Governor of Lagos State between 2007–2015, Babatunde Raji Fashola, cleansed Ojuelegba of prostitutes and miscreants. Fashola positioned the place as a worthwhile environment during his reign and that drive has been sustained till today. Fashola is now the Minister of Power, Works, and Housing in the current Buhari’s Administration. Despite the thorough cleansing, the negative nuances attached to Ojuelegba have become very difficult to remove from the people’s mindsets. The displacement of the “Area Boy” from Ojuelegba might have given rise to the heart rendering poem of “Area Boy.” The author approaches the discussion from the broad views of the systemic organization of the clauses in the stanzas, and semantic values attached to the clauses most especially from the goings-on. The poem opens up with a circumstantial element of place, At times, to indicate a consistent feeling of the “Area Boy” concerning the issues of life that he has undergone. This is expressed through a mental process, remember. Remember illustrates the trauma in the cognitive capacity of the “Area Boy.” The recurrence of you projects the poet as a voice for the “Area Boy” because the Actor, you, refers to what has happened to an individual, as the experience of the past that connects the “Area Boy’s” present condition. In every stanza, except in the fifth, you functions, at least, three times consecutively. The applications of Actor, you, present the poem more as containing declarative clauses. As statements, Subjects and Finites operate well in propagating the interactive nature on the clauses (Thompson 2004). All the clauses are declarative except CL15 and CL16 that are punctuated. Apart from CL1 that the Subject, You, takes the Finite, remember, in the Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 32 of 50 present tense, the Subjects You, They, and nobody in CL2, CL3, and CL4 present their Subjects, spent, flung, and said in the past. Out of the past elements, spend, fling, and say are systemically-deduced as predicators. This operation reveals SFL as a viable tool of separating a fused verbal group structure into two systemic distinct forms that function in the domains of Finite and Predicator (Halliday and Matthiessen 2014; Dalamu 2017b). For the Participant, Those agonizing years, analyzed earlier as Phenomenon, in Figure 10, CL1, is a painful expression that demonstrates how the “Area Boy” has been subjected to the modern day slavery in the General’s farm, where the individual works and receives a meager salary. The circumstantial communicative device, in the General’s farm, seems to refer to “an individual who was an Army General” and after the service years retired to establish a farm to generate money. In that course, the “Area Boy” becomes a useful-cum-precious tool in the farm. This is because an average Nigerian graduate detests tilling the land. Most elites are in search of and doing white-collar-jobs. Perhaps, that attitude has contributed to the importation of food from most parts of the world to the country. Even those that read Agricultural (related) Sciences in Universities may not be ready to practice farming either as relating to livestock or crop productions. It is in that light that the “Area Boy” becomes a resourceful personality for the General, as expressed in CL2. That exploitation encourages the speaker to conclude that Yet nobody said anything then. The clause with a verbal process indicts every onlooker at the manhandling of the “Area Boy.” The poet expects that all the concerned should have raised their voices concerning the abuse of the rights of the “Area Boy” in the General’s farm. Individuals have all been recalcitrant simply because the “Area Boy” is not a family member of successful persons. Such unwillingness reveals the nature of relationship between the rich and the poor. The people have forgotten that the “Area Boy” is a member of the Nigerian society who has the right equivalent to the General’s. The adjunct, then, serves a purpose of reminding the society of the situation of the “Area Boy” when he needs helps and nobody is observant of his plight. Then links the past agony of the “Area Boy” to his present status of being a vagabond. The structure below shows the thematic functions of the stanza one. Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 33 of 50 The point of departure of CL1 and CL4 is the same as having two themes; whereas CL2 and CL3 elements have one theme each. The second stanza is comparatively-parallel to the first stanza because it begins with a flashback into the past, remembering the painful years … spent as a houseboy in Ikoyi. The “Area Boy” is connected to Ojuelegba, while the master lives in Ikoyi. The implication is that Ikoyi is in the Lagos Island, while Ojuelegba is in the mainland, representing two different worlds or perspectives. Foreigners and well-meaning Nigerians reside in Ikoyi. This can indicate that if the “Area Boy” will have an access at all to Ikoyi, it can only be made possible through rendering services to the master. The poet constructs one domain for the poor and the other for the affluent. The “Area Boy” is neither a foolish person nor a senseless individual. It is that he needs helps from the society and no one appreciates his humble cries to assist the helpless human being. Such supports, if rendered, could give the individual breakthroughs in order to showcase his talents and skills in resourceful ways. The mood system in stanza two is similar in structure to stanza one. The study locates the differences in CL7 and CL8 where the Subjects, Oga’s callousness and Madam’s overbearing attitude/you, attract different Finites. The two Finites, haunt and can no longer, operate in present forms. It is salient to have no longer in the verbal group. It is a negative polarity that compels the boy to continue to flashback to his experience with the wife of his master. That is, the Madam’s imperious characteristic. The Participant, The painful year and circumstantial element, as a reluctant houseboy support the remark on Madam’s dictatorial capacity. The “Area Boy” understands that he passes through pains in the master’s house, nonetheless, because there is no one to help, the indigent resigns himself to fate. According to the narrator, the “Area Boy” discharges his responsibilities sluggishly. The houseboy’s experience sensitizes the readers in two forms. The master’s characteristic, expressed as Oga’s callousness, and the madam’s attitude describes, as Madam’s overbearing Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 34 of 50 attitude. The Oga (i.e. master) is emotionally-hardened. That heartlessness has made the boss to be careless of the sufferings of the concerned, which has turned him to a restless individual. Perhaps, that has lead to the persistent complaints that the writer observes from the “poetic narrative.” The other approach is that the woman in the house does not help matters. Madam makes the situation possibly-worse. The madam’s domineering role overwhelms the “Area Boy” to be forcefully-dedicated to his responsibilities despite the initial reluctance. Indirectly, the destitute engages in a sort of forced labor in the house of the rich and, probably, in the presence of the children Yet nobody said anything then. It is painful that nobody comes to his aid, being a reason for the lamentation. The structure below illustrates the thematic organization of stanza two. CL5, CL8, and CL9 portray similar thematic choices of dual theme operations; whereas CL6 and CL7 organize single theme each. The third outpouring of a hurting heart shows in a psychological form in the third stanza. The poet calls that The psychological oppression, which is Phenomenon to the mental process, remember. The first concern positions the “Area Boy”, as a laborer in the farm. The second challenge is the nagging of the master and wife on the houseboy. The experience here plays out as a kind of feeling and not an exercise of personal strength in order to achieve a mission. One can argue that the concern of the “Area Boy”, this time, does not have a solid logical foundation. This is because the grievance is not objective. The circumstantial devices of Of washing their scions spray dollars in parties/Of that limousines splashing water on you … are pains that fall into the terrain of personal feelings, expressed in the form of envy. The sentiments of jealousy classified as oppression can lead to perpetration of evil. Moreover, the resentment is a thought that has the potency to persuade the “Area Boy” to look for Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 35 of 50 money at all costs. In the same spirit of self-appraisal of sensationalism, the individual raises a complaint Of your wondering where you went wrong. Actually, it is a good thing to be comfortable in life. Nevertheless, developing a spirit of rivalry against someone’s neighbor is not acceptable in all ramifications of social norms. The “Area Boy” has forgotten that fingers are not equal and can never be equal. The throbbing heart needs to transcend the mundane activities that he witnesses and complains of in diverse forms in order to focus on how to survive socially economically. To fully register the grievance, four circumstantial facilities with the markers of of (three times), and in are employed. These demonstrate the degree of the “Area Boy’s” annoyance against the family of his master. The longevity of the clause supports the claim above. As if that is not enough, the playing of a blame- game emanates to project the individual as someone, who has sometimes missed opportunities. Probably, that validation influences the speaker to begin to query where the “Area Boy” has gone wrong. In my argument, the “Area Boy” needs to dig deep in terms of his past, his parents, and perhaps personal disobedience to instructions from the guardian. The stubbornness, ignorance, and lackadaisical qualities of the complainant might have caused his present situation of indigence. The declarative, Yet nobody said anything, motivated with a verbal process, is striking. This is because the content recurs three times in stanzas, one, two, and three – CL4, CL9, and CL11. The implication of the repetitive statement is that it strongly expresses the wish of the “Area Boy.” The verbs “to be” and “to have” as well as the auxiliary “can no longer” exhibited in negative polarity occupy the Finite positions of CL14, CL12, and CL13. The other ‘interacts’ of the mood are repetitions of the Subjects discussed earlier in stanzas one and two. CL15 does not have mood at all but residue. The “tormented Area Boy”, as a laborer, a houseboy, and a psychologically-oppressed individual, expects members of the Lagos community to provide him a succor from the anguish experienced. Before anyone casts blame on the Lagos society, it is important to point out that Lagos is a very busy city where the concept of individualism dominates virtually all activities (Gustavsson 2008). The blame must first go to the parents and second to Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 36 of 50 the government. If the parents have failed in their responsibilities of properly raising the child, the government is supposed to bear the burden of caring for the citizens, including the less privileged ones. Overwhelming responsibilities of other parents may have prevented them from taking care of the “Area Boy” and others in a similar condition. So, the plight of the boy is a lesson to all parents; people must give birth to only the children that they can cater for because nobody will say anything while their untrained children roam. Parents must wonder while their disobedient children wander. The structure below demonstrates the point of departure of the clauses in stanza three. The organization of the clauses in stanza three reveals different communicative background, when one makes a dialectical appreciation with earlier dissected stanzas one and two. Three organizational structures unfold here. CL10 and CL11 have two themes each. CL12, CL13, and CL14 operate with three themes each whereas CL15 has neither the thematic system nor the mood system. The poet restricts the function of CL15 to rhematic elements, corresponding to the interpersonal devices of Predicator, Lashing out at and Complement, the system. After the past that stanzas one, two, and three favor, the conformity of stanzas four and five operates in the present events. The narrator describes the phenomenon, using the circumstantial mechanism of Now, which refers to the engagement of time. One observes the emphasis of Now in clauses 12, 13, 14, and 20 as well. The past can be categorized as the premature stage, while the mature stage connotes the present. The past unveils the “Area Boy” as being in servitudes of influential people, who violate social standards to take advantage of the boy’s weakness and cheat him. The present displays the “Area Boy”, as an individual with freedom. Thus, he troubles the Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 37 of 50 society that, by the opinion of the boy, has not been kind to the sore-hearted person. Those who renege in ministering to the needy usually pay the astronomical price for their negligence. Perhaps, some people may not dream of evil perpetration; the circumstances surrounding them may incite their behaviors toward social vices. The “Area Boy” reveals that Now that something in you has snapped, one can retaliate, has a connection with the experience of the past. The author can recall the utilization of the mental process of remember in three different occasions in the verses. The lexeme, remember, is a pointer to how the servitude experience borders the painful- hearted fellow, and negatively influences his mental capabilities. The experience has made the boy so unpleasant to an extent of outpouring his annoyance to the audience. The gathered knowledge stimulates the boy to confess that one can no longer stomach it. It is a frame of mind expressed in the mental process of can no longer stomach that is very difficult to erase from one’s cognitive storage. It is an indicator that those who are rich in the society must treat the poor fairly well; else as the time is fast approaching, in no time, the less privileged will fight back, and perhaps, be terroristic. The poet takes cognizance of this, as commented that Now that you’re fighting back in the street … stinging the molochs it is time to call you names: Tout! … The “Area Boy” understands the trouble that he causes the society. Besides, it is the society that labels him “Area Boy” and other synonymous appellations such as tout, vagrant, and vandal. The names seem to signify the punishment that the boy inflicts on the society. In a precise way, “Area Boys” are those irresponsible children most of them boys (because there are no “Area Girls” in Lagos), who pick pockets, steal, and later turn to armed robbers. Possibly, some socially affected individuals might not be armed robbers but beggars and miscreants, who wander to beseech people for money in order to sustain their lives under the bridge. It cannot be totally ruled out that one, who wanders, can turn to a thief because an aphorism stipulates that an idle hand is the devil’s workshop. There are four processes in stanza four. These are stinging (material), operate (material), tolerate (behavioral), and is (relational). Sting represents a poisonous spirit, while operate refers to the workable mechanism of the social system and structure. Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 38 of 50 Tolerate describes the attitude of the entire actors of society that accept the evil that the powerful perpetrate on the less privileged (see Halliday and Matthiessen 2004, 179–238). At this juncture, the expectation of the narrator is that the society ought to intervene by protesting against the General’s inhumanity, Oga’s callousness, and Madam’s overbearing qualities. Instead of necessary interpositions, the people seem to add to the traumatic experience of the boy. The poem draws on those behaviors to create a relational process as an attribute as well as a cursor to the current perspective of the society on the helpless individual, who has perhaps turned to a criminal. CL16 expresses the residue as Predicator, stinging and Complement, the molochs. The Subjects in CL17 and CL18 are relative markers, who, that attract different Finites of operate and tolerate in present tense respectively. CL19’s Subject is it, which takes is as its Finite. The explanation below characterizes the thematic choices of stanza four. The four clauses in stanza four appear in different parameters except that CL17 and CL19 have a similar pattern of theme/rheme structures. CL16 operates only in rheme with an empty set of theme. The stanza experiences a full-stretch of thematic configuration in CL18 with three themes at a go. The poet fraternizes with the “Area Boy” by calling him brother as a point of departure of CL20. The association becomes a necessity because George, the “Area Boy” provides the poet pieces of information about the environment and personal feelings. The poem, “Area Boy”, seems to honor the citizens, who are victims of being miscreants; the poet is a probable voice for “Area Boys.” Besides, the poet, being an intellectual, subsumes himself into a similar situation in order to sensitize the government to rise to the plight of “Area Boys”, who cause headaches to the larger society. In respect of that, the poet perceives the challenge of the notion of “Area Boy”, as a concern that rocks the social boat of Lagos and Nigeria at large. Even if the government should rescue “Area Boys” after that they have been socially bartered; Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 39 of 50 what about the damages that the misfortune has created in their subconscious souls? The Subjects, your being an Area Boy and Nobody in CL20 and CL21 take on the Finites, is and will ever respectively to reflect the mood system of the interaction. The structure below indicates the themes of the stanza. CL20 demonstrates a marked multiple structure of themes, while CL21 shows an unmarked thematic organ. The poet concludes with a declarative that Nobody will ever bother to evacuate/The fossils of disenchantment/Buried deep down in your soul. Perhaps, irrespective of the aids given to the “Area Boy” the past experience might prevent the agonized destitute from adopting the full status and responsibility of a good citizen. In that case, it becomes imprudent to allow citizens to degenerate in social treasures before the society rescues them from their plights. Such delay could be very extortionate in relation to loss of lives and property. Conclusion The study shows that SFL is an instrument of DH by allowing scientific facilities to process the structural values of the poem, as illustrated earlier in Figures 8, 9, and 10; Tables 3, 4, and 5; and Figures 11, 12, 13, and 14. The results of the analysis of the poem, “Area Boy”, in which SFL is applied display the tenor of discourse, as explicating the experience of the painful heart in declarative statements functioning with Subject and Finite. These semiotic values are highly supported with Adjuncts. The interactions reveal how the society creates bitterness in the soul of the “Area Boy.” The organization of the clauses operates on themes, which are sometimes marked multiple themes, as means of expressing the markedness. The mode of discourse exhibits meaning in the rhemes, which are in alignment with most processes. The experience that the text shares utilizes material processes of having and being (e.g. spent, fighting back, lashing out at, stinging, and operate) in order to explain the situation of the “Area Boy” in the past and in the present. The study also observes the field of discourse, oscillating between mental processes (e.g. remember, can no Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 40 of 50 longer forget, and can no longer stomach) and verbal processes (e.g. said). These are indicators of the traumatic experience of the painful soul and his expectation from the society. Examining the poem from the transitivity systemic approach, the author observes some repetitive devices from the investigation. There are about five of the communicative facilities that function as processes, circumstances, and full-fledged clauses. These are: At times you still remember (declarative clause); Yet nobody said anything (declarative clause); spent (material process); Now (circumstantial element of time); and Of (circumstantial element of manner). There are two divisions in the poem, that is, the environment of the past and that of the present. The past experience displays accumulated thoughts of the “Area Boy” as a laborer as well as a houseboy. Apart from that, personal feelings, crowded with sentiments, disturb the victim. The later can be accepted as the fault of the society. However, one is also compelled to indict the boy, and to negate his grievances. The boy does not need to blame others for his shortcomings, failings, and challenges. Instead of groaning, the individual ought to chart a new course of survival in a legally- acceptable way. The envy of the master’s family berates social norms. Nevertheless, the present, the poet alerts, is a fighting back – a time of retaliation. This represents a period when the concerned individual feels being frustrated by the society. The disappointment might have permitted the “Area Boy” to become a burden to the society because the government has somehow abdicated its social responsibility of caring for its own. That is, all citizens. The selfishness of the society contributes as well to make a nuisance out of the helpless individual. Instead of assisting the “Area Boy” in order to have access to good things of life, the penniless is taken advantage of in order to slavishly serve the haughty. From a theoretical perspective, the study suggests that SFL has the potency to provide socio-cultural meaning potential to texts in their literary forms. It also deduces from “Area Boy” that the government and private individuals should endeavor to consider the less privileged, which have equal rights to survive as citizens of the nation. Apart from the corpus that can be achieved, SFL textual interpretations have the capacity to stimulate computer experts to construct simulations of poetic devices that the audience can easily observe from computers. It is the hope of the study that Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 41 of 50 researchers will make use of SFL conceptual frameworks to analyze literature for desired meaning potential. Furthermore and in retrospect, the constraints experienced in the manual counting of the systemic constituents of “Area Boy” inspires the following suggestions. To the best of my knowledge, some of the available software assisting in DH (e.g. AntConc) could not vividly cater for systemic appreciations of texts. On that ground, one could recommend the need for computer experts (or programmers) to produce some software that can take care of SFL analyses and positions on lexemic investigations. Such technological facility must have the potency to identify and compute a corpus of systemic processes, circumstantial devices, continuatives, vocatives, etc. of their kinds. If a project of this magnitude, involving systemicists and software experts, is conducted; one is seemingly sure that such cross- fertilization of ideas will yield some merits. First, the software will eradicate the manual counting, as done in this paper, to automation of systemic accountability of communicative facilities either in Microsoft Excel Sheet or Microsoft Word or any other computerization concepts. Second, it might attract researchers to participate in the development of SFL for the betterment of the humanity at large. Third, the software could promote SFL as learner- and user-friendly. Fourth, it can aid easy generation of meaning potential of texts to reveal “what a composer of a text means” structurally and contextually. Acknowledgements My sincere appreciation goes to Dr. Daniel P. O’Donnell (Editor-in-Chief), Mr. Virgil Grandfield (Managing Editor) and Mr. Steven Gillis (Congress 2017 Issue Manager) of Digital Studies/Le champ numerique (DSCN), the University of Lethbridge Journal Incubator, and all the other editors and reviewers who have contributed in one way or another to this article. The contributions of these individuals, without mincing words, have had great impacts on this work. I am also grateful to Mrs. Bonke Dalamu for her consistent encouragement during the multi-tasking reviewing processes of this article. Competing Interests The author has no competing interests to declare. Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 42 of 50 References Adesanmi, Pius. 2010. “Area Boy.” In Lagos of the Poets, edited by Ofeimun Odia, 308. Lagos: Hornbill House. Anthony, Laurence. 2018. AntConc: A freeware Corpus Analysis Toolkit for Concordancing and Text Analysis. Accessed June 13 2018. Bottom of Form: http://www.laurenceanthony.net/software/antconc/. Bailey, Richard W. 1985. “Negotiating Meaning: Revisiting the Context of Situation.” In Systemic Perspectives on Discourse 2, edited by James D. Benson, and William S. Greaves, XVI: 1–16. Norwood, New Jersey: Ablex Publishing Corporation. Bartlett, Tom. 2013. “I’ll Manage the Context: Context, Environment and the Potential for Institutional Change.” In Systemic Functional Linguistics: Exploring Choice, edited by Lise Fontaine, Tom Bartlett, and Gerard O’Grady, 342–364. Cambridge: Cambridge University Press. DOI: https://doi.org/10.1017/ CBO9781139583077.021 Binotti, Lucia, and Carmen Urioste-Azcorra. 2017. “Digital Humanities and the Common Good. The Case of Entiéndelo.” Revista de Humanidades Digitalés 1: 207–222. Accessed May 21, 2018. http://revistas.uned.es/index.php/RHD. Bloor, Thomas, and Meriel Bloor. 2004. The Functional Analysis of English. Great Britain: Hodder. DOI: https://doi.org/10.4324/9780203774854 Bloor, Thomas, and Meriel Bloor. 2013. The Functional Analysis of English. Abingdon, Oxon: Routledge. Bogna, Alice. 2017. “From Ancient Texts to Maps (and Back Again) in the Digital World. The Digiliblt Project.” Revista de Humanidades Digitalés 1: 297–313. Accessed May 21, 2018. http://revistas.uned.es/index.php/RHD. DOI: https:// doi.org/10.5944/rhd.vol.1.2017.16784 Bradley, John. 2004. “Text tools.” In A Companion to Digital Humanities, edited by Susan Schreibman, Ray Siemens, and John Unsworth, 487–503. Oxford: Blackwell. DOI: https://doi.org/10.1002/9780470999875.ch33 Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 43 of 50 Burdick, Anne, Johanna Drucker, Peter Lunenfeld, Todd Presner, and Jeffrey Schnapp. 2012. Digital_Humanities. USA: Massachusetts Institute of Technology Press. Burke, Kenneth. 1969. A Grammar of Motives. California: University of California Press. Burrows, John. 2004. “Textual Analysis.” In A Companion to Digital Humanities, edited by Susan Schreibman, Ray Siemens, and John Unsworth, 316–342. Oxford: Blackwell. DOI: https://doi.org/10.1002/9780470999875.ch23 Burton, Dolores M. 1981a. “Automated Concordances and Word Indexes: The Fifties.” Computers and the Humanities 15: 1–14. DOI: https://doi. org/10.1007/BF02404370 Burton, Dolores M. 1981b. “Automated Concordances and Word Indexes: The Early Sixties and the Early Centers.” Computers and the Humanities 15: 83–100. DOI: https://doi.org/10.1007/BF02404202 Busa, Roberto A. 2004. “Foreword: Perspectives on the Digital Humanities.” In A Companion to Digital Humanities, edited by Susan Schreibman, Ray Siemens, and John Unsworth, xvi–xxi. Oxford: Blackwell. Butler, Christopher S. 1985. Systemic Linguistics Theory and Applications. London: Batsford Acedemic and Educational. Butler-Kisber, Lynn. (ed.) 2013. Teaching and Learning in the Digital World: Possibilities and Challenges 6(2): 1–423. Accessed January 12, 2016. https:// www.learninglandscapes.ca/index.php/learnland/issue/view/Teaching-and- Learning-in-the-Digital-World-Possibilities-and-Challenges. Castro, Rojas A. 2017. “Big Data in the Digital Humanities. New Conversations in the Global Academic Context.” Humanities Commons, Digital Culture Annual Report, 62–71. Accessed April 23, 2018. https://hcommons.org/deposits/item/ hc:11759/. Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 44 of 50 Ciula, Arianna. 2017. “Digital Palaeography: What is Digital about it?” Digital Scholarship in the Humanities 32(suppl. 2): ii89–ii105. Accessed December 17, 2018. DOI: https://doi.org/10.1093/llc/fqx042 Craig, Hugh. 2004. “Stylistic Analysis and Authorship Studies.” In A Companion to Digital Humanities, edited by Susan Schreibman, Ray Siemens, and John Unsworth, 271–285. Oxford: Blackwell. DOI: https://doi. org/10.1002/9780470999875.ch20 Crane, Greg. 2004. “Classics and the Computer: An End of the History.” In A Companion to Digital Humanities, edited by Susan Schreibman, Ray Siemens, and John Unsworth, 46–55. Oxford: Blackwell. DOI: https://doi. org/10.1002/9780470999875.ch4 Creative Block Inc. 2017. IBM’s New Tagline: Think 3.0. Accessed January 27, 2018. http://blog.thecreativeblock.marketing/ibms-new-tagline-think-3.0. Dalamu, Taofeek O. 2017a. “Institution’s Title and Shibboleth: A Construction of Grammatical Relationship in Advertising Plates.” Journal of Language and Linguistic Studies 13(1): 260–282. Dalamu, Taofeek O. 2017b. “Systemic Functional Theory: A Pickax of Textual Investigation.” International Journal of Applied Linguistics and English Literature 6(3): 187–198. DOI: https://doi.org/10.7575/aiac.ijalel.v.6n.3p.187 Dalamu, Taofeek O. 2017c. “A Preliminary Exposé of Systemic Functional Theory Fundamentals.” Ethical Lingua 4(2): 98–108. DOI: https://doi.org/10.30605/ ethicallingua.v4i2.414 Dalamu, Taofeek O. 2017d. “Nigerian Children Specimens as Resonance of Print Media Advertising: What for?” Communicatio 11(2): 79–111. Dalamu, Taofeek O. 2017e. “Narrative in Advertising: Persuading the Nigerian Audience within the Schemata of Storyline.” Anu. Filol. Lleng. Lit. Mod. 7, 19–45. Dalamu, Taofeek O. 2017f. “Periodicity: Interpreting Waves of Information in Osundare’s Harvestcall.” Buckingham Journal of Language and Linguistics 10: 42–70. Dalamu, Taofeek O. 2017g. “Maternal Ideology in an MTN® Advertisement: Analysing Socio-Semiotic Reality as a Campaign for Peace.” Journal of Language Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 45 of 50 and Education 3(4): 16–26. DOI: https://doi.org/10.17323/2411-7390-2017-3- 4-16-26 Dalamu, Taofeek O. 2017h. “Yuletide Ideology as Advertising Ideology: An Historical Illumination from Saint Nicholas to the Present Day.” Facta Universitatis Series: Linguistics and Literature 15(2): 143–161. UDC 659. 1: 27–36. Nikola, sveti. Dalamu, Taofeek O. 2017i. “A Discourse Analysis of Language Choice in MTN® and Etisalat® Advertisements in Nigeria.” PhD Thesis, Yaba, Lagos: University of Lagos, School of Postgraduate Studies. Dalamu, Taofeek. 2018. “Exploring Advertising Text in Nigeria within the Framework of Cohesive Influence.” Styles of Communication 10(1): 75–97. De Beaugrande, Robert. 1991. Linguistic Theory: The Discourse of Fundamental Works. London and New York: Longman. Digital Humanities Manifesto. nd. A Manifesto on Manifestos. Accessed March 14, 2017. http://manifesto.humanities.ucla.edu/2009/05/29/the-digital- humanities-manifesto-20/. Drucker, Johanna. 2016. “Graphical Approaches to the Digital Humanities.” In A New Companion to Digital Humanities, edited by Susan Schreibman, Ray Siemens, and John Unsworth, 238–250. West Sussex, UK: Wiley Blackwell. Edmond, Jennifer. 2016. “Collaboration and Infrastructure.” In A New Companion to Digital Humanities, edited by Susan Schreibman, Ray Siemens, and John Unsworth, 54–66. West Sussex, UK: Wiley Blackwell. Eggins, Suzanne. 2004. Introduction to Systemic Functional Linguistics. London: Continuum. Ellis, Jeffrey. 1987. “The Logic and Textual Functions.” In New Developments in Systemic Linguistics: Theory and Description, edited by Michael A. K. Halliday, and Rupert P. Fawcett 1: 107–129. London: Frances Painter. Erlin, Matt. 2016. “Digital Humanities Masterplots.” Digital Literary Studies 1(1): 1–11. Ess, Charles. 2004. “Revolution? What Revolution? Successes and Limits of Computing Technologies in Philosophy and Religion.” In A Companion to Digital Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 46 of 50 Humanities, edited by Susan Schreibman, Ray Siemens, and John Unsworth, 132–144. Oxford: Blackwell. DOI: https://doi.org/10.1002/9780470999875. ch12 Firth, John R. 1957. Papers in Linguistics, 1934–1951. London: Oxford University Press. Fontaine, Lise. 2013. Analyzing English Grammar: A Systemic Functional Introduction. Cambridge: Cambridge University Press. Gustavsson, Gina. 2008. “What Individualism Is and Is Not.” Workshop Paper to be Presented at the NOPSA Conference 2008, Tromsö, 1–25. http://www.diva- portal.org/smash/get/diva2:54576/FULLTEXT01.pdf. Halliday, Michael A. K. 1973. Explorations in the Functions of Language. London: Edward Arnold. Halliday, Michael A. K. 1985. “Systemic Background.” In Systemic Perspectives on Discourse XV, edited by James Benson, and Williams Greaves, 1–15. Norwood, New Jersey: Ablex Publishing Corporation. Halliday, Michael A. K. 1994. An Introduction to Functional Grammar. Great Britain: Arnold. Halliday, Michael A. K. 2013. “Meaning as Choice.” In Systemic Functional linguistics: Exploring Choice, edited by Lise Fontaine, Tom Bartlett, and Gerard O’Grady, 15–36. Cambridge: Cambridge University Press. DOI: https://doi.org/10.1017/ CBO9781139583077.003 Halliday, Michael A. K., and Christian M. I. M. Matthiessen. 2004. An Introduction to Functional Grammar. Great Britain: Hodder Arnold. Halliday, Michael A. K., and Christian M. I. M. Matthiessen. 2014. Halliday’s Introduction to Functional Grammar. Abindon, Oxon: Routledge. Halliday, Michael A. K., and Ruqaiya Hasan. 1985. Language, Context, and Text: Aspects of Language in a Socio-Semiotic Perspective. Geelong: Deakin University Press. Hockey, Susan. 2004. “The History of Humanities Computing.” In A Companion to Digital Humanities, edited by Susan Schreibman, Ray Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 47 of 50 Siemens, and John Unsworth, 3–19. Oxford: Blackwell. DOI: https://doi. org/10.1002/9780470999875.ch1 Hodge, Robert, and Gunther Kress. 1988. Social Semiotics. Cambridge: Polity. IBM. 2017. Enter the Cognitive Era. Accessed June 17, 2018. https://www.ibm.com/ us-en/. Innis, Robert E. 1987. “Entry for Bühler, Karl.” In Thinkers of the Twentieth Century, edited by Roland Turner. London: Saint James Press. Jockers, Matthew L., and Ted Underwood. 2016. “[Text] Mining the Humanities.” In A New Companion to Digital Humanities, edited by Susan Schreibman, Ray Siemens, and John Unsworth, 291–306. West Sussex, UK: Wiley Blackwell. Jørgensen, Finn Arne. 2016. “Summary: The Internet of Things.” In A New Companion to Digital Humanities, edited by Susan Schreibman, Ray Siemens, and John Unsworth, 42–53. West Sussex, UK: Wiley Blackwell. Kirschenbaum, Matthew G. 2010. “What is Digital Humanities and What’s It Doing in English Departments?” ADE Bulletin 150: 55–61. DOI: https://doi. org/10.1632/ade.150.55 Kreniske, Philip, and Jesse Kipp. 2014. “How the San of Southern Africa Used Digital Media as Educational and Political Tools.” The Journal of Interactive Technology and Pedagogy. Accessed February 15, 2016. https://jitp.commons. gc.cuny.edu/how-the-san-of-southern-africa-used-digital-media-as-educational- and-political-tools/. Kress, Gunther. 2010. Multimodality: A Semiotic Approach to Contemporary Communication. New York: Routledge. Kress, Gunther, and Theo van Leeuwen. 2003. Reading Images: The Grammar of Visual Design. London and New York: Routledge. Lee, James, Blaine Greteman, Jason Lee, and David Eichmann. 2018. Linked Reading: Digital Historicism and Early Modern Discourses of Race around Shakespeare’s Othello. Accessed November 23, 2018. https://osf.io/preprints/ socarxiv/tg23u/. Malinowski, Bronislaw. 1935. Coral Gardens and Their Magic, 2: The Language of Magic and Gardening. New York: American Book Company. Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 48 of 50 Malmkjaer, Kirsten. (ed.) 2004. The Linguistics Encydopedia. London: Routledge. DOI: https://doi.org/10.4324/9780203644645 Martin, James R. 1992. English Text: Structure and System. Philadephia: John Benjamins. DOI: https://doi.org/10.1075/z.59 Matthiessen, Christian. 1993. “Register in the Round: Diversity in a Unified Theory of Register Analysis.” In Register Analysis: Theory and Practice, edited by Mohen Ghadessy, 221–392. London and New York: Pinter Publisher. McGregor, William B. 1992. “The Place of Circumstantial in Systemic-Functional Grammar.” In Advances in Systemic Linguistics: Recent Theory and Practice, edited by Martin Davies, and Louise Ravelli, 136–145. London: Pinter Publisher. McGregor, William B. 1997. Semiotic Grammar. London and New York: Oxford University Montfort, Nick. 2016. “Exploratory Programming in Digital Humanities Pedagogy and Research.” In A New Companion to Digital Humanities, edited by Susan Schreibman, Ray Siemens, and John Unsworth, 98–107. West Sussex, UK: Wiley Blackwell. Muzny, Grace, Mark Algee-Hewitt, and Dan Jurafsky. 2017. “Dialogism in the Novel: A Computational Model of the Dialogic Nature of Narration and Quotations.” Digital Scholarship in the Humanities 32(suppl. 2): ii31–ii52. DOI: https://doi.org/10.1093/llc/fqx031 O’Donnell, Daniel P., Katherin L. Walter, Alex Gil, and Neil Fraistat. 2016. “Only Connect: The Globalization of the Digital Humanities.” In A New Companion to Digital Humanities, edited by Susan Schreibman, Ray Siemens, and John Unsworth, 493–510. West Sussex, UK: Wiley Blackwell. Olivares, Beatriz E. Q. 2013. “The Interpersonal and Experiential Grammar of Chilean Spanish: Towards a Principled Systemic-Functional Description Based on Axial Argumentation.” PhD thesis. Accessed February 17, 2018. www.isfla. org/Systemics/Print/Theses/BQuiroz_2013.pdf. O’Toole, Michael. 1994. The Language of Displayed Art. London: Pinter. Peirson, Erick, Julia Damerow, and Manfred Laubichler. 2016. “Software Development & Trans-Disciplinary Training at the Interface of Digital Humanities Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 49 of 50 and Computer Science.” Digital Studies/Le champ numérique, 1–15. Accessed June 22, 2018. https://www.digitalstudies.org/articles/10.16995/dscn.17/. Quirk, Randolph, and Sidney Greenbaum. 1973. University Grammar of English. Essex England: Longman. Rashidi, Linda S. 1992. “Towards an Understanding of the Notion of Theme: An Example from Dari.” In Advances in Systemic Linguistics: Recent Theory and Practice, edited by Martin Davies, and Louise Ravelli, 189–204. London: Pinter Publisher. Ravelli, Louise. 2000. “Getting Started with Functional Analysis of Texts.” In Researching Language in Schools and Communities, edited by Len Unsworth, 27–63. London and Washington: Cassel. Riguet, Marine, and Suzanne Mpouli. 2017. “At the Crossroads Between the Scientific and the Literary Discourse: Comparison as a Figure of Dialogism.” Digital Scholarship in the Humanities 32(suppl. 2): ii60–ii77. Accessed October 15, 2018. DOI: https://doi.org/10.1093/llc/fqx026 Robinson, Amy, and Jon Saklofske. 2017. “Connecting the Dots: Integrating Modular Networks and Narrativity in Digital Scholarship.” Digital Studies/le Champ Numerique 9. Accessed October 15, 2018. https://www.digitalstudies. org/articles/10.16995/dscn.266/. DOI: https://doi.org/10.16995/dscn.266 Rodilla, Patricia M., and César Gonzalez-Perez. 2017. “A Modelling Language for Discourse Analysis in Humanities: Definition, Design, Validation and First Experiences.” Revista de Humanidades Digitales 1: 368–378. Accessed March 16, 2018. DOI: https://doi.org/10.5944/rhd.vol.1.2017.16133 Schreibman, Susan, Ray Siemens, and John Unsworth. (eds.) 2004. A Companion to Digital Humanities. Oxford: Blackwell. Svensson, Patrik. 1998. Number and Countability in English Nouns. An Embodied Model. Uppsala: Swedish Science Press. Svensson, Patrik. 2010. “The Landscape of DH.” Digital Humanities Quarterly 4(1): 1–35. Accessed March 19, 2017. http://digitalhumanities.org/dhq/ vol/4/1/000080/000080.html. Thompson, Geoff. 2004. Introducing Functional Grammar. Great Britain: Hodder Arnold. Dalamu: Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities Art. 8, page 50 of 50 Thompson, Geoff. 2014. Introducing Functional Grammar. Abingdon, Oxon: Routledge. DOI: https://doi.org/10.4324/9780203785270 Unsworth, John. 2002. “What Is Humanities Computing and What Is Not?” Graduate School of Library and Information Sciences. Illinois Informatics Institute, University of Illinois, Urbana. Unsworth, John. 2010. “Message to the Author.” E-mail to Matthew Kirschenbaum. Warwick, Claire. 2016. “Building Theories or Theories of Building? A Tension at the Heart of Digital Humanities.” In A New Companion to Digital Humanities, edited by Susan Schreibman, Ray Siemens, and John Unsworth, 538–552. West Sussex, UK: Wiley Blackwell. Wodak, Ruth, and Michael Meyer. (eds.) 2001. Methods of Critical Discourse Analysis. London: SAGE. DOI: https://doi.org/10.4135/9780857028020 Yule, George. 1985. The Study of Language. Cambridge: Cambridge University Press. How to cite this article: Dalamu, Taofeek. 2019. “Illuminating Systemic Functional Grammatics (Theory) as a Viable Tool of Digital Humanities.” Digital Studies/Le champ numérique 9(1): 8, pp. 1–50. DOI: https://doi.org/10.16995/dscn.287 Submitted: 04 November 2017 Accepted: 18 October 2018 Published: 23 April 2019 Copyright: © 2019 The Author(s). This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. See http://creativecommons.org/licenses/by/4.0/. OPEN ACCESS Digital Studies/Le champ numérique is a peer-reviewed open access journal published by Open Library of Humanities. work_ffhzfogh2vc6bpes4zeewwyv4m ---- informatics Article Conceptualization and Non-Relational Implementation of Ontological and Epistemic Vagueness of Information in Digital Humanities † Patricia Martin-Rodilla 1,* and Cesar Gonzalez-Perez 2 1 CiTIUS, University of Santiago de Compostela; Jenaro de la Fuente Domínguez, s/n, 15782 Santiago de Compostela, Spain 2 Institute of Heritage Sciences (Incipit) Spanish National Research Council (CSIC) Avda. Vigo, s/n, 15705 Santiago de Compostela, Spain; cesar.gonzalez-perez@incipit.csic.es * Correspondence: patricia.martin.rodilla@usc.es † This paper is an extended version of our paper published in TEEM’18, Salamanca, Spain, 24–26 October 2018. Received: 22 March 2019; Accepted: 30 April 2019; Published: 6 May 2019 �� Abstract: Research in the digital humanities often involves vague information, either because our objects of study lack clearly defined boundaries, or because our knowledge about them is incomplete or hypothetical, which is especially true in disciplines about our past (such as history, archaeology, and classical studies). Most techniques used to represent data vagueness emerged from natural sciences, and lack the expressiveness that would be ideal for humanistic contexts. Building on previous work, we present here a conceptual framework based on the ConML modelling language for the expression of information vagueness in digital humanities. In addition, we propose an implementation on non-relational data stores, which are becoming popular within the digital humanities. Having clear implementation guidelines allow us to employ search engines or big data systems (commonly implemented using non-relational approaches) to handle the vague aspects of information. The proposed implementation guidelines have been validated in practice, and show how we can query a vagueness-aware system without a large penalty in analytical and processing power. Keywords: vagueness; non-relational databases; conceptual modelling; imprecision; uncertainty; knowledge representation; digital humanities; ConML 1. Introduction We generate knowledge from raw data through different mechanisms, such as observation, perception, theorization, and deduction [1], thus producing information models that constitute the starting point of any knowledge generation process. These information models pose a significant impact on the quality and type of knowledge that we are able to generate. When working in the humanities, we also create information models that reflect not only the data that we have but also the possible hypotheses from them in order to fill the knowledge gap. This model-building process is especially relevant when working with information about our past, in which this gap is usually larger. For these reasons, several authors have recently pointed out how relevant models are in the humanities, and identified improvement and evaluation research needs [2,3]. Thus, conceptual modelling techniques have been emerged as a theoretical valid and practical way to represent humanistic knowledge. Conceptual models have been successfully used in humanities projects such as Europeana [4], ARIADNE [5], and DARIAH [6]. Conceptual models describe the world in terms of concepts, their properties, and the relationships amongst them. The main advantage of conceptual modelling, as opposed to other approaches, is its Informatics 2019, 6, 20; doi:10.3390/informatics6020020 www.mdpi.com/journal/informatics http://www.mdpi.com/journal/informatics http://www.mdpi.com https://orcid.org/0000-0002-3976-7589 http://dx.doi.org/10.3390/informatics6020020 http://www.mdpi.com/journal/informatics https://www.mdpi.com/2227-9709/6/2/20?type=check_update&version=3 Informatics 2019, 6, 20 2 of 23 focus on the knowledge-level representation of the domain of discourse, which allows us to obtain simplified and manageable proxies of a relevant scope [3]. Conceptual modelling has been mostly developed under the umbrella of software engineering, and due to this disciplinary heritage, current conceptual modelling techniques lack the necessary mechanisms to represent different subjective opinions or hypotheses [3], and address the ontological or epistemic vagueness that is often part of the part of the world being studied [3]. This is unfortunate, because vagueness plays a crucial role in humanistic models. This is so, firstly, because humanistic studies often deal with our past, which is often described through incomplete and partially unknown information sources and/or fragmented data, and, secondly, because many research practices in the humanities imply a significant degree of vagueness due to their ethnographic and narrative methodologies. Developing conceptual models that are capable of managing vagueness is difficult, mainly because modelling involves making decisions about the nature, degree, and characteristics of the reality modelled. This difficulty only increases when we try to implement these models as software systems to organize, query, annotate, or search data and assist in the generation of new knowledge. The technologies that we usually employ to do this, either relational or non-relational, are significantly unaware of information vagueness, which only compounds the problem. In this context, the ConML conceptual modelling language [7] was developed as a simple and affordable tool that can be used by specialists in the humanities without much experience in information technologies, and with special attention towards the implementation of conceptual models as computer artefacts and databases. In this paper, we present the modelling mechanisms in ConML that explicitly address the representation of vagueness in the humanities. Then, we elaborate by proposing some implementation mechanisms that we can use to carry this improvement over to computer systems, and in particular non-relational store systems. We also provide a complete validation using a real-world humanities project. The paper is organized as follows: the rest of this section presents a review of existing modelling approaches of vagueness, describing what problems have been found in relation to humanistic information. Section 2 presents the proposed conceptual framework. Section 3 illustrates the proposed approach through its application to a real project in digital humanities, which includes an implementation of a non-relational environment and some examples of data queries involving vagueness resolution. Section 4 discusses the results obtained. Section 5 critically analyses the work and its future possibilities. 1.1. Uncertain Information in Humanities Fields Data and information modelling applied to humanities is a sub-discipline that has experienced decades of development, due to the need to create models representing humanities data in daily research practices. This need increases exponentially with the recognition of digital humanities as a discipline, and the use of information software systems for storing, indexing, searching, and reasoning about humanistic data. Within this context, there is a large number of works on modelling information in humanities fields [8–10], organized into two underlying categories. On the one hand, humanistic information modelling studies are derived from curation and archives studies, whose practitioners have considerable experience in storing and processing information. These studies have been joined by so-called Linked Open Data approaches [11], which advocate information models that are subsequently shared on the web, converting it into a common database. In all these approaches, the underlying conceptual models usually have a first layer based on an entity–relationship model [12] or similar models and later add layers for the interconnection of models using technologies such as RDF [13]. Common solutions for implementation described here are XML technologies [14], which analyze how the information was obtained or who obtained it (the so-called metadata), or useful annotations for further study of the information contained in the models through information encoding paradigms such as TEI [15]. These conceptual and technological ecosystems for information modelling in the humanities Informatics 2019, 6, 20 3 of 23 are very common as a basis for important documentation projects in the field, such as DARIAH [6] or Pelagios [16,17]. Regarding the support for expressing uncertain and imprecise information, neither TEI specification or existing Linked Open Data metamodels explicitly support vagueness (ontological or epistemic). This lack incapacitates these ecosystems regarding the true generation of knowledge in their application domains [18]. In practice, users who need to build software systems based on these models have identified problems with vagueness representation, creating some ad hoc implementations using XML technologies and TEI mechanisms for the representation of vagueness in the metadata part. For instance, some TEI annotation resources have been used (like the TEI Note tag) for representing the certainty degree of some data (adding a possible uncertainty value to the tag) [14] or using XML tags to represent probabilistic aspects [19]. However, these solutions only solve the problem laterally and not modelling the uncertainty as something intrinsic and transversal to the whole model, forcing users to use modelling mechanisms, such as annotation tags, which are not specifically designed for this purpose. Consequently, software searching and indexing systems do not know that these “custom” uses will not be able to index and search while taking vagueness properties into account. On the other hand, we can find more aligned approaches with the theoretical framework previously presented, not those that use metadata approaches but those that use modelling based on entities and characteristics of the information itself. One of the most well-known works here for digital humanities is CIDOC-CRM (the conceptual reference model impulse by the International Council of Museums) [20], an ISO standard generally applied to the cultural sector that has traditionally been used in archaeological and museum environments, although it has extensions for other humanities uses. The need for modelling aspects of uncertain information has been determined as intrinsic to archaeological practice [21,22] and has also been detected in conceptual analyses carried out on CIDOC-CRM [22,23], although CIDOC-CRM does not support it in its specification [20]. Recently, some authors have started working on an extension of CIDOC CRM to support uncertainty [24], although only covering the uncertainty introduces specific modelling when different users present different points of view or discourses about the information. This approach mixes subjective modelling approaches, and only models some epistemic vagueness scenarios. In addition, we can find other specifications using a thesaurus, an ad hoc creation of ontologies and folksonomies [25], and similar approaches for covering digital humanities’ needs in terms of vagueness modelling, but again without any explicit support at a metamodel level. All these works, and recent international initiatives such as PROgressive VIsual DEcision-Making in Digital Humanities (PROVIDEDH) [26], reveal the need to represent vagueness semantics in the humanities models as part of the intrinsic specification of the modelling mechanisms, avoiding ad hoc solutions. Both large groups of modelling approaches in the digital humanities discussed previously are lacking in this respect. Finally, there are some initiatives for using well-known software engineering modelling technologies to apply uncertainty modelling patterns to the humanities but still are a work in progress. For instance, we can find some isolated examples of using UML [27] to represent information in the humanities, identifying but not addressing the vagueness topic [28]; UML approaches, independent of the application domain, are discussed in the next section. In summary, although vagueness modelling for humanities information is a need that has been detected for decades in many of the works, the existing techniques do not incorporate mechanisms for this within their specifications and are limited to its ad hoc treatment in special cases. 1.2. Existing Approaches Outside Humanities Modelling aspects of information vagueness represents a field of interest for numerous fields and projects outside humanities disciplines, with different approaches. To facilitate the process of reviewing these approaches for our purposes, we have divided the approaches into three large groups: statistical approaches, strongly mathematical approaches, and software engineering approaches, although some of the reviewed works can be considered hybrids. All these approaches model vagueness explicitly, and Informatics 2019, 6, 20 4 of 23 some of them have developed techniques and tools that allow for the explicit treatment of both types of vagueness, which makes them a starting point to analyse their possible application to humanities fields. First of all, statistics is a particularly relevant discipline in vagueness modelling. Both for ontological and epistemic vagueness, we can find statistical approaches that generally associate probability functions to especially vague attributes of the information that we are modelling. The probability functions could be indicators of the precision (using in inferential statistics) or of the certain degree of the values of the attributes (i.e., error measurements for a given value). These solutions, while explicitly modelling both types of vagueness, assume vagueness as a margin of error function, contradicting our premise of treating uncertain information in the humanities as an intrinsic characteristic of them (that enriches the information) and not as something to mitigate. Thus, we can use these approaches as an idea to explicitly model aspects of vagueness but without giving it semantics of error. Regarding strongly mathematically approaches, they start from similar paradigms to the previous ones (based on margins of error) such as the interval predictor models [29], models that estimate regions of uncertainty of the contained information. A less error-focused approach corresponds to the fuzzy logic subdiscipline [30,31], which develop specific techniques (e.g., fuzzy sets and probability degrees, rule bases, linguistic summaries as fuzzy descriptions of variables or fuzzy quantifiers, and similarity measures) [31–34] for the modelling of vague aspects of the information. All these techniques contemplate the richness that both types of vagueness bring to the information models and their software applications [32]. Finally, approaches from software engineering maintain the differentiation between imprecision and uncertainty that we have detailed in our theoretical framework. In the case of ontological vagueness (imprecision), they try to expressly model the probability and possibility of the existence of entities in the data and information models. In the case of epistemic vagueness (uncertainty), they try to identify modellable characteristics such as set membership, interval membership, incompleteness, and other vague aspects. These works are still in progress (the OMG standardization group for vagueness in UML is still working, and their first ideas are from 2016) [35,36], although some UML modelling solutions based on stereotypes [37] can already be found. In any case, UML does not currently include support for modelling vagueness in its official specification [27]. The three groups of approaches have been applied to represent information and implement software systems in several domains of application (genetics and medicine [38], e-government and infrastructures [39], energy resources, etc.), being less common in models for representing humanistic information. Its treatment of vagueness closely linked to the concept of error and its large mathematical base makes its direct application to humanities fields difficult, where the definition of a probability function or the assumption of an a priori distribution of the data is complex. With the idea of providing a solution for the explicit modelling of uncertain information in the humanities that is (1) far from this notion of error and (2) simple and intuitive for humanities researchers [40], the modelling language ConML has incorporated specific modelling mechanisms of both types of vagueness. The following section explains in detail the conceptual framework and the mechanisms proposed. In order to define, characterize, and implement vagueness mechanisms as part of any conceptual model ad their subsequent software systems based on them, it is necessary to make some decisions about the specific treatment of vagueness we adopt and what modelling language is adequate for expressing the models. The following sections introduce both of them. 1.3. Theoretical Framework Many terms have been used in the literature to refer to the fact that data, or information, is not clear or perfectly defined: imprecision, vagueness, uncertainty, imperfection, etc. A complete conceptual characterisation of what is meant by these terms is rarely provided, so confusion ensues. Informatics 2019, 6, 20 5 of 23 To avoid this, we provide here a small theoretical framework that hopefully will clarify things and establish the basis for further developments such as the solution proposed in Section 3. To start with, we acknowledge that many aspects of the world are unclear, imprecise, or not well defined, and when we try to represent them in a model, we are often confronted with the need to either remove or explicitly manage this vagueness. Vagueness comes in two forms: • Ontological vagueness, or imprecision, which refers to things in the world that are not clear-cut, such as the boundaries of a hill; • Epistemic vagueness, or uncertainty, which refers to situations where our knowledge about something is unclear or incomplete. We say that imprecision is ontological because it is an inherent property of some things in the world. For example, a hill is an entity that any of us can conceptualise and reason about, but it lacks clear-cut boundaries, so that it is impossible to determine a line marking the hill’s boundary. This fact is independent of the knowledge that we may or may not have about the hill. Contrarily, we say that uncertainty is epistemic because it relates to how much we know about something. For example, I may know the name of this particular hill, or I may ignore it, or I may be roughly certain but not sure about it. This is a subjective phenomenon and definitely not inherent to the hill. Vagueness, in turn, jointly refers to imprecision and uncertainty. A deeper and complete treatment of vagueness as a knowledge representation concern, including imprecision and uncertainty, can be found in [3] (Chapter 14). Imprecision, being inherent to the things of the world and independent of our knowledge, depends on what properties we look at. Some properties, such as the names of people or cities, or the height of buildings or people, are not imprecise, as they are clearly established for any particular entity we may consider. For example, I have a clear name and height, regardless of whether you know them or not. This means that a modelling approach that aims to support the expression of imprecision must provide a mechanism to identify which properties or things being represented are subject to this kind of vagueness. On the contrary, anything may be subject to uncertainty because uncertainty depends on our knowledge about something, regardless of what that something is. As anyone may possibly be more or less knowledgeable about anything, every property of everything is, in principle, equally subject to uncertainty. Finally, it is worth mentioning the concept of accuracy. Whereas precision refers to how much detail an expression contains (such as 15.25 being more precise than 15.2), accuracy refers to how well an expression represents something, e.g., if I have 15.25 euros in my pocket, the expression 15 is imprecise but is quite accurate, whereas the expression 37.123 is much more precise but far less accurate. Note that precision is a property of expressions alone, regardless of how well they represent anything; contrarily, accuracy is a property of the representational power of expression. In this regard, accuracy is a useful tool to fight uncertainty. For example, imagine that we are required to express the distance between two places in kilometers. If we believe the distance is around 650 km but are unsure of it, we can refrain from attempting to be accurate in order to gain certainty by saying that the distance is between 500 and 900 km. This is certainly not very accurate, but we are probably right as the actual distance falls inside the given interval. 1.4. ConML ConML is a conceptual modelling language designed for the humanities and social sciences. Using ConML, we can represent the entities in the world as well as their characteristics and the connections among them. We can also represent the relevant categories that we employ to classify these entities, together with the relationships between them. ConML is based on the object-oriented paradigm, as are many other popular modelling languages such as UML [27], but is much simpler so that non-experts in software systems can learn it and use it in under 30 h [18,41]. Informatics 2019, 6, 20 6 of 23 At the category (type) level, the basic constructs of ConML are class, which represents a category in the world, and feature, which represents a characteristic of a category. There are two kinds of features: attributes that correspond to atomic characteristics, which are expressed through simple values (such as someone’s age or the name of a place), and semi-associations, which correspond to complex characteristics, which are expressed through references to other things, such as a house’s owner (which is a person) or a person’s birth place (which is a town). In addition, inverse pairs of semi-associations are combined into associations; in this regard, each semi-association of an association corresponds to associations as seen from the point of view of each of the participant classes. In this regard, we can say that, in ConML, classes have features, which can be either attributes or semi-associations, and classes are related to each other through associations, each of which is composed of a pair of inverse semi-associations. For example, we may have a ConML model representing the fact that buildings have an address and a height, and are located in cities, which have a name. Here, building and city are two classes. The building class has two attributes, address and height, whereas the city class has one attribute, name. Furthermore, building and city are related by the association Is Located In. Attributes in ConML have a data type, which specifies what kind of data may be stored by their instances. Only five simple data types exist in ConML: Boolean, number, time, text, and data. In addition, ConML supports enumerated data types. An enumerated type consists of a list of pre-defined named items, and a value of this type can only hold an existing item. For example, a model may define a styles enumerated type containing the items romanesque, gothic, and neoclassical. An attribute such as building style, defined as having type styles, could only take one of these items as a value. Interestingly, the items in an enumerated type do not need to be arranged as a linear list but can be hierarchically organized to represent subsumption or aggregation, so that every item may have a “parent” or super-item and may have a number of “child” sub-items. For example, we could add Decorated Gothic and Flamboyant Gothic under gothic in the styles enumerated to reflect the fact that there are two subkinds of the gothic style. At the entity (instance) level, the basics constructs of ConML are object, which represents a specific entity in the world as an instance of a class; value, which represents a characteristic of an entity as an instance of an attribute; and link, which represents a connection between two entities as an instance of an association. We can say that, in ConML, objects have values and are connected to each other by links. For example, we may have a ConML model representing the fact that the cathedral in Santiago de Compostela is 32 m high. Here, cathedral and Santiago de Compostela refer to objects instance of building and city, respectively: 34 m is a value instance of Height, and “in” refers to a link between these two objects. A comprehensive description of ConML is outside the scope in this article but can be found in [3,7]. 2. Materials and Methods This section presents the ConML mechanisms proposed for expressing vagueness as part of digital humanities conceptual models. 2.1. Expressing Imprecision and Uncertainty with ConML ConML features several mechanisms that support imprecision and vagueness. These mechanisms are distinct, but they are often used in combination to express complex facts. In general, imprecision is difficult to treat through cross-cutting mechanisms, as its semantics depend largely on the nature of each imprecise characteristic. On the contrary, uncertainty can be satisfactorily treated through cross-cutting mechanisms in the language, as it is independent of the characteristics being described. The following sections describe each of these mechanisms in turn. Informatics 2019, 6, 20 7 of 23 2.1.1. Null and Unknown Semantics Most modelling or software-oriented languages, as well as most database management systems and languages, provide a null keyword, or equivalent, to express that a piece of data is not available. However, this is ambiguous, because data unavailability may be due to ontological or epistemic reasons. For example, if we read that p.Name = null where p is a person, we should interpret null as meaning epistemic absence, i.e., we do not know p’s name. However, if we encounter something like b.Protection¬Level = null, where b is a building, we may interpret this as epistemic or ontological absence, i.e., we do not know what protection level applies to b, or b has no protection level whatsoever. To avoid ambiguity, ConML offers two different keywords: • Null, which indicates ontological absence; b.Protection¬Level = null means that no protection level has been established for b; • Unknown, which indicates epistemic absence; b.Protection¬Level = unknown means that a protection level has been established for b, but we do not know what it is. In this manner, unknown provides a simple but powerful mechanism to express ignorance of a fact, which is an extreme case of uncertainty. Null semantics may be applied only to those features that have a minimum cardinality of zero. For example, if the Person.Name attribute in our previous example is defined as having a cardinality of 1 in a class model, then it may not take null values in an instance model in order to maintain type conformance. However, unknown semantics may be applied to any feature, as anything is susceptible of not being known. 2.1.2. Certainty Qualifiers To cater for finer degrees of uncertainty, ConML incorporates certainty qualifiers. These are labels that may be attached to instances of classes or features to express how certain a statement is, following an exclusive order relation between them. Note that ConML does not define the qualifiers in a quantitative level (e.g., assigning a percentage of certainty to each qualifier), because this assignation could vary between domains of applications or even between implementation solutions, and it could be assigned in next phases of the mode implementation. There are five pre-defined degrees of certainty in ConML: • Certain. The expressed fact is known to be true. This is indicated by an asterisk * sign; • Probable. The expressed fact is probably true. This is indicated by a plus + sign; • Possible. The expressed fact is possibly true. This is indicated by a tilde ~ sign; • Improbable. The expressed fact is probably not true. This is indicated by a minus − sign; • Impossible. The expressed fact is known to be not true. This is indicated by an exclamation ! sign. Certainty qualifiers can be applied to describe existence or predication. When used for existence, they are attached to an instance of class in order to express how certain we are of the existence of such an entity. For example, we may label building b in our previous example as (+), to indicate that the building represented by b probably exists. Similarly, certainty qualifiers can be applied to instances of features to express how certain we are of the associated predication. For example, we may state that b.Height = 34 (*) to indicate that we are sure that the building represented by b is 34 m high. 2.1.3. Abstract Enumerated Items In previous sections we described the fact that items in an enumerated type can be hierarchically organized to represent subsumption or aggregation between items and sub-items. We can use this varying abstraction level of enumerated items to represent different degrees of vagueness, both ontological (imprecision) and epistemic (uncertainty). Let us imagine that we have a World¬Regions Informatics 2019, 6, 20 8 of 23 enumerated type having root items Europe and Asia, and then items France, Germany, and Spain under Europe. Imagine now that that we wanted to express where the prehistoric bell-beaker culture took place. We know that it happened in Europe, but its boundaries are naturally (i.e., ontologically) vague; for this reason, the best thing we can do is use Europe, as France, Germany, or Spain would be too restrictive. The ontologically vague Europe is an acceptable representation of the fact we want to convey, namely, that the bell-beaker culture happened all over Europe but without clear-cut boundaries. Imagine now that we need to indicate where someone was born, and that we know that it was somewhere in Europe but we are not sure what country. Again, we should use Europe to capture this fact. By doing this, we would be purposefully injecting some inaccuracy to gain certainty, as explained in previous sections. As illustrated by the examples, using an abstract enumerated item such as Europe may entail ambiguity, as statements such as Place¬Of¬Occurrence = Europe may mean two different things: the place of occurrence is all of Europe (imprecision), or the place of occurrence is some particular spot in Europe, which we are not sure of (uncertainty). Despite this, the semantics of the expressions are usually sufficient to resolve the ambiguity; for example, Place¬Of-Birth = Europe should be interpreted as an uncertain (rather than imprecise) expression as we know that people are born in a specific spot rather than in a whole continent. 2.1.4. Arbitrary Time Resolution The time data type introduced in previous sections corresponds to expressions of points along the arrow of time. However, as opposed to other modelling languages, ConML allows expressions of the time data type to contain arbitrary resolution. This means that time points do not necessarily follow the usual pattern of day, month, year, hour, minute, and second, but can be as “thick” or “thin” as needed. Some sample time values in ConML are 8 June 1996 20:45, September 1845, late 20th century, or early neolithic. All these expressions represent “points” in time of different “thickness”. In a similar way as we did with abstract enumerated types, we can use “thick” time points to express imprecision or uncertainty. Furthermore, like in the previous case, the ensuing ambiguity must be resolved by looking at the semantics of each individual expression. For example, a statement such as Moment = 1936 may mean that something was ongoing throughout the complete year 1936 (imprecision), or that it happened at a particular time this year but we are not sure when (uncertainty). A statement such as Date¬Of-Birth = 1936, however, is clearly uncertain rather than imprecise, as we know that people are born on a specific day and time rather than throughout a full year. The four mechanisms presented cover most of the needs found in terms of humanities information modelling, although it could be possible to define other mechanisms to support imprecision and vagueness as part of ConML (e.g., methods for defining ranges) that we are considering for future revisions. Next section presents a proposal for an implementation of these mechanisms on non-relational data structures, validating ConML mechanisms in a project with real data and showing how the software system manages data queries involving vagueness resolution. 3. Results 3.1. Case Study and Resultant Models This section describes the application of the solution proposed in previous sections to a real scenario in Digital Humanities. This scenario occurred with a research project carried out at the Institute for Medieval and Renaissance Studies and Digital Humanities (Instituto de Estudios Medievales y Renacentistas y de Humanidades Digitales IEMYRhd) [42], University of Salamanca, Spain. The research project, named DICTOMAGRED [43], analyses historical sources (including oral testimonies, legal documents, literature, etc.), most of them in Arabic, which contain geographical references describing routes through different areas in the Maghreb, their place names, their topography, and Informatics 2019, 6, 20 9 of 23 other related issues. The main goal of the project is “to provide a software tool for humanities specialists to retrieve information about the location of toponyms in North Africa as they appear in historical sources of medieval and modern times” [43]. Due to the heterogeneous nature of these historical sources, both in type and chronology, multiple needs appeared in relation to the representation of vagueness. In addition, and as in most cases in digital humanities research, vagueness not only helps researchers to better represent the area of study, it also provides additional knowledge about it. For this project in particular, needs included the specification of the degree of certainty of sources in relation to place names, the description of population estimates of the different geographical areas, and the indication of whether these places are now inhabited or not, among others [44]. Figure 1 shows an excerpt of the class model created for the project, focusing on toponyms (i.e., place names) and relations between them, the related geographical areas, and the historical sources that were employed. • Toponym: proper name referring to a geographical place. No vagueness is involved; • ToponymDistance: relative distance between two toponyms. This class also holds information related to the reliability of the distance estimation as a separate attribute; • GeographicArea: location of the place referred to by a toponym. If a toponym is still in use, the corresponding geographic area is epistemologically vague but known; if not, the geographic area may be estimated from the historical sources; • HistoricalSource: any manifestation of a testimony, (textual such as letters, publications, and bibliographical references) or oral testimonies (formal or informal) that allows the reconstruction, analysis, and interpretation of historical events. Informatics 2019, 6, x 9 of 23 (i.e., place names) and relations between them, the related geographical areas, and the historical sources that were employed. • Toponym: proper name referring to a geographical place. No vagueness is involved; • ToponymDistance: relative distance between two toponyms. This class also holds information related to the reliability of the distance estimation as a separate attribute; • GeographicArea: location of the place referred to by a toponym. If a toponym is still in use, the corresponding geographic area is epistemologically vague but known; if not, the geographic area may be estimated from the historical sources; • HistoricalSource: any manifestation of a testimony, (textual such as letters, publications, and bibliographical references) or oral testimonies (formal or informal) that allows the reconstruction, analysis, and interpretation of historical events. ConML allowed us to make decisions about the treatment of vagueness very early in the project while working at the conceptual level, and thus avoid bringing technological dependencies or other implementation decisions to the conceptual model. Thus, the class diagram in Figure 1 lays the foundation for expressing vagueness when taking instances. To illustrate this, we take some instances of the classes in Figure 1, as depicted in Figure 2. Firstly, toponym was instantiated as objects top1, top2, and top3 in order to represent toponyms of interest: Sijilmasa, Aghmat Ourika, and Tamdalt. According with two historical sources (instances of TextualHistoricalSource that are not presented in the following diagrams for space reasons), Sijilmasa was an important human location founded at 757 B.C. These historical sources place it within the limits of Tiaret, close to a rich gold mine that existed between Sudan and Zawila, on a difficult route. This was a medieval Moroccan center of commerce in the far north of the Sahara in Morocco. The history of the city was marked by several successive invasions of Berber dynasties. Due to their strategic importance, their distance with other important cities and the related routes have been studied for decades. Another important extinct city is Tamdalt, whose records date from the 2nd century B.C.; from Tāmdalt to Siŷilmāsa there are 11 marhalas (stages). Tamdalt is the Ansara river, which was born in the mountain that is ten miles from it, in the Mahgreb, where there is a silver mine. Currently, the name Tamdalt is not in use. Finally, Agmat Ourika was a city located eight days from Siŷilmāsa and three days from Dar’a. From the localities of the Sūs to this city, it takes six days to walk, and many villages of Berber tribes are crossed, whose apogee lay in middle ages. Currently, the known archaeological site Journaa Aghmat in an enclave in the Moroccan Ourika road. All this information is described with vagueness mechanisms in the object-oriented diagram in Figure 2. Figure 1. ConML class model for Toponym Studies in DICTOMAGRED project. ConML allowed us to make decisions about the treatment of vagueness very early in the project while working at the conceptual level, and thus avoid bringing technological dependencies or other implementation decisions to the conceptual model. Thus, the class diagram in Figure 1 lays the foundation for expressing vagueness when taking instances. To illustrate this, we take some instances of the classes in Figure 1, as depicted in Figure 2. Firstly, toponym was instantiated as objects top1, top2, and top3 in order to represent toponyms of interest: Sijilmasa, Aghmat Ourika, and Tamdalt. According with two historical sources (instances of Textual Historical Source that are not presented in the following diagrams for space reasons), Sijilmasa was an important human location founded at 757 Informatics 2019, 6, 20 10 of 23 B.C. These historical sources place it within the limits of Tiaret, close to a rich gold mine that existed between Sudan and Zawila, on a difficult route. This was a medieval Moroccan center of commerce in the far north of the Sahara in Morocco. The history of the city was marked by several successive invasions of Berber dynasties. Due to their strategic importance, their distance with other important cities and the related routes have been studied for decades. Another important extinct city is Tamdalt, whose records date from the 2nd century B.C.; from Tāmdalt to Siŷilmāsa there are 11 marhalas (stages). Tamdalt is the Ansara river, which was born in the mountain that is ten miles from it, in the Mahgreb, where there is a silver mine. Currently, the name Tamdalt is not in use. Finally, Agmat Ourika was a city located eight days from Siŷilmāsa and three days from Dar’a. From the localities of the Sūs to this city, it takes six days to walk, and many villages of Berber tribes are crossed, whose apogee lay in middle ages. Currently, the known archaeological site Journaa Aghmat in an enclave in the Moroccan Ourika road. All this information is described with vagueness mechanisms in the object-oriented diagram in Figure 2. Informatics 2019, 6, x 10 of 23 Figure 1. ConML class model for Toponym Studies in DICTOMAGRED project. Figure 2. ConML model for Sijilmasa, Tamdalt, and Aghmat Ourika toponyms information in DICTOMAGRED project. In grey, objects created for instantiate the class model, representing imprecise and uncertain information regarding toponym, ToponymDistance, and geographic area. Vagueness is expressed throughout the model in Figure 2 as follows: Three objects (top1, top2, and top3) represen the three toponyms involved on this scenario. For each object, time arbitrary resolution is used to express when each toponym was initially used. In addition, certainty qualifiers are employed to describe how certain we are about these datings: for Sijilmasa and Tamdalt, the asterisk in parenthesis at the end of the UsedIn attribute line indicates that we are sure that the toponym was in use on these dates for the reliable historical sources; for Aghmat Ourika, we use a tilde sign to indicate that we are not sure that it was used at the middle ages. In addition, the CurrentName for Aghmat Ourika is Journâa Aghmat, the current name of the archaeological site with a certain qualifier sign, as no place name exists today in references to the other archaeological sites; Sijilmasa and Tamdalt, on the contrary, maintains their original names but with a minus sign, because is false that the old toponyms are now in use. Parallel objects ga1, ga2, and ga3 represent the geographical areas where we currently place each toponym. These objects also employ certainty qualifiers for the values of XCoord and YCoord attributes in order to express the certainty of the coordinates. Abstract enumerated items are also employed with the region attribute. In the case of Sijilmasa and Aghmat Ourika, since there are well- known archaeological places in the center of Morocco, we can safely state them in Morocco. In the case of Tamdalt, it is an inhabited archaeological site near frontiers at present, so the level of certainty about the region is low, and therefore the very general Maghreb value is chosen since we cannot be more specific. Regarding both topDis1 and topDis2 objects, vagueness is explicitly treated through the ReliabilityLevel enumerated type, which allows us to state that the distance of “marhalas” (a stage or period in different Arabic languages and dialects) presents low reliability, whereas “walking days” Figure 2. ConML model for Sijilmasa, Tamdalt, and Aghmat Ourika toponyms information in DICTOMAGRED project. In grey, objects created for instantiate the class model, representing imprecise and uncertain information regarding toponym, ToponymDistance, and geographic area. Vagueness is expressed throughout the model in Figure 2 as follows: Three objects (top1, top2, and top3) represen the three toponyms involved on this scenario. For each object, time arbitrary resolution is used to express when each toponym was initially used. In addition, certainty qualifiers are employed to describe how certain we are about these datings: for Sijilmasa and Tamdalt, the asterisk in parenthesis at the end of the Used In attribute line indicates that we are sure that the toponym was in use on these dates for the reliable historical sources; for Aghmat Ourika, we use a tilde sign to indicate that we are not sure that it was used at the middle ages. In addition, the Current Name for Aghmat Ourika is Journâa Aghmat, the current name of the archaeological site with a certain qualifier sign, as no place name exists today in references to the other Informatics 2019, 6, 20 11 of 23 archaeological sites; Sijilmasa and Tamdalt, on the contrary, maintains their original names but with a minus sign, because is false that the old toponyms are now in use. Parallel objects ga1, ga2, and ga3 represent the geographical areas where we currently place each toponym. These objects also employ certainty qualifiers for the values of XCoord and YCoord attributes in order to express the certainty of the coordinates. Abstract enumerated items are also employed with the region attribute. In the case of Sijilmasa and Aghmat Ourika, since there are well-known archaeological places in the center of Morocco, we can safely state them in Morocco. In the case of Tamdalt, it is an inhabited archaeological site near frontiers at present, so the level of certainty about the region is low, and therefore the very general Maghreb value is chosen since we cannot be more specific. Regarding both topDis1 and topDis2 objects, vagueness is explicitly treated through the ReliabilityLevel enumerated type, which allows us to state that the distance of “marhalas” (a stage or period in different Arabic languages and dialects) presents low reliability, whereas “walking days” presents medium reliability. Additionally, we cannot specify a distance in km, so unknown is used as a value for Km Distance. As we can see in the DICTOMAGRED conceptualization [44], the use of explicit vagueness modelling mechanisms (both ontological and epistemic) allows us to capture relevant information needs in digital humanities research. In addition, it allows us to develop a software system while taking into account these specificities in the information. 3.2. Implementatio The final aim of the vagueness inclusion in DICTOMAGRED project includes the development of indexing and searching mechanisms according to different levels of information uncertainty, for example, searching only toponyms in current usage or accessing those that are on camel-days journeys or marhalas measurements of estimated distance with a high confidence by the historical sources. A non-relational storage structure has been chosen for the software system, since it allows us to maintain acceptable rates for indexing and searching information. Non-relational databases present particularities that we need to manage when implementing the vagueness mechanisms. In order to define this implementation proposal as universally as possible, we have decided to work with key-value structures for the expression of information, since they are the simplest and most commonly employed structure in all non-relational databases. Additionally, key-value principle is used as basis for document-based structures, which are also commonly non-relational schemas in which the data entities are grouped in documents as objects, which are composed by keys (properties) and values. These documents are usually formatted following JSON syntax [45,46]. For complete information about the non-relational terminology used here for describing implementation mechanisms, please consult [47]. In addition, it should be pointed out that non-relational databases are the most widely currently used structure for application development, due to their performance in terms of indexing and searching performance, real-time data management, and connectivity (for example, for mobile or distributed applications). Digital humanities software systems also require these indexing and searching performance capabilities. Next, we detailed the non-relation implementation designed for each vagueness mechanism defined: 1. Null and unknown semantics. Most of the non-relational systems do not allow one to create specific reserved words that could implement the need for null and unknown semantic for expressing vagueness. Some systems use numeric values such as zero, negatives values, or empty strings to represent null and/or unknown values. Other values are sometimes used as “magic” values for these semantics. However, these practices often introduce ambiguity and confusion, as zero and empty strings may constitute acceptable values for associated attributes. It is also common practice to create specific informational objects in the database structure for null or unknown semantics. This is a possible solution in systems where the object structure is still supported, such as MongoDB [48]. However, this solution is not possible in all non-relational Informatics 2019, 6, 20 12 of 23 systems. As we need specific semantics elements for representing absence of facts and absence of information universally, we have defined a node in our non-relational structure for each of them, encapsulating in specific references in the non-relational software systems the semantic required. Figure 3 shows the non-relational node and the key-value structures defined for null and unknown semantics and their use in a specific toponym information description in DICTOMAGRED.Informatics 2019, 6, x 12 of 23 Figure 3. Firebase console showing the data node for defining null and unknown semantics. 2. Certainty qualifiers. As we previously detailed, a certainty qualifier offers some “extra” information about a specific value of an attribute defined in the conceptual model (i.e., in b.Height = 34 (~), “34” is the value and the certainty qualifier indicates extra information; we are not very sure about the height given value). Thus, it is necessary to firstly define in the non- relational structure the certainty qualifiers as specific references that we can add to any key- value previously defined. A node with all possible certainty qualifiers is defined as part of the non-relational structure, separated from any other information node. With this solution, it is possible to correlate another key-value structure to the value “34” itself (following the example), for indicating the certainty qualifier. Figure 4 shows the nodes added and their use in a specific toponym information description in DICTOMAGRED. Figure 4. Firebase console showing the data node for defining certainty qualifiers in DICTOMAGRED implementation. 3. Abstract enumerated items. Some systems use numeric values for representing levels of abstraction in a hierarchical structure of items. Other values are sometimes used as ad hoc formatted values for these semantics, as chains of strings separated by special characters like “.” or “/” for representing the entire path of the enumerated item value (Region = Magreb.Morocco). However, these practices often introduce ambiguity and confusion in the information, as they may constitute acceptable values for the associated attributes or responds to arbitrary Figure 3. Firebase console showing the data node for defining null and unknown semantics. 2. Certainty qualifiers. As we previously detailed, a certainty qualifier offers some “extra” information about a specific value of an attribute defined in the conceptual model (i.e., in b.Height = 34 (~), “34” is the value and the certainty qualifier indicates extra information; we are not very sure about the height given value). Thus, it is necessary to firstly define in the non-relational structure the certainty qualifiers as specific references that we can add to any key-value previously defined. A node with all possible certainty qualifiers is defined as part of the non-relational structure, separated from any other information node. With this solution, it is possible to correlate another key-value structure to the value “34” itself (following the example), for indicating the certainty qualifier. Figure 4 shows the nodes added and their use in a specific toponym information description in DICTOMAGRED. Informatics 2019, 6, x 12 of 23 Figure 3. Firebase console showing the data node for defining null and unknown semantics. 2. Certainty qualifiers. As we previously detailed, a certainty qualifier offers some “extra” information about a specific value of an attribute defined in the conceptual model (i.e., in b.Height = 34 (~), “34” is the value and the certainty qualifier indicates extra information; we are not very sure about the height given value). Thus, it is necessary to firstly define in the non- relational structure the certainty qualifiers as specific references that we can add to any key- value previously defined. A node with all possible certainty qualifiers is defined as part of the non-relational structure, separated from any other information node. With this solution, it is possible to correlate another key-value structure to the value “34” itself (following the example), for indicating the certainty qualifier. Figure 4 shows the nodes added and their use in a specific toponym information description in DICTOMAGRED. Figure 4. Firebase console showing the data node for defining certainty qualifiers in DICTOMAGRED implementation. 3. Abstract enumerated items. Some systems use numeric values for representing levels of abstraction in a hierarchical structure of items. Other values are sometimes used as ad hoc formatted values for these semantics, as chains of strings separated by special characters like “.” or “/” for representing the entire path of the enumerated item value (Region = Magreb.Morocco). However, these practices often introduce ambiguity and confusion in the information, as they may constitute acceptable values for the associated attributes or responds to arbitrary Figure 4. Firebase console showing the data node for defining certainty qualifiers in DICTOMAGRED implementation. Informatics 2019, 6, 20 13 of 23 3. Abstract enumerated items. Some systems use numeric values for representing levels of abstraction in a hierarchical structure of items. Other values are sometimes used as ad hoc formatted values for these semantics, as chains of strings separated by special characters like “.” or “/” for representing the entire path of the enumerated item value (Region = Magreb.Morocco). However, these practices often introduce ambiguity and confusion in the information, as they may constitute acceptable values for the associated attributes or responds to arbitrary implementation decisions. It is also common practice to create implement abstract enumerated items as in the previous certainty qualifiers mechanism, defining a hierarchical node in the non-relational structure and putting the most concrete value of the hierarchy (Region = Morocco). Then, the software system iterates this node in order to obtain at what level of abstraction the value is described. The final possibility is to define the hierarchical node but putting as Boolean values of the attribute all the levels involved (Magreb = true; Morocco = true). Both last solutions follow a non-relational structure and are operational for implementing abstract enumerated items. However, iterating the node each time we want to solve the abstraction information is inefficient in non-relational environments, so finally we chose the Boolean values structure. Figure 5 shows the non-relational node defined for the regions enumerated type and their items, and their use in a specific toponym information description in DICTOMAGRED. Informatics 2019, 6, x 13 of 23 implementation decisions. It is also common practice to create implement abstract enumerated items as in the previous certainty qualifiers mechanism, defining a hierarchical node in the non- relational structure and putting the most concrete value of the hierarchy (Region = Morocco). Then, the software system iterates this node in order to obtain at what level of abstraction the value is described. The final possibility is to define the hierarchical node but putting as Boolean values of the attribute all the levels involved (Magreb = true; Morocco = true). Both last solutions follow a non-relational structure and are operational for implementing abstract enumerated items. However, iterating the node each time we want to solve the abstraction information is inefficient in non-relational environments, so finally we chose the Boolean values structure. Figure 5 shows the non-relational node defined for the regions enumerated type and their items, and their use in a specific toponym information description in DICTOMAGRED. Figure 5. Firebase console showing the regions data node implementing the abstract enumerated items mechanism. 4. Arbitrary time resolution. Most of the non-relational systems use the timestamp mechanism to represent temporal values (number of milliseconds after 1st January 1970). The need for representing previous dates at any granularity level in digital humanities makes timestamps use impossible for humanities information. There are some non-relational systems, such as MongoDB [48], that present specific data types for dates but with a very rigid format guided by ISO 8601 standards, which also presents other problems for humanities information, such as absence of support of Julian calendar or problems in data conversions between other date systems, such as Hegira (used in DICTOMAGRED project), Chinese calendar, etc. These limitations encouraged us to implement class library supporting the arbitrary resolution inherent to the time data type in ConML, which allows for some of the most usual forms of time representation, including simple and incomplete dates (and times), years, decades, and centuries. Now, we have implemented part of the functionalities of the class library in the non- relational environment for DICTOMAGRED. Similar to the certainty qualifiers implementation, we have defined a node in the non-relational structure with a hierarchical conceptualization of Figure 5. Firebase console showing the regions data node implementing the abstract enumerated items mechanism. 4. Arbitrary time resolution. Most of the non-relational systems use the timestamp mechanism to represent temporal values (number of milliseconds after 1st January 1970). The need for representing previous dates at any granularity level in digital humanities makes timestamps use impossible for humanities information. There are some non-relational systems, such as MongoDB [48], that present specific data types for dates but with a very rigid format guided by ISO 8601 standards, which also presents other problems for humanities information, such as Informatics 2019, 6, 20 14 of 23 absence of support of Julian calendar or problems in data conversions between other date systems, such as Hegira (used in DICTOMAGRED project), Chinese calendar, etc. These limitations encouraged us to implement class library supporting the arbitrary resolution inherent to the time data type in ConML, which allows for some of the most usual forms of time representation, including simple and incomplete dates (and times), years, decades, and centuries. Now, we have implemented part of the functionalities of the class library in the non-relational environment for DICTOMAGRED. Similar to the certainty qualifiers implementation, we have defined a node in the non-relational structure with a hierarchical conceptualization of vagueness points in a timeline that we want to manage (years, decades, centuries, time eras, etc.). Then, we included a key-value structure referring to the specific point in time used for solved a given value. For instance, UsedIn = middle ages contains a key-value structure indicating that the value “middle ages” needs to be interpreted as the “Age” level of granularity in time. Figure 6 shows the non-relational node defined for the arbitrary time resolution, and its use in a specific toponym information description in DICTOMAGRED. Informatics 2019, 6, x 14 of 23 vagueness points in a timeline that we want to manage (years, decades, centuries, time eras, etc.). Then, we included a key-value structure referring to the specific point in time used for solved a given value. For instance, UsedIn = middle ages contains a key-value structure indicating that the value “middle ages” needs to be interpreted as the “Age” level of granularity in time. Figure 6 shows the non-relational node defined for the arbitrary time resolution, and its use in a specific toponym information description in DICTOMAGRED. Figure 6. Firebase console showing UsedIn attribute implementation according the arbitrary time resolution mechanism. Note that, although we explained the implementation proposal by each vagueness mechanism, it is possible (and desirable) to combine the mechanisms, exploiting the expressiveness of the ConML vagueness mechanisms and the potential of the non-relational structure. Thus, it is possible to express in a non-relational structure that one specific toponym was used in the second century (S.II B.C.) with highly confidence (using certainty qualifiers) while other was used in middle ages with a lower confidence. Figure 6. Firebase console showing UsedIn attribute implementation according the arbitrary time resolution mechanism. Note that, although we explained the implementation proposal by each vagueness mechanism, it is possible (and desirable) to combine the mechanisms, exploiting the expressiveness of the ConML vagueness mechanisms and the potential of the non-relational structure. Thus, it is possible to express in a non-relational structure that one specific toponym was used in the second century (S.II B.C.) with highly confidence (using certainty qualifiers) while other was used in middle ages with a lower confidence. All the implementation details in non-relational structure shows are implemented in DICTOMAGRED, including vague measurements for distances or vague locations (see Figures 2 Informatics 2019, 6, 20 15 of 23 and 7). The project uses a web-based environment with non-relational real-time database provided by Firebase services [49]. Firebase is a mobile and web application development platform run by Google since 2014 that allow us to personalize the non-relational database implementation with indexing and searching integrated services, as well as other functionalities (real time maintenance, cloud services, etc.). It is important to highlight that the implementation proposal presented here is defined in terms of the conceptual model previously defined and following a non-relational data structure, but independently of the specific non-relational environment chosen. Thus, as well as on Firebase, the following implementation could also be adopted as part of any other well-known non-relational environment based on key-value or document-based structures, such as MongoDB, Amazon DynamoBD, CouchBase, Oracle noSQL, etc. [47,50]. Following this premise, the specific modelling and implementation decisions made during this work present some homogeneity for all mechanisms, in order to ensure that the implementation proposal defined here is as universally applicable as possible for non-relational contexts with expression of informational vagueness needs, both ontological and epistemic. In addition, we employed a search system service provided from Algolia [51] via a RESTful JSON API for implementing the non-relational queries, although Firebase supports the main programming languages (including Javascript, PHP, or Python, among others) that will allow us to integrate the DICTOMAGRED system via web. The following subsection shows the experiments carried out within the DICTOMAGRED project defining specific queries that include aspects of vagueness and illustrating how the DICTOMAGRED software system manages vagueness in its query results.Informatics 2019, 6, x 15 of 23 Figure 7. Firebase console showing final implementation details. At right, the values marhalas or parasangs (Iranian past measure unit for distance) as vague measurement units for distance in the DICTOMAGRED data model. At left, the final values for the specific Tamdalt toponym supporting vague information. All the implementation details in non-relational structure shows are implemented in DICTOMAGRED, including vague measurements for distances or vague locations (see Figure 2 and Figure 7). The project uses a web-based environment with non-relational real-time database provided by Firebase services [49]. Firebase is a mobile and web application development platform run by Google since 2014 that allow us to personalize the non-relational database implementation with indexing and searching integrated services, as well as other functionalities (real time maintenance, cloud services, etc.). It is important to highlight that the implementation proposal presented here is defined in terms of the conceptual model previously defined and following a non-relational data structure, but independently of the specific non-relational environment chosen. Thus, as well as on Firebase, the following implementation could also be adopted as part of any other well-known non- relational environment based on key-value or document-based structures, such as MongoDB, Amazon DynamoBD, CouchBase, Oracle noSQL, etc. [47,50]. Following this premise, the specific modelling and implementation decisions made during this work present some homogeneity for all mechanisms, in order to ensure that the implementation proposal defined here is as universally applicable as possible for non-relational contexts with expression of informational vagueness needs, both ontological and epistemic. In addition, we employed a search system service provided from Algolia [51] via a RESTful JSON API for implementing the non-relational queries, although Firebase supports the main programming languages (including Javascript, PHP, or Python, among others) that will allow us to integrate the DICTOMAGRED system via web. The following subsection shows the experiments carried out within the DICTOMAGRED project defining specific queries that include aspects of vagueness and illustrating how the DICTOMAGRED software system manages vagueness in its query results. 3.3. Query-Based Vagueness Resolution Results Three queries have been defined according to the specific vagueness needs of the case study shows in Figure 2 from DICTOMAGRED, expressed first in natural language and subsequently executed in the Algolia search systems accessing the Firebase-defined structure: • QUERY A: Searching for all Dictomagred toponyms located in Maghreb region whose CurrentName is improbable. This means that the toponym is probably not in use regarding current maps of populations and cities. QUERY A involves tow vagueness mechanisms: abstract enumerated items to solve the hierarchical levels of the information about the regions attribute, and certainty qualifiers to evaluate what values of the current name present an improbable qualifier. Figure 7. Firebase console showing final implementation details. At right, the values marhalas or parasangs (Iranian past measure unit for distance) as vague measurement units for distance in the DICTOMAGRED data model. At left, the final values for the specific Tamdalt toponym supporting vague information. 3.3. Query-Based Vagueness Resolution Results Three queries have been defined according to the specific vagueness needs of the case study shows in Figure 2 from DICTOMAGRED, expressed first in natural language and subsequently executed in the Algolia search systems accessing the Firebase-defined structure: • QUERY A: Searching for all Dictomagred toponyms located in Maghreb region whose CurrentName is improbable. This means that the toponym is probably not in use regarding current maps of populations and cities. QUERY A involves tow vagueness mechanisms: abstract enumerated items to solve the hierarchical levels of the information about the regions attribute, and certainty qualifiers to evaluate what values of the current name present an improbable qualifier. Informatics 2019, 6, 20 16 of 23 • QUERY B: Searching for all DICTOMAGRED toponyms whose distance from Sijilmasa is unknown. This means that the system evaluates the instances of ToponymDistance where KmDistance is unknown and shows the correspondence toponyms involved in these instances as origin or destinies. This query allows us to test the resolution of unknown references. • QUERY C: Searching for all toponyms used in middle ages or in the second century B.C. This means that the software system has to query UsedIn attribute value at two levels of abstraction for solving the query employed arbitrary time resolution (note that both points in time present different levels of granularity and neither of them adjusts to classic timestamps of data formats employed in ISO 8601 standard or similar references). Note that all queries require, at least, the use of one vagueness mechanism or even combined versions of them, in order to offer to the DICTOMAGRED users (mainly researchers on Arabic language; Magreb topography, history, and/or archaeological remains; etc.) responses to their research questions (Figures 8–13). Next Figures 8, 10, and 12 show how these queries are executed, and Figures 9, 11, and 13 show the corresponding results consulting our Firebase non-relational database using the Algoria search engine. Note that, for executing a query in the Algolia dashboard, it is necessary to define as filters or facets [51] the parameters that the query requires, in our case region as Maghreb and CurrentName certainty as improbable in the query A (Figure 8), KmDistance as unknown in the query B (Figure 10) and UsedIn as middle ages or second century B.C. in query C (Figure 12). Informatics 2019, 6, x 16 of 23 • QUERY B: Searching for all DICTOMAGRED toponyms whose distance from Sijilmasa is unknown. This means that the system evaluates the instances of ToponymDistance where KmDistance is unknown and shows the correspondence toponyms involved in these instances as origin or destinies. This query allows us to test the resolution of unknown references. • QUERY C: Searching for all toponyms used in middle ages or in the second century B.C. This means that the software system has to query UsedIn attribute value at two levels of abstraction for solving the query employed arbitrary time resolution (note that both points in time present different levels of granularity and neither of them adjusts to classic timestamps of data formats employed in ISO 8601 standard or similar references). Note that all queries require, at least, the use of one vagueness mechanism or even combined versions of them, in order to offer to the DICTOMAGRED users (mainly researchers on Arabic language; Magreb topography, history, and/or archaeological remains; etc.) responses to their research questions (Figures 8–13). Next Figures 8, 10, and 12 show how these queries are executed, and Figures 9, 11, and 13 show the corresponding results consulting our Firebase non-relational database using the Algoria search engine. Note that, for executing a query in the Algolia dashboard, it is necessary to define as filters or facets [51] the parameters that the query requires, in our case region as Maghreb and CurrentName certainty as improbable in the query A (Figure 8), KmDistance as unknown in the query B (Figure 10) and UsedIn as middle ages or second century B.C. in query C (Figure 12). Figure 8. Query A execution through Algolia search engine. We have added two facets with the two requirements of the query about the region and the certainty in the current name use of the toponyms. Figure 8. Query A execution through Algolia search engine. We have added two facets with the two requirements of the query about the region and the certainty in the current name use of the toponyms. In the first case, query A results offered all toponyms situated specifically at Maghreb whose current name certainty is improbable. The system recovers two toponyms with the following conditions: Sijilmasa and Tamdalt (see in Figure 2 the correct values of these toponyms according with the query requirments.). Figure 9 shows the results for the query, showing the data for Sijilmasa toponym. Regarding query B results, the systems recovers four toponyms (two of them are part of our example) whose distance in kilometers from Sijilmasa is unknown. Informatics 2019, 6, 20 17 of 23 Finally, query C involved the execution of two combined searching structures due to the fact we have to manage toponyms used in the middle ages or used in the second century BC. Logical operators are common in relational database structures, but less supported in non-relational systems. Algolia allow us to use OR logical operator thanks to the custom search console including in their dashboard. Query C results recovers 20 toponyms used in these periods of time, including two presented in our case: Aghmat Ourika and Tamdalt.Informatics 2019, 6, x 17 of 23 Figure 9. Results for query A. In the first case, query A results offered all toponyms situated specifically at Maghreb whose current name certainty is improbable. The system recovers two toponyms with the following conditions: Sijilmasa and Tamdalt (see in Figure 2 the correct values of these toponyms according with the query requirments.). Figure 9 shows the results for the query, showing the data for Sijilmasa toponym. Figure 10. Query B execution using Algolia search engine. We have added custom expression on the Algolia console referring to Sijilmasa internal code as the reference point for recovering distances to it. Figure 9. Results for query A. Informatics 2019, 6, x 17 of 23 Figure 9. Results for query A. In the first case, query A results offered all toponyms situated specifically at Maghreb whose current name certainty is improbable. The system recovers two toponyms with the following conditions: Sijilmasa and Tamdalt (see in Figure 2 the correct values of these toponyms according with the query requirments.). Figure 9 shows the results for the query, showing the data for Sijilmasa toponym. Figure 10. Query B execution using Algolia search engine. We have added custom expression on the Algolia console referring to Sijilmasa internal code as the reference point for recovering distances to it. Figure 10. Query B execution using Algolia search engine. We have added custom expression on the Algolia console referring to Sijilmasa internal code as the reference point for recovering distances to it. Informatics 2019, 6, 20 18 of 23 Informatics 2019, 6, x 18 of 23 Figure 11. Results for Query B. Regarding query B results, the systems recovers four toponyms (two of them are part of our example) whose distance in kilometers from Sijilmasa is unknown. Figure 12. Query C execution through Algolia search engine. We have added a custom expression on the Algolia console with an OR expression for executing it. Figure 11. Results for Query B. Informatics 2019, 6, x 18 of 23 Figure 11. Results for Query B. Regarding query B results, the systems recovers four toponyms (two of them are part of our example) whose distance in kilometers from Sijilmasa is unknown. Figure 12. Query C execution through Algolia search engine. We have added a custom expression on the Algolia console with an OR expression for executing it. Figure 12. Query C execution through Algolia search engine. We have added a custom expression on the Algolia console with an OR expression for executing it. Informatics 2019, 6, 20 19 of 23 Informatics 2019, 6, x 19 of 23 Figure 13. Results for query C. Finally, query C involved the execution of two combined searching structures due to the fact we have to manage toponyms used in the middle ages or used in the second century BC. Logical operators are common in relational database structures, but less supported in non-relational systems. Algolia allow us to use OR logical operator thanks to the custom search console including in their dashboard. Query C results recovers 20 toponyms used in these periods of time, including two presented in our case: Aghmat Ourika and Tamdalt. In summary, the previous implementation of the four ConML mechanisms for expressing vagueness in the Firebase non-relational database allowed us to define searches that include vagueness references in their specification, taking advantage of the capabilities of non-relational systems. 4. Discussion The results obtained for the A, B, and C queries defined and the Firebase-based software system created for the presented implementation show that the non-relational implementation of the vagueness mechanisms is possible with vagueness resolution in the query system. Note that, apart from the specific example that we wanted to show in this paper (represented in Figure 2), the software system manages all toponyms, retrieving those that meet the established vagueness criteria. It will be also possible to concretize the results only for our case study, using the filtering mechanisms on the original results. This filtering service is provided by Firebase (and almost all non-relational structure software systems) and could analyse only the case of Sijilmasa and related toponyms. Because DICTOMAGRED [43] has a manageable number of nodes in its non-relational structure (currently DICTOMAGRED manages 53 toponyms with five hierarchical levels of information in their tree non-relational structure, which constitutes around 300 nodes of information), we could validate with the project researchers that the coverage of the implementation is total, that is, the conceptual model and the vagueness mechanisms created represent both the research needs of the project and the data source, obtaining accurate results (data that meet the conditions of indexing and searching) for queries A, B, and C. This type of expert-guided validation is only possible with a Figure 13. Results for query C. In summary, the previous implementation of the four ConML mechanisms for expressing vagueness in the Firebase non-relational database allowed us to define searches that include vagueness references in their specification, taking advantage of the capabilities of non-relational systems. 4. Discussion The results obtained for the A, B, and C queries defined and the Firebase-based software system created for the presented implementation show that the non-relational implementation of the vagueness mechanisms is possible with vagueness resolution in the query system. Note that, apart from the specific example that we wanted to show in this paper (represented in Figure 2), the software system manages all toponyms, retrieving those that meet the established vagueness criteria. It will be also possible to concretize the results only for our case study, using the filtering mechanisms on the original results. This filtering service is provided by Firebase (and almost all non-relational structure software systems) and could analyse only the case of Sijilmasa and related toponyms. Because DICTOMAGRED [43] has a manageable number of nodes in its non-relational structure (currently DICTOMAGRED manages 53 toponyms with five hierarchical levels of information in their tree non-relational structure, which constitutes around 300 nodes of information), we could validate with the project researchers that the coverage of the implementation is total, that is, the conceptual model and the vagueness mechanisms created represent both the research needs of the project and the data source, obtaining accurate results (data that meet the conditions of indexing and searching) for queries A, B, and C. This type of expert-guided validation is only possible with a manageable number of nodes, which are easily verifiable by humans. In other contexts, a solution based on monitoring the coverage of the algorithm automatically will be necessary. Finally, it is important to highlight the need for the initial conception of vagueness support from the first stages of design of each project or concrete application. As can be seen, the queries that we have designed already take into account the possibilities of expressing the vagueness of the software system Informatics 2019, 6, 20 20 of 23 since they arise from the previously conceptual model created. Without this previous conceptual design, the queries designed would probably not follow the vagueness logic of the model. We believe that the presented implementation constitutes an important advance for the support of vagueness in digital humanities at a conceptual level. Specially, and going back to the motivation of this work, the explicit addressing of the value added for the vague information in the humanities is treated through the proposal presented, providing mechanisms for future projects with same needs to deal with the vagueness in their implementation, instead of adapting non-vague support solutions. In addition, and due to the performance advantages of non-relational systems, there are currently more applications and projects in digital humanities that choose non-relational structures to manage their data. This implementation can serve as a relevance reference for these type of projects and applications with clear vagueness management needs. 5. Conclusions Imprecise and uncertain information constitutes an intrinsic characteristic of the digital humanities research practice, and, when properly modelled and expressed, may comprise a valuable asset. This paper has reviewed most well-known approaches to the modelling of vagueness, and presented a theoretical framework and specific modelling mechanisms in ConML for the expression of ontological and epistemic vagueness in the digital humanities. As illustrated by an application to a real project, these mechanisms allow researchers to express imprecision and uncertainty in their own models. In addition, the implementation proposal presented allows them to fulfill their vagueness needs without a large penalty in analytical and processing power, thanks to the non-relational structures. As far as we know, this is the first implementation proposal for vagueness in digital humanities that offers a software solution for vagueness from the conceptual model design to the implementation in a real digital humanities project, dealing with specific examples of vagueness needs. Due to this innovative component, critical analysis is also needed. Some suggestions for making improvements are identified as part of our future roadmap. The first aspect is that, in contrast to data coverage validation that we have already mentioned in the previous section, we do not have data about the performance of the software system (time for solving a query, etc.). It has not been considered necessary to measure them because, below a specific volume of information (as DICTOMAGRED volume), it is difficult to obtain reliable measures in performance. As a future plan, it is necessary to evaluate the implementation presented with a greater volume of nodes, so that the performance in some searches could be compromised. This is especially relevant in queries that involve the vagueness mechanism of arbitrary time resolution or that involve more than one vagueness mechanism at the same time. In addition, we plan to compare the performance results obtained with implementations in relational structures, in order to stablish some criteria or guidelines that will help engineers and digital humanities project managers in making decisions about their implementation data structure based on the informational needs of each project or application. Secondly, some of the defined vagueness mechanisms are closely related to new implementation techniques related to fuzzy logic. For instance, certainty qualifiers could be seen as fuzzy characterizations of information. For theses reason, we are also considering fuzzy sets and levels of set membership [32–34] or similar rule-based logic mechanisms [32] for improving specific details of the implementation of vagueness. Finally, the application of the proposal presented here, both at a conceptual level and at the implementation level, to heterogeneous projects or application on digital humanities will allow us to test the vagueness mechanisms expressiveness and the implementation in a variety of humanistic contexts and realities. These future works will allow us to improve applications and specific implementations of the preset proposal for cases with greater demands for vagueness in the digital humanities. Informatics 2019, 6, 20 21 of 23 Author Contributions: Conceptualization, C.G.-P.; Data Curation and Software Implementation, P.M.-R.; Methodology, Validation, Formal Analysis and Writing, C.G.-P. and P.M.-R. Funding: This research was partially funded by Spanish Ministry of Economy, Industry, and Competitiveness under its Competitive Juan de la Cierva Postdoctoral Research Programme, grant FJCI-2016-28032. Conflicts of Interest: The authors declare no conflict of interest. References 1. Ackoff, R.L. From data to wisdom. J. Appl. Sys. Anal. 1988, 16, 3–9. 2. Ciula, A.; Eide, Ø. Modelling in digital humanities: Signs in context. Digit. Scholarsh. Humanit. 2016, 32 (Suppl. 1), i33–i46. [CrossRef] 3. Gonzalez-Perez, C. Information Modelling for Archaeology and Anthropology: Software Engineering Principles for Cultural Heritage; Springer International Publishing: Berlin, Germany, 2018. [CrossRef] 4. Europeana. Europeana Project 2008–2015 [26/04/2016]. Available online: http://www.europeana.eu/ (accessed on 22 March 2019). 5. ARIADNE. ARIADNE Project 2013. Available online: http://ariadne-infrastructure.eu/ (accessed on 22 March 2019). 6. DARIAH-EU. Digital Research Infrastructure for the Arts and Humanities (DARIAH) 2007–2015 [26/04/2016]. Available online: https://dariah.eu/ (accessed on 22 March 2019). 7. Incipit. ConML Technical Specification. ConML 1.4.4 2015. Available online: http://www.conml.org/ Resources_TechSpec.aspx (accessed on 22 March 2019). 8. Flanders, J.; Jannidis, F. Data modeling. In A New Companion to Digital Humanities; Schreibman, S., Siemens, R., Unsworth, J., Eds.; Wiley: Hoboken, NJ, USA, 2015. 9. Flanders, J.; Jannidis, F. Knowledge Organization and Data Modeling in the Humanities. Available online: https://www.wwp.northeastern.edu/outreach/conference/kodm2012/flanders_jannidis_ datamodeling.pdf (accessed on 22 March 2019). 10. Hedges, M. Grid-enabling humanities datasets. Digit. Humanit. Q. 2009, 3, 4. 11. Linked Data. Available online: http://linkeddata.org/ (accessed on 22 March 2019). 12. Chen, P.P.-S. The entity-relationship model: Toward a unified view of data. In Readings in Artificial Intelligence and Databases; Elsevier: Amsterdam, The Netherlands, 1988; pp. 98–111. 13. W3C. RDF Schema 1.1. W3C Recommendation 25 February 2014. Available online: https://www.w3.org/TR/ rdf-schema/ (accessed on 22 March 2019). 14. Hunter, A.; Liu, W. Representing and Merging Uncertain Information in XML: A Short Survey. Available online: http://www0.cs.ucl.ac.uk/staff/A.Hunter/papers/saj.pdf (accessed on 22 March 2019). 15. Consortium, T. Text Enconding Initiative (TEI) 2016. Available online: http://www.tei-c.org/index.xml (accessed on 22 March 2019). 16. Isaksen, L.; Simon, R.; Barker, E.T.E.; de Soto Cañamares, P. Pelagios and the emerging graph of ancient world data. In Proceedings of the 2014 ACM conference on Web science, Bloomington, IN, USA, 23–26 June 2014; pp. 197–201. 17. Commons, P. Pelagios Commons WebSite (Pelagios 6 Project). Available online: http://commons.pelagios.org/ (accessed on 22 March 2019). 18. Gonzalez-Perez, C.; Martín-Rodilla, P. Teaching Conceptual Modelling in Humanities and Social Sciences. Digit. Humanit. Mag. 2017, 1, 408–416. 19. Chirico, R.D.; Frenkel, M.; Diky, V.V.; Marsh, K.N.; Wilhoit, R.C. ThermoML—An XML-Based Approach for Storage and Exchange of Experimental and Critically Evaluated Thermophysical and Thermochemical Property Data. 2. Uncertainties. J. Chem. Eng. Data 2003, 48, 1344–1359. [CrossRef] 20. ISO. ISO 21127:2006 Information and Documentation—A Reference Ontology for the Interchange of Cultural Heritage Information 2006. Available online: https://www.iso.org/standard/34424.html (accessed on 22 March 2019). 21. De Runz, C.; Desjardin, E.; Piantoni, F.; Herbin, M. Using Fuzzy Logic to Manage Uncertain Multi-modal Data in an Archaeological GIS. Available online: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1. 108.7063 (accessed on 22 March 2019). http://dx.doi.org/10.1093/llc/fqw045 http://dx.doi.org/10.1007/978-3-319-72652-6 http://www.europeana.eu/ http://ariadne-infrastructure.eu/ https://dariah.eu/ http://www.conml.org/Resources_TechSpec.aspx http://www.conml.org/Resources_TechSpec.aspx https://www.wwp.northeastern.edu/outreach/conference/kodm2012/flanders_jannidis_datamodeling.pdf https://www.wwp.northeastern.edu/outreach/conference/kodm2012/flanders_jannidis_datamodeling.pdf http://linkeddata.org/ https://www.w3.org/TR/rdf-schema/ https://www.w3.org/TR/rdf-schema/ http://www0.cs.ucl.ac.uk/staff/A.Hunter/papers/saj.pdf http://www.tei-c.org/index.xml http://commons.pelagios.org/ http://dx.doi.org/10.1021/je034088i https://www.iso.org/standard/34424.html http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.108.7063 http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.108.7063 Informatics 2019, 6, 20 22 of 23 22. Tolle, K.; Wigg-Wolf, D. Uncertainty . . . ? ECFN Meeting 2014—Basel Goethe University 2014. Available online: http://ecfn.fundmuenzen.eu/images/Tolle_Wigg-Wolf_Uncertainty.pdf (accessed on 6 May 2019). 23. Christensen-Dalsgaard, B.; Castelli, D.; Jurik, B.A.; Lippincott, J. Research and Advanced Technology for Digital Libraries. In Proceedings of the 12th European Conference, ECDL 2008, Aarhus, Denmark, 14–19 September 2008. 24. Van Ruymbeke, M.; Hallot, P.; Billen, R. Enhancing CIDOC-CRM and compatible models with the concept of multiple interpretation. Remote Sens. Spat. Inf. Sci. 2017, 4, 287. [CrossRef] 25. Ore, C.-E.; Eide, Ø. TEI and cultural heritage ontologies: Exchange of information? Lit. Linguist. Comput. 2009, 24, 161–172. [CrossRef] 26. PROVIDEDH. PROgressive VIsual DEcision-Making in Digital Humanities (PROVIDEDH) Project 2019. Available online: https://providedh.eu (accessed on 22 March 2019). 27. ISO/IEC. Information Technology—Object Management Group Unified Modeling Language (OMG UML) Part 1: Infrastructure. ISO/IEC 19505-1:2012. Available online: https://www.iso.org/standard/32624.html (accessed on 22 March 2019). 28. Malta, M.C.; González-Blanco, E.; Cantón, C.M.; Del Rio, G. A Common Conceptual Model for the Study of Poetry in the Digital Humanities. Available online: https://dh2017.adho.org/abstracts/148/148.pdf (accessed on 22 March 2019). 29. Lacerda, M.J.; Crespo, L.G. Interval predictor models for data with measurement uncertainty. In Proceedings of the 2017 American Control Conference (ACC), Seattle, WA, USA, 24–26 May 2017. 30. Zadeh, L.A. Fuzzy logic = computing with words. IEEE Trans. Fuzzy Syst. 1996, 4, 103–111. [CrossRef] 31. Zadeh, L.A. A Summary and Update of “Fuzzy Logic”. In Proceedings of the 2010 IEEE International Conference on Granular Computing, San Jose, CA, USA, 14–16 August 2010. 32. Bouchon-Meunier, B. Strengths of Fuzzy Techniques in Data Science. Available online: https://hal.sorbonne- universite.fr/hal-01676195/document (accessed on 22 March 2019). 33. Zhou, H.; Wang, J.-Q.; Zhang, H.-Y. Multi-criteria decision-making approaches based on distance measures for linguistic hesitant fuzzy sets. J. Oper. Res. Soc. 2018, 69, 661–675. [CrossRef] 34. Faizi, S.; Rashid, T.; Sałabun, W.; Zafar, S.; Wątróbski, J. Decision making with uncertainty using hesitant fuzzy sets. Int. J. Fuzzy Sys. 2018, 20, 93–103. [CrossRef] 35. OMG. Project Portal for OMG® Uncertainty Modeling (UM) 2017. Available online: http://www.omgwiki. org/uncertainty/doku.php?id=Home (accessed on 22 March 2019). 36. Yue, T.; Ali, S.; Selic, B. Standardizing Uncertainty Modeling at OMG. Available online: http://www.cister. isep.ipp.pt/ae2016/presentations/utest2.pdf (accessed on 22 March 2019). 37. Xiao, J.; Pinel, P.; Pi, L.; Aranega, V.; Baron, C. Modeling uncertain and imprecise information in process modeling with UML. In Proceedings of the Fourteenth International Conference on Management of Data (COMAD), Mumbai, India, 17–19 December 2008. 38. Jackson, C.H.; Bojke, L.; Thompson, S.G.; Claxton, K.; Sharples, L.D. A framework for addressing structural uncertainty in decision models. Med. Decis. Mak. 2011, 31, 662–674. [CrossRef] [PubMed] 39. Ottomanelli, M.; Wong, C.K. Modelling uncertainty in traffic and transportation systems. Transportmetrica 2011, 7, 1–3. [CrossRef] 40. Sarma, A.D.; Benjelloun, O.; Halevy, A.; Nabar, S.; Widom, J. Representing uncertain data: Models, properties, and algorithms. VLDB 2009, 18, 989–1019. [CrossRef] 41. Martín-Rodilla, P.; Gonzalez-Perez, C. Assessing the learning curve in archaeological information modelling: Educational experiences with the Mind Maps and Object-Oriented paradigms. In Proceedings of the 45th Computer Applications and Quantitative Methods in Archaeology (CAA 2017), Atlanta, GA, USA, 13–16 March 2017. 42. IEMYR. Instituto de Estudios Medievales y Renacentistas y de Humanidades Digitales IEMYRhd 2018. Available online: http://iemyr.usal.es/ (accessed on 22 March 2019). 43. Dictomagred. DICTOMAGRED: Diccionario de Toponimia Magrebí 2018. Available online: https:// dictomagred.usal.es/ (accessed on 22 March 2019). 44. Rodríguez, M.A.M. Paisajes, espacios y objetos de devoción en el Islam. Available online: https://dialnet. unirioja.es/servlet/libro?codigo=708334 (accessed on 22 March 2019). 45. Sharp, J.; McMurtry, D.; Oakley, A.; Subramanian, M.; Zhang, H. Data Access for Highly-Scalable Solutions: Using SQL, NoSQL, and Polyglot Persistence; Microsoft Patterns & Practices: Redmond, DC, USA, 2013. http://ecfn.fundmuenzen.eu/images/Tolle_Wigg-Wolf_Uncertainty.pdf http://dx.doi.org/10.5194/isprs-annals-IV-2-W2-287-2017 http://dx.doi.org/10.1093/llc/fqp010 https://providedh.eu https://www.iso.org/standard/32624.html https://dh2017.adho.org/abstracts/148/148.pdf http://dx.doi.org/10.1109/91.493904 https://hal.sorbonne-universite.fr/hal-01676195/document https://hal.sorbonne-universite.fr/hal-01676195/document http://dx.doi.org/10.1080/01605682.2017.1400780 http://dx.doi.org/10.1007/s40815-017-0313-2 http://www.omgwiki.org/uncertainty/doku.php?id=Home http://www.omgwiki.org/uncertainty/doku.php?id=Home http://www.cister.isep.ipp.pt/ae2016/presentations/utest2.pdf http://www.cister.isep.ipp.pt/ae2016/presentations/utest2.pdf http://dx.doi.org/10.1177/0272989X11406986 http://www.ncbi.nlm.nih.gov/pubmed/21602487 http://dx.doi.org/10.1080/18128600903244636 http://dx.doi.org/10.1007/s00778-009-0147-0 http://iemyr.usal.es/ https://dictomagred.usal.es/ https://dictomagred.usal.es/ https://dialnet.unirioja.es/servlet/libro?codigo=708334 https://dialnet.unirioja.es/servlet/libro?codigo=708334 Informatics 2019, 6, 20 23 of 23 46. De Freitas, M.C.; Souza, D.Y.; Salgado, A.C. Conceptual Mappings to Convert Relational into NoSQL Databases. In Proceedings of the 18th International Conference on Enterprise Information Systems, Rome, Italy, 25–28 April 2016. 47. What are NoSQL Databases? Available online: https://aws.amazon.com/nosql/ (accessed on 22 March 2019). 48. MongoDB. Available online: https://wwwmongodbcom/ (accessed on 22 March 2019). 49. Inc. G. Firebase 2019 [01/03/2019]. Available online: https://firebase.google.com/ (accessed on 22 March 2019). 50. Abramova, V.; Bernardino, J. NoSQL databases: MongoDB vs cassandra. In Proceedings of the International C* Conference on Computer Science and Software Engineering, Porto, Portugal, 10–12 July 2013. 51. Algolia. Algolia Website 2019. Available online: https://www.algolia.com/ (accessed on 22 March 2019). © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). https://aws.amazon.com/nosql/ https://www mongodb com/ https://firebase.google.com/ https://www.algolia.com/ http://creativecommons.org/ http://creativecommons.org/licenses/by/4.0/. Introduction Uncertain Information in Humanities Fields Existing Approaches Outside Humanities Theoretical Framework ConML Materials and Methods Expressing Imprecision and Uncertainty with ConML Null and Unknown Semantics Certainty Qualifiers Abstract Enumerated Items Arbitrary Time Resolution Results Case Study and Resultant Models Implementatio Query-Based Vagueness Resolution Results Discussion Conclusions References work_fnntlamvnnhzfm53c6ydj4oeoa ---- [PDF] Discovering relationships from imperial court documents of Qing China | Semantic Scholar Skip to search formSkip to main content> Semantic Scholar's Logo Search Sign InCreate Free Account You are currently offline. Some features of the site may not work correctly. DOI:10.3366/ijhac.2012.0036 Corpus ID: 9673344Discovering relationships from imperial court documents of Qing China @article{Hsiang2012DiscoveringRF, title={Discovering relationships from imperial court documents of Qing China}, author={J. Hsiang and Shih-Pei Chen and Hou Ieong Ho and H. Tu}, journal={Int. J. Humanit. Arts Comput.}, year={2012}, volume={6}, pages={22-41} } J. Hsiang, Shih-Pei Chen, +1 author H. Tu Published 2012 History, Computer Science Int. J. Humanit. Arts Comput. The Qing Imperial Court documents are a major source of primary research material for studying the Qing era China since they provide the most direct and first-hand details of how national affairs were handled. However, the way Qing archived these documents has made it cumbersome to collect documents covering the same event and rebuild their original contexts. In this paper, we describe some information technology that we have developed to discover two important and useful relations among these… Expand View via Publisher thdl.ntu.edu.tw Save to Library Create Alert Cite Launch Research Feed Share This Paper 5 CitationsBackground Citations 1 Methods Citations 1 View All Figures, Tables, and Topics from this paper figure 1 table 1 figure 2 table 2 figure 3 figure 4 table 4 figure 5 table 5 figure 6 figure 7 figure 8 figure 9 figure 10 figure 11 figure 12 figure 13 figure 14 View All 18 Figures & Tables Text mining Digital library Historical document Diagram Archive 5 Citations Citation Type Citation Type All Types Cites Results Cites Methods Cites Background Has PDF Publication Type Author More Filters More Filters Filters Sort by Relevance Sort by Most Influenced Papers Sort by Citation Count Sort by Recency Discovering land transaction relations from land deeds of Taiwan Shih-Pei Chen, Yu-Ming Huang, J. Hsiang, H. Tu, Hou Ieong Ho, Ping-Yen Chen Business, Computer Science Lit. Linguistic Comput. 2011 3 PDF View 2 excerpts, cites background Save Alert Research Feed A Chinese ancient book digital humanities research platform to support digital humanities research Chih-Ming Chen, C. Chang Computer Science Electron. Libr. 2019 1 Save Alert Research Feed Application of Taiwan's Human Rights-Themed Cultural Assets and Spatial Information Shuhui Lin Sociology, Computer Science Complex. 2020 PDF Save Alert Research Feed Visuality in a Cross-disciplinary Battleground: Analysis of Inscriptions in Digital Humanities Journal Publications Rongqian Ma, Kai Li Computer Science 2021 PDF View 2 excerpts, cites methods Save Alert Research Feed A Bibliographic Analysis of Scholarly Publication in the Emerging Field of Digital Humanities in Taiwan K. Chen, Muh-Chyun Tang Sociology 2019 Save Alert Research Feed References SHOWING 1-2 OF 2 REFERENCES Methods for Identifying Versioned and Plagiarized Documents T. C. Hoad, J. Zobel Computer Science J. Assoc. Inf. Sci. Technol. 2003 370 PDF Save Alert Research Feed Collection statistics for fast duplicate document detection Abdur Chowdhury, O. Frieder, D. Grossman, M. McCabe Computer Science TOIS 2002 282 PDF Save Alert Research Feed Related Papers Abstract Figures, Tables, and Topics 5 Citations 2 References Related Papers Stay Connected With Semantic Scholar Sign Up About Semantic Scholar Semantic Scholar is a free, AI-powered research tool for scientific literature, based at the Allen Institute for AI. Learn More → Resources DatasetsSupp.aiAPIOpen Corpus Organization About UsResearchPublishing PartnersData Partners FAQContact Proudly built by AI2 with the help of our Collaborators Terms of Service•Privacy Policy The Allen Institute for AI By clicking accept or continuing to use the site, you agree to the terms outlined in our Privacy Policy, Terms of Service, and Dataset License ACCEPT & CONTINUE work_fooxxved5zgsddcx7iawm44dhi ---- Microsoft Word - Lothian_2013.docx 1 Journal of e-Media Studies Volume 3, Issue 1, 2013 Dartmouth College Can Digital Humanities Mean Transformative Critique? Alexis Lothian and Amanda Phillips We need new hybrid practitioners: artist-theorists, programming humanists, activist- scholars; theoretical archivists, critical race coders. We need new forms of graduate and undergraduate education that hone both critical and digital literacies. We have to shake ourselves out of our small, field-based boxes so that we might take seriously the possibility that our own knowledge practices are normalized, modular, and black boxed in much the same way as the code we study in our work. ––Tara McPherson, “Why is the Digital Humanities So White?” (154) We were invited to this issue of the Journal of e-Media Studies because we gave something a name. We are two participants in a group of early-career queer, feminist, and ethnic studies scholars of media, literature, and culture who are interested in digital scholarship, who kept meeting at conferences and wondering why the critical frameworks and politicized histories of our activist inquiry were so rarely part of the conversations we were having about scholarly technology. The series of academic conference events that led us to converge as a collective have by now been hashed and rehashed many times: there was an idea at THATCamp SoCal in response to anxiety at MLA 2011; then a small but productive panel at ASA (American Studies Association) 2011; some blog posts on HASTAC (Humanities, Arts, Sciences, and Technology 2 Advanced Collaboratory) and elsewhere, a Tumblr; and the birth of a hashtag that finally caught the attention of the digital humanities (DH) Twittersphere. Somebody made a Google Doc, some bodies attended a panel, and some buddies were in the collective hoping that people would take over the hashtag and submit to the Tumblr and blog about why #transformDH was cute but vague and ultimately misguided. But, ultimately, the project’s goal was to put a name to a feeling and see who else was thinking the same thing. That there are now names out there, records of attendance, email trails, and other evidence for the future tenure files that might take such endeavors into account, was a side effect that has taught us much about the power of naming–– you might even say of branding––when you want to get an idea into circulation. What was the idea? In short, #transformDH is an aggregated statement of the obvious. First of all: the emergent methods and practices we call digital humanities are not only for traditional work. Years of DH criticism might point to the banality of this sentiment; the changing shapes of communication and technology alter the terms of scholarship, and keeping afloat in the coming century will require mastery over new tools and methods. The revolution of DH is in full swing, with the force of multicampus institutions, internet portals, and federal funding at its back. The histories that DH as a discipline traces back through practices of humanities computing have indeed done transformational work on the structures of scholarship and the bureaucracies that shape our careers. Yet the bright lights and marching bands of the so-called big tent outshine less marketable histories of engagement with technology that have emerged from standpoints that critique the privileging of certain gendered, racialized, classed, able-bodied, Western-centric productions of knowledge. In a recent blog entry, filmmaker, feminist, and academic Alex Juhasz describes why she does not affiliate herself wholeheartedly with digital humanities: 3 The “field” does the amazing potentially radicalizing work of asking humanities professors (and students) to take account for their audiences, commitments, forms, and the uses of their work. But this was always there to take account of, being obscured by the transparent protocols of publishing and pedagogy that have been revealed because of the force of the digital. However, this turn is occurring, for the most part, as if plenty of fields, and professors, and artists, and students, and humanists hadn’t been already been doing this for years (and therefore without turning to these necessarily radical traditions of political scholars, theoretical artists, and humanities activists). #TransformDH was our attempt to turn the digital humanities toward these radical traditions, as well as toward the bodies of critical work in new media studies by Wendy Chun, Lisa Nakamura, Anna Everett, Tara McPherson, and many others, that unpack the politics inherent in the force of the digital, the powers that shape the hardware and software that in turn shape our scholarly work. We wanted to think about the institutions that were forming in this ever more amorphous thing called digital humanities. We didn’t want the ways of engaging knowledge that were important to us to be left out. We felt it would be too easy to say that we were doing something other than DH, whether that be new media studies or critical cultural studies with a focus on the digital; instead, we wanted to bring what Juhasz calls “necessarily radical traditions,” which have nourished us, into the DH field in which we also felt at home. If humanities scholars in critical media and cultural studies, queer studies, ethnic studies, disability studies, and related areas are doing work in and with the digital, we should lay claim to our place within digital humanities. We should explicitly occupy that space and assert––as McPherson and Jamie “Skye” Bianco, 4 among others, have recently done––that the honorable history of humanities computing is not the only one that matters for whatever it is we mean when we talk about the field. Inclusivity is important to DH practitioners in the humanities computing tradition. We share that goal, but it is not the heart of our project. In “Whose Revolution? Towards a More Equitable Digital Humanities,” Matthew K. Gold’s MLA 2012 talk reflecting on his book Debates in the Digital Humanities, Gold raises the question of which hierarchies, uneven distributions of labor, and value systems DH might preserve even as it seeks to change the way academic work is done. His important discussion focuses on the vital and often overlooked power of institutional resources to shape what scholarly work gets done. Yet the metaphor that comes after his set of concrete and useful suggestions for diversifying DH is interesting: “as any software engineer can tell you, the more eyes you have on a problem, the more likely you are to find and fix bugs in the system.” If the system of DH were to run smoothly, Gold implies, it would not perpetuate hierarchies or inequalities. Gender, race, sexuality, ability, and class––and the marked bodies on which they become most visible––can be content that would fit within the forms already being established and funded for digital work: the on-campus centers, the annotated archives. But what we know about the academy, from its constitutive imbrications with nationalism and empire to the structures of race and gender that still shape its labor practices, suggests otherwise. Content and form are not so separable; truly accounting for one will unavoidably change the other. So instead of smoothing out the bugs in the digital academy, we wonder how digital practices and projects might participate in more radical processes of transformation––might rattle the poles of the big tent rather than slip seamlessly into it. To that end, we are interested in digital 5 scholarship that takes aim at the more deeply rooted traditions of the academy: its commitment to the works of white men, living and dead; its overvaluation of Western and colonial perspectives on (and in) culture; its reproduction of heteropatriarchal generational structures. Perhaps we should inhabit, rather than eradicate, the status of bugs––even of viruses—in the system. Perhaps there are different systems and anti-systems to be found: DIY projects, projects that don’t only belong to the academy, projects that still matter even if they aren’t funded, even if they fail. What would digital scholarship and the humanities disciplines be like if they centered around processes and possibilities of social and cultural transformation as well as institutional preservation? If they centered around questions of labor, race, gender, and justice at personal, local, and global scales? If their practitioners considered not only how the academy might reach out to underserved communities, but also how the kinds of knowledge production nurtured elsewhere could transform the academy itself? These questions are not hypothetical. These digital humanities already exist. Here we offer a curated list of projects, people, and collaborations that suggest the possibilities of a transformative digital humanities: one where neither the digital nor the humanities will be terms taken for granted. The transformative digital humanities will not be found only among the members of our ad hoc collective. Nor will it be found only where the funding is, where the easily recognized and intensively supported DH projects are. We’ve gathered a selection of projects, ranging from institutionally sponsored archives of less-than-traditional materials to networks that purposefully have no direct connection to the academy as such. None belongs to a core member of our 6 collective, because we are becoming a little alarmed at the publicity our act of naming has begun to generate. All the projects put the questions of decades of feminist, queer, and critical race theory (all of which share significant temporal nodes with the politicized computing movements at the heart of much DH philosophy) at the center of their work, leveraging the affordances and methodologies for social justice. Here one can find collaboration pushed to collectivity, interdisciplinarity that reaches outside of the ivory tower, and art that builds its own theory. These are only beginnings, suggestions; you may disagree that these are projects worth gathering, or you may wish to suggest other projects for consideration. Your feedback, critiques, and additions will help us to build a transformative digital humanities together. Curation Transformative Archives Archives may be the most legible form of digital humanities production, as digital tools have been developed to preserve, gather, and share historical documents. Digital humanities practitioners have increasingly been theorizing the power structures and silences of the archive, as well as drawing on materials less often granted the legitimacy of academic preservation. 7 Adeline Koh: Digitizing “Chinese Englishmen”: Representations of Race and Empire in the Nineteenth Century Adeline Koh’s online Digitizing “Chinese Englishmen” project is an early step in the direction of decolonizing the archive, offering a forum for collaborative annotation and novel social media intervention on texts that expand the Victorian Anglophone repertoire beyond its current “narrow geographical boundaries.” Koh’s project carves out a space for the postcolonial archive: The website is meant to be both a “decentralized” and a “postcolonial” archive. By a “decentralized” archive, it refers to one which provides modes for democratic access and exchange. On first glance, the term “postcolonial” nineteenth century archive may appear anachronistic, as no colonies were in fact “postcolonial” in this time period. My use of the term “postcolonial,” however, derives more from the type of postcolonial 8 literary criticism and postcolonial theory commonly associated with Edward Said and the Subaltern Studies Collective than with movements towards decolonization before and after the Second World War. In this definition, a “postcolonial” archive is one which examines and questions the creation of imperialist ideology within the structure of the archive. Additionally, it aims to assemble a previously unrepresented collection of subaltern artifacts. (“Addressing Archival Silence on 19th Century Colonialism – Part 2”) Straits Chinese Magazine, the project’s source text, offers readers a complicated, alternative view of what it meant to be both an Englishman and a Chinese gentleman in the 19th century. Koh’s archive makes no effort to resolve or simplify the complicated identity practices of the Chinese Englishmen, hoping instead to offer a platform to evaluate them without the colonial impulse to reduce these Victorians to paragons of false consciousness or imitations of “real” British gentlemanliness. Digitizing “Chinese Englishmen” expands the archive beyond colonial representations of nonwhite peoples in the 19th century, leveraging the reach of the digital to transform the face of 19th-century studies. 9 Women Who Rock: Making Scenes, Building Communities at the University of Washington Women Who Rock is an oral history archive at the University of Washington, built from the ground up on the principles of women of color feminism: collaboration across difference, intersectional critique, and accountability to communities outside the academy. Participation in the project provides training for women’s and ethnic studies graduate students in the digital skills that suit their research interests, from web design to video production. Headed by Michelle Habell-Pallán, this is one of the few well-established, institutionally supported DH projects that are rooted in critical feminist media theory and praxis. Women Who Rock Research Project (WWRRP) supports, develops, and circulates cultural production, conversations and scholarship by cultural producers and faculty, 10 graduate students, and undergraduates across disciplines, both within and outside the University, who examine the politics of gender, race, class, and sexuality generated by popular music. Our goal is to generate dialogue and provide a focal point from which to build and strengthen relationships between local musicians and their communities, and educational institutions. (Women Who Rock Project: Making Scenes, Building Communities) [Video by Angelica Macklin: http://vimeo.com/24484214] Oral histories such as this are committed to the production of knowledge from below, bringing people and practices who have traditionally been excluded from academic spheres––or simply not taken seriously there––into the frameworks of institutional preservation. In the case of Women Who Rock, the preservation of popular music’s communities and histories is also aimed at a transformation of the institutional archive itself, bringing down barriers between the university and the knowledge worlds that lie outside its walls. Transformative Artistic Production Definitions of the digital humanities do not often include digital artistic production. But why not? The borders of artistic practice, software design, political activism, and critical knowledge production are porous. 11 Micha Cárdenas: Transreal Politics A queer performance artist currently working toward a PhD in the University of Southern California (USC)’s Interactive Media Arts and Practice program, Micha Cárdenas uses art, theory, and technology to encourage social justice thinking, which results in a unique brand of art-theory that pushes each of the fields in which it engages. Cárdenas develops new software applications, designs and builds electronic gadgets that challenge hegemonic regimes, and infuses each performance with theoretical writing. Cárdenas’s collaborative work has resulted in two theoretical texts so far: Trans Desire/Affective Cyborgs, coauthored by Barbara Fornssler and Wolfgang Shirmacher, and The Transreal: Political Aesthetics of Crossing Realities, coauthored with Zach Blas, Elle Mehrmand, and Amy Sara Carroll. Cárdenas’s work takes trans- to its fullest extent, crossing realities, genders, theoretical perspectives, and technical design. 12 The video featured here, “Becoming Transreal,” a performance in collaboration with Elle Mehrmand and Chris Head, focuses the attention of the digital back on the material body and its entanglement with global capital, reminding us, through the pain of transgender experience braided with a dystopic science fiction narrative, that technology is of concern to bodies (and corporations) most of all. From the video’s description: What if you could become anything? What happens after species change surgery becomes a reality? becoming transreal speculates on a future in which the promises of bionanotechnology have become realized, and yet as capitalism has continued to fail, both the interiors of our bodies and the virtual world have become totally commodified. you can become anything, but to finance your whims of identity transformation, the same nanohormones that transform your body are also producing drugs for others. becoming transreal looks at transgender experience through a lens of slipstream science fiction poetry about bio-nano drug piracy. The performance uses motion capture to interface with Second Life avatars [http://en.wikipedia.org/wiki/Second_Life] and 3D stereoscopic imagery to immerse the audience in this transreal world. Cárdenas operates in the tradition of mixed-reality performance, which Steve Benford and Gabriella Giannachi define broadly as a subset of performance art, including augmented reality and pervasive gaming, that combine “many real, virtual, augmented reality, and augmented virtuality environments into complex hybrid and distributed performance stages” (3). Although many mixed-reality works, such as Blast Theory’s Uncle Roy Everywhere or 42 Entertainment’s 13 I Love Bees, focus on direct user participation and mobile technologies, Cárdenas invites the audience to enter the world of the performance through indirect means such as audience props and immersive presentation technologies. Using large-scale projection equipment and biometric sensors keyed to the performers’ bodies, Cárdenas’s transreal performances bridge a physical installation space with the virtual world of Second Life, (dis)embodying their own content through form. Cárdenas creates a performance space and temporality layered with autobiography and speculative fiction, physical bodies and digital avatars. [Video by Micha Cárdenas and Elle Mehrmand, “Becoming Transreal”: http://vimeo.com/16869351] Zach Blas: Queer Technologies Zach Blas’s Queer Technologies project invites viewers to rethink the role of critical theory by bringing it out of academic language and into the realm of product design. Blas’s art reimagines 14 queer theory as a high-design brand, building objects that we can imagine as desirable accessories for the discerning plugged-in activist, and challenging us to pay attention to the commodification of art and ideas. Part manifesto, part news report, part critical essay, Queer Technologies’ suite of instructional videos takes digital production as both theory and praxis. Each video documents a queer weapon of resistance that responds to, yet participates in, the methods of the technological tools of empire. Blas’s playful, speculative products ironically reproduce the signifiers of global capital while offering queer possibilities for undermining them, as indicated by the promotional speech embedded in each video: Queer Technologies is an organization that develops applications for queer technological agency, interventions, and social formation. We use technology to make queer weapons of resistance. These include: transCoder, a queer programming anti-language software development kit; ENgendering Gender Changers, a solution to gender adapters’ male/female binary; Gay Bombs, a technical manual manifesto that outlines a how-to of queer networked activism; and GRID, a mapping application that tracks dissemination of queer technologies and maps the battle plans to more thoroughly infect networks of global capital. You can find our products at the Disingenuous Bar, a center for political support for technical problems, or in various consumer electronics stores, such as Best Buy, Radio Shack, and Target. This sarcastic PR spin calls into question the Apple products and slick gadgetry on which media- inclined academics depend; indeed, Queer Technologies asks us to consider not only the ends to 15 which we apply our digital tools, but also the troubling legacies and potential applications of cutting-edge developments in science and technology. The video “Fag Face, or How to Escape Your Face” responds to biometric technologies that enlist the face in governmental control systems, whose applications range from commercial digital camera software to surveillance technologies used by local law enforcement. Responding to legacies of homophobia and neoliberal governance with Deleuze, Guattari, and gay pornography, “Fag Face” offers a new way to think about and produce critical theory. [Video by Zach Blas, “Fag Face”: http://vimeo.com/26638452] From the Center Scholarship and activism, academy and community, theory and pedagogy are often considered to be separate. By including this project, in which researchers and technology educators work with incarcerated women of color using digital storytelling techniques, we hope to challenge readers to think about what it might mean to allow our ideas about scholarship and political commitment to be transformed from the ground up. Digital scholar, poet, and University of California– Berkeley graduate student Margaret Rhee serves as project co-lead and conceptualist. At the 16 2011 HASTAC Conference, Rhee spoke of this collaborative activist work as “counterintuitive to the logics and rewards of the academy”––yet absolutely necessary. As feminists in our new media age, we believe women should be the authors, directors and storytellers of our own lives. We re-imagine how new media technologies can provide a vital intervention for all women, even those whose voices are subsumed in larger hegemonic discourse. Oftentimes, incarcerated women and issues of race, class and sexuality are unacknowledged even in interdisciplinary areas such as Ethnic, Women and Queer Studies and in larger conversations and decisions of HIV/AIDS prevention education, policy and new media technologies. “From the Center” derives from intersectional issues, domains and disciplines. We hope to bridge seemingly disparate subjects: feminist praxis, HIV/AIDS education, digital storytelling, the prison industrial complex, Women’s Studies, Ethnic Studies and New Media Studies. Thus, we question, hope and urge a re-articulation of women’s identity, HIV/AIDS education and the digital divide by centering the issues and concerns of incarcerated women. (From the Center) The field of digital humanities has become well known for its willingness to challenge academic conventions on one level: the idea that a PhD constitutes professional training that should lead invariably to a tenure-track university teaching position. Yet the vision of From the Center, and Rhee’s insistence that her work should be considered part of a scholarly project, highlights the limits of the academic transformations suggested by the increasingly celebrated alt-ac narrative (which encourages PhDs to seek careers in non-teaching roles in the university). From the Center is a far more radical vision of what alternative scholarly knowledge projects and professional 17 practices could be. It is not uncommon for scholars with particular political commitments to use their skills for activist projects in addition to their university work of teaching, research, and (in the age of DH) digital projects. But what would it mean to slip the bounds of the neoliberal academy, even for a moment, and imagine this work as the center of scholarly activity? [Digital story, “Miracle”: http://vimeo.com/26096719] Because I want to help women know that it is okay to go through things like that, this life. Because I have someone in my family who has HIV. And I learned from her how to have safe sex and get tested. From the evocative intensity of the video to the straightforward statements that highlight a reality too rarely acknowledged within scholarly spaces, knowledge is being produced and transmitted here. When From the Center’s team travels to conferences, its presenters include formerly incarcerated participants as well as academics and professional activists. Their presence suggests that the privileged sphere of digital scholarship need not remain hermetically sealed from those who “go through things like that, this life.” Transformative Networked Pedagogies Connections and support networks among those engaged in knowledge production are central to the growth of the digital humanities sphere. Much unacknowledged work of consolidation, mentorship, and intellectual framing takes place in and through digitally mediated social 18 networks. Here we highlight two examples that make the work of theory/practice explicit and conscious, building collaborative spheres on feminist principles and connecting transformative praxes inside and outside the academy. Fembot Collective: Feminism, New Media, Science and Technology The Fembot Collective consists of faculty, graduate students, and librarians who created a portal for feminist scholarship about technology. Committed to the ideals of open source, Fembot hosts an online journal, Ada: Journal of Gender, New Media and Technology, with an open peer editorial process, an expanded notion of what “article” means, and a built-in system to help contributors bolster promotion and tenure portfolios: Fembot has developed a framework for a two-level review process that includes an open editorial peer review and a community level of review for works in progress. Valuing both the scholarly works and participation in the community of review, Fembot will 19 provide metrics on article views/downloads and the usefulness of comments. These metrics will be aggregated into a portfolio, which is conducive to forming an incentive to participate in the community and support an argument for value toward promotion. In addition to its transformation of scholarly publishing, Fembot contributes pedagogical tools on the undergraduate and graduate levels, hosting blog posts in the site’s Laundry Day section that outline short, teachable moments in feminist technology scholarship, and providing tenure policies and dissertation prospectuses for use in professionalization training. Most recently, Fembot acts as the portal for FemTechNet, a feminist technology teaching network that hopes to launch a course taught worldwide, Dialogues on Feminism and Technology, in 2013. Billed as a “Distributed Online Collaborative Course,” FemTechNet is an attempt at developing a viable model for transdisciplinary, transnational, transmedial collaborative pedagogies, and a feminist intervention on the MOOC (Massive Open Online Course) model that is prevalent and controversial in current digital humanities discourse. In the future, Fembot will host peer-evaluated readings, videos, bibliographies, and other teaching resources to aid participants in tailoring local instances of the course to its networked goals. Experiments such as FemTechNet and Ada position the Fembot Collective as an innovator in scholarly communicative possibilities. Crunk Feminist Collective “Mission Statement” 20 In its mission statement, the Crunk Feminist Collective throws off “hegemonic ways of being” in favor of reveling in and sharing the intoxicating effects of women of color feminisms with its readership and commenting community. This blogging community provides a space for women of color to commune, critique, and call out hegemonic culture in ways that reach across the divide separating academia from the popular. Beat-driven and bass-laden, Crunk music blends Hip Hop culture and Southern Black culture in ways that are sometimes seamless, but more often dissonant. Its location as part of Southern Black culture references the South both as the location that brought many of us together and as the place where many of us still do vibrant and important intellectual and political work. The term “Crunk” was initially coined from a contraction of “crazy” or “chronic” (weed) and “drunk” and was used to describe a state of uber- intoxication, where a person is “crazy drunk,” out of their right mind, and under the influence. But where merely getting crunk signaled that you were out of your mind, a crunk feminist mode of resistance will help you get your mind right, as they say in the South. Casting off stilted academic speech for lyrical manifestos, insisting on the utility of affect for deep and considered arguments, and refusing to disconnect deeply personal stories from the project of scholarship, the Crunk Feminists’ commentary is more timely than journal production and more effective in enlisting the passion and drive of reader-students for social justice purposes. 21 The collective’s interventions in internet and popular culture have included critiquing mainstream media for its coverage of Olympians Gabby Douglas and Claressa Fields, covering the triumphs and missteps of the popular The Misadventures of Awkward Black Girl web series, and offering film and television reviews that range from Love and Hip Hop to Pariah. The Crunk Feminists also offer practical career advice for young academics and swap experiences and strategies for the unique struggles of the black feminist running a university-level class. As the blog’s large community of regular readers and commenters attest, the tactics and philosophies of Crunk Feminism reach into academia and beyond, educating and transforming their corner of the web. Conclusion As the tools and methods of the digital humanities take up their new positions of prominence, we can only hope that they will begin to take on the mutations and instabilities represented by the practitioners and projects featured here, rather than settle into the creaky machine of the corporate university. Whatever its future, DH has already proved its power to unsettle the old guard, inducing anxious and skeptical blog posts from high-profile critics and me-too conference panels spreading the word to far-off disciplines. The spirit of #transformDH is not to arrest this momentum, but to channel it in truly transformative directions—to avoid trading whiteness for more whiteness, heteropatriarchy for more heteropatriarchy, one imperialist hierarchy for another. We hope the community at large will continue to find and go viral with the social justice-minded hybrid practices, identities, and collaborations elaborated in McPherson’s epigraph to this work 22 of curation and analysis—the antiracist archives, the queer art-theories, the collaborative feminist pedagogies, the crunk academic activisms, the critical race coders. #TransformDH is a convenient means to do so, but in the spirit of transformative work, we hope it will be supplanted by something else soon. About the Authors Alexis Lothian is assistant professor of English at Indiana University of Pennsylvania, where she researches and teaches at the intersections of cultural studies, digital media, speculative fiction, and queer theory. She is the editor of an upcoming special issue of Ada: Journal of Gender, New Media and Technology on feminist science fiction, a coeditor of a Social Text Periscope dossier on Speculative Life, and a founding member of the editorial team for the journal Transformative Works and Cultures. Her work has been published in International Journal of Cultural Studies, Cinema Journal, Camera Obscura, and Journal of Digital Humanities. Amanda Phillips is a PhD candidate in the Department of English with an emphasis in feminist studies at the University of California–Santa Barbara. Her dissertation takes a vertical slice of the video games industry to look at how difference is produced and policed on multiple levels of the gamic system. Her interests more broadly are in queer, feminist, and race-conscious discourses in and around technoculture, popular media, and the digital humanities. In addition to participating in the Humanities Gaming Institute 2010, sponsored by the National Endowment for the Humanities (NEH), Amanda has been a HASTAC Scholar since 2009; she has also hosted, in conjunction with Margaret Rhee, an online HASTAC forum on Queer and Feminist New Media 23 Spaces, the organization’s most commented on forum to date. She has presented at the conferences for UCLA Queer Studies, the American Studies Association, the Modern Language Association, the Popular Culture Association, and the Conference on College Composition and Communication, and has participated in unconferences such as HASTAC’s Peer-to-Peer Pedagogy Workshop, THATCamp SoCal, and the Transcriptions Research Slam. Most recently, she has been involved with the #transformDH Collective’s efforts to encourage and highlight critical cultural studies work in digital humanities projects. Bibliography Benford, Steve, and Gabriella Giannachi. Performing Mixed Reality. Cambridge, MA: MIT Press, 2011. Blas, Zach. “Fag Face, or How to Escape Your Face.” Vimeo. 2012. Accessed May 9, 2012. http://vimeo.com/26638452. ———. “Queer Technologies: Automating Perverse Possibilities.” Queer Technologies. 2012. Accessed May 9, 2012. http://www.zachblas.info/projects/queer-technologies/. Cárdenas, Micha. Transreal.org. 2012. Accessed May 9, 2012. http://transreal.org/. Cárdenas, Micha, and Elle Mehrmand. “Becoming Transreal.” UCLA Freud Playhouse, Los Angeles, CA. Performed Nov. 3, 2010. Vimeo. May 9, 2012. The Crunk Feminist Collective. The Crunk Feminist Collective. 2010–present. Accessed May 9, 2012. http://crunkfeministcollective.wordpress.com/. ———. “Mission Statement.” The Crunk Feminist Collective (blog). Mar. 6, 2010. Accessed May 9, 2012. http://crunkfeministcollective.wordpress.com/about/. 24 The Fembot Collective. Fembot: Feminism, New Media, Science and Technology. 2012. Accessed May 9, 2012. http://fembotcollective.org/. Gold, Matthew K. “Whose Revolution? Towards a More Equitable Digital Humanities.” The Lapland Chronicles (blog). Jan. 10, 2012. Accessed May 9, 2012. http://mkgold.net/blog/2012/01/10/whose-revolution-toward-a-more-equitable-digital- humanities/. González, Isela, Margaret Rhee, Allyse Gray, and Kate Monico Klein. From the Center: Facilitating Feminist Digital Theory and Praxis in a Digital Environment (blog). 2012. Accessed May 9, 2012. http://hastac.org/blogs/alexislothian/2011/12/02/hastac2011- center-facilitating-feminist-digital-theory-and-praxis-dig. Graduates of From the Center. “Miracle.” Vimeo. 2010. Accessed May 9, 2012. http://vimeo.com/26096719. Juhasz, Alex. “Two Conferences: One Students’/Women’s Media Power.” Media Praxis: Integrating Media Theory, Practice and Politics (blog). Apr. 2, 2012. Accessed May 9, 2012. http://aljean.wordpress.com/2012/04/02/two-conferences-one-studentswomens- media-power/. Koh, Adeline. “Addressing Archival Silence on 19th Century Colonialism – Part 1: The Power of the Archive.” Adeline Koh (blog). Mar. 4, 2012. Accessed May 9, 2012. http://www.adelinekoh.org/blog/2012/03/04/addressing-archival-silence-on-19th-century- colonialism-part-1-the-power-of-the-archive/. ———. “Addressing Archival Silence on 19th Century Colonialism – Part 2: Creating a Nineteenth Century ‘Postcolonial’ Archive.” Adeline Koh (blog). Mar. 4, 2012. Accessed May 9, 25 2012). http://www.adelinekoh.org/blog/2012/03/04/addressing-archival-silence-on-19th-century- colonialism-part-2-creating-a-nineteenth-century-postcolonial-archive/. ———. Digitizing “Chinese Englishmen.” 2012. Accessed May 9, 2012. http://chineseenglishmen.adelinekoh.org/. Macklin, Angelica. “I Saw You On The Radio!” Vimeo. 2011. Accessed May 9, 2012. http://vimeo.com/24484214. McPherson, Tara. “Why is the Digital Humanities So White?, or, Thinking the Histories of Race and Computation.” In Debates in the Digital Humanities, edited by Matthew K. Gold, 138–160. Minneapolis: Minnesota University Press, 2012. Women Who Rock Project: Making Scenes, Building Communities. 2012. Accessed May 9, 2012. http://womenwhorockcommunity.org/. Published by the Dartmouth College Library. http://journals.dartmouth.edu/joems/ Article DOI: 10.1349/PS1.1938-6060.A.425 work_fqxsysypnfaf7cgja5pr7o75lu ---- DOI: 10.12862/Lab16CVN I manoscritti vichiani della Biblioteca Nazionale di Napoli “Vittorio Emanuele III” Le “Carte Villarosa” Sei fascicoli di carte vichiane varie non rilegate (Ms. XIX, 42) Nota editoriale e indici Laboratorio dell’ISPF, XIII, 2016 2 PREMESSA Le cosiddette Carte Villarosa rappresentano una raccolta estremamente evocati- va per lo studioso vichiano; e insieme, del tutto imprescindibile. Non si può lavorare sui testi di Vico senza avere almeno una volta trascorso del tempo a leggere e decifrare le scritture del lascito villarosano, che è composto dalle carte ereditate dal marchese Carlantonio De Rosa direttamente dal figlio di Giambat- tista, Gennaro Vico. Fausto Nicolini1 ricostruisce sinteticamente la storia della famiglia, ricordandone l’origine abruzzese e rievocando il capostipite, primo marchese di Villarosa, Carlantonio (1638-1712), uomo di toga come da tradi- zione della famiglia, reggente del Collaterale a Napoli e amico di Antonio Vico, padre di Giambattista. Alla loro amicizia Nicolini attribuisce la decisione pater- na di avviare il giovane Giambattista agli studi giuridici, presa su consiglio per l’appunto del marchese, al quale Antonio aveva confidato le sue preoccupazio- ni . Da questi discese poi quel Carlantonio che fu allievo di Vico nel 1738 e in seguito avvocato. Ma fu un altro Carlantonio (1762-1847), quinto marchese di Villarosa, a rac- cogliere e pubblicare per la prima volta gli Opuscoli vichiani; bibliofilo appassio- nato, ricevette dalle mani di Gennaro Vico, ormai anziano, quel poco che del padre aveva potuto raccogliere, e in più cominciò a girare per biblioteche pub- bliche e private, o a incaricare amici di farlo altrove, per mettere insieme la straordinaria raccolta che la Biblioteca Nazionale di Napoli “V. Emanuele III” conserva in forma manoscritta. Il lascito, insieme a quel che aveva variamente recuperato, divenne in seguito materiale a stampa e costituì la prima raccolta dell’opera vichiana, composta di quattro volumi da lui stesso editi. Le preziosissime Carte Villarosa2, custodite nella suggestiva cornice della Se- zione Manoscritti della Biblioteca napoletana, sono raccolte in sei fascicoli e dodici codici, e rappresentano il materiale principale sul quale viene condotta l’operazione ecdotica dell’editore critico vichiano, che a sua volta trova forma e collocazione nei volumi di edizione critica dell’opera omnia condotta dall’Istituto per la storia del pensiero filosofico e scientifico moderno del Consiglio nazio- nale delle ricerche fin dal 1982. Manuela Sanna 1 B. Croce, Bibliografia vichiana accresciuta e rielaborata da Fausto Nicolini, Napoli, Ricciardi, 1947, pp.135-138. 2 Una prima descrizione è da vedere nel Catalogo vichiano napoletano, a cura di M. Sanna, supplemento al «Bollettino del Centro di studi vichiani», XVI, 1986, affiancato dal Catalogo della Mostra bibliografico-documentaria in occasione delle Onoranze a Vico nel II centenario della nascita, a cura di G. Guerrieri, Napoli, 1968. Le “Carte Villarosa” 3 AVVERTENZA I contenuti delle “Carte Villarosa” (Ms. XIX, 42 della Biblioteca Nazionale di Napoli) sono stati classificati e catalogati in maniera descrittiva. Nelle note apposte ai singoli materiali – individuati con la segnatura e il numero di fascicolo, seguiti dalla numera- zione risultante dall’ordinamento bibliotecario riscontrabile a margine – si farà riferi- mento al Catalogo vichiano napoletano, a cura di M. Sanna, Napoli, Bibliopolis, 1986 (pp. 501-505) con la sigla “CVN”; eventuali riferimenti a B. Croce - F. Nicolini Bibliografia vichiana, Napoli, Ricciardi, 1947, vol. I, e al catalogo della Mostra bibliografica documentaria in occasione delle “Onoranze a Vico nel II centenario della nascita”, a cura di G. Guerrieri, Na- poli, L’Arte tipografica, 1968, saranno dati rispettivamente con le sigle “C-N” e “Guerr.”. Soltanto per i fascicoli I e III, contenenti rispettivamente Versi e iscrizioni ed Epi- stole, che si è indicati singolarmente, si è scelto di dare conto anche delle principali edizioni a stampa in cui i diversi materiali sono stati pubblicati. Le indicazioni, date in forma abbreviata, corrispondono a: Ultimi onori di letterati amici in morte di Angela Cimini, in Napoli, nella stamperia di Felice Mosca, 1727; Opuscoli di Giovanni Battista Vico raccolti e pubblicati da Carlantonio de Rosa marchese di Villarosa, Napoli, presso Porcelli, 1819; Opu- scoli di Giambattista Vico, nuovamente pubblicati con alcuni scritti inediti da Giuseppe Ferrari, Milano, Società tipografica de’ Classici italiani, 1836; Opuscoli vari di Giambattista Vico, cioè Scritti scientifici, orazioni, iscrizioni e poesie, Napoli, Jovene, 1840; G. Vico, L’autobiografia, il carteggio e le poesie varie, a cura di B. Croce e F. Nicolini, seconda edizio- ne, Bari, Laterza, 1929; G. Vico, Versi d’occasione e scritti di scuola, con appendice e bi- bliografia generale delle opere a cura di Fausto Nicolini, Bari, Laterza, 1941; G. Vico, Scritti vari e pagine sparse, Bari, Laterza, 1941; G. Vico, Epistole, con aggiunte le epistole dei suoi corrispondenti, a cura di M. Sanna, Napoli, Morano, 1993; G. Vico, Minora. Scritti latini, storici e d’occasione, a cura di G. G. Visconti, Napoli, Guida, 2000. Nel fascicolo III, Lettere del Vico e al Vico o riguardanti Vico, sono indicate tra parentesi uncinate le epistole mancanti di destinatario o mittente e tra parentesi tonde quelle non indirizzate a Vico ma riguardanti Vico. Questa pubblicazione è parte del progetto di edizione elettronica dei Manoscritti vi- chiani della Biblioteca Nazionale di Napoli curato dal Centro di Umanistica Digitale dell’ISPF-CNR su materiale acquisito grazie al POR-FESR Campania 2007-2013. Hanno collaborato in particolare Roberto Evangelista (fascicoli V e VI e revisione), Assunta Sansone (fascicoli I e III), Roberta Visone (fascicoli II e IV), Ruggero Cerino (supporto tecnico). Coordinamento di Leonardo Pica Ciamarra. Supervisione scientifi- ca di Manuela Sanna. Si ringrazia Mariolina Rascaglia della Biblioteca Nazionale di Napoli per la preziosa consulenza nella preparazione del materiale da riprodurre. Nota editoriale e indici 4 INDICE N.B. È dato di seguito l’elenco di tutti i contenuti della raccolta suddivisi per fascicoli. Cliccando sull’intestazione del fascicolo lo si apre in un’altra finestra. All’interno di ciascun fascicolo, la funzio- ne “Segnalibri” dà accesso ad un indice interattivo dei contenuti. Giacché gli originali hanno dimen- sioni molto diverse tra loro e sono riprodotti su una maschera orizzontale uniforme, si suggerisce al lettore di impostare di volta in volta l’ingrandimento più comodo. FASCICOLO I Versi ed iscrizioni del Vico e al Vico Ammiravo già un tempo Roma e Atene Con mano al re quelle gran vie far note Con sue alte ampie moli, e sterminate Con voi m’allegro, o figlio alme di Giove Del fier perduto Mondo i Primi Vati Divina Rosa d’un eterno Aprile Due candide Colombe a Dio dilette In Coppia ricca di valor latino O Bel Trionfo, a cui vado favore O Sovrano, Real Lione alato Pregio Sommo e Sovran del secolo nostro Sommo Genio Sovran d’eroi famosi Un Nume io vidi in spoglia di pastore Vaga Colomba, che con spedit’ali Venere mentre a le sue Grazie unita Heheu Dalmarsus summi pars magna Senatus Questi d’alti immortal Cigni canori Gran Vico, che tra l’altre avare ingiuste --- A’ miei sudori il Ciel non temprò ingiuste Piena di giusto sdegno al mio pensiero Nestora non laudet non Graeca docta Periclem Guari non fia che ’l mio vario destino O divino Uomo, o glorioso, e grande Quell’ardente desio alto e immortale Garzon sublime, e pien d’animo grande O Mastro egregio di più elette Rime Questo spirto divino alto e immortale https://rep.giambattistavico.it:9000/rpc/cat/repository/manoscritti/BNN_Ms_XIX_42_01/index.html Le “Carte Villarosa” 5 Veggio la Fama tua che ’l Mondo a pieno Da l’innesto real nato è ’l germoglio Sommo, e sovran del secolo nostro onore Mentre obliando sulle usate piume Desta da Giove, in pria si volse a lui Contro un meschino il Fato armossi e ’n lui Né superbo Lavor, né Marmi incisi Tornò al Ciel la gran donna e saggia e forte Io, che m’induro incontro a Morte e innaspro Vico, che per sermone eletto, e saggio Il cieco insano vulgo estima uom saggio De mente heroica Festa dies oritur, discurrant undique laeti Almae quid facerent, rogo, sorores Blancardi, mihi amore singulari Ab Siculis oris ad nostra Fasque, Fidesque Capassi, socium meorum ocellus Cyrille, o prope corculum Minervae Iam redit alma dies, qua errantia Lumina Caeli Quidnam saeva sedens Martis super arma Hymenaeus Quid fit, Musae innuptae recinant Hymenaea Mens facta ad verum, cui plenum pectus honesti Musa tibi adspirat, Vates, argute, jocisque AFFETTI DI UN DISPERATO CANZONE IN MORTE DEL SIGNOR CONTE D. ANTONIO CARAFFA Canzone di Giambattista Vico nella promozione della santità di Clemente XII Iscrizioni Iscrizione con la quale il Vico accompagnava un esemplare dell’Opera De universo Jure mandato in dono al principe Eugenio di Savoia In morte del marchese Orazio Rocca Iscrizione per il sepolcro del Cardinale Innico Caracciolo Per l’edificazione del ponte presso Ravenna e per la costruzione di altre opere sui fiumi Ronco e Montone Iscrizione fatta per un arco da erigersi al serenissimo Infante di Spagna Don Carlo Per la nomina di Filippo di Borbone iuniore a generalissimo del corpo di spedizione spagnuolo in Italia In morte del principe Francesco Caracciolo In morte di Giacomo III Stuart In morte di Francesco Boncore Nota editoriale e indici 6 Iscrizione per il nuovo palazzo innalzato da Luigi Molinelli Due iscrizioni in morte del duca Argento Quattro iscrizioni per le nozze di Carlo di Borbone con Maria Amalia Walburga Due iscrizioni in morte di Caterina d’Aragona FASCICOLO II Frammenti di scritti vari del Vico 1. Apografo dell’Orazione per la partenza del conte di S. Stefano 2. Due apografi per la Parthenopea Conjuratione 3, Autografo di Emendationes in Historiam Caraphae 4. Foglio volante contenente Ad lectores aequanimos risalente a un primo abbozzo del Diritto universale 5. Traduzione autografa degli articoli del Le Clerc intorno al Diritto universale 6. Dedica apografa premessa ai componimenti per le nozze di Adriano Carafa con Teresa Borghese 7. Traduzione dei citati articoli del Le Clerc intorno al Diritto universale 8. Aggiunta all’Autobiografia 9. Dedica del De Aequilibrio corporis animantis a Carlo di Borbone 10. Foglio volante su cui è incollata la «dipintura» preposta alla Scienza Nuova, ediz. 1730, con avvertenza autografa 11. Foglio volante con note autografe del Vico: istruzioni per la seconda edizione della Scienza Nuova 12. Foglio volante autografo: «Ex Bernardi Tanucci…» epistola 13. Editio princeps del De mente heroica, Dissertatio habita in regia Academia Neapolitana, Napoli, Johannes Franciscus Pacius, regia universitatis typographus, publica auctoritate excude- bat, 1732 FASCICOLO III Lettere del Vico e al Vico o riguardanti Vico Di Nicola Galizia (Di Giovanni Crisostomo Damasceno) Di Bernardo Maria Giacco Di Bernardo Maria Giacco https://rep.giambattistavico.it:9000/rpc/cat/repository/manoscritti/BNN_Ms_XIX_42_02/index.html https://rep.giambattistavico.it:9000/rpc/cat/repository/manoscritti/BNN_Ms_XIX_42_03/index.html Le “Carte Villarosa” 7 Di Biagio Garofalo Di Tommaso Maria Minorelli Di Bernardo Maria Giacco Di Bernardo Maria Giacco Di Jean Leclerc Di Bernardo Maria Giacco Del Cardinale Corsini Di Giovan Artico conte di Porcia Di Lorenzo Corsini Di Edouard de Vitry A Edouard de Vitry Di Lorenzo Corsini Di Giuseppe Athias Di Giovan Artico di Porcia Di Antonio Corsini Di Antonio Conti Di Giovan Artico di Porcia Di Francesco Saverio Estevan Di Francesco Saverio Estevan Di Tommaso Russo A Tommaso Russo Di Domenico Lodovico Di Nicola Gaetani di Laurenzano Di Niccolò Giovo A Niccolò Giovo Di Niccolò Concina Di Tommaso Maria Alfani < Di Tommaso Maria Alfani> Di Tommaso Maria Alfani Di Daniele Concina Di Joseph Joachim de Montealegre Di Joseph Joachim de Montealegre Di Niccolò Concina Di Muzio Gaeta (Di Isabella Pignone del Carretto) Nota editoriale e indici 8 Di Francesco Serao Di Francesco Serao Di Michelangelo Franceschi FASCICOLO IV Carte varie della scuola del Vico 1. Apografo: Institutionum Oratoriarum liber unus: exposuit utriusque iuris doctor J. B. a Vico…, 1711 2. Due fogli volanti autografi contenenti Oratiunculae pro adsequenda laurea in utroque iurae, secondo la definizione di Villarosa 3. Quadernetti e fogli volanti apografi di varia grandezza, recan- ti varie volte l’indicazione «G. B. Vico, 1738»; appunti dalle lezioni del Vico FASCICOLO V Un’opera per commissione, manoscritto autografo con saltuarie corre- zioni apografe. Ragionamento primo: L’acquisto delle scienze… tutt’altro necessa- rissimo ad un giovane nobile Ragionamento secondo: Per istradare i nobili giovanetti all’acquisto delle cosiddette scienze FASCICOLO VI Carte varie relative alla vita e alla fortuna del Vico 1. Breve nota di ragioni per don G. B. Vico contro la magni- fica donna Caterina Tommaselli 2. Apografo della vita di G. B. Vico napolitsno scritta dall’avv. N. Sala 3. Varie minute autografe di iscrizioni composte da Gennaro Vico pel padre 4. Copia manoscritta della Vita del Vico del Fabbroni 5. Due minute del frammento di relazione di Gennaro Vico a una designata edizione delle opere del padre 6. Appunti di Francesco Daniele intorno al modo… 7. Un’anonima apologia del cattolicesimo del Vico 8. Copia di una recensione degli opuscoli del Vico pubblicata dal Marchese di Villarosa https://rep.giambattistavico.it:9000/rpc/cat/repository/manoscritti/BNN_Ms_XIX_42_04/index.html https://rep.giambattistavico.it:9000/rpc/cat/repository/manoscritti/BNN_Ms_XIX_42_05/index.html https://rep.giambattistavico.it:9000/rpc/cat/repository/manoscritti/BNN_Ms_XIX_42_06/index.html work_fs52igq465cbtkoirnfkiz766e ---- Analysis of Weight Distribution in Term of Forces and Torques during Lifting Weight using Digital Human Modelling Analysis of Weight Distribution in Terms of Forces and Torques during Lifting Weight Using Digital Human Modelling Zafar Ullah*and Shahid Maqsood University of Engineering and Technology, Peshawar 25000, Pakistan ABSTRACT Construction activities performed by workers are usually repetitive and physically demanding. Execution of such tasks in awkward postures can strain the body parts and can result in fatigue, back pain or in severe cases permanent disabilities. In view of this Digital Human Modelling (DHM) technology offers human ergonomics experts the facilities of an efficient means of kinematics characteristics of lifting heavy weights in different postures. The objective of this paper is to analyse and calculate the forces and torques on the different body parts during lifting weights in four different postures using Digital Human Modelling software. For this purposes four different lifting postures were analysed and the forces and torques were calculated. It was identified that changing the postures considerably minimize the redundant stresses on the body muscles. Keywords: Musculoskeletal disorders; Lifting task; Lower back pain INTRODUCTION The International Labour Organization (ILO) estimates that some 2.3 million women and men around the world succumb to work-related accidents or diseases every year; this corresponds to over 6000 deaths every single day. Worldwide, there are around 340 million occupational accidents and 160 million victims of work-related illnesses annually [1]. Over the years, manufacturing companies have taken ergonomics and usability as basic parameters of quality for their products [1]. The design approach has been reviewed, giving to the end-users’ needs, requests, and limitations an extensive consideration. For this reason, an increasing attention is currently devoted to ergonomics and human factors evaluations even from the early stages of the design process [2-4]. Digital Mock-Ups (DMUs) provided by many computer aided engineering applications enable manufacturers to design a digital prototype of a product in full details, simulating its functions and predicting interaction among its different components [5-8]. The production of physical prototypes, which is a very time consuming task, is then deferred to the final stages of the design process [9]. In order to take advantage of digital simulations to conduct ergonomic assessments (computer aided ergonomics), digital substitutes of human beings capable of interacting with the DMUs in the simulation environment are required [10,11]. This has given birth to the so- called Digital Human Modelling (DHM), which led to the development of many software tools [10,12,13]. These tools are mainly used to study human-product and human-process interaction and to conduct ergonomic and biomechanical analyses, as well as manual process simulations, even before the physical prototype is available. DMUs, together with digital human models, are increasingly used in order to reduce the development time and cost, as well as to facilitate the prediction of performance and/or safety [14]. The ergonomic design methodology relying on digital human models makes the iterative process of design evaluation, diagnosis and review more rapid and economical [15,16]. It increases also the quality by minimizing the redundant changes and improves safety of products by eliminating ergonomics related problems [17,18]. Furthermore, with the arising of the forth-industrial revolution (Industry 4.0), the concept of the virtualization of the manufacturing processes has gained a greater importance. In this context, human simulation in production activities will certainly play a significant role [19]. These digital humans, provided by many process simulation software, are essentially kinematic chains consisting of several segments and joints [20]. In view of this the digital human modelling software helps to construct the Jo ur na l of Ergonom ics ISSN: 2165-7556 Journal of Ergonomics Research Article Correspondence to: Ullah Z, University of Engineering and Technology Peshawar 25000, Pakistan, Tel: +92 03329278262; E-mail: zafarullah631@yahoo.com Received: October 18, 2018; Accepted: March 19, 2019; Published: March 27, 2019 Citation: Ullah Z, Maqsood S (2019) Analysis of Weight Distribution in Term of Forces and Torques during Lifting Weight using Digital Human. J Ergonomics 9:243. doi:10.35248/2165-7556.19.9.243 Copyright: © 2019 Ullah Z, et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. J Ergonomics, Vol.9 Iss.1 No:243 1 mailto:zafarullah631@yahoo.com human replica within the software and analysis is made on the mannequins in lifting task to calculate the forces and torques. METHODOLOGY Digital human models are computer-generated prototype of human beings used for biomechanical analysis. The mannequins are design through Human Computer Aided Design (CAD) software to mimic the real life industries workers posture. The facility of Ergo Tool is also available in the software which provides the static biomechanical stress on the different body parts. Four different lifting postures were analysed for forces and torque calculation assigning 20 kg concrete block to be lift. MANNEQUIN POSTURE DURING LIFTING WEIGHT The mannequins were assigned 20 kg weight to be lift in four different postures. Through Ergo Tool in Human Cad Mannequin Pro were applied to calculate the forces and torques applied on different body parts. Mannequin in Figure 1, picking the 20 kg load in semi standing forward bending position, in Figure 2 picking the same load in semi sitting position with align knee and hip position with hand more extended and neck bending slightly from frontal plane. Similarly the mannequin in Figure 3, loading the load with standing feet and hand extended, the mannequin in Figure 4, picking the load with sitting position with one leg front support and one leg back support. Figure 1: Mannequin lifting block sitting with head extended down. Figure 2: Mannequin lifting block in semi sitting. Figure 3: Mannequin lifting block with forward extension with legs straight. Figure 4: Mannequin lifting block with one leg back with knee support. RESULTS OF DIGITAL HUMAN MODELLING The detailed forces and torque is provided in the static biomechanics (Tables 1-4). The postures taken is the replica of real life workers during lifting blocks. Four mannequin were created and assign to pick 20 kg concrete block and the masses act as a weights due to gravity. In the Human CAD the Ergo tool of Static Biomechanics Tool were applied and all the forces and torque are displayed on the window screen. The details of static biomechanical stress are given in the Tables 1-4. Table 1 shows the static biomechanical stresses on different body parts, the highest force applied on pelvis (359.049 N) and the second most load bearing region is thorax (268.708 N). Similarly the highest positive torque act on the thorax (183.927 Nm) and secondly (167.889 Nm) positive torque act on the pelvis. The line graph in Figure 5 shows that most of the stresses are concentrated on the pelvic region. Table 1: Static biomechanical forces of posture 1. Force(N) Torque(Nm) Head 65.629 0 Left Arm 24.356 45.807 Left Foot 17.682 0.475 Ullah Z, et al. J Ergonomics, Vol.9 Iss.1 No:243 2 Left Forearm 10.518 36.547 Left Palm 7.317 9.886 Left Shank 49.872 12.144 Left Thigh 121.998 23.426 Pelvis 359.049 183.927 Right Arm 25.267 38.982 Left Foot 17.682 1.087 Right Forearm 11.429 36.206 Right Palm 105.317 9.584 Right Shank 49.872 4.817 Right Thigh 121.998 30.857 Thorax 268.708 167.889 Table 2 shows the static biomechanical stresses on different body parts, the highest force applied on pelvis (359.049 N) and the second most load bearing region is thorax (268.708 N). Similarly the highest positive torque act on the thorax (183.927 Nm) and secondly (167.889 Nm) positive torque act on the pelvis. The line graph in Figure 6 shows that most of the stresses are concentrated on the pelvic region. Table 2: Static biomechanical forces of posture 2. Force(N) Torque(Nm) Head 65.629 0 Left Arm 24.356 51.533 Left Foot 17.682 1.147 Left Forearm 10.518 37.884 Left Palm 7.317 10.983 Left Shank 49.872 3.71 Left Thigh 121.998 32.084 Pelvis 359.049 122.721 Right Arm 25.267 41.744 Left Foot 17.682 0.468 Right Forearm 11.429 31.72 Right Palm 95.317 7.335 Right Shank 49.872 13.93 Right Thigh 121.998 3.682 Thorax 268.708 103.175 Table 3: Static biomechanical forces of posture 3. Force(N) Torque(Nm) Head 65.629 0 Left Arm 24.356 16.562 LeftFoot 17.682 1.145 Left Forearm 10.518 15.321 Left Palm 7.317 6.36 Left Shank 49.872 1.145 Left Thigh 121.998 2.777 Pelvis 359.049 103.136 Right Arm 25.267 15.674 Left Foot 17.682 1.094 Right Forearm 11.429 15.424 Right Palm 7.317 5.605 Right Shank 49.872 1.094 Right Thigh 121.998 2.626 Thorax 268.708 112.915 Table 4: Static biomechanical forces of posture 4. Force(N) Torque(Nm) Head 65.629 0 Left Arm 24.356 37.216 Left Foot 17.682 1.023 Left Forearm 10.518 26.133 Left Palm 7.317 8.598 LeftShanke 49.872 6.64 LeftThigh 121.998 26.871 Pelvis 359.049 160.717 Right Arm 25.267 29.889 Left Foot 17.682 0.965 Ullah Z, et al. J Ergonomics, Vol.9 Iss.1 No:243 3 Right Forearm 11.429 17.848 Right Palm 105.317 7.426 Right Shank 49.872 6.631 Right Thigh 121.998 26.856 Thorax 268.708 156.112 Figure 5: Static biomechanical graph of posture 1. Figure 6: Static biomechanical graph of posture 2. Table 3 shows the static biomechanical stresses on different body parts, the highest force applied on pelvis (359.049 N) and the second most load bearing region is thorax (268.708 N). Similarly the highest positive torque act on the thorax (112.915 Nm) and secondly (103.136 Nm) positive torque act on the pelvis. The line graph in Figure 7 shows that most of the stresses are concentrated on the pelvic region. Figure 7: Static biomechanical graph of posture 3. Table 4 shows the static biomechanical stresses on different body parts, the highest force applied on pelvis (359.049 N) and the second most load bearing region is thorax (268.708 N). Similarly the highest positive torque act on the thorax (156.112 Nm) and secondly (160.717 Nm) positive torque act on the pelvis. The line graph in Figure 8 shows that most of the stresses are concentrated on the pelvic region. Results of forces of the four postures given in below Table 5 and comparing results of torque of the four postures given in below Table 6. Figure 8: Static biomechanical graph of posture 4. Table 5: Comparing Forces, comparing results of the four postures and Results of forces of the four postures. Figure 1 (Force(N)) Figure 2 (Force(N)) (Force(N)) Figure 4 (Force(N)) Head 65.629 65.629 65.629 65.629 Left Arm 24.356 24.356 24.356 24.356 Left Foot 17.682 17.682 17.682 17.682 Left Forearm 10.518 10.518 10.518 10.518 Left Palm 7.317 7.317 7.317 7.317 Left Shank 49.872 49.872 49.872 49.872 Left Thigh 121.998 121.998 121.998 121.998 Pelvis 359.049 359.049 359.049 359.049 Right Arm 25.267 25.267 25.267 25.267 Left Foot 17.682 17.682 17.682 17.682 Right Forearm 11.429 11.429 11.429 11.429 Right Palm 105.317 95.317 7.317 105.317 Right Shank 49.872 49.872 49.872 49.872 Right Thigh 121.998 121.998 121.998 121.998 Thorax 268.708 268.708 268.708 268.708 Ullah Z, et al. J Ergonomics, Vol.9 Iss.1 No:243 4 Figure 3 Figure 1 Torque(Nm) Figure 2 Torque(Nm) Figure 3 Torque(Nm) Figure 4 Torque(Nm) Head 0 0 0 0 Left Arm 45.807 51.533 16.562 37.216 Left Foot 0.475 1.147 1.145 1.023 Left Forearm 36.547 37.884 15.321 26.133 Left Palm 9.886 10.983 6.36 8.598 Left Shanke 12.144 3.71 1.145 6.64 Left Thigh 23.426 32.084 2.777 26.871 Pelvis 183.927 122.721 103.136 160.717 Right Arm 38.982 41.744 15.674 29.889 Left Foot 1.087 0.468 1.094 0.965 Right Forearm 36.206 31.72 15.424 17.848 Right Palm 9.584 7.335 5.605 7.426 Right Shank 4.817 13.93 1.094 6.631 Right Thigh 30.857 3.682 2.626 26.856 Thorax 167.889 103.175 112.915 156.112 DISCUSSION Musculoskeletal Disorders are noted as a result of the presence of different risk factors, including contact stress, force, vibrations, repetition and jobs that put muscles under redundant physical forces. In the proposed study it is shown that changing the posture significantly change thee stresses. Figure 9 shows the comparative forces applied, the highest forces allied on posture 4 in Figure 4, followed by posture 3 in Figure 3. Similarly in posture 2 in Figure 2 a less forces is applied and the most ergonomically less stresses posture is in Figure 1 of posture 1. Similarly is the case of torque produced in the body is concentrated in the pelvis region. As from Figures 9 and 10, it is clear that most of the forces and positive torque is concentrated in pelvis region and the pelvis region is the most sensitive region of the human skeletal system. Figure 9: Static biomechanical graph of the forces. Figure10: Static biomechanical graph of the torques. CONCLUSION Through Human CAD tool the static Biomechanical stresses distributions were calculated. In an industrially developing countries like Pakistan the source of exposure to MSDs risks seem to be severe mainly because of the untrained workforce and due the absence of the labour laws implementation. The conclusion taken is that, though many studies have shown a significant relation between manual labour and MSDs, in an industrially developing countries, people are exposed to work without knowing the new job physical demand. In this regard, there is a dire need for medical and physical examination as a prerequisite for new jobs. In addition, workers should be trained on ergonomics basis before they are exposing to manual material handling. REFERENCES 1. Kaulio MA. Customer, consumer and user involvement in product development: A framework and a review of selected methods. Total Quality Management. 1998;9:141-149. 2. Stanton NA, Salmon PM, Rafferty LA, Walker GH, Baber C, Jenkins DP. Human factors methods: a practical guide for engineering and design. CRC Press. 2017 3. Shackel B. Ergonomics in information technology in Europe-a review. Behav Inf Technol. 1985;4:263-287. 4. Martinsons MG, Chong PK. The influence of human factors and specialist involvement on information systems success. Human relations. 1999;52:123-152. 5. De Sa AG, Zachmann G. Virtual reality as a tool for verification of assembly and maintenance processes. Computers & Graphics. 1999;23:389-403. 6. Stark R, Krause FL, Kind C, Rothenburg U, Müller P, Hayka H, et al. Competing in engineering design- The role of Virtual Product Creation. CIRP Journal of Manufacturing Science and Technology. 2010;3:175-184. Ullah Z, et al. J Ergonomics, Vol.9 Iss.1 No:243 5 Table 6: Comparing torque, comparing the results of torque of the four postures. 7. Dolezal WR. Success factors for digital mock-ups (DMU) in complex aerospace product development. Technische Universität München. 2008. 8. Mourtzis D, Papakostas N, Mavrikios D, Makris S, Alexopoulos K. The role of simulation in digital manufacturing: applications and outlook. Int J Comput Integr Manuf. 2015;28:3-24. 9. Whiteside J, Bennett J, Holtzblatt K. Usability engineering: Our experience and evolution handbook of human-computer interaction. Elsevier. 1998. 10. Pelliccia L, Klimant F, De Santis A, Di Gironimo G, Lanzotti A, Tarallo A, et al. Task-based motion control of digital humans for industrial applications. Procedia CIRP. 2017;62:535-540. 11. Magistris GD, Micaelli A, Savin J, Gaudez C, Marsot J. Dynamic digital human models for ergonomic analysis based on humanoid robotics techniques. Int J Digital Human. 2015;1:81-109. 12. Di Gironimo G, Pelliccia L, Siciliano B, Tarallo A. Biomechanically-based motion control for a digital human. Int J Interact Des Manuf. 2012;6:1-13. 13. Magistris G, Micaelli A, Evrard P, Andriot C, Savin J, Gaudez C, et al. Dynamic control of DHM for ergonomic assessments. Int J Ind Ergon. 2013;43:170-180. 14. Ma L, Chablat D, Bennis F, Zhang W, Guillaume F. A new muscle fatigue and recovery model and its ergonomics application in human simulation. Virtual Phys Prototyp. 2010;5:123-137. 15. Rasmussen J. Skills, rules, and knowledge; signals, signs, and symbols, and other distinctions in human performance models. IEEE transactions on systems, man, and cybernetics. 1983;13:257-266. 16. Maguire M. Methods to support human-centred design. Int J Hum Comput Stud. 2001;55:587-634. 17. Demirel HO, Duffy VG. Applications of digital human modeling in industry. International Conference on Digital Human Modeling. Springer. (2007) 18. MacLeod D. The ergonomics edge: Improving safety, quality, and productivity. John Wiley & Sons. US. 1994. 19. Hai Z. Development of smart industry maturity model. University of Twente. Master’s Thesis. 2017. 20. Aggarwal JK, Cai Q. Human motion analysis: A review. Comput Vis Image Underst. 1999;73:428-440. Ullah Z, et al. J Ergonomics, Vol.9 Iss.1 No:243 6 De 内容 Analysis of Weight Distribution in Term of Forces and Torques during Lifting Weight using Digital Human Modelling ABSTRACT INTRODUCTION METHODOLOGY MANNEQUIN POSTURE DURING LIFTING WEIGHT RESULTS OF DIGITAL HUMAN MODELLING DISCUSSION CONCLUSION REFERENCES work_ftbmk33innhhdebs3n535qcp6a ---- Realizing Lessons of the Last 20 Years: A Manifesto for Data Provisioning & Aggregation Services for the Digital Humanities (A Position Paper) Search D-Lib: HOME | ABOUT D-LIB | CURRENT ISSUE | ARCHIVE | INDEXES | CALENDAR | AUTHOR GUIDELINES | SUBSCRIBE | CONTACT D-LIB D-Lib Magazine July/August 2014 Volume 20, Number 7/8 Table of Contents Realizing Lessons of the Last 20 Years: A Manifesto for Data Provisioning & Aggregation Services for the Digital Humanities (A Position Paper) Dominic Oldman, British Museum, London Martin Doerr, FORTH-ICS, Crete Gerald de Jong, Delving BV Barry Norton, British Museum, London Thomas Wikman, Swedish National Archives Point of Contact: Dominic Oldman, doint@oldman.me.uk doi:10.1045/july2014-oldman Printer-friendly Version Abstract The CIDOC Conceptual Reference Model (CIDOC CRM), is a semantically rich ontology that delivers data harmonisation based on empirically analysed contextual relationships rather than relying on a traditional fixed field/value approach, overly generalised relationships or an artificial set of core metadata. It recognises that cultural data is a living growing resource and cannot be commoditised or squeezed into artificial pre-conceived boxes. Rather, it is diverse and variable containing perspectives that incorporate different institutional histories, disciplines and objectives. The CIDOC CRM retains these perspectives yet provides the opportunity for computational reasoning across large numbers of heterogeneous sources from different organisations, and creates an environment for engaging and thought-provoking exploration through its network of relationships. The core ontology supports the whole cultural heritage community including museums, libraries and archives and provides a growing set of specialist extensions. The increased use of aggregation services and the growing use of the CIDOC CRM has necessitated a new initiative to develop a data provisioning reference model targeted at solving fundamental infrastructure problems ignored by data integration initiatives to date. If data provisioning and aggregation are designed to support the reuse of data in research as well as general end user activities then any weaknesses in the model that aggregators implement will have profound effects on the future of data centred digital humanities work. While the CIDOC CRM solves the problem of quality and delivering semantically rich data integration, this achievement can still be undermined by a lack of properly managed processes and working relationships between data providers and aggregators. These relationships hold the key to sustainability and longevity because done properly they encourage the provider to align their systems, knowing that the effort will provide long lasting benefits and value. Equally, end user projects will be encouraged to cease perpetuating the patchwork of short-life digital resources that can never be aligned and which condemn the digital humanities to a pseudo and predominantly lower quality discipline. Introduction This paper addresses the complex issues of large scale cultural heritage data integration crucial for progressing digital humanities research and essential to establishing a new scholarly and social relevance for cultural heritage institutions often criticised for being, "increasingly captive by the economic imperatives of the marketplace and their own internally driven agendas1." It includes a discussion on the essential processes and necessary organisational relationships, data quality issues, and the need for wider tangible benefits. These benefits should extend beyond end user reuse services and include the capability to directly benefit the organisations that provide the data, providing a true test of quality and value. All these components are interdependent and directly affect the ability of any such initiative to provide a long term and sustainable infrastructure on which evidence producers, information curators and evidence interpreters can rely on, invest in and further develop. Many cultural data aggregation projects have failed to address these foundational elements contributing instead to a landscape that is still fragmented, technology driven and lacking the necessary engagement from humanities scholars and institutions. Therefore this paper proposes a new reference model of data provision and aggregation services2 aiming to foster a more attractive and effective practice of cultural heritage data integration from a technological, managerial and scientific point of view. This paper is based on results and conclusions from recent work of the CIDOC CRM Special Interest Group (CRM SIG), a working group of CIDOC, the International Committee for Documentation of the International Council of Museums (ICOM). The Group has been developing the CIDOC Conceptual Reference Model (Doerr, 2003; Crofts et al., 2011) and been providing advice for integration of cultural heritage data over the past 16 years. Over this period the adoption of the CIDOC CRM has significantly increased, supported by enabling technology such as RDF stores3 and systems like SolrTM4, with Graph5 databases providing further potential6. These systems have become mature and powerful enough to deal with the real complexity and scale of global cultural data. Consequently, this means that issues of sustainable management of integrated resources are more urgent that ever before, and consequently the Group is calling for a collaborative community effort to develop a new Reference Model of Data Provision and Aggregation, which is based on a completely different epistemological paradigm compared to the well-known OAIS (Open Archival Information Service) Reference Model (Consultative Committee for Space data Systems, 2009). Therefore, this paper reflects positions developed within the CRM SIG and by others on three important aspects of developing integrated cultural heritage systems and associated data provisioning processes. These positions in summary are: Cultural heritage data provided by different organisations cannot be properly integrated using data models based wholly or partly on a fixed set of data fields and values, and even less so on 'core metadata'. Additionally, integration based on artificial and/or overly generalised relationships (divorced from local practice and knowledge) simply create superficial aggregations of data that remain effectively siloed since all useful meaning is available only from the primary source. This approach creates highly limited resources unable to reveal the significance of the source information, support meaningful harmonisation of data or support more sophisticated use cases. It is restricted to simple query and retrieval by 'finding aids' criteria. The same level of quality in data representation is required for public engagement as it is for research and education. The proposition that general audiences do not need the same level of quality and the ability to travel through different datasets using semantic relationships is a fiction and is damaging to the establishment of new and enduring audiences. Thirdly, data provisioning for integrated systems must be based on a distributed system of processes in which data providers are an integral part, and not on a simple and mechanical view of information system aggregation, regardless of the complexity of the chosen data models. This more distributed approach requires a new reference model for the sector. This position contrasts with many past and existing systems that are largely centralised and where the expertise and practice of providers is divorced. The effects of these issues have been clearly demonstrated by numerous projects over the last 20 years and continue to affect the value and sustainability of new aggregation projects, and therefore the projects that reuse the aggregated data. This paper is therefore particularly aimed at new aggregation projects that plan to allocate resources to data provisioning and aggregation and that wish to achieve sustainability and stability, but also informs existing aggregators interested in enhancing their services. Background During the last two decades many projects have attempted to address a growing requirement for integrated cultural heritage data systems. By integrating and thereby enriching museum, library and archive datasets, quality digital research and education can be supported making the fullest use of the combined knowledge accumulated and encoded by cultural heritage organisations over the last 30 years — much of this effort having been paid for by the public purse and by other humanities funding organisations. As a result the potential exists to restore the significance and relevance of these institutions in a wider and collaborative context revitalising the cultural heritage sector in a digital environment. The ability to harmonise cultural heritage data such that individual organisational perspectives and language is retained, yet at the same time allowing these heterogeneous datasets to be computationally 'reasoned' over as a larger integrated resource, is one that has the potential to propel humanities research to a level that would attract more interest and increased investment. Additionally, the realisation of this vision provides the academy with a serious and coherent infrastructural resource that encodes knowledge suitable as a basis for advanced research and crucially, from a Digital Humanities7 perspective, based upon cross disciplinary practices. As such it would operate to reduce the intellectual gap that has opened up between the academy and the cultural heritage sector. Even though industrial and enterprise level information integration has a successful 20 year history (Wiederhold, 1992; Gruber, 1993; Lu et al., 1996.; Bayardo et al., 1997; Calvanese, Giacomo, Lenzerini, et al., 1998), to date very few projects, if any, attempting to deliver such an integrated vision for the cultural heritage sector have been able to preserve both meaning and provide a sustainable infrastructure under which organisations could realistically align their own internal infrastructures and processes. Systems have lacked the necessary benefits and services that would encourage longer-term commitment, failed to develop the correct set of processes needed to support long-term data provisioning relationships, and neglected to align their services with the essential objectives of their providers. The situation reflects a wider problem of a structurally fragmented digital humanities landscape and an ever-widening intellectual gap with cultural heritage institutions, reflected rather than resolved by aggregation initiatives. This is despite the existence of very clear evidence (lessons learnt during the execution of these previous projects) for the reasons behind this failure. The three main lessons that this paper identifies are: The nature of cultural heritage data (museum collection data, archives, bibliographic materials but also more specialist scientific and research datasets) is such that it cannot be treated in the same way as warehouse data, administrational information or even library catalogues. It contains vast ranges of variability reflecting the different types of historical objects with their different characteristics, and therefore is also influenced by different scholarly disciplines and perspectives. Different institutional objectives, geography and institutional history also affect the data. There are no 'core' data. However, many project managers use these characteristics to claim that such complexity cannot be managed, or conversely that such rich data does not exist, or that it is impossible to create an adequate common data representation. These positions hinder research and development and limit engagement possibilities. There is a frequent lack of understanding that cultural heritage data cannot be carved up into individual products to be shipped around, stored and individually 'consumed', like a sort of emotion stimulant coming to "fruition"8. Heritage data is rather the insight of research about relationships between a past reality and current evidence in the form of curated objects. Therefore it is only meaningful as long as the provenance and the connection to the material evidence in the hands of the curators are preserved. The different entities that exist within it are fundamentally related to each other providing mutually dependent context. Heritage data is subject to continuous revision by the curators and private and professional researchers, and must reliably be related to previous or alternative opinions in order to be a true source of knowledge. Consequently, the majority of the data in current aggregation services, sourced from cultural heritage organisations, cannot be cited scientifically. Sustaining aggregations using data sources from vastly different owning organisations requires an infrastructure that facilitates relationships with the local experts and evidence keepers to ensure the correct representation of data (such that it is useful to the community but also to the providing organisations themselves), and also takes into account the changing nature of on-going data provision. Changes can occur at either end of data provisioning relationships and therefore a system must be able to respond to likely changes, taking into account the levels of resources available to providers necessary to maintain the relationship throughout. Aggregations must also include processes for directly improving the quality of data (i.e. using the enriched integrated resources created by data harmonisation) and feeding this back to institutional experts. These three issues are inextricably linked. Understanding the meaning of cultural heritage data and the practices of the owners of the data or of material evidence are essential in maintaining long term aggregation services. Longevity requires that data must be encoded in a way to provide benefits for all parties; the users of resulting services, the aggregators and the providers. Due to the same functional needs such principles are self-evident for instance, for the biodiversity community, expressed as the "principle of typification" for zoology in (ICZN, 1999) and its application in the collaboration of GBIF (Global Biodiversity Information Facility) within natural history museums, but they are not evident for the wider cultural heritage sector. This may be because, in contrast to cultural heritage information, misinterpretation of biodiversity data can have immense economic impact, for instance, with pest species. By not addressing these problems only short-lived projects can result that, in the end, consume far more resources than resolving these issues would require. The Historical Problem Consider this statement from 1995: "Those engaged in the somewhat arcane task of developing data value standards for museums, especially the companies that delivered collections management software, have long had to re-present the data, re-encode it, in order for it to do the jobs that museums want it to perform. It's still essentially impossible to bring data from existing museum automation systems into a common view for use for noncollections management purposes as the experience of the Museum Educational Site Licensing (MESL) and RAMA (Remote Access to Museum Archives) projects have demonstrated. Soon most museums will face the equally important question of how they can afford to re-use their own multimedia data in new products, and they will find that the standards we have promoted in the past are inadequate to the task." (Bearman, 1995) Both the projects cited in the quote are testament to the fact that managing data and operational relationships with cultural heritage organisations in support of collaborative networks is a complicated undertaking and requires a range of different skills. Some of the issues have been clarified and resolved by the passage of time and we now have a better understanding of what benefits are practically realistic and desirable when forming new data collaborations. Yet new projects seem intent on replicating flawed approaches and repeating the mistakes of the past.9 The Continuing Problem Consider this opinion from the JISC Discovery Summit 2013 from the expert panel. "...developers are impatient and just want to get access to the data and do interesting things, and on the other side of the equation we have curators reasonably enough concerned about how that data is going to be used or misinterpreted or used incorrectly. I think that this is actually a difficult area because the conceptual reference models are generally more popular with the curators than with the developers [...It is not] clear to me ... how we solve the problem of engaging the people who want to do the development...through that mechanism, but nonetheless as this great experiment that we are living through at the moment with opening up data and seeing what can be done [...]unfolds, if we find that the risks are starting to become too great and the value is so poor because the data is being misused or used incorrectly or inappropriately, if that risk is a risk to society in general and not just to the curators...then we are going to have to find those kind of solutions." (JISC, 2013)10 This suggests a continuing problem and that digital representation of cultural heritage information is still determined by those who understand it least. It also suggests that cultural heritage experts have yet to engage with the issue of digital representation and continue to leave it to technologists. Nevertheless, why would a software developer believe that representing the semantics and contextual relationships between data is not as interesting (let alone crucially important) as representing data without them, and why do they determine independently the mode of representation in any event? Given the enormous costs involved in aggregating European data such a risk assessment suggested above, might reasonably be conducted up front, since the infrastructural changes needed to resolve the realisation of this risk would be almost impossible to implement. The delegates of the Summit agreed by a large majority that their number one concern was quality of metadata and contextual metadata, contrary to the view of some of the panel members11 — emphasising the gap between providers, users and aggregators. Ironically, it is those who advocate technology and possess the skills to use computers who seem most reluctant to explore the computer's potential for representing knowledge in more intelligent ways. As computer scientists regularly used to say, 'garbage in, garbage out' (GIGO)12. The value and meaning of the data should not be secondary or be determined by an intellectually divorced technological process. Position 1 — The Nature of Cultural Heritage Data The RAMA project, funded in the 1990s by the European Commission, serves as a case study demonstrating how squeezing data into fixed models results in systems that ultimately provide no significant progress in advancing cultural heritage or scholarly humanities functions. Yet large amounts of scarce resources are invested in similar initiatives that can only provide additional peripherally useful digital references. The most recent and prominent example is the Europeana project13, which although technologically different to RAMA, retains some of the same underlying philosophy. The RAMA project proceeded on the premise that data integration could only be achieved if experts were prepared to accept a, "world where different contents could be moulded into identical forms", and not if, "one thinks that each system of representation should keep its own characteristics regarding form as well as contents" (Delouis, 1993). This is a view that is still widely ingrained in the heads of many cultural heritage technologists. While more aggregations, like Europeana, have made some use of knowledge representation principles and event based concepts they continue to use them in highly generalised forms and with fixed, core field modelling. This is clearly wrong from both a scholarly and educational perspective (as well as for subsequent engagement opportunities) and therefore results in wasteful technical implementations. Yet academics seem unable to deviate (or fail to understand) from this traditional view of data aggregation. The CIDOC CRM, which commenced development in the latter part of the 1990's post the failure of RAMA and MESL, is a direct answer to the "impossible" problem identified by Bearman and others. The answer, realised by several experts, was to stop the technically led pre-occupation with fixed values and fields which inevitable vary both internally and between organisations, and instead think about the relationships between things and the real world context of the data. This not only places emphasis on the meaning of the data but also places objects, to a certain degree, back into their historical context. "Increasingly it seems that we should have concerned ourselves with the relationships...between the objects." (Bearman, 1995) This fundamentally different approach concentrates on generalisations not determined by high committee but is instead based on many years of empirical analysis. It is concerned with contextual relationships that are mostly implicit but prominent in various disciplinary forms of digital documentation and associated research questions, and that cultural heritage experts in all fields are able to agree on. From this analysis a notion of context has been emerging which concentrates on interrelated events and activities connected to hierarchies of part-whole relationships of things, events and people, and things subject to chains of derivation and modification. This is radically different from seeking the most prominent values by which people are supposed to seek for individual objects. This approach is highly significant for the digital humanities (see Unsworth (2002)) because it inevitably requires a collaborative shift of responsibilities from technologists to the experts who understand the data. It therefore also requires more engagement from museums and cultural heritage experts. However, the widening gap between the Academy and resource-poor memory institutions means that a solution requires clearly identified incentives to encourage this transfer of responsibility. It entails the alignment of different strategies and the ability to provide more relevant and useful services with inbuilt longevity. It must carry an inherent capacity to improve the quality of data and deliver all benefits cost effectively. Given this, the alignment needs to start at the infrastructure level. The technical reasons why applications of the CIDOC CRM can be much more flexible, individual and closer to reality than traditional integration schemata, and yet allow for effective global access, are as follows. Firstly, the CRM extensively exploits generalization/specialization of relationships. Even though it was clearly demonstrated in the mid 1990's that this distinct feature of knowledge representation models is mandatory for effective information integration (Calvanese, Giacomo, Lenzerini, et al., 1998), it has scarcely been used in other schemata for cultural data integration14. It ultimately enables querying and access to all data with unlimited schema specializations but by fewer implicit relationships15, and removes the need to mandate fixed data field schemas for aggregation. This is also substantially different from adding 'application profiles' to core fields (e.g. schema.org), where none of the added fields will reveal the fact of a relationship in a more generic query. Secondly, it foresees the expansion of relationships into indirections, frequently implying an intermediate event, and the deduction of direct relationships from such expansions. For instance, the size of an object can be described as a property of an object or as a property of a measurement of an object. The location of an object can be property of the object or of a transfer of it. The expansion adds temporal precision. The deduction generalizes the question to any time, as a keyword search does. Modern information systems are well equipped to deal consistently with deductions, but no other documented schema for cultural data integration has made use of this capacity (Tzompanaki and Doerr, 2012; Tzompanaki et al., 2013). This paradigm shift means that, instead of the limitations imposed by using fixed fields for global access, the common interface for users is defined by an underlying system of reasoning that is invisible to the user (but is explicitly documented) and is crucially detached from the data entry format. It provides seemingly simple and intuitive generalizations of contextual questions. This use of algorithmic reasoning, that makes full use of the precise underlying context and relationships between entities, provides a far better balance and control of recall and precision and can be adjusted to suit different requirements. By representing data using a real world semantic ontology, reflecting the practice and understanding of scholars and researchers, aggregation projects become more serious resources, and as a result their sustainability will become a more serious concern across the community. The enthusiasm of technologists and internal project teams is not sufficient for long-term sustainability, and corporate style systems integration techniques are not appropriate for cultural heritage data. Just like the proliferation of data standards, often justified by small variations in requirements, isolated aggregations using the same justifications will also proliferate affecting overall sustainability and diluting precious resources and financing. In contrast an aggregation that supports and works with the variability of cultural heritage data and owning organisations, and that services a wider range of uses, stands a far better chance of long-term support. Other schemas, despite using elements of knowledge representation, are still created, 'top down' and perpetuate a belief in the need for 'core'; and are inevitably flawed by a lack of understanding of knowledge and practice. It is far easier and quicker for technologists to make artificial assumptions about data, and mandate a new schema, than it is to develop a 'bottom up' understanding of how cultural heritage data is used in practice. However, the CRM SIG has completed this work removing the need for further compromise on this field. Position 2 — Engagement Needs Real World Context A familiar argument put to the community by technologists is that creating resources using a semantic reference model is complicated and expensive, and that aggregations designed to satisfy a general audience do not need this level of sophistication. Moreover, the requirements of museum curators (see above) and other academics are not the same as those of the public and the latter should be prioritised when allocating resources to create services. In other words, publishing data, in whatever form, is the most important objective. However, publishing data and communicating understanding are two completely different concepts and humanities data can be impossible to interpret without meaningful context. This view also misunderstands the role of museums and curators who are keepers of primary material evidence and hold a primary role in communicating with and engaging general audiences using rich contextual narratives. The only reference model that influences the design of current aggregation systems is the Reference Model for an Open Archival Information System (OAIS). It basically assumes that provider information consists of self-contained units of knowledge, such as scholarly or scientific publications, finished and complete with all necessary references. It assumes that they are finished products that have to be fixed and preserved for future reference. The utterly unlucky choice of the term 'metadata', for cultural data, assuming that the material cultural object is the real 'data', actually degrades curatorial knowledge to an auxiliary retrieval function without scientific merit, as if one could 'read out' all curatorial knowledge just by contemplating the object, in analogy to reading a book. Consequently, a surface representation with limited resolution (a 3D Model) is taken as a sufficient 'surrogate' for the object itself, the assumed real target of submission to the Digital Archive. The absence of a different type of reference model perpetuates this view in implementer and management circles. In reality, 'museum metadata' are the product, but not as a self-contained book, but rather equivalents of paragraphs, illustrations, headings and references of a much larger, 'living' book — the network of current knowledge about the past. The same holds for other fields of knowledge, such as biodiversity or geology data. Museum curators are skilful in representing objects using a range of different approaches, all of them more sophisticated than the presentation of raw object metadata. Their experience and practice has wider value for colleagues in schools, universities and other research environments. The reason why many curators have not engaged with technology is because of the limitations that it apparently presents in conveying the history and context in which objects were produced and used. Museums, by their nature, remove the object from its historical context and "replaces history with classification". Curators, almost battling against the forces of their own environment, attempt to return objects back into their own original time and space, a responsibility difficult to achieve within the "hermetic world" of the museum gallery (Stewart, 1993, p.152), and particularly amongst a largely passive and untargeted mixture of physical 'browsers'. In the digital world flat data representations, even if augmented with rich multimedia, do not convey the same quality of message and validity of knowledge that curators attempt to communicate to general audiences every day. The lonely gallery computer with its expensive user experience and empty chair is all too often a feature of 'modern' galleries. Museums also spend vast amounts of money enriching data on their web sites, sometimes with the help of curators, and attempt to add this valuable and engaging context. However, such activity involves the resourcing of intensive handcrafted content that inevitably limits the level of sophistication and collaboration that can be achieved, as well as the range and depth of topics that can be covered. (Doerr and Crofts, 1998, p.1) Far from being driven by purely private and scholarly requirements, curators would see contextual knowledge representation as a way of supporting their core role in engaging and educating the public but on a scale they could not achieve with traditional methods and with current levels of financing. Since semantically harmonised data reveals real world relationships between things, people, places, events and time, it becomes a more powerful engagement and educational tool for use with wider audiences beyond the walls of the physical museum. In comparison to traditional handcrafted web page development it also represents a highly cost effective approach. Semantic cultural heritage data using the CIDOC CRM may not equate to the same type of narrative communication as a curator can provide, but it can present a far more engaging and sophisticated experience when compared to traditional forms of data representation. While it can help to answer very specific research questions it can also support the unsystematic exploration of data. It can facilitate the discovery of hitherto unknown relations and individual stories, supporting more post-modern concerns; but also providing a means to amalgamate specifics and individual items into a larger "totalizing" view of expanding patterns of history16 (Jameson, 1991, p.333). Unsystematic exploration17 (but which invariably leads to paths of relationships around particular themes) is extremely useful for general engagement, but this is also seen as increasingly important for scholars (curators, researchers/scientists from research institutions and universities) working with big data, changing the way that scholars might approach research and encouraging new approaches that traditionally have been viewed as more appropriate to the layperson. The CIDOC CRM ontology supports these different approaches bringing together methodologies that are useful to researchers, experts, enthusiasts and browsers alike, but in a single multi-layered implementation. The opportunity provided by the CIDOC CRM goes further. Just in the same way that lessons identified in the 1990's about cultural heritage integration have been ignored, the research into how museums might shape the representation of cultural knowledge has also been ignored in most digital representations. The pursuit of homogenised views with fixed schemas continues with vigour within digital communities, but the strength of the knowledge held by different museums is in its difference — its glorious heterogeneous nature. Yet again from the 1990s, a quote from a leading academic museologist. "Although the ordering of material things takes place in each institution within rigidly defined distinctions that order individual subjects, curatorial disciplines, specific storage or display spaces, and artefacts and specimens, these distinctions may vary from one institution to another, being equally firmly fixed in each. The same material object, entering the disciplines of different ensembles of practices, would be differently classified. Thus a silver teaspoon made in the eighteenth century in Sheffield would be classified as 'Industrial Art' in Birmingham City Museum, 'Decorative Art' at Stoke on Trent, 'Silver' at the Victoria and Albert Museum, and 'Industry' at Kelham Island Museum in Sheffield. The other objects also so classified would be different in each case, and the meaning and significance of the teaspoon itself correspondingly modified". (Greenhill, 1992, pp.6-7) While the World Wide Web has undoubtedly revolutionised many aspects of communication, work and engagement, its attractive but still essentially "Gutenberg" publishing model has effectively created an amnesia across the community. While a pre-Web world talked about how computers could push the boundaries of humanities as a subject, a post-Web world seems content with efficient replication of the same activities that previously took place on paper (Renn, 2006; McCarty, 2011, pp.5-6). Position 3 — The Reality of Cultural Heritage Data Provisioning It is a long-standing failure of aggregators to design and implement a comprehensive set of processes necessary to support long term provider-to-aggregator relationships. The absence of such a reference model is considered to be a major impediment to establishing sustainable integrated cultural heritage systems and therefore, by implication, a significant factor in the inability to fully realize the benefits of the funding and resources directed towards the humanities over the last 20 years. This legacy has contributed to a general fragmentation of humanities computing initiatives as project after project has concentrated on end user functionality without properly considering how they could sustain the relationships that ultimately determined their shelf life — if indeed this was an objective. This, along with the lack of an empirically conceived cultural heritage reference model (discussed above), has impeded the ultimate goal of collaborative data aggregation to support intelligent modelling and reasoning across large heterogeneous datasets, and provide connections between data embedded with different perspectives. Instead, each new and bigger initiative pushes further the patience of funders who are increasingly unhappy with the return to the community of their investment. In contrast to the approach of most aggregators, the responsibilities demanded of such systems are viewed by this paper from a real world perspective, as distributed and collaborative rather than substantially centralized and divorced from providers, as the OAIS Reference Model assumes. In reality the information provider curates his/her resources and provides, at regular intervals, updates. The provider is the one who has access to the resources to verify or falsify statements about the evidence in their hands. Therefore the role of the aggregator includes the responsibility for the homogeneous access to the integrated data and the resolution of co-references (multiple URIs, 'identifiers', for the same thing) across all the contributed data - but not to 'take over' the data like merchandise. The latter synopsis of consistency appears to be the genuine knowledge of the aggregator, whereas any inconsistencies should be made known to, and can only be resolved by, the original providers. The process of transformation of these information resources to the aggregator's target system requires a level of quality control that is often beyond the means of prospective providers. Therefore a collaborative system that delivers such controls means that the information provider benefits from data improvement and update services that would normally attract a significant cost, and could not be done as effectively only based on local knowledge. Additionally, if the aggregation is done well, harmonisation should deliver significant wider benefits to the provider (including digital relevance and exposure for organisations regardless of their status, size and location) and to the community and society as a whole. The process of mapping from provider formats to an aggregator's schema needs support from carefully designed tools. All current mapping tools basically fail in one way or another to support industrial level integration of data, from a large number of providers,18 in a large number of different formats and different interpretations of the same formats,19 undergoing continuous data and format changes at the provider side, undergoing semantic and format changes at the aggregator side20. To consistently maintain integrated cultural data requires a much richer, component and workflow based architecture. The proposed architecture includes a new component, a type of knowledge base or 'mapping memory', and at its centre a generic, human readable 'mapping format' (currently being developed as X3ML)21, designed to support different processes and components and accommodate all organisations with different levels of resourcing. Such architecture begins to overcome the problem of centralised systems where mapping instructions are unintelligible and inaccessible to providers22 and hence lack quality control. Equally, it overcomes the problem of decentralised mapping by providers which often interpret concepts within the aggregator's schema in mutually incompatible ways. It finally overcomes the problem of maintaining mappings after changes to source or target formats and changes of interpretations of target formats or of terminological resources on which mappings are conditional. It should further provide collaborative communication (formal and social) support for the harmonization of mapping interpretations. Since the mapping process depends on clean data and brings to light data inconsistencies, sophisticated feed-back processes for data cleaning and identifier consistency between providers must also be built into the design. The ambition of such an architecture exceeds the scope of typical projects and it can only come to life if generic software components can be brought to maturity by multiple providers. Unfortunately, all current 'generic' mapping tools are too deeply integrated into particular application environments and combine too many functions in one system to contribute to an overall solution. We do not expect any single software provider to be capable of providing such generic components for all the necessary interface protocols. This is borne out by the continued expenditure of many millions of Euros by funding bodies to fund mapping tools and other components in dozens of different projects without ever providing a solution of industrial strength and high quality. Therefore the proposed architecture and reference model (called Synergy), which has already been outlined by the CRM SIG in various parts23, aims at a specification of open source components, well-defined functionality and open interfaces. Implementers may develop and choose between functionally equivalent solutions with different levels of sophistication, for example table-based, or graph-based visualization of mapping instructions, intelligent systems that automatically propose mappings or purely manual definition, etc. They may choose functionally equivalent components from different software providers capable of dealing with particular format and interface protocols and therefore different provider-aggregator combinations. Only in this way does the community have a chance to realize an effective data aggregation environment in the near future. In the reference model that we propose the architecture plays a central role and is a kind of proof of feasibility. However, it is justified and complemented by an elaborate model of the individually identified business processes that exist between the partners of cultural data provision and aggregation initiatives, both as a reference and a means to promote interoperability on an organisational and technical level. The processes enabled by this architecture should also be viewed with an understanding that, as a result of a properly defined end-to-end provisioning system, other more collaborative processes and practices can be initiated that enable organizations to support each other and to exchange experience and practice more easily and to greater effect. The establishment of a system that supports many different organisations promotes greater levels of collaboration between them independently of aggregators, and increases the pool of knowledgeable resources. It is at this level where structural robustness can be practically implemented to enable a re-construction of the essential alliances between the cultural heritage sector and the wider academic body. These considerations should be considered a priority for any new aggregation service rather than a problem to be solved at a later date. Prioritisation of quantity above quality and longevity means that aggregators soon reach a point where they can no longer deal adequately with data sustainability issues. The solution is often to concentrate even more resources on functionality and marketing in a hope that the underlying problem might be solved externally, and make the decision to remove funding more difficult. Inevitably relationships between providers and aggregators start to fail, links become broken and relationships break down leading to a gradual decline and finally failure. The cultural heritage sector is unable to invest resources into schemes that have unclear longevity and which lack the certainty needed to support institutional planning. Without a high degree of certainty cultural organisations cannot divert resources into aligning their own systems with that of aggregators and therefore the scope of those systems becomes so low that their overall value is marginal. Organisations become ever more cautious towards these new projects. This current state of affairs is also implicitly reflected by the growth of smaller and more discrete projects that seek to aggregate data into smaller, narrower and bespoke models designed as index portals for particular areas of study and particular research communities. These projects can be interpreted as a direct statement of dissatisfaction with larger more ambitious aggregation projects that have failed to provide infrastructures onto which these communities can build and develop (and therefore contribute to an overall effort) extensions for particular areas of scholarly investigation. However, the reduced and narrow scope of these projects mean that they are only of use to those who already have a specialised understanding of the data, can piece together information through their own specialist knowledge, and are content with linear reference resources. They embrace concepts of linked data but forego the idea that a more comprehensive and contextual collaboration is possible. To more cross-disciplinary researchers and other groups interested in wider and larger questions, the outputs of these projects are of limited interest and represent a fragmented patchwork of resources. They provide little in the way of information that can be easily understood or which easily integrates with other initiatives because they provide only snippets, data products that have been separated from their natural wider context and, in effect, they practically restrict (in contrast to their stated aims) the ability to link data outside their narrow domain. This severely limits the possibilities and use cases for the data and contributes to a fragmented landscape over which wider forms of digital humanities modelling is impossible. Conclusion Knowledge Representation At the beginning of the twenty-first century, much effort was spent trying to both define and predict the trajectory of what had now been termed the digital humanities24. "In some form, the semantic web is our future, and it will require formal representations of the human record. Those representations — ontologies, schemas, knowledge representations, call them what you will — should be produced by people trained in the humanities. Producing them is a discipline that requires training in the humanities, but also in elements of mathematics, logic, engineering, and computer science. Up to now, most of the people who have this mix of skills have been self-made, but as we become serious about making the known world computable, we will need to train such people deliberately." (Unsworth, 2002) The debate about the extent to which humanists must learn new skills still continues today. The reasons why humanists have been slow and reluctant to incorporate these new skills into their work, in perhaps the same way that some other disciplines have done, are too varied and complex to consider here. Whether a lack of targeted training or a philosophical position about the extent to which computers can address the complexities of historical interpretation, or a lack of understanding about the areas of scholarship that can be enhanced or transformed by computers, the resulting lack of engagement seriously affects the outcomes and value of digital humanities projects. While there are notable exceptions the overwhelming body of 'digital humanities' work, while often providing some short term wonderment and 'cool', has not put the case well enough to persuade many humanists to replace existing traditional practices. This has a direct connection with the reasons why, despite a clear identification of the problem from different sources, flawed approaches still exist and are incorporated, without challenge, into each new technological development. The cultural heritage linked data movement currently provides new examples of this damaging situation. Far from providing meaningful linking of data, the lack of a properly designed model reflecting the variability of museum and other cultural data, and the inability to provide a robust reference model for collaborative data provisioning mean that early optimism has not materialised into a coherent and robust vision. While more data is being published to the Internet it has limited value beyond satisfying relatively simple publishing use cases or providing reference materials for discrete groups. While these resources are useful, their preoccupation has seriously impeded more ground-breaking humanities research that might uncover more profound discoveries and demonstrate that humanities research is as important to society as scientific research, and is deserving of more consideration from funders. However, just as in scientific research, the humanities community must learn from previous research to be considered worthy of increased attention. Further fragmentation and unaligned initiatives are unlikely to instil confidence in those organisations that have previously been willing to finance digital humanities projects. We must learn from projects like CLAROS25 at Oxford University that are significant because, while they adopt a more semantic and contextual approach, they have evolved from the lessons learnt directly from projects like RAMA. The CLAROS team includes expertise and experience derived through past contributions to the RAMA project, as well as others similar projects. This experience has provided a first-hand understanding of the problems of data aggregation and larger scale digital humanities research. As a result the CLAROS project positively benefits from the failures of previous research rather than replicates unsuccessful methodologies. New projects like ResearchSpace26 and CultureBrokers27, are now building on the work of CLAROS to create interactive semantic systems together with other types of CRM based projects. Research and Engagement While different audiences have different objectives all data use cases benefit from the highest quality of representation, the preservation of local meaning and the re-contextualising of knowledge with real world context. Researchers benefit from the ability to investigate and model semantic relationships and the facility to use context and meaning for co-reference and instance matching. Engagement and education activities benefit from exactly the same semantic properties that bring data to life and provide more interesting and varied paths for people to follow without the 'dead ends' that would ordinarily confront users of traditional aggregation services. There is no longer an excuse for using 'top down' schemas because the 'bottom up' empirical and knowledge based approach is now available (the result of years of considerable effort) and accessible in the form of the CIDOC CRM. The core CRM schema is mature and stable, growing in popularity and provides no particular technological challenge. It is an object-oriented schema based on real world concepts and events implementing data harmonisation based on the relationships between things rather than artificial generalisations and fixed field schemas. It simplifies complicated cultural heritage data models but in doing so provides a far richer semantic representation sympathetic to the data and the different and varied perspectives of the cultural heritage community. Reversing Fragmentation and Sustaining Collaboration The history of digital humanities is now littered by hundreds of projects that have made use of and brought together cultural heritage data for a range of different reasons. Yet these projects have failed to build up any sense of a coherent and structural infrastructure that would make them more than "bursts of optimism" (Prescott, 2012). This seems connected with a general problem of the digital humanities clearly identified by Andrew Prescott and Jerome McGann. "... the record of the digital humanities remains unimpressive compared to the great success of media and culture studies. Part of the reason for this failure of the digital humanities is structural. The digital humanities has struggled to escape from what McGann describes as 'a haphazard, inefficient, and often jerry-built arrangement of intramural instruments, free-standing centers, labs, enterprises, and institutes, or special digital groups set up outside the traditional departmental structure of the university'" (Prescott, 2012; McGann, 2010) But while these structural failings exist in the academic world, Prescott also identifies years of exclusion of cultural heritage organisations, keeping them at arm's length and thereby contributing to a widening gap with organisations that own, understand and digitise (or, increasingly, curate digital material) our material, social and literary history. Prescott, with a background that includes both academic and curatorial experience in universities, museums and libraries, is highly critical of this separation and comments that, "my time as a curator and librarian [was] consistently far more intellectually exciting and challenging than being an academic". This experience and expertise, which in the past set the tone and pace for cultural heritage research and discovery, is slowly but surely disappearing making it increasingly difficult for cultural heritage organisations to claim a position in a, "new digital order"28 — particular in an environment of every increasing financial pressures. (Prescott, 2012). In effect they are reduced to simple service providers with little or no stake in the outcomes.29 Within the vastly diverse and every changing nature of digital humanities projects how can organisations and projects collaborate with each other? How can they spend time and resource effectively in a highly fragmented world that ultimately works against effective collaboration? We believe that one answer is to change the emphasis from the inconsistent 'bursts' and instead focus on the underlying structures that could support more consistent innovation. By establishing the foundational structures that providers, aggregators and users of cultural data all have a common interest in maintaining, a more consistent approach to progressing in the digital humanities may be achieved. In such an environment projects are able to build tools and components that can be both diverse and innovative but that contribute to the analysis and management of a growing body of harmonised knowledge capable of supporting computer based reasoning. The challenge is not about finding the right approach and methodology (these aspects being understood back in the 1990's) but rather how the ingrained practices of the last 20 years, determined mostly by technologists, can be reversed and a more collaborative, cross disciplinary and knowledge led approach can be achieved. This is a collaboration based not simply on university department collaboration but a far wider association of people and groups who provide an equally important role in establishing a healthy humanities ecosystem. The CRM SIG has already started elaborating and experimenting with key elements for a new reference model of collaborative data provision and aggregation, in line with the requirements indicated above. Work on this new structure has already commenced in projects like CultureBrokers, a project in Sweden starting to develop some of the essential components described above. We call on other prospective aggregators, existing service providers and end users to pool resources and contribute to the development of this new and sustainable approach. Without such a collaboration the community risks never breaking out of a cycle based on flawed assumptions and restrictive ideas, and therefore never creating the foundational components necessary to take digital humanities to a higher intellectual, and practical, level. Notes 1 Clough, G. Wayne, Secretary of the Smithsonian institute citing Robert Janes (Janes, 2009) in Best of Both Worlds: Museums, Libraries, and Archives in the Digital Age (Kindle Locations 550-551), (2013). 2 Synergy Reference Model of Data Provision and Aggregation. A working draft is available here. 3 Database engine for triple and quad statements in the model of the Resource Description Framework. 4 Solr™ is the fast open source search platform from the Apache Lucene™ project (http://lucene.apache.org/solr/). 5 A database using graph structures, i.e., every element contains a direct pointer to its adjacent elements. 6 The CIDOC CRM is largely agnostic about database technology, relying on a logical knowledge representation structure. 7 By "Digital Humanities" we mean not only philological applications but any support of cultural-historical research using computer science. 8 For the recent use of the term "cultural heritage fruition", see, e.g., "cultural assets protection, valorisation and fruition" in Cultural Heritage Space Identification System, Best Practices for the Fruition and Promotion of Cultural Heritage or Promote cultural fruition. 9 For example, the failed umbrella digital rights management model of MESL. 10 Paul Walk, former Deputy Director, UKOLN — transcribed from the conference video cast. 11 Panel included: Rachel Bruce, JISC; Maura Marx, The Digital Public Library of America; Alister Dunning, Europeana; Neil Wilson, British Library; Paul Walk, UKOLN; and David Baker, Resource Discovery Task Force. 12 Garbage in, garbage out (GIGO) in the field of computer science or information and communications technology refers to the fact that computers, since they operate by logical processes, will unquestioningly process unintended, even nonsensical, input data ("garbage in") and produce undesired, often nonsensical, output ("garbage out"). (Wikipedia) 13 Europeana, a European data aggregation project with a digital portal. 14 Only recently Europeana developed the still experimental EDM model, which adopted the event concept of the CIDOC CRM. Dublin Core application profiles implemented in RDF make use of some very general superproperties, such as dc:relation, dc:date, dc:description, dc:coverage. XML, JSON, Relational and Object-Relational data formats cannot represent superproperties. 15 In terms of RDF/OWL, one would speak of "superproperties", which are known to the schema and the database engine, but need not appear explicitly in the data. (Calvanese, Giacomo, and Lenzerini, 1998) use the terms, "relation subsumption" and "query containment". 16 A historiographic concern in respect of the tension between post-modern approaches and uncovering and exposing supposedly power relations across different political-economic phases. For example, (Jameson, 1991). 17 An approach associated with writers such W.G. Sebald, for example, see (Sebald, 2002). 18 The German Digital Library envisaged about 30,000 potential providers in Germany. 19 Hardly any tool supports sources as different as RDBMS, XML dialects, RDF, MS Excel, and tables in text formats. 20 See, for instance, the migration of Europeana from ESE to EDM. 21 See delving / x3ml for the X3ML mapping format development. 22 For example, the use of XSLT. 23 The reference model has been named 'Synergy'. A working draft is available here. 24 Formerly, Humanities Computing. 25 CLAROS, a CIDOC CRM-based aggregation of classical datasets from major collections across Europe. 26 ResearchSpace, TA project, funded by the Andrew W. Mellon Foundation, to create an interactive research environment based on CIDOC CRM data harmonisation. 27 Culturebroker, a project mainly funded by a consortium led by the Swedish Arts Council implementing data provisioning using the new reference model for Swedish institutions. 28 Something different to simple information publication and the use of popular social networking facilities. 29 See The Two Art Histories, Haxthausen (2003) and The Museum Time Machine, Lumley (Ed.) (1988) for further evidence. References [1] Bayardo, R., et al. (1997) "InfoSleuth: Agent-Based Semantic Integration of Information in Open and Dynamic Environments", in ACM SIGMOD International Conference on Management of Data, pp. 195-206. [2] Bearman, D. (1995) "Standards for Networked Cultural Heritage". Archives and Museum Informatics, 9 (3), 279-307. [3] Calvanese, D., Giacomo, G., Lenzerini, M., et al. (1998) "Description Logic Framework for Information Integration", in 6th International Conference on the Principles of Knowledge Representation and Reasoning (KR'98), pp. 2-13. [4] Calvanese, D., Giacomo, G. & Lenzerini, M. (1998) "On the decidability of query containment under constraints", in Principles of Database Systems, pp. 149-158. [5] Consultative Committee for Space data Systems. (2009) "Reference Model for an Open Archival Information System" (OAIS). [6] Crofts, N. et al. (eds.) (2011) "Definition of the CIDOC Conceptual Reference Model". [7] Delouis, D. (1993) "TOlOsystixnes France", in International Cultural Heritage Informatics. [8] Doerr, M. (2003) "The CIDOC conceptual reference module: an ontological approach to semantic interoperability of metadata". AI Magazine, 24 (3), 75. [9] Doerr, M. & Crofts, N. (1998) "Electronic espernato—The Role of the oo CIDOC Reference Model". Citeseer. [10] Gruber, T. R. (1993) "Toward Principles for the Design of Ontologies Used for Knowledge Sharing". International Journal Human-Computer Studies, (43), 907-928. [11] Greenhill, E. H. (1992) Museums and the Shaping of Knowledge. Routledge. [12] Haxthausen, Charles W. (2003) The Two Art Histories: The Museum and the University. Williamstown, Mass: Yale University Press. [13] ICZN (1999) International Code of Zoological Nomenclature. 4th edition. The International Trust for Zoological Nomenclature. [14] Jameson, F. (1991) Postmodernism or the Cultural Logic of Late Capitalism. Duke University Press. [15] Janes, Robert R. (2009) Museums in a Troubled World: Renewal, Irrelevance or Collapse? London: Routledge, pp. 13. [16] JISC (2013) "JISC Discovery Summit 2013". [17] Lu, J. et al. (1996) "Hybrid Knowledge Bases". IEEE Transactions on Knowledge and Data Engineering, 8 (5), pp. 773-785. [18] Lumley, Robert. (1988) The Museum Time Machine. Routledge. [19] McCarty, W. (2011) "Beyond Chronology and Profession", in Hidden Histories Symposium. 17 September 2011, University College London. [20] McGann, J. (2010) "Sustainability: The Elephant in the Room", from a Mellon Foundation Conference at the University of Virginia. [21] Prescott, A. (2012) "An Electric Current of the Imagination." Digital Humanities: Works in Progress. [22] Renn, J. (2006) "Towards a Web of Culture and Science". Information Services and Use 26 (2), pp. 73-79. [23] Sebald, W. G. (2002) The Rings of Saturn. London: Vintage. [24] Stewart, S. (1993) "On Longing: Narratives of the Miniature, the Gigantic, the Souvenir, the Collection". Duke University Press. [25] Tzompanaki, K., et al. (2013) "Reasoning based on property propagation on CIDOC-CRM and CRMdig based repositories", in Online Proceedings for Scientific Workshops. [26] Tzompanaki, K. & Doerr, M. (2012) "Fundamental Categories and Relationships for intuitive querying CIDOC-CRM based repositories". [27] Unsworth, J. (2002) "What is Humanities Computing and What is not?". [28] Wiederhold, G. (1992) "Mediators in the Architecture of Future Information Systems". IEEE Computer. About the Authors Dominic Oldman is currently the Deputy Head of the British Museum's Information Systems department and specialises in systems integration, knowledge representation and Semantic Web/Linked Open Data technologies. He is a member of the CIDOC Conceptual Reference Model Special Interest Group (CRM SIG) and chairs the Bloomsbury Digital Humanities Group. He is also the Principal Investigator of ResearchSpace, a project funded by the Andrew W. Mellon Foundation using CIDOC CRM to provide an on-line collaboration research environment. A law graduate he also holds a post graduate degree in Digital Humanities from King's College, London. Martin Doerr is a Research Director at the Information Systems Laboratory and head of the Centre for Cultural Informatics of the Institute of Computer Science, FORTH. Dr. Doerr has been leading the development of systems for knowledge representation and terminology, metadata and content management. He has been leading or participating in a series of national and international projects for cultural information systems. His long-standing interdisciplinary work and collaboration with the International Council of Museums on modeling cultural-historical information has resulted besides others in an ISO Standard, ISO21127:2006, a core ontology for the purpose of schema integration across institutions. He is chair of the CRM SIG. Gerald de Jong has a background in combinatorics and computer science from the University of Waterloo in Ontario, Canada. He has a more than a decade of freelance experience in the Netherlands, both coding and training, including being part of the original Europeana technical team. He has a passion for finding simplicity in otherwise complex things, and with multi-agent and darwinistic approaches to solving gnarly problems. He co-founded Delving BV in 2010 to focus on the bigger information challenges in the domain of cultural heritage. He is a member of the CRM SIG. Barry Norton is Development Manager of ResearchSpace, a project developing tools for the cultural heritage sector using Linked Data. He has worked on data-centric applications development since the mid 90's and holds a PhD on Semantic Web and software architecture topics from the University of Sheffield. Before working at the British Museum he worked as a consultant Solutions Architect following a ten year academic career at universities in Sheffield, London (Queen Mary), Karlsruhe, Innsbruck and at the Open University. Thomas Wikman is an experienced manager working on national and European ICT projects and museum collaborations since the mid 90's. He is the Project manager and Co-ordinator at the Swedish National Archives for the CultureCloud and the CultureBroker projects. CultureBroker is an implementation of the Data Provisioning Reference Model and the CIDOC CRM aggregating archival and museum data. He is a member of the CRM SIG. Copyright © 2014 Dominic Oldman, Martin Doerr, Gerald de Jong, Barry Norton and Thomas Wikman work_ftnttjzrpnafjjgbra75tg323a ---- 0 Neurocognitive Literary Studies and Digital Humanities Dr. Valiur Rahaman (Paper Presenter) Asstt Professor, Department of English Madhav Institute of Technology & Science Gwalior-INDIA Founder President, Indian Society of Digital Humanities (formed 2016) Principal Investigator of CRS Research Project on Humanities Inspired Technology Long Presentation at ADHO Conference / Digital Humanities 2020 /Virtual Conference Keywords: Humanities-Inspired Technology, Research in Digital Humanities, Neurocriticism, Autism, Literary Studies, Literary data Modeling, Digital Narrative, Social Media, Transdisciplinary Research 1 Neurocognitive Literary Studies and Digital Humanities ABSTRACT The paper demonstrates how neurocognitive social psychology can be applied to study human behavior through literary character analysis with digital tools; and how the digital literary studies in terms of neurocognitive psychology may help develop new models for technology and theories of contemporary science. On the basis of the theses, the paper illustrates the theoretical methodology called “Humanities-inspired technology for society” as an essential sub-branch of Digital Humanities and its application to the two major research studies: to great classics of all times and to etiology of autism. The paper advocates to bring literary theory and neurocognitive literature in the curricular of science and technology. Keywords: Humanities-Inspired Technology, Research in Digital Humanities, Neurocriticism, Autism, Literary Studies, Literary data Modeling, Digital Narrative, Social Media, Transdisciplinary Research 1. Introduction Psychology, Cognitive Science and Psychoanalysis are often intersectional subjects with literary studies. Digital Humanities strengthens literary studies when its scholarship help develop models for advancement of science and technology. Till date, a very few studies have gone to this direction-how DH scholarship help technological modeling for challenging social problems and healthcare issues. The paper highlights the conceptual ground of humanities-inspired technology for society (HITS), its applications and functions. It has a major component 'neurocognitive literary study' through digital tools and hence the paper establishes a networked rapport of literary arts with neurocognitive science and digital humanities/studies. At the beginning of the paper, the author defines HITS as an approach to knowledge system and concludes with its applications. 2 2. HITS as Sub-branch of DH: A Study in Digital Humanities to Technological Advancement Digital Humanities scholarship is utilized to disseminate, preserve, conserve and represent visuals of the knowledge system but seldom used for advancing human technologies for social welfare. The paper explores humanities inspired technology as a subdiscipline of Digital humanities which studies how humanities scholarship intersected or interpreted or analyzed with digital technological tools and it demonstrates attributes to modelling for technological development. It deals with practical expositions of literary or language philosophers, and critical theorists as impetuses for modeling of cognitive computational technology. Hence, it strongly establishes an inseparable bridge between practices in technology and humanities epistemology. The function of Humanities-inspired technology for society (HITS) essentially lies with developing models based on digital studies in philosophy of language and literary studies in terms of brain, mind and behavior. It coordinates the two different streams of knowledge system for three reasons: first, to remind; second to upgrade; and third to develop. It reminds what is missed by the world of technology; suggests to upgrade technological tools and devices for their humane utilization without their hazardous impacts on the earth and beyond; and develops new models out of scholarly studies in humanities for technological advances. For instance, there is no neuro- model based technology developed till date to identify the factors of sexual deviant criminals, to control or detect such heinous criminals. Begun with empathy to the victims, a HITS scholar studies the behavior patterns of such personalities in Literature in terms of neuro-cognitive psychology and social psychology and may develop behavior semiotic model based on the studies patterns and prepared corpus. Such studies develop industry-based research and development in the fields of Digital Humanities, which is much awaited epistemological contention in the arena of humanities departments in India and across the world. For ages, Literature is studied in its own terms: Aristotelian, Longinian, Classicist, Romantics, Modern, Postmodern, Gender, Colonial and Postcolonial. Literary studies seldom go beyond its defined disciplinary territories and this was the major reason for its fall across the world. 3 Its boundary is defined for its users and the users are not allowed to go beyond the boundaries, thus, communication with the real world is questioned in literary studies. The influences of Marx, Freud, Nietzsche, Foucault, Lacan, and Derrida are irresistible penetrating human thinking so they could touch the offshoots of the literary studies despite the disciplinary resistance of classical rhetoricians. Now, something has happened more than that: interferences of science and technology in the study of Humanities with slow but steady manners; in respective phases resulting in Humanities Computing, Computational Humanities, Digital Humanities, Speculative Digital Humanities (SpecLab), and Public Digital Humanities. 3. Conceptualization, Experimentation, and Invention The demand of transdisciplinary studies of science and arts, aesthetics and technology are observed in the history of ideas of contentions of difference and epistemological hybridity. I.A. Richards’s collaborative works with C.K Ogden developed a transdisciplinary approach to the poetics called ‘science of criticism’ (Green); C. P. Snow observed two cultures in the “intellectual life of the whole of western society” (Rede lectures); E. O. Wilson‘s Consilience: The Unity of Knowledge (Wilson) is the finest exposition of trans-disciplinary thought argues for “consilience” referring to “the synthesis of knowledge” derived from different specialized fields of human endeavor to envision a new field of knowledge serving the society. “The greatest enterprise of the mind has always been and always will be the attempted linkage of the sciences and humanities.” (Wilson; Morris) How this linkage is possible? Let’s understand with few examples: Descartes’ painting is a part popular science known as a pattern-design of the first experimentation in designing the airplane.(Miller) The coordinate system is ingrained in Descartes's philosophy; and similarly, Thomas Carlyle’s Circle is well-known model in Mathematics (DeTemple) as “a certain circle in a coordinate plane associated with a quadratic equation” and may similar studies are yet to be done. The implications of humanities knowledge of the two are examples of Humanities inspired technology and science. Such findings of interferences of Humanities in the domains of science and technology are observable to establish an ideation that science and technology are developed also by the epistemological influences of Humanities (esp. linguistics, literature and cultural heritage). The HITS never establishes superiority of a knowledge system over another one such as demonstrated in Science and Poetry as a problem in epistemological enquiries. (Midgley) 4 4. Literature, Neurocognitive science, and Technology: Substantial Studies in Neurocognitive Digital Humanities Based on the concept argued above, the paper now reflects substantiated studies research on Humanities inspired technology. In this, it is shown how knowledge of Humanities polishes, cherishes the motives for developing technological tools to guarantee the safety and security of the human society at large. We conducted two studies together: I theorized the ‘Neurocognitive literary theory’ based on “activated neurons affecting/effecting the human behavior (ANAEHB)” patterns and applied to study Hamlet’s neurological problems equating his mental status with existing persons in real society; to study R.N. Tagore’s The Post Office in terms of how neurocognitive forces in an author empathetically influence the audiences of the play resulting in its translation and staging across the world during the World War. (Rahaman and Sharma); and to study neurodevelopment issues reflected through behavior such as the mental anguish and moral dilemmas of Rodion Raskolnikov in Fyodor Dostoyevsky’s Crime and Punishment (1866), and neurocognitive factors of racial discriminative behavior patterns of Marlowe & Kurtz in Joseph Conrad’s Heart of Darkness (1899), and sexual deviant behavior of David Lurie in J. M. Coetzee’s Disgrace (1999). These characters illustrate the behavior patterns of the socially disturb mindset resulting in numerous societal problems at large. The specific factors of behaviors disturbing the other members of the society and their connection with the CNS are etiologically studied and replied to the research questions: Can Literary reading be intersected with neurological and computational studies? Can reading in Humanities or knowledge of humanities help solve complex problems in the development of AI, Neurocomputation, Human Nature Inspired Computing, and Medical computing? Based on the following findings which are observed as outcomes of neurocognitive literary studies: 1. the impulses of human beings through deep reading of literary classics, and compare with real-life situations in Human Society is feasible 2. Understanding human impulses identifying neurological causes behind human behavior and developed computational modeling to express the criminal mindset 3. Based on Humanities and Knowledge Engineering for Medical & Technology, developed a device to protect a woman from the unwanted accident 4. Established the possibility of 5 Trans-disciplinary research in Arts & Literature intersected with cognitive sciences and computational studies 5. Established Literature & Language as a reflection of socio-neuron behavior and identified mental patterns of the neurological disorder in humans to commit Rape and Murder. For literary studies, words are the only media for assessing human behaviors so Atlas.ti the software application is used to analyze the patterns of behavior through frequencies of words used by the characters of the literary works. 5. Literary Narratives, Neurodevelopment and Techno-epidemiology As argued, the trans-disciplinary approach always brings novelty in the procedures of experimentation resulting in prismatic ways to see the world. For example, Friedrich Salomon Rothschild (1899-1995), a psychiatrist and colleague of Erich Fromm (1900-1980) developed the theory of biosemiotics. Rothschild was a reader of Charles W. Morris (I have cited above) who studied Engineering and Psychology at NU and earned a Ph.D. under the research supervision of psycho-sociologist George Herbert Mead (1863-1931). His book Signs, Language, and Behavior (1946) elucidates the signs representing human behavior; specific modes of signifying adequacy, truth, and reliability of signs; and defined life is but the semiotic narrative, and as the signature of human behavior. Similarly, J. C. Whitehorn and G. K. Zipf collaboratively wrote “Schizophrenic language” (1943); G. K. Zipf edited The Psycho-Biology of Language (1939), “The Unity of Nature, Least-Action, and Natural Social Science” (1942), and “Observations of the Possible Effect of Mental Age Upon the Frequency-Distribution of Words, from the viewpoint of Dynamic Philology” are the oldest research papers archived in the PubMed and remain foundational works in cognitive- linguistic disorders which symptomatize the Autism basically. These works are the consequences of inclinations towards what we called “research consilience” a trans-disciplinary approach to knowledge serving humanity and its associated agencies. 6. ZEF factor of Autism The prevalence of the rate of autism in the world states itself the facts of a less effective approach to cure and challenge autism. To do so, there is unavoidable necessity to observe the history of the etiology of autism: from the second decade of the twentieth century to the WW I & II, and to 2019. The entire history of autism reveals various factors of autism established by medical practices 6 or special treatment. The keen observation of etiology of autism states that the epidemiological historians of autism could really not differentiate the terms between symptomatology and etiology of autism. The problem is strongly put forth in “Deconstructing the Etiology of Autism and its Cure through Social Media & Digital Literary Narratives” (Rahaman 2020) and came up with a major finding that Autism eventuates during fertilization periods, longtime before the birth of a child. It is the evaluative study of the research pursued in the etiology of ASD and the possibility to develop a parallel treatment way by deconstructing the established hardcore medical practices for ASD. We studied, critically evaluated articles published between 1943 and 2019, consulted the world health organization reports of the prevalence of ASD in USA & eight South Asian countries, and develop an additional idea as therapy of ASD through “Social Media” & “Literary Narratives” differentiating technological and developed a model of post-technological Autism treatment. The study contributed to help the cure procedures for ASD through “Social media” and “literary narratives” further requirement of upgradation in epidemiological treatment through technological imaging and development of technology based on the ZEF factors of Autism. The other findings establish the open possibilities of research in the fields required to design further research and make policies to resist the prevalence of ASD around the world. Acknowledgements The concept of “Humanities Inspired Technology & Science” (HITS) sprung from the readings for the ongoing research project sponsored by the Collaborative Research Scheme under TEQIP-III, National Project Implementation Unit of MHRD, Govt. of India. The aim of the project is to define the potential of Humanities & Social Sciences to be used for the development of technology and science for the welfare of human beings with minimum after-effects or side-effects upon common lives or its target groups. The title of the presentation "Cognitive Literary Studies for Computational Cognitive Modeling: A Humanities Inspired Approach to Technological Advancement" is briefed as short title "Neurocognitive Literary Studies and Digital Humanities". The presentation is one of the outcomes of the collaborative research project funded under the National Project Implementation Unit (NPIU)-AICTE, MHRD-Govt. of India. The membership to ADHO is subscribed by the NPIU. The contribution of Dr R K Pandit, Director MITS Gwalior for his supports in forms of rich discussion for the research studies. 7 Works Cited DeTemple, Duane W. “Carlyle Circles and the Lemoine Simplicity of Polygon Constructions.” The American Mathematical Monthly, vol. 98, no. 2, 1991, pp. 97–108, doi:10.1080/00029890.1991.11995711. Green, Elspeth. “I . A . Richards Among the Scientists.” ELH Fall, vol. 86, no. 3, 2019, pp. 751– 77, doi:https://doi.org/10.1353/elh.2019.0028. Midgley, Mary. Science and Poetry. Routledge London, 2001. Miller, Leonard G. “Descartes, Mathematics, and God.” Philosophical Review, vol. 66, no. 4, 1957, pp. 451–65, doi:10.1093/mind/xx.80.592. Morris, Charles William. “Symbolism and Reality : A Study in the Nature of Mind.” Foundations of Semiotics, no. 15, 1993, pp. xxv, 128 p. Rahaman, Valiur, and Sanjiv Sharma. Reading an Extremist Mind through Literary Language: Approaching Cognitive Literary Hermeneutics to R.N. Tagore’s Play The Post Office for Neuro-Computational Predictions. Edited by G R Sinha and Jasjit S B T - Cognitive Informatics Suri Computer Modelling, and Cognitive Science, Academic Press, 2020, pp. 197–210, doi:https://doi.org/10.1016/B978-0-12-819445-4.00010-2. Wilson, Edward O. Consilience: The Unity of Knowledge. Vintage Books Random House New York, 1999. Consulted Works 1. Arbib, Michael A. James J. Bonaiuto. From Neuron to Cognition via Computational Neuroscience. The MIT Press. 2016 2. Bara, B. G., Ciaramidaro, A., Walter, H., & Adenzato, M. Intentional minds: A philosophical analysis of intention tested through fMRI experiments involving people with schizophrenia, people with autism, and healthy individuals. Frontiers in Human Neuroscience, 5(7), 111. 2011. 3. Baron-Cohen, S. Mindblindness: An essay on autism and Theory of Mind. Cambridge, MA: MIT Press. 1995 8 4. Brown, Julie. Writers on the Spectrum: How Autism and Asperger Syndrome have Influenced Literary Writing, Jessica Kingsley Publishers, London. 2010. 5. Cook, Amy. Shakespearean Neuroplay: Reinvigorating the Study of Dramatic Texts and Performance through Cognitive Science. Palgrave-Macmillan. 2010. 6. Corbett BA, et al “Treatment Effects in Social Cognition and Behavior following a Theater- based Intervention for Youth with Autism.” Cortex. 2019 Jun; 115:15-26. doi: 10.1016/j.cortex.2019.01.003. Epub 2019 Jan 22. 7. Dutta, Krishna; Robinson, Andrew, eds. Rabindranath Tagore: an anthology. Macmillan. 1998. 8. Einstein AJ, Henzlova MJ, Rajagopalan S. Estimating risk of cancer associated with radiation exposure from 64-slice computed tomography coronary angiography. JAMA 2007;298 (3):317–323. 9. Emmeche, Claus; Kull, Kalevi Towards a Semiotic Biology: Life is the Action of Signs” Imperial College Press. 2011 10. Fitzgerald, Michael The Genesis of Artistic Creativity Asperger’s Syndrome and the Arts. Jessica Kingsley Publishers. 2005. 11. George McKenzie, Jackie Powell and Robin Usher Ed. Understanding Social Research: Perspectives on Methodology and Practice, The Falmer Press. London. 1997. 12. Glynn. Dylan, Quantitative Methods in Cognitive Semantics: Corpus-Driven Approaches (Cognitive Linguistic Research). De Gruyter Mouton.2010. 13. Hickok, Gregory. The Myth of Mirror Neurons: The Real Neuroscience of Communication and Cognition, WW Norton. London. 2014. 9 14. Hickok, Gregory. “Eight Problems for the Mirror Neuron Theory of Action Understanding in Monkeys and Humans” Available. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2773693/(2009) 15. Korczak, Janusz. Ghetto Diary with an Introduction of Betty Jean Lifton Available https://ia800401.us.archive.org/2/items/GhettoDiary- EnglishJanuszKorczak/ghettodiary.pdf. Retrieved on 1.8.2019 16. Pandit, RK, and Rahaman, Valiur, “Critical Pedagogy in Digital Era: Understanding the Importance of Arts & Humanities for Sustainable IT Development” (May 12, 2019). Proceedings of International Conference on Digital Pedagogies (ICDP) 2019. Available http://dx.doi.org/10.2139/ssrn.3387020 17. Peter Tepe, Discourse Studies, Vol. 13, No. 5, Special Issue on Hermeneutics and Discourse Analysis (October 2011), pp. 601-608. 18. Pineda, Jaime A., ed. 2013. Mirror Neuron Systems: The Role of Mirroring Processes in Social Cognition. Springer. Humana Press. 19. Rahaman, Valiur. Introducing Digital Humanities. Yking Books. Jaipur. India 2016. 20. Rahaman, Valiur. 2020. “Epi/Pandemic in Literature: A Study in Medical Humanities for COVID 19 Prevention Plenary Speaker. National Webinar on Literature & Epidemics. May 2020. MK Bhavnagar University, Gujarat. India.” Bhavnagar: Bhavnagar University India. https://sites.google.com/view/webinar-eng-mkbu/plenaries?authuser=0. 21. Rahaman, Valiur, and Sanjiv Sharma. “Reading an Extremist Mind through Literary Language: Approaching Cognitive Literary Hermeneutics to R.N. Tagore’s Play The Post Office for Neuro-Computational Predictions.” Cognitive Informatics Computer Modelling, and Cognitive Science. Ed. G R Sinha and Jasjit Suri., 197–210. Academic Press. Elsevier. 2020. doi: https://doi.org/10.1016/B978-0-12-819445-4.00010-2. 22. Ramchandran V. Blackslee, Sanda. (1998) Phantoms in the Brain: Probing the Mysteries of the Human Mind. HarperCollins. London. 1999. P 368. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2773693/(2009) https://ia800401.us.archive.org/2/items/GhettoDiary-EnglishJanuszKorczak/ghettodiary.pdf.%20Retrieved%20on%201.8.2019 https://ia800401.us.archive.org/2/items/GhettoDiary-EnglishJanuszKorczak/ghettodiary.pdf.%20Retrieved%20on%201.8.2019 https://ia800401.us.archive.org/2/items/GhettoDiary-EnglishJanuszKorczak/ghettodiary.pdf.%20Retrieved%20on%201.8.2019 https://dx.doi.org/10.2139/ssrn.3387020 https://dx.doi.org/10.2139/ssrn.3387020 https://sites.google.com/view/webinar-eng-mkbu/plenaries?authuser=0 10 23. Shakespeare, William.. Hamlet. Ed. Burton Raffe and Bloom. Yale University Press. London. 2003. doi:10.1017/CBO9781107415324.004. 24. Wolfrey, Julian. 2011. Introducing Criticism at the 21st Century. Introducing Criticism in the 21st Century, Edinburgh University Press. London. 25. Wilson, Matthew W. “Cyborg Geographies: Towards Hybrid Epistemologies”. Gender, Place and Culture, 16(5) (2009): 499–515. 26. Yeo, Richard. Defining Science, William Whewell, natural knowledge, and public debate in early Victorian Britain. Cambridge University Press. 1993. 27. V. Gallese, M.A. Gernsbacher, C. Heyes, G. Hickok, M. Iacoboni, “Mirror Neuron Forum”, Perspectives on Psychological Science 6 (4) (2011) 369-407. https://dx.doi.org/10.1177%2F1745691611413392 work_fvlnsuuajzcchapn3qcudhpmh4 ---- Humanist Studies & the Digital Age, 6.1 (2019) ISSN: 2158-3846 (online) http://journals.oregondigital.org/hsda/ DOI: 10.5399/uo/hsda.6.1.3 32 The Origins of Humanities Computing and the Digital Humanities Turn1 Dino Buzzetti, University of Bologna Abstract: At its beginnings Humanities Computing was characterized by a primary interest in methodological issues and their epistemological background. Subsequently, Humanities Computing practice has been prevailingly driven by technological developments and the main concern has shifted from content processing to the representation in digital form of documentary sources. The Digital Humanities turn has brought more to the fore artistic and literary practice in direct digital form, as opposed to a supposedly commonplace application of computational methods to scholarly research. As an example of a way back to the original motivations of applied computation in the humanities, a formal model of the interpretive process is here proposed, whose implementation may be contrived through the application of data processing procedures typical of the so called artificial adaptive systems. 1. Introduction A retrospective overview of the first stages of development of the newly emerged forms of reflection and methodological practices that, in Italian, have been properly named informatica umanistica (humanities computing) can also foster a better understanding of the new current trends. As a matter of fact, the limitations of the technological tools available at the time conferred more space to the ideation of what could have been achieved through the application of computational methods. The essential nature of the available technology focused the attention on the vast range of future opportunities enabled by the implementation of computational procedures on the model of the universal Turing machine. The origins of humanities computing are therefore characterized by a marked attention to the methodological and theoretical implications of research projects based on the application of computational procedures. The ensuing technological developments produced a rather paradoxical drawback. By polarizing the attention of scholars on the functionalities of application programs occasionally imposing themselves as dominant technologies, they induced a conceptual dependence on the available technology, to the detriment of a well-grounded choice of appropriate methods and alternative solutions. It may therefore be worthwhile to review the successive phases of 1 This article was published in Italian: Alle origini dell’Informatica Umanistica: Humanities Computing e/o Digital Humanities, in Il museo virtuale dell’informatica archeologica, a cura di Paola Moscati e Tito Orlandi. Atti della “Segnatura” (13 dicembre 2017). «Rendiconti dell’Accademia Nazionale dei Lincei», Classe di Scienze morali, storiche e filologiche, S. ix, 30.1-2 (2019), 71-103. http://journals.oregondigital.org/hsda/ http://creativecommons.org/licenses/by-nd/4.0/ Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 33 applied computation characterized by new technological developments in order to evaluate their impact on the research practices of humanities computing and to reconsider its current orientations in the light of the theoretical discussions of its opening phase. 2. The era of the mainframes The initial phase of humanities computing was characterized by employment of large computers available in computer centers, or public or private institutions that used them for administrative purposes. A “special thanks” for the intertextual analysis of the correspondences between two fundamental legal texts of the Jewish tradition, conducted by Sergio Noja in 1968, was addressed “to the Management of the S. Paolo Banking Institute of Turin,” who made “the electronic computer available,” which in turn made the publication of the essay presenting the results (Noja 1968, 582). The most famous example of this type of facility is that of the 56 printed volumes of the Index Thomisticus prepared by father Roberto Busa (1974-1980) thanks to the support of Thomas J. Watson, founder of IBM, and completed, after thirty years of work, only in the 70s. It has surely been noted that both computational projects conducted by the electronico IBM automato resulted in printed publications. At first glance, in an era characterized by the ever- increasing pervasiveness of the digital, this seems surprising. However, the paradox is only apparent and, if well considered, leads us to conclusions of a different sort. This circumstance leads us to reflect on the working conditions that the mainframe technology then allowed. The memories, consisting of punched cards and only later of magnetic tapes, did not allow any form of visualization of the data, and the output resulting from their processing was returned in print format from the output units of the computers specifically used. Therefore, the purpose of computation could not consist in reproduction and visualization of digital data, or object sources of investigation, but only in the elaboration and analysis of their informational content. In this situation the purpose of the research was primarily directed to computation, that is to the application of computational procedures to objects, in our case, of humanistic research. Hence the name of humanities computing or, in Italian informatica umanistica, for the research practices of this first period. The possibility of tackling research problems in the humanities with computational methods involved, at this stage and in the absence of technological mediation consisting of instruments already available and ready for use, a reflection directly addressed to the fundamentals of computation. IBM itself is likely to have supported Father Busa’s project for the opportunity it offered to extend the application of computation, up to that point almost exclusively aimed at processing numeric data, into the realm of processing textual information. Therefore, researchers did not simply have to make use of computational tools already set up, and choose those most suitable for the purposes of the research, but they had to contribute their own design, focusing attention on the specific aims of their own research. 3. A definition of humanities computing Attention to “the fundamentals of computing theory and science, which today absolutely nobody in Humanities Computing mentions” (Orlandi 2016, 80-81), together with the constant reference to the objectives of the research, led in the first phase of development of humanities computing to a Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 34 definition expressed in the title of Nicholas Wirth’s widely read book, Algorithms + Data Structures = Programs (Wirth 1976). In this regard, a volume that appeared at the conclusion of an investigation on the “impact of new technologies in the humanities” in Europe included a chapter, coordinated by Tito Orlandi, dedicated to study of formal methods, in which we find this specification – proposed by Manfred Thaller – of humanities computing as it was characterized in previous years and understood fundamentally as “applied computer science” (de Smedt 1999): we will attempt to define the core in terms of the traditional combination of data structures and algorithms, applied to the requirements of a discipline: - The methods needed to represent the information within a specific domain of knowledge in such a way that this information can be processed by computational systems result in the data structures required by a specific discipline. - The methods needed to formulate the research questions and specific procedures of a given domain of knowledge in such a way as to benefit from the application of computational processing result in the algorithms applicable to a given discipline. Crucial in this definition was the awareness that computation applied to the humanities requires both representation (data structures) and processing (algorithms) of the information contained in the objects of study, a requirement often overlooked in the subsequent phases of development influenced by advances in technology. This essentially theoretical characterization of humanistic information technology placed formalization in the foreground as a necessary and unavoidable prerequisite of research. From this point of view, the “real beginnings” of humanities computing can be directly traced, together with other initial “experiments” to the seminal works of Jean-Claude Gardin (Orlandi 2016, 79) on “formalization of aspects of archaeological research connected with the processes of representation and classification of data” (Moscati 2013, 7). As Paola Moscati rightly notes (2013, 10), Gardin stated that “the interest of method [...] rather comes from its logical implications, and from the consequences it seeks to provoke in the general economy of the archaeological research” (Gardin 1960, 5); and later, in 1971, in a letter to René Ginouvès, Gardin reiterated that “[...] the comparative merits of such or such machine model or punch cards since 1955 worried us less than the methods of formalization (mise en forme) of the data and of the archeological reasoning, in the perspective of a ‘mechanization’ conceived without referring to any of these cards or these machines in particular” (JCG 205, 12 January 1971). What mattered more than the technology, therefore, was the formal organization of data. From this point of view Jean-Claude Gardin, “really is at the source of Humanities Computing” (Orlandi 2016, 82). 4. Representation vs. Data processing Subsequently, the impressive technological development that occurred within a few years with the introduction of personal computers, graphic interfaces and then the implementation of the World Wide Web, profoundly transformed the research practices in the domain of the humanities computing, and greatly influenced the relationship itself between the representation and the elaboration of the content of the data under examination. The new and more advanced opportunities for practical use of computers, made possible by the progressive advancement of technology, have paradoxically Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 35 caused, if not quite a setback, at least one obvious delay in the theoretical elaboration necessary for the planning of applications specifically designed for specific research purposes. Once more Manfred Thaller, in the aforementioned volume, stated that, for an adequate training in humanities computing, “the study of computational methods themselves” was essential to the development of “new methods” for “the explanation,” framed “according to formal principles,” of the phenomena studied in the various humanities disciplines (de Smedt 1999). But in the new technological context, the center of the discussion was progressively moving from the investigation of the formalization of research methods and of the applicability of computation, to the evaluation of the possibility of using the new technological tools that gradually became available. The theoretical discussion about the design of applications specifically developed for specific projects of research and their intrinsic methodological implications thus fell into the background. To give just one example, in the years in which exclusively textual terminals were still in use, the graphic layout of a document was decided by the author himself during the composition of the text with the insertion of the markup, or print instructions, in declarative or directly procedural form. Subsequently, with the introduction of graphical interfaces, writing programs working in WYSIWYG mode (What You See Is What You Get) automatically inserted the markup, removing from the author the direct control of the layout, which could be carried out exclusively in the ways provided by the program functionalities. The alleviation of effort in the process of composition was obtained at the cost of renouncing the direct design of the graphic characteristics of the document. This example, after all banal and certainly not relevant in terms of research, is nonetheless useful to point out certain consequences, often unnoticed, implicit in the development of technology, and to highlight some actual reasons for the progressive renunciation of the design of computer applications usable for research purposes, in favor of the passive and uncritical use of new instruments thrust upon us. Also the very rapid spread of the Web has had profound consequences in the evolution of humanities computing. On closer inspection, the specific role of the computer in the practical use of the Web is quite limited, since it does nothing but guarantee remote access to data or documents stored at a distance, to be viewed on the computer screen. But the elaboration of the information content of the resources displayed remains entrusted to the reader’s ability to understand and, from this point of view, nothing changes. About the so-called “liberational effect of electronic technology on texts” Marilyn Deegan and Kathryn Sutherland have acutely observed that “the narrative of redemption from print” – anticipated by McLuhan and repeated emphatically by its followers of the 90s – did not foresee that further, unimagined, developments in electronic technology, like the Google search engine, the brainchild of two college students, would lovingly extend the culture of the book through instant delivery of high-resolution images of the pages of thousands of rare and previously hard-of-access volumes. (Deegan and Sutherland 2009, 10). Therefore, it is possible to assert that one “Google Book Search” – as they emphasize – “is not providing electronic text, it is providing books” (147). In other words, this common tool uses electronic technology to archive and instantly return “simulations of print and manuscript documents” (27). The elaboration of the informational content of the object of study is substantially neglected and the interest is turned to the digital reproduction of the source. That being said, we certainly do not Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 36 want to underestimate the importance of availability of multimedia resources. Digital images can, for example, be technically elaborated to improve the readability of deteriorated manuscripts, but the precedence given to the mere visualization of the sources profoundly modified the functionality of the fundamental link between representation and information processing. Attention came to be mainly addressed to the representation of information transmitted by objects of study, to the detriment of the elaboration of their contents for purposes of analysis and interpretation. The lack of attention paid to the elaboration of information content of textual data can also be seen in the strategic choices of Text Encoding Initiative (TEI), with which in the 90s the community of scholars of humanities established “a standard for the representation of texts in digital form” (TEI 2016). The purpose of the TEI was indeed to publish Guidelines for Electronic Text Encoding and Interchange, in order to “define and document a markup language for representing the structural, renditional, and conceptual features of texts,” above all “in the humanities and social sciences” (TEI 2015). With the introduction of document production systems, among computer scientists, a so-called document community was formed, which took care of the automation, the visualization and printing processes of the documents, distinct from the community denominated in a similar way data processing or database community, dedicated instead to the design of archives for structured data. Now, while for the document community, in the interchange of data between different systems, it was essential to maintain the invariance of the representation of the documents, for the data processing community, it was instead fundamental to ensure the invariance of data processing operations. As a result, while the document community “chose to standardize the representation of data,” to guarantee its interchangeability, the database community “chose to standardize the semantics of data,” by developing “data models that described the logical properties of data, independently of how it was stored,” and regardless of the particular format of their representation. To tell the truth, even the “data semantics was not irrelevant to the document community, but the definition of semantics did seem to be a difficult problem” and the attempts undertaken proved too easily exposed to criticism (Raymond et al. 1996, 27). So, for quite similar reasons, attempts to define semantics in the scholarly community, most notably the Text Encoding Initiative, similarly met with resistance. Thus, the route proposed by SGML was a reasonable one: promote the notion of application and machine independence, and provide a base on which semantics could eventually be developed, but avoid actually specifying a semantics. (28). The technology of document management systems thus affected the choices for the digital representation of the text and led to the adoption of Standard Generalized Markup Language (SGML) as a standard language for the codification of textual data. As a language of simple representation and not data processing – because it lacks a semantics of its own – SGML represented a clear limit for the processing of textual data for the analysis of contents and the interpretation of texts. In the field of humanities computing, the prevalence assigned to the representation as opposed to the processing of information radically changed, in this phase, the prevailing orientations of the research. 5. Semantic Web and Digital Humanities In the last and most recent period, the development of technology has had contrasting effects on the research practices of humanities computing. The Semantic Web project has brought back to the Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 37 fore the fundamental demand for the elaboration of the information accessible online. Languages have been developed for the representation of the content of Web resources, such as the Resource Description Framework (RDF) and for the construction of formal ontologies. Through the use of these languages, the so-called DIKW hierarchy (Data, Information, Knowledge, Wisdom), already in use in information science, could be treated more formally, allowing meaning to be assigned to data conceived as pure symbols that have not been interpreted and to represent the information conveyed by them through descriptions of the linked content between them. The networks and graphs of semantic relations thus obtained (linked data) made it possible to define connections rigorously and to organize certain fields of knowledge according to logically defined structural relationships and as such to allow the application of real procedures of formal inference. All this made it possible to bring attention back – through the tools provided by Semantic Web technologies – to the problem of processing the content of digital resources accessible online. At the same time, and against this trend, the use of the expression digital humanities to define the customary field of humanities computing has imposed itself. The deliberate adoption of this name seems to be due to the preference expressed by the publisher Blackwell Publishing for a catchy title for its Companion for the introduction to the discipline (Schreibman et al. 2004). However, this has favored the tendency to comprehend under this definition all the phenomena in which the digital medium is used to disseminate contents related to the humanities. Even a simple e-book, or all applications for mobile devices designed for the access to multimedia concessions, thus seems to enter the field of interest of humanities computing. The transition from humanities computing to digital humanities also comes explicitly theorized as a positive evolution of humanities computing. In fact, literary and artistic practice itself is more and more taking place in a directly digital form. In a recent interview with the online magazine Il lavoro culturale (Cultural Work) Jeffrey Schnapp, founder and director of metaLAB of Harvard University, claims to fully share “the point of view according to which a definition of digital humanities that reduced it to the application of a series of IT tools for the study of cultural heritage would be a relatively trivial operation,” and argues that already in the 1990s, when the formula, Digital Humanities, was established in the United States, and we stopped talking about Computational Humanities or Humanistic Computing, we wanted to emphasize two aspects: the emergence of the Network as a public space and the personalization of the computer... The expression Digital Humanities marked precisely this moment of transition, in which the distinction between the world of digital technologies and culture in society no longer existed. This is a moment of unification in which there has certainly been a rethink on what research in the field of human sciences can be. Consequently, humanities computing should give way to a “new experimental model of the human sciences,” to a new social practice of “Knowledge Design,” as opposed to nineteenth-century practice of philology (Capezzuto 2017). In the face of all this, there is no lack of authoritative positional statements that in different forms instead recommend the opportunity for a return to origins. So John Unsworth, already in the title of an essay in which he draws coherent attention to the results achieved in the various phases of Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 38 humanities computing development, urges us to go beyond the simple digital representation of the primary research sources (Unsworth 2004): we are, I think, on the verge of what seems to me the third major phase in humanities computing, which has moved from tools in the 50s, 60s, and 70s, to primary sources in the 80s and 90s, and now seems to be moving back to tools, in a sequence that suggests an oscillation that may repeat itself in the future. But whether or not the pattern ultimately repeats, I think we are arriving at a moment when the form of the attention that we pay to primary source materials is shifting from digitizing to analyzing, from artifacts to aggregates, and from representation to abstraction. The exhortation to proceed beyond the simple digital “representation” of the studied documents, passing to the “analysis” of the content and to the “abstraction” necessary for the formal specification of computational procedures, is here quite evident, while the reference to “tools” should be understood as being functional with respect to the formalization of the methods adopted, and not as simple technological devices prepared in advance, regardless of the specific procedures applied, just placed “in the hands of researchers” for the computer-assisted practice of the usual activity of examining and annotating documents (Leon s.d.). 6. The “logicism” of Jean-Claude Gardin Is it then possible to envisage the forms of this desired return to origins in our new context? The indications are not lacking and many inspiring principles can be drawn precisely from the illuminating anticipations of Jean-Claude Gardin. First of all, in the whole of his theoretical reflection, his rigorous methodological perspective and the reference to its necessary epistemological foundation take a central role. In a text published for the proceedings of a seminar held on 17 January 1994 at the University of Bologna, in which he presents the “research program” in which he was engaged “for thirty years” (Gardin and Borghetti 1995, 17), Gardin states that he is interested more than in “extending the field of application of computer science in the human sciences than in the progress and consolidation, with or without the calculator, of methodologies and their epistemological status.» (70). On the other hand, without giving priority to theoretical reflection on the possibility of applying computational methods to the humanities, one would inevitably run “the risk of confusion between the means and the ends of research (70),” and humanities computing would lose those “features of an autonomous intellectual project, with its own tools and goals” that actually characterize it (33). Hence the proposal by Gardin of the “logicist method” (30 ff) and “of the inevitable reference to epistemology” (19) that the application of this method necessarily involves. In the “analysis of archaeologists’ and historians’ texts” (17), or of the human sciences in general, considered “in their entirety” and as “constructs” (1980) or “scientific constructions” (Gardin and Borghetti 1995, 19), Gardin interest lies not so much, with Wittgenstein, in “erecting a building,” but rather “in having the foundations of possible buildings transparently before me” (18; see Wittgenstein 1998, 9). It is therefore necessary to face a problem of method and “practical epistemology” (Gardin and Borghetti 1995, 19), that is, of a type of epistemological reflection which he considers as “an activity whose purpose is to clarify the basic conceptual constructs of the human sciences, as they arise in practice, through the combined study of the symbolic systems that provide the materials, and the chains of operations that govern its architecture (Gardin et al. 1987, 29).” Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 39 The logicist method presented by Gardin consists therefore in the “study of the mechanisms and foundations of scientific argumentation” (Gardin and Borghetti 1995, 19) and in applying its principles to scientific «constructions» of the humanities, defined as follows: I define ‘constructions’ the texts elaborated on the model of scientific works, with the following three components: (a) a set of observation facts or data ascertained on any type of foundation; (b) hypotheses or conclusions based on these data and which constitute the end of the construction, its reason for being; lastly, (c), the argument produced to link these two components: data to conclusions or, conversely, hypotheses to the facts, with modalities that can be of different nature: natural reasoning or common sense, mathematics, formal logic, computer science, or any type of conjunction of these instruments that is considered as distinctive of our intellectual procedures in the human sciences (18). The application of the logicist method involves the use of “schematizations,” which in turn are defined – following “the logician J.B. Grize” (30) – as “the exercise that aims to isolate the operations called ‘natural logic’, currently practiced in the argument of ordinary language” (31; cf. Grize 1974, 204); the schematizations “are therefore nothing more than exercises in transferring into logical or, rather, semiological form, specialized texts in a particular discipline or field of research” (31). The assignment of a logical form to the discursive arguments obtained through the schematizations “shows that every construction can be defined through the combination of two elements” (34), the “initial” propositions that describe the “facts” (35) and the “rewriting operations,” or the discursive passages, “whose sequence constitutes the reasoning” that leads to the “conclusions” (34), that is the propositions called “terminal” (35). The rewriting operations constitute real “logical operations that are in reality particularly diversified” (30) and depend on the peculiar principles of inference of the different “modes of reasoning” (93) that in the “discursive practices” (37) of the human sciences can take the most varied forms: “inductions, implications, abductions, inferences, deductions, etc.” (30) Now, “the two elements” of the schematization previously cited are found again unchanged in the structure of the knowledge base in the field of artificial intelligence” and “the organizations thus defined constitute” practically “the specific subject of expert systems or ‘knowledge-based systems.’ ” (35) The result was “the possibility of conceiving the schematizations as one source of knowledge for the elaboration of expert systems or, conversely, expert systems as a possible development of schematizations” (36); and “the computational paradigm” could become “the main tool” of the logicist analysis, that is, of that “rewriting modality which consists in expressing interpretative constructions in the form of chains of propositions that link observed data” to theoretical statements such as “in a calculation procedure” (Gardin 1993, 12). The adoption of the computational tool, therefore, originates from a precise methodological choice and is based on the “homology” between the “architecture” of expert systems and that of schematizations (Gardin and Borghetti 1995, 36). Now, the more relevant aspect related to the switching to expert systems “concerns the ‘added value’” which is expected to be obtained “on the epistemological level” in which our scientific arguments are situated. But if, on the one hand, “the mandatory conversion of rewriting operations in reasoning ‘rules’ ” offers the possibility of “applying these rules in an experimental way, through simulations aimed at proving their validity”; on the other hand, “nothing allows us to affirm that our discursive practices can be assimilated to the true and Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 40 proper, and rigorously formal, rules of reasoning” (37). The computational methodological option thus requires an equally well-founded epistemological justification. 7. Epistemological reflection and expert systems In response to the alleged “scientist infection” of which he would be a victim (Gardin 1993, 15) and to other criticisms of his logicist approach, Gardin presents a thorough picture of the contrasting positions in an attempt to arrive at an adequate characterization of the method of human sciences, without, however, refraining from considering “the limits and possibilities of logicism” (19). Among the different positions taken into consideration we can distinguish, on the one hand, those that presuppose an exclusive “dualism” between the expositional methods of the human sciences and those of the natural sciences (Gardin and Borghetti 1995, 18-19) and, on the other, the “middle positions” that insist on the intermediate nature of the human and social sciences “should one characterize them only by negations (‘neither this nor that’), or should one opt for an intermingling of categories (‘a little of this, a little of that’)” (Gardin 1999, 125). In this debate, the position of logicism “seems to be confused with that of the human sciences themselves, in that ‘entre-deux’ (Passeron 1991) where they intend to legitimize their location today,” unless they put in question the very definition of “this ‘third way’ of knowledge which, according to the some, would not be that of science or literature; according to the others, it would be neither that of the symbolic constructions, separated from the logic and ‘natural’ languages, nor entirely that of argumentation current in everyday life.” (Gardin 1993, 19). Rather than following this debate in detail, it is important to note here that Gardin declares that he “feels uncomfortable in these intermediate spaces where the rules of the discursive game remain obscure” (1999, 125), even admitting in the end, that the “substance de l’entredeux, elle, m’echappe” (1991a, 32). Indeed, it is perhaps more important to observe that, today, both the epistemological reflection and the most up-to-date computational procedures actually seem to converge in offering a way out that addresses the issue Gardin leaves unresolved. In retrospect, its difficulty seems to depend upon the state of research in the field of expert systems at that precise moment and upon the availability of inference engines, which at the time were still too tied to the classical model of hypothetical-deductive reasoning, typical of the natural sciences. Now, however, a possibility of solution is in sight, in full compliance with the homology recognized by Gardin between the “logical form” (1995, 31) of the scientific constructions of the human sciences and the procedures of formalized inference of expert systems; and all this without overthrowing, however, from another point of view, the relationship of priority between the adoption of the logicist method and the “computer applications” that accompany it, but which – it should be reiterated – “do not constitute its main objective, nor its inevitable extension” (1993, 12). Therefore, it seems appropriate to pay attention to the possibility of a more detailed analysis of the interpretative and inferential practices of the texts expressed in natural language both in general and, for what concerns us more directly, in the field of human sciences. 8. Adaptive systems and methodological issues In this regard, in an essay published on Archeologia e Calcolatori we find an affirmation that sounds almost surprising to those who usually rely on the classical deterministic paradigm of computation, but that is, however, of particular importance for our purposes since it is precisely based on the Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 41 “homology” already highlighted by Gardin between the “architecture” of the expert systems and that of the schematizations of scientific constructions expressed in natural language (Gardin and Borghetti 1995, 36). This essay is dedicated to the epistemological foundations of “adaptive systems,” whose theory developed thanks to contributions from “several fields” of research such as biology, cognitive sciences, and artificial intelligence (cf. Holland 1962, 297). Precisely at the beginning Massimo Buscema, the author of this essay, writes expressly, “I shall use an analogy to explain the difference” or, to say it in a better way, the relationship, “between artificial science and natural language; the computer is to the artificial sciences as writing is to natural language” (2014, 53). In other words, in the artificial sciences, the computer is what writing represents for natural language: the artificial sciences consist of formal algebra for the generation of artificial models (structures and processes), in the same way in which natural languages are made up of semantics, syntax and pragmatics for the generation of texts. (2011, 17) It follows that in the “artificial adaptive systems,” that are “part of the vast world of natural computation” – which constitutes in turn “a subset of the artificial sciences” – the functioning of the texts composed in ordinary language is assimilated, in an apparently unexpected way, to the algorithmic operation of the computer. What is illustrated here, in fact, is a homology between the forms of computation and the analysis of cultural phenomena, which amounts to a renewed proposal of the homology already theorized by Jean-Claude Gardin between the schematizations of scientific constructions in the humanities and the architecture of expert systems. In this sense, we can also read the definition of artificial science proposed by Buscema: “artificial sciences are those sciences for which an understanding of natural and/or cultural processes is achieved by the recreation of those processes through automatic models” (2013, 17). One could then almost say that Gardin’s logicism also fits, like the adaptive systems studied by Buscema, in the field of the so-called “natural computing,” which is, however, described as “the computational version of the process of extracting ideas from nature to develop computational systems” (de Castro 2006, 3). In both cases, the homology between discursive procedures and automatic systems works in reverse: while in natural computation the rules of the system adapt to the processes from which they are derived, in Gardin’s logicist analysis, the discursive schematizations are necessarily adapted to the formal rules of the expert system in use. Among the expert systems examined by Gardin and the adaptive systems studied by Buscema there is therefore a crucial difference. What is characteristic of the adaptive systems is the presence of “rules that determine the conditions of possibility of other rules”; by their nature, these rules – formed by ‘constraints (links)’ which give the artificial models of natural processes the capacity to dynamically generate other rules – “are similar to the Kantian transcendental rules” and constitute the regulatory overarching principles on which the adaptive functioning of the system depends. In this way, natural computation does not try to recreate natural and/or cultural processes by analyzing the rules which make them function, and thus formalizing them [statistically] into an artificial model. On the contrary, natural computation tries to recreate natural and/or cultural processes by constructing artificial models able to create local rules dynamically and therefore capable of change in accordance with the process itself. (Buscema 2013, 20) Based on these considerations, the idea of building an adaptive model of this type – functional with respect to the analysis of texts expressed in natural language, in order to overcome the difficulty Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 42 encountered by Gardin in assigning a well-defined logical form to that intermediate form of argument that seems plausibly proper to all humanistic disciplines – looks quite legitimate, without thereby legitimizing non-clarified discursive rules and the residual mingling of models. Even from a more general epistemological point of view, “this ‘third way’ of knowledge,” still viewed with suspicion by Gardin (1993, 19), has some plausible justifications. It is true that even the field of humanistic research can be subdivided in “two sectors,” one more strictly “governed by logic” and the other “governed by what you might call intuition”. It is also true that it is “difficult to subject intuition to scrutiny” for validity (Orlandi 2016, 79), an objection that Gardin frequently aims at positions that are prone to justify “the plurality and accumulation of interpretations” (1999, 119), without defining a precise criterion for validation. However, “a phenomenologically inclined cognitive scientist,” reflecting on the origins of cognition, might reason as follows: We reflect on a world that is not made, but found, and yet it is also our structure that enables us to reflect upon this world. Thus, in reflection we find ourselves in a circle: we are in a world that seems to be there before reflection begins, but that world is not separate from us. For the French philosopher Maurice Merleau-Ponty, the recognition of this circle opened up a space between self and world, between the inner and the outer. This space was not a gulf or divide; it embraced the distinction between self and world, and yet provided the continuity between them. Its openness revealed a middle way, an entre-deux. (Varela et al. 1991, 3) The recognition of this entre-deux, of this intermediate path between the self and the world, brings into play the fundamental relationship between the subject and the object of knowledge. Gardin also considers the problem posed by the “incisive formula,” frequently cited in epistemological debates, of the retour en force du sujet (1991b, 99); however – without going into the discussion of the complex relationship between model or subjective representation of phenomena and objective reality, or between observer and observed – he tends to treat the “subject” from a predominantly objective point of view and to deal above all with the “objective evaluation of the role of the subject in human sciences” (98). Suffice it to mention, however, that in the face of this, even in the natural sciences and especially in physics, the problem has been addressed in a direct way: “when a theory is highly successful and becomes firmly established, the model tends to become identified with ‘reality’ itself, and the model nature of the theory becomes obscured,” writes the theoretical physicist Hugh Everett, who thus goes on: once we have granted that any physical theory is essentially only a model for the world of experience, we must renounce all hope of finding anything like “the correct theory.” There is nothing which prevents any number of quite distinct models from being in correspondence with experience (i.e., all "correct"). (Everett 1973, 134) Also in physics, therefore, the ‘multi-interpretation,’ considered so problematic by Gardin, does not cause scandal and, except for the criterion of empirical conformity, the problem of choice doesn’t arise anymore. Then the question shifts rather to the formal reconstruction of the interpretative process in the discursive practices of the ‘third way’ mainly followed in the human sciences, whose characterizing element seems to be constituted precisely by a form of self-referentiality, which includes in itself the role of the observer. Thus, one understands the relevance of the processes of redefining their own rules which are typical of automatic adaptive systems. Formal analysis of self-referential Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 43 procedures of internal transformation imposes itself as the primary task of a research that can be extended – thanks to the analogy previously reported – to the interpretative practices of the texts expressed in natural language. 9. Ordinary language: formal model and natural computation Which formal model can therefore be proposed for the representation and the formal analysis of the texts in natural language, which constitute the main product of scientific constructions in the humanities? Inspiration can come only from an analysis of language and from the perception of the enormous distance that separates the rigid “formalist’s motto” – characteristic of one of the most raw formulations of Good Old-Fashioned Artificial Intelligence (GOFAI), If you take care of the syntax, the semantics will take care of itself (Haugeland 1985, 106) and the compelling image of the connection between the text and its meaning offered by Samuel Beckett: There are many ways in which the thing I am trying in vain to say may be tried in vain to be said. (1965, 123) This illuminating sentence dissolves with immediate naturalness the extreme trivialization of the relationship between syntax and semantics of the previous maxim. In the conception of good old- fashioned artificial intelligence, a formalization of the syntax should lead to an alleged one-to-one correspondence between the syntactic structure and the semantic structure of the text, an assumption which persists also philosophies of language of analytical orientation. As Davidson maintains, “to give the logical form of a sentence is, then, for me, to describe it in terms that bring it within the scope of a semantic theory” (Davidson 1970, 145). The illusory postulation of this cherished one-to-one relation between syntax and semantics is completely debunked by the iconic representation of the fundamental indeterminacy of the relationship between the many ways of saying the same thing and the many ways to understand what is said by the same sentence: an identical content can admit different forms of expression, while an identical expression each time can be assigned different meanings. Here we encounter opposite conceptions of the relationship between the “expression” and the “content” of the text (see Hjelmslev 1961, 47-60). Following Saussure, who speaks of a plane of ideas (plan... des idées) and a plane of sounds (celui... des sons), (49) Hjelmslev states that an adequate description of the functioning of language “must analyze content and expression separately,” and that each of the two analyses may identify a certain number of entities “which are not necessarily susceptible of one-to-one matching with entities in the opposite plane” (46). On the one hand, the logicians, perhaps too conditioned by the symbolic character of the formal languages, are led to suppose that a syntactic system has “essentially the same structure as a semiotic” system and to consider it “normative for the concept of a semiotics.” On the other hand, for linguists it is the language which must be “considered as normative” for the functioning of a syntactic system (110). Accordingly, the task of the linguistic theoretician is not merely that of describing the actually present expression system, but of calculating what expression systems in general are possible as expression for a given content system, and vice versa. (105) Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 44 In fact, “the two planes,” the syntactic one and the semantic one, “cannot be shown to have the same structure throughout,” with a “one-to-one relation” between the functioning of the one and the functioning of the other (112). Therefore, while a logician like Carnap proposes “a sign-theory where, in principle, any semiotic is considered as a mere expression system without regard for the content,” from a linguistic point of view the “formal” description “is not limited to the expression-form, but sees its object in the interplay between the expression-form and a content-form” (110-111). However, it would be misleading to think that the radical difference between the two different conceptions of the relationship between the expression and the content of the text puts into question the possibility of establishing a functional homology between the discursive practices of ordinary language and the most advanced systems of artificial intelligence and natural computation. In Jean- Claude Gardin’s view, the formalization of the scientific production of the humanistic disciplines consists essentially in the formalization of the discours savant (1974, 57): in fact, the possibility of formalizing the textual phenomena in no way requires, as a necessary condition, a one-to-one correspondence between the syntactic structure and the semantic structure of the text. Rather, it is necessary to reflect on other characteristic aspects of the text and in particular on its diacritical or self- referential forms of expression. Also, in this regard, however, the approach of the logicians and that of the linguists diverge. As Hjelmslev observes, “the logistic theory of signs finds its starting point in the metamathematics of Hilbert,” which considers the system of mathematical symbols only as “a system of expression-figurae with complete disregard of their content,” and which treats its “transformation rules,” or rules of rewriting as Gardin would say, “without considering possible lnterpretations.” The same method was then “carried over by the Polish logicians into their ‘metalogic’” and eventually “brought to its conclusion by Carnap” (1961, 110). In particular, Hjelmslev, who had defined language in general as “a semiotic into which all other semiotics may be translated,” (109) argues that this is the advantage of everyday language, and its mystery. And this is why the Polish logician Tarski (who reached the same conclusion independently of the present author) rightly says that everyday languages are characterized in contrast to other languages by their ‘universalism.’ (1970a, 104-105) For Tarski, on the other hand, rather than constituting an advantage, it is presumably just this universality of everyday language which is the primary source of all semantical antinomies, like the antinomies of the liar or of heterological words. (1956, 164) In fact, “one does not realize that the language about which we speak must not at all coincide with the language in which we speak” and if the semantics is elaborated in that same language, the analysis of antinomies shows that “the language which contains its own semantics and within which the logical rules commonly accepted apply must inevitably be inconsistent” (1936, 2). So while for Hjelmslev “owing to the universalism of everyday language, an everyday language can be used as metalanguage to describe itself as object language,” (1970a, 132) for Tarski “in contrast to natural languages, the formalized languages do not have universality.” (1956, 167). In fact, formal languages are developed as pure symbolic systems regardless of the content and for this reason, “when we investigate the language of a formalized deductive science, we must always distinguish clearly between the language about which we speak and the language in which we speak,” (167), between the “metalanguage” and the “language under investigation” (172). However, proceeding in this way, the normative relationship Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 45 between semiotic structure and logical structure is reversed and what is deemed to be an obstacle due to the self-referentiality of natural language is avoided through the sharp separation of the “metalanguage,” the language “to describe,” from the “object language,” the “language described” (Hjelmslev 1970a, 132). But it is precisely the search for the forms of expression of metalinguistic import within the natural language that can make our investigation proceed to the construction of a formal model of its self-referential semiotic system. To use Hjelmslev’s linguistic terminology natural language can actually be described as a “semiotics” that includes its “metasemiotics,” which is more specifically described as “a semiotics whose content plane is a semiotic” (1961, 114), also expressed in the natural language itself. 10. The markup: diacritical function and self-referential cycle A useful starting point for this research can be found precisely by considering the current model for the digital representation of the text. It is well known that the text understood from a computational point of view as a data type of, that is, exclusively as “information coded as characters or sequences of characters” (Day 1984, 1), fails to represent all the information contained in the text understood in its current meaning. To solve this problem one resorts to the markup, whose standard form, accepted by the community of scholars of the humanities, consists in embedding, in the ordered sequence of the set of characters, marks or tags that define the properties of its partial segments or subsets. Now, if the markup represents textual information, the legitimate question arises about the status assumed by the markup in relation to the text. Thus, as Allen Renear puts it, one can enquire “about just what markup really is,” and in particular, “when it is about a text and when it is part of a text,” or in other words whether it belongs to the object language or the metalanguage of the text, without however excluding that “it may sometimes be both” (2000, 419). Trying to arrive at a satisfactory answer, we can examine the case of punctuation. Alluding to the importance of this topic for the interpretation of the text, the title of a book dedicated to punctuation, Eats, Shoots and Leaves (Truss 2003), presents an interesting example: written with the comma the title means “eats, shoots and leaves” and can allusively describe the rude behavior of a young man invited to dinner by a friend; written without comma it means “eats buds and foliage” and may describe the eating habits of a panda. Now that comma, which completely changes the meaning of the sentence or of single words like shoots and leaves, can be considered, like any other another diacritical sign of the text, both as an element of the text, in that it is part of the writing system, and as an indication or a metalinguistic rule, as it prescribes the way in which the text must be interpreted. It therefore can be affirmed that “punctuation is not simply part of our writing system,” but that “it is a type of document markup” (Coombs et al. 1987, 935). Along the same lines, the condition of the markup in general can be assimilated to that of a diacritical sign which, as such, has a double function: when it is used “to describe a document’s structure” (Raymond et al. 1992, 1) it carries out a metalinguistic function, but since it is expressed with “assigned tokens” which denote “specific positions in a text” (4) it constitutes itself the structure. The markup is therefore “simultaneously embedded and separable” from the text, is “part of the text, yet distinguishable from it” (3). Therefore, because it “denotes structure” in the text and at the same time it “is structure” itself (Buzzetti 2002, 80), the markup plays with respect to the text – in addition to “a properly diacritical function” – also “a self-reflexive function” and “can be considered, respectively, as an extension of the expression that Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 46 explains its structure” and the implicit rules that determine it, “as a form of external reference to its functional and structural aspects” (Buzzetti 2000). In short, “markup is at once representation and representation of a representation” (Buzzetti 2002, 81). Because of its ambivalent nature, every form of diacritical expression generates a cyclic process (markup loop) within the textual dynamic: we may say that an act of composition is a sense-constituting operation that brings about the formulation of a text. The resulting expression can be considered as the self-identical value of a sense- enacting operation. By fixing it, we allow for the indetermination of its content. To define the content, we assume the expression as a rule for an interpreting operation. An act of interpretation brings about a content, and we can assume it as its self-identical value. A defined content provides a model for the expression of the text and can be viewed as a rule for its restructuring. A newly added structure mark can in turn be seen as a reformulation of the expression, and so on, in a permanent cycle of compensating actions between determination and indetermination of the expression and the content of the text. (Buzzetti and McGann 2006, 68) All this can also be appropriately expressed with a diagram (Fig. 1). It is worthwhile to pause and consider in more detail some of the formal aspects both of the cycle and of the diagram that represents it. The diagram refers in particular to the markers that complete the digital representation of the text and that can be inserted inside it, or be made up of external elements connected through pointers to certain positions in the linear sequence of the characters. Since there is no direct correspondence between the elements of the syntactic structure and the elements of the semantic structure, the internal (embedded) markup – as it is part of the sequence of characters and it forms itself its structure – diacritically describes syntactic and expressive properties of the text. The external markup (stand-off), on the other hand, not being bound to the linear structure of the expression of the text, can freely express aspects that are not necessarily linear in the structure of its content. In the multidimensional diagram of the self-referential cycle of the text, there is therefore a correspondence between the dimension of the expression and that of the internal markup, as well as that of the content and external markup. Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 47 Figure 1. The markup loop (Buzzetti and McGann 2006, 68). The dual linguistic and metalinguistic function of the markup owing to its diacritical nature means that the same marker is both a self-identical element of the expression of the text, and a rule that determines the structure of the content and defines its specific elements, which in turn behave in the same way with respect to the expression. Therefore, the structural diacritical elements of expression and content can be considered both as the result of a restructuring operation and as the operations themselves that determine, respectively, the organization of the structure, both of the expression and of the content of the text. Formally, therefore, they can be understood as values of a function, or as the functions themselves which formally represent the rules for structuring the text. The relationship between the formal representation of the value of the function and the formal representation of the function or the rule itself, deserves to be carefully considered from the logical point of view, in order not to run into serious confusion between the linguistic and the metalinguistic levels present in natural language. In his careful analysis of ordinary language use, Gilbert Ryle appropriately cautions against easy “category-mistakes” (1949, 17) which one incurs if one does not pay attention to the “logical type or category,” (16) of commonly used expressions. As for what concerns us, Ryle observes that “a ‘variable’ or ‘open’ hypothetical statement” (120) – that is, a propositional function that contains variables, and all propositions of this type that express law- statements or a rule – “belong to a different and more sophisticated level of discourse from that, or those, to which belong the statements of the facts that satisfy them” (121). These propositions therefore constitute real rules of inference, for a law is used as, so to speak, an inference-ticket (a season ticket) which licenses its possessors it to move from asserting factual statements to asserting other factual statements (121). So the rules of inference, and with them the diacritical expressions we are dealing with, can be considered as statements of a higher order that belong to the logical type of the “inference-licenses,” studied by Stephen Toulmin in The Uses of Argument (2003, 91), who by his own admission “owes Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 48 much” to Ryle’s ideas, which he also “applied to the physical sciences” in his own Philosophy of Science (2003, 239). In his review of this work, Ernest Nagel (1954) observes that, thanks to the so-called deduction theorem, the principle, now “canonical in modern logical theory” (405), that “a rule of inference can in general be replaced by a premise,” holds in the case of our inference-licenses, and that “in the case of material rules of inference,” consisting of true non-tautological propositions like the ones we are dealing with, “this can apparently always be done.” Nagel tells us too that this “maneuver” can also “be introduced in reverse” (406). This means, according to standard logic, that the same sentence can act both as a first-order asserted premise in the object language, and as a rule of inference in the metalanguage. One should note that while in logic the object language and the metalanguage are necessarily kept separate – and are made up of statements respectively in the “material” and in the “formal mode of speech,” to use the terminology introduced by Carnap (1934), or by statements de re and statements de voce, to use a terminology drawn from the use of medieval logical Latin (Henry 1984), a technicized, but still natural language – in the case of ordinary language, which contains its own metalanguage, inference rules are expressed by statements. Such object-language higher-order de re statements are however inferentially equivalent to first-order de voce statements expressed in an external metalanguage, separated from the object language. Consequently, in natural language, self-referential object-language diacritical expressions take on a double function: considered as first-order statements, they are used as structural markers both of the expression and the content of the text; whereas considered as second-order statements, they constitute rules of inference which are used as functions of the expression to determine the structure of the content or, conversely, rules of inference used as functions of the content to determine the structure of the expression. 11. Generalization of the model Still on a formal level, we can observe that the structure of the markup cycle, represented above – which can however be generalized for all forms of diacritical expression – corresponds exactly to the “conversational cycle,” which according to Frederick Parker-Rhodes represents the actual “speech process” between the speaker and the listener (1978, 16) or, dealing with texts, between the writing and the reading of a text (Fig. 2). Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 49 Figure 2. The conversational cycle (Parker-Rhodes 1978, 16). In this cycle, the “expression” (A) is an operation performed by the speaker “which takes a ‘thought’ as input (which we must think as formalized in some manner)” and produces a “text” (B). One should note, incidentally, that here by expression we mean an operation, which is a function of the content, and not its result, a fact that proves the ambivalence of the diacritical mark on which it operates. In turn, the “comprehension” (C), or interpretation, is an operation performed by the listener, who receives the text as an “input containing all the information imparted to it by the speaker” and that produces “again a thought” (D) as its “output” (17). It is clear, regardless of the use of a different terminology, that the structure of this cycle corresponds exactly to that of the previously examined markup cycle (Fig. 1). However, an important observation by Parker-Rhodes should not be overlooked. It explicitly refers to the indeterminacy of the interpretation process: the “thought that the speaker had intended to convey,” once received and interpreted in the mind of the listener, “could produce the elaboration of a new thought” as a “result” (17). In this case the diagram could take the form of an open spiral, which is more suitable to represent the case of several possible interpretations (Fig. 3). Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 50 Figure 3. The helicoidal cycle (Gardin 1980, 45). Such a cycle could end at some point, returning to the starting point, or proceed indefinitely, depending on the context in which a given expression of the text is received. Jean-Claude Gardin also recognizes as “self-evident” the “cyclical nature” of the process of scientific construction. However, similarly to Parker-Rhodes, he believes that the cycle is not necessarily closed and therefore can be best represented by a “helicoidal curve,” more suited to retrace “the successive steps of its formation,” which are produced through a series of choices, depending not only on the data and their organization, but also and above all on the “logico-semantic rules of interpretation” and the different “interpretative models” that are equally possible (1980, 145). 12. Epistemological foundations The analysis of the cyclical nature of discursive practices brings us back again to the question of its epistemological foundation. As Gardin observes, the process of scientific construction can be considered both “from within,” and “from without,” or in other terms, subjectively from the author’s point of view, and objectively from the point of view of those who examine it, as an alternative to other constructions, in order to express a judgment of “validation” (145). This allows us to better evaluate the intermediate nature of the humanistic disciplines’ methodology, which many locate in the entre-deux between the predominantly objective nature of the methods of the natural sciences and the predominantly subjective nature of literary or discursive production in general. In other words, one has to decide whether this entre-deux divides or joins the two points of view, establishing what relationship exists between the subject and the object, or else the subjectivity and the objectivity absolutely considered. The cyclic and self-referential nature of the discursive process, which in the ordinary language form of expression jointly includes both the representation of its own object, and the representation of the way in which the subject represents it, inclines towards an answer that excludes the absolute separation between the subject and the object, or in other words between the observer and the observed. This is the position embraced, for example, by the theorists of autopoiesis (Varela et al. 1991), that draw inspiration from the epistemological discourse of Maurice Merleau-Ponty and his notion of “chiasm.” In one of his most iconic descriptions, Merleau-Ponty presents the chiasm as Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 51 an exchange between me and the world, between the phenomenal body and the “objective” body, between the perceiving and the perceived: what begins as a thing ends as consciousness of the thing, what begins as a “state of consciousness” ends as a thing exchange between me and the world, between the phenomenological body and the ‘objective body’ between the perceiver and the perceived: what begins as a thing ends as consciousness of the thing, what begins as a “state of consciousness” ends up as a thing. (1968, 215) In his essay on La structure du comportement, in order to clarify the connection between the subject and the object, Merleau-Ponty, once again, cites (1942, 11) the physiologist Viktor von Weizsacker, who describes that relation in these terms: “the properties of the subject and the intentions of the subject (...) not only mix with each other, but also constitute a new whole” (1927, 45). This means that the subject and the object must not be conceived as separate, but as constantly connected in a continuous process of “overlapping or encroachment (empiétement)” (Merleau-Ponty 1968, 123), as if one would over and over again take the place of the other. The chiastic interlacement thus consists of a relationship of “activity and passivity coupled,” (261) a representing and being represented of the subject and the object both in language and perception. Thus, the understanding of the “chiasm,” as described by Merleau-Ponty, leads to the conclusion that language, understood as natural language, “is the same” thing that simultaneously represents and is represented, but not the same “in the sense of real identity,” but rather “the same in the structural sense,” that is, in the sense of a unique and self- identical semiosis, which also includes the semiosis that represents it (261). The same relationship between the subject that represents and the object being represented, when conceived as ‘the same thing,’ that is, as the ‘new whole’ that they constitute, is found in the notion of the subject proper to the cybernetics “of the second order,” the cybernetics of the “observing systems,” in which “the observer enters the system by stipulating his own purpose,” as opposed to the cybernetics of the “observed systems,” or “first-order” cybernetics, in which “the observer enters the system by stipulating the system’s purpose” (von Foerster 2003, 285-286). Thus, in this context, one can find this enlightening definition of the subject: “I am the observed relation between myself and observing myself.” (257). Here the subject is defined as one and the same thing, a new whole, constituted by the representation of the relationship between the self observing itself and the self observed by itself. Hence the idea that the conception of systemic self-referentiality – that takes place, for example, both in natural language and in its formal model – could constitute a new fundamental scientific paradigm. A new paradigm of this kind necessarily leads one to believe that the nature of the human sciences can be considered as an intermediary one, only as long as the natural sciences and the literary or the discursive disciplines in general are conceived of as absolutely separate and incompatible with each other. However, the recognition of the unavoidable relationship between the observer and the “systems observed,” or the principle of the autonomous organization of the “observing systems,” today extends manifestly beyond the field of the disciplines characterized by the interpretive method to the field of the physical and biological sciences. Thus, the paradigm of self-referentiality seems to open a new perspective of convergence between the methods of the natural sciences and the methods of the human sciences, whose median nature would then be based more on the nature of the object of the investigation than on the specific nature of the method whereby knowledge is constructed. Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 52 13. Subjectivity and objectivity: formalization and implementation At this point, our diagram of the self-referential cycle of the discursive process can be reconsidered, taking into account the reflexive character of the relationship between the subject and the object. Language, in as much as it is seen as expression, is subjective, because it is the representation of the form of our act of representing; however, in as much as it is seen as content, language is objective, because it is the representation of the form of what it represents. In turn, a form of diacritical expression of the text, subjective in itself, can be considered both from an objective point of view, as an element of the expression identical to itself, and from a subjective point of view, as a function that determines a structural element of the content (Fig. 4). The same can be said of an element of the content: objective in itself, which can be considered both from an objective point of view as an element identical to itself, and from a subjective point of view as a function that determines a structural element of the expression. Figure 4. Subjectivity and objectivity in the speech process (Parker-Rhodes 1978, 16). The distinction between something subjective and something objective is therefore a recursive distinction that could continue indefinitely (Fig. 5): Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 53 Figure 5. Recursiveness of the subjective/objective distinction. But this does not happen, for the very reason that language is self-referential, as one can clearly infer from the diagram shown in Fig. 6: Figure 6. Chiastic self-referentiality of the subjective/objective distinction. It can be reasonably assumed that this scheme represents a possible formal model of the ‘chiasm,’ that is, of the relationship between the subject and the object that involves a continuous process of mutual “encroachment, infringement (empiétement, enjambement)” (Merleau-Ponty 1964, 175), or reciprocal displacement, dismissal, and override. An image of continual oscillation between what is subjective and what is objective, as an uninterrupted process, is aptly allusive of the self-referential mobility of the text and the dynamic nature of the ambivalence of the diacritical structural elements of both the expression and the content of the text. This is an aspect which cannot be described only metaphorically, but which finds formal expression also in rigorous mathematical terms. As David Hestenes writes about the mathematician who introduced the algebras that bear his name, Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 54 Clifford may have been the first person to find significance in the fact that two different interpretations of number can be distinguished, the quantitative and the operational. On the first interpretation, number is a measure of “how much” or “how many” of something. On the second, number describes a relation between different quantities. (1999, 60) In other words, seen from the latter point of view, a number describes the operation that connects two different quantities. The same concept of ambivalence between value and function can be found in the “calculus of indications” introduced by the English mathematician George Spencer Brown (1969, 11): he admits that there can be a “partial identity of operand and operator,” since an operand “is merely a conjectured presence or absence of an operator” (88). Granted that a rigorous formalization of such a model is possible, it can be surmised that a computational implementation could be obtained by developing a suitable adaptive system, endowed with functional capabilities as those previously illustrated. If the adaptive artificial systems that we described are built on the basis of a recognized analogy with the operation of natural language, that is, foreseeing the presence of rules capable of modifying other rules of the system, the same analogy allows us to suppose that a formal model of the discursive processes of natural language could be implemented precisely using an adaptive computational system of the same type. ln fact, the ambivalence of precise mathematical objects strictly defined (operation and operand, function and value), can constitute the formal expression of the relationship between subject and object that we have described by recalling the epistemological notion of the ‘chiasm.’ Secondly, it is precisely the indeterminate character of the relationship between syntax and semantics in the natural language that gives origin to the self-referential cycle of “rules” of the second order “that establish the conditions of possibility of other rules” of the system (Buscema 2013, 20). Thus, in this way, the road is open for the possibility of implementing a computational model of the discursive processes proper to scientific constructions in the humanities, consisting in an automatic system of an adaptive type. 14. Conclusions This concludes our long, extended argument aimed at supporting the opportunity for a return to the origins of humanities computing to avoid the risk, of which Jean-Claude Gardin has made us aware, of exchanging means for research purposes. The period of the origins, or the so-called humanities computing, was distinguished by an attitude aimed primarily at reflecting on the methods and their epistemological foundations, as a preliminary condition to the choice of computational means suitable for the solution of the research problems of a specific disciplinary field. Subsequently, subordination to the rapid technological development of the 1990s, favoring the importance of the digital medium in artistic and literary production, has actually reversed this relationship. The priority given to practices of cultural production directly in digital forms and to research activities assisted by the computer, although still conducted in all-traditional forms, has produced a veritable mutation of the humanities computing practice of the original period and led to the advent of the so-called digital humanities. Thus, interest has abated in what Jerome McGann considers, in this new digital environment, the urgent and very current philological “imperative” of the “preservation of cultural memory” (2012), in agreement with the famous definition of August Boeckh, die Erkenntnis des Erkannten. Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 55 In the second part of the essay, I therefore tried to present, through an example, a form of restoration of the original humanities computing attitude to deal with theoretical and methodological issues in regard to the interpretation of texts. Building on the thoughts of Jean-Claude Gardin on the analysis of discursive practices in the human and social sciences – in particular, on the homology between the structure of knowledge base expert systems and the structure of the schematizations of data and argumentations in humanities scientific texts – I have noted a significant convergence, or rather a substantial homology, between the analysis of the self-referential phenomena of natural language and the establishment of data processing rules in automatic adaptive systems. Thus, this correspondence has allowed me to outline a formal model for the analysis of the interpretative practices of texts in ordinary language congruent with data processing procedures proper of adaptive systems. A possibly successful implementation of this model would undoubtedly confirm the fecundity, for humanities computing, of a re-proposal of the priority of the theoretical and methodological reflection that particularly characterized the period of its origins. translated by Massimo Lollini Works Cited S. BECKETT, Proust and Three Dialogues with Georges Duthuit, London 1965. R. BUSA S.J. (ed.), Index Thomisticus: Sancti Thomae Aquinatis operum omnium indices et concordantiae in quibus verborum omnium et singulorum formae et lemmata cum suis frequentiis et contextibus variis modis referuntur quaeque, auspice Paulo 6 Summo Pontifice, consociata plurium opera atque electronico IBM automata usus digessit Robertus Busa, 56 voll., Stuttgart-Bad Cannstatt 1974-1980. M. BUSCEMA, Artificial Adaptive Systems: Philosophy, Mathematics and Applications, in M. Buscema, M. Ruggieri (eds.), Advanced Networks, Algorithms and Modeling for Earthquake Prediction, Aalborg 2011. M. BUSCEMA, The General Philosophy of Artificial Adaptive Systems, in M. Buscema, W.J. Tastle (eds.), Intelligent Data Mining in Law Enforcement Analytics: New Neural Networks Applied to Real Problems, Dordrecht 2013. M. BUSCEMA, The General Philosophy of Artificial Adaptive Systems (AAS), in M. Ramazzotti (ed.), ARCHEOSEMA. Artificial Adaptive Systems for the Analysis of Complex Phenomena. Collected Papers in Honour of David Leonard Clarke, “Archeologia e Calcolatori,” Supplemento 6 (2014), pp. 53- 84. http://www.archcalc.cnr.it/indice/Suppl_ 6/04_Buscema. pdf [ 11/04/2018]. Buzzetti, Dino. “Digital Representation and the Text Model.” New Literary History, vol. 33, no. 1, 2002, pp. 61–88. doi:10.1353/nlh.2002.0003. D. BUZZETTI, Ambiguittà, diacritica e Markup: Note sull’edizione critica digitale, in S. Albonico (ed.), Soluzioni informatiche e telematiche per la filologia, Atti del Seminario di studi (Pavia, 30-31 marzo 2000), Pavia 2000. http://studi umanistici. uni pv.itidipslamm/pubtel/Atti2000/dino_b uzzetti.htm [11/04/2018]. Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 56 D. BUZZETTI, J. McGANN, Critical Editing in a Digital Horizon, in L. Burnard, K. O’Brien O’Keeffe, J. Unsworth (eds.), Electronic Textual Editing, New York 2006, pp. 51-71. S. CAPEZZUTO, Il design della conoscenza: Intervista a Jeffrey Schnapp, “II lavoro culturale” (2017). http://www.lavoroculturale.org/intervista-a-jeffrey-schnapp/ [11.04.2018]. R. CARNAP, Logische Syntax der Sprache, Wien 1934.65 J.H. COOMBS, A.H. RENEAR, S.J. DEROSE, Markup Systems and the Future of Scholarly Text Processing, “Communications of the ACM” 30: 11 (1987), pp. 933-947. D. DAVIDON, Action and reaction, “Inquiry” 13:1-4 (1970), pp. 140-148. A.C. DAY, Text Processing, Cambridge 1984. L.N. DE CASTRO, Fundamentals of natural computing: An overview, “Physics of Life Reviews” 4:1 (2007), pp. 1-36. K. DE SMEDT et al. (eds.), Computing in Humanities Education: A European Perspective, University of Bergen 1999. http:/ /www.hd.uib.no/ AcoHum/book/ [11/04/2018]. M. DEEGAN, K. SUTHERLANDT, Transferred Illusions: Digital technology and the forms of print, Farnham-Burlington 2009. H. EVERETT, III, The Theory of the Universal Wave Function, in B.S. de Witt, N. Graham (eds.), The Many-Worlds Interpretation of Quantum Mechanics, Princeton 1973, pp. 3-140. H. VON FOERSTER, Understanding Understanding: Essays on Cybernetics and Cognition, New York NY 2003. J .-C. GARDIN, Les applications de la mecanographie dans la documentation archeologique, “Bulletin des Bibliotheques de France” 5:1-3 (1960), pp. 5-16. J.-C. GARDIN, Les analyses de discours, Neuchatel 1974. J.-C. GARDIN, Archaeological Constructs: An aspect of theoretical archaeology, Cambridge 1980. J .-C. GARDIN, Le calcul et la raison: Essais sur la formalisation du discours savant, Paris 1991. (a). J.-C. GARDIN, Le role du sujet dans les sciences de l’homme: Essais d’evaluation objective, “Revue europeenne des sciences sociales” 29:89 (1991), pp. 91-102. (b) J.-C. GARDIN, Points de vue logicistes sur les methodologies en sciences sociales, “Sociologie et societes” 25:2 (1993), pp. 11-22. J.-C. GARDIN, Archeologie, formalisation et sciences sociales, “Sociologie et sociétés, 31:1 (1999), pp. 119-127. J .-C. GARDIN, M.N. BORGHETTI, L’architettura dei testi storiografici: Un’ipotesi, a cura di I. Mattozzi, Bologna 1995. J.-C. GARDIN, M.-S. LAGRANGE, J-M. MARTIN, J. MOHO, J. NATALI-SMIT, La logique du plausible: Essais d’épistémologie pratique en sciences humaines, 2e éd. Revue et augmentée, Paris 1987. [JCG] Fondo Équipe Archéologie de l’Asie Centrale et Jean-Claude Gardin, Archivi della Maison Archéologie & Ethnologie René-Ginouvès, Nanterre. Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 57 J.-B. GRIZE, Logique mathématique, logique naturele et modèles, in Formalisierung in den Geisteswissenschaften / Sciences humaines et formalisation, “Jahresbericht der Schweizerischen Geisteswissenschaftlichen Gesellschaft” (1974), pp. 201-207. J. HAUGELAND, Artificial Intelligence: The very idea, Cambridge MA 1985. D.P. HENRY, That Most Subtle Question (Quaestio Subtilissima): The metaphysical bearing of medieval and contemporary linguistic disciplines, Manchester 1984. D. HESTENES, New Foundations for Classical Mechanics (Second Edition), New York 2002. L. HJELMSLEV, Prolegomena to a Theory of Language, Madison WI 1961. L. HJELMSLEV, I fondamenti della teoria del linguaggio [1961], introduction and translation by G.C. Lepschy, Torino 1968. L. HJELMSLEV, Language: An introduction, Madison WI 1970a. L. HJELMSLEV, Il linguaggio [1970], a cura di G.C. Lepschy, Transl. A. Debenedetti Woolf, Torino 19706. J.H. HOLLAND, Outline for a Logical Theory of Adaptive Systems, «Journal of the ACM» 9:3 (1962), pp. 297-314. S. LEON, Digital Public History, s.d. http://www.6floors.org/dossier I personal-statement/digital- public-history/ [11/04/2018]. J.J. McGANN, Memory Now, posted on “4Humanities,” 19 August 2012. https://4humanities.org/2012/08/jerome-j-mcgann-memory-now-2/ [27/04/2018]. M. MERLEAU-PONTY, La structure du comportement (1942), 6’ ed., Paris 1967. M. MERLEAU-PONTY, Le visible et l’invisible, Paris 1964. Merleau-Ponty, Maurice. The Visible and the Invisible: Followed by Working Notes (1964). Northwestern University Press, 1968. P. MOSCATI, Jean-Claude Gardin (Parigi 1925-2013): Dalla meccanografia all’informatica archeologica, “Archeologia e Calcolatori” 24 (2013), pp. 7-24. http://www.archcalc.cnr.it/ indice/PD F24/0 l_Moscati. pdf [27/04/2018]. E. NAGEL, Review of The Philosophy of Science by S. Toulmin, “Mind” 63:251 (1954), pp. 403-412. S. NOJA, Saggio di un confronto a mezzo di un elaboratore elettronico tra lo “Šulḫan ‘arȗḵ” di Karo e quello del rabbino di Lubavitch, “Atti della Accademia delle Scienze di Torino” 102 (1967-68), pp. 555-582. T. ORLANDI, Interview, in J. NYHAN, A. FLINN, Computation and the Humanities: Towards an oral history of Digital Humanities, Cham 2016. A.F. PARKER-RHODES, Inferential Semantics, Hassocks 1978. J.-C. PASSERON, Le raisonnement sociologique: l’espace non-popperien du raisonnement naturel, Paris 1991. D.R. RAYMOND, F.W. TOMPA, D. WOOD, Markup reconsidered, paper presented at the First International Workshop on Principles of Document Processing, Washington DC, 22-23 October 1992, pp. Humanist Studies & the Digital Age Buzzetti 6.1 (2019) 58 1-25. http://citeseerx.ist. psu.edu/viewdoc/download?doi= 10. l. l .80.9369&rep=rep 1 &type=pdf [09/04/2018]. D.R. RAYMOND, F.W. TOMPA, D. WOOD, From Data Representation to Data Model: Meta-semantic issues in the evolution of SGML, “Computer Standards and Interfaces” 18 (1996), pp. 25-36. A. RENEAR, The Descriptive/Procedural Distinction is Flawed, “Markup Languages” 2:4 (2000), pp. 411- 420. G. RYLE, The Concept of Mind, London 1949. S. SCHREIBMAN, R.G. SIEMENS, J. UNSWORTH ( eds.), A Companion to Digital Humanities, Malden MA 2004. G. SPENCER-BROWN, Laws of Form, London 1969. Tarski, Alfred. “Grundlegung der wissenschaftlichen Semantik.” In Actes du Congrès international de philosophie scientifique, Sorbonne, Paris 1935. vol. III: Language et pseudo-problèmes. Hermann, Paris 1936, pp. 1-8. https://gallica.bnf.fr/ark:/12148/bpt6k383668/f5.image.r=.langFR Tarski, Alfred. “The Concept of Truth in Formalized Languages.” In Logic, Semantics, Metamathematics: Papers from 1923 to 1938. Translated by J.H. Woodger. Clarendon Press, 1956, pp. 152-278. TEXT ENCODING INITIATIVE TEI: Text Encoding Initiative, 2016. http://www.tei-c.org/ [09/04/2018]. TEXT ENCODING INITIATIVE, Guidelines for Electronic Text Encoding and Interchange, 2015. http://www.tei-c.org/Guidelines/ [09/04/2018]. S. E. TOULMIN, The Philosophy of Science: An introduction, London 1953. Toulmin, Stephen. The Uses of Argument (1st ed. 1958). Updated ed., Cambridge University Press, 2003. L. TRUSS, Eats, Shoots and Leaves: The zero tolerance approach to punctuation, London 2003. J. UNSWORTH, Forms of Attention: Digital Humanities beyond representation, paper delivered at “The Face of Text: Computer-Assisted Text Analysis in the Humanities,” The third conference of the Canadian Symposium on Text Analysis (CaSTA), McMaster University, 19-21 November 2004. http://www.people.virginia.edu/,jmu2m/FOA/ [l0/04/2018]. F. VARELA, E. THOMPSON, E. ROSCH, The Embodied Mind, Cambridge MA 1991. V. von WEIZSÄCKER, Reflexgesetze, in A. Bethe et al. (hrsg.), Handbuch der normalen und pathologischen Physiologie, Bd. 10, Berlin 1927. N. WIRTH, Algorithms + Data Structures = Programs, Englewood Cliffs N.J. 1976. L. WITTGENSTEIN, Culture and Value: A selection from the posthumous remains, edited by G. H. von Wright in collaboration with H. Nyman, revised edition of the text by A. Pichler, translated by P. Winch, Oxford 1998. The Origins of Humanities Computing and the Digital Humanities Turn Dino Buzzetti, University of Bologna 1. Introduction 2. The era of the mainframes 3. A definition of humanities computing 4. Representation vs. Data processing 5. Semantic Web and Digital Humanities 6. The “logicism” of Jean-Claude Gardin 7. Epistemological reflection and expert systems 8. Adaptive systems and methodological issues 9. Ordinary language: formal model and natural computation 10. The markup: diacritical function and self-referential cycle 11. Generalization of the model 12. Epistemological foundations 14. Conclusions Works Cited work_fw6fzegd7bfwfkfhyo4q4vmmlu ---- Unus pro omnibus! Generic research tool for all Humanities disciplines. André Kilchenmann a.kilchenmann@dasch.swiss Flavie Laurens flavie.laurens@dasch.swiss Data and Service Center for the Humanities DaSCH — November 11, 2020 CFP Paper Abstract | DARIAH Annual Event 2020 The “digital turn” has changed research in the Humanities to a large extent: many new digital tools and methods exist with which you can access and analyze texts, videos, sound and music. However, those tools are most of the time standalone applications and it is more difficult to combine various records. A good illustration of this situation is research projects with moving image as main (re)source. Scholars record current events and interview contemporary witnesses like historic or ethnographic projects. Here, moving images or videos need to be transcribed which could be a “simple” interview transcription. But in some disciplines like sociology or film and media studies, these multimedia objects must be extended which complexify the process. In those cases, scholars would also like to annotate the source, to describe the composition of the image, the soundtrack, or the movement of the camera. It’s a linkage between various sources and descriptions. The question is: How can we bring them all together? At the Data and Service Center for the Humanities (hereinafter called DaSCH) in Basel, Switzerland, we have to deal with all different data sets from all disciplines in the Humanities. The DaSCH is a national research infrastructure which provides data handling services like data curation, long-term access, and research and analysis tools to work with qualitative data. We bring a wide variety of data, data models and media (digital representations) from different disciplines together: from archaeology to philosophy; from moving image to books, audio and still images. An important aspect of managing qualitative data in the Digital Humanities is that, in most cases, the preservation of data sets alone makes little sense. We have to store data sets that can be accessed, re-used, connected and annotated. To reach this goal and to provide qualitative data handling services, the DaSCH develops and maintains a software platform called DaSCH Service Platform (previous “Knora”) consisting of a database based on a Resource Description Framework (RDF) triple store and Application Programming Interfaces (APIs). The DaSCH Service Platform handles data from database, as well as media files stored on our own IIIF-based- media server. Those tools are part of the backend, the server side. Scholars with good IT-skills can interact with APIs and work with their data. For scholars with limited IT-knowledge, we need to provide a simple, generic user interface. We are developing an intuitive, easy-to-use web-based application, called “DSP-App”, placed on top of DaSCH Service Platform to directly use its powerful data management functionalities. Data models and data will automatically follow accepted standards, be findable, accessible interoperable, and re-usable (FAIR principles). With DSP-App, scholars will have a ready-to-use platform in order to create their own data models, upload data, attach metadata, and perform analyses and data-visualization as they could do 1 with a desktop data management tool. Even scholars with small data sets will have access to long-term accessibility at minimal cost and time to keep their research data alive, guaranteeing longevity of the data. Author Biography Dr. André Kilchenmann studied cultural anthropology, media studies and computer science at the Uni- versity of Basel. During this time, he worked at the museum of cultures in Basel and at the data center of the University. His interests are photography, design and digital work in general. In 2016, he completed his PhD studies at the Digital Humanities Lab in Basel and now works for the Data and Service Center for the Humanities DaSCH. Flavie Laurens is a front-end designer and web developer. She has a master’s degree in “Systematics, Evo- lution, Paleobiodiversity” minor in “Biodiversity Informatics” from Pierre and Marie Curie University (UPMC), Paris. Since June 2018, she has been working on different user interfaces for the Data and Service Center for the Humanities DaSCH. 2 work_fzdifx2eunfa3cwi4ehblrf6ze ---- Editorial for the Special Issue on “Digital Humanities” information Editorial Editorial for the Special Issue on “Digital Humanities” Cesar Gonzalez-Perez Institute of Heritage Sciences (Incipit), Spanish National Research Council (CSIC), Avda. Vigo, s/n, 15705 Santiago de Compostela, Spain; cesar.gonzalez-perez@incipit.csic.es Received: 8 July 2020; Accepted: 8 July 2020; Published: 10 July 2020 �� Digital humanities are often described in terms of humanistic work being carried out with the aid of digital tools, usually computer-based. Other disciplinary fields in, for example biology or economy, went through a digital turn a few years or decades ago. Now, many areas of the humanities are going the same way. This is especially so of literary studies, linguistics, and archaeology. Many researchers in the humanities regularly carry out their work in information- and computing-intensive settings, employing techniques and tools that so far have been limited to software engineers or computer scientists [1]. However, there is little consensus on what digital humanities actually are, whether they constitute a new discipline or just a passing fad, or how they change the nature of humanistic enquiry. In this setting, the role of information is especially relevant. As with any other field of study, researchers in the humanities produce large amounts of information that is generated, stored, manipulated, communicated, and visualised through digital means. This Special Issue attempts to contribute to a better understanding of digital humanities by focusing on the role that information plays in humanistic research and, specifically, how humanistic knowledge is generated, communicated, used, and institutionalised through information-intensive tools, techniques, and methods. Relevant issues include how things are documented and described; how natural language is incorporated into the research process; how time, space, subjectivity, change, and multilingualism affect reasoning and knowledge production; how computing techniques (such as big data, artificial intelligence, or information visualisation) can help in the humanities; finally, any other aspects of humanistic research that are often performed in information-intensive settings. The articles in this Special Issue cover a wide range of topics related to information in digital humanities. Some address information issues from an ontological point of view. This includes, for example, “Capturing the Silences in Digital Archaeological Knowledge” [2], which explores non-knowledge, or lack of knowledge as captured in archaeological datasets. The article “Linking Theories, Past Practices, and Archaeological Remains of Movement through Ontological Reasoning” [3] proposes new approaches to knowledge generation through the construction of ontologies, with a special focus on movement over a territory. Finally, the article “Ontology-Mediated Historical Data Modeling: Theoretical and Practical Tools for an Integrated Construction of the Past” [4] takes a constructionist approach to the whole life cycle, from knowledge modelling to the development of a software tool, to aid in the study of the past. Other articles take a more specialised approach, such as “Exploring West African Folk Narrative Texts Using Machine Learning” [5], which employs a number of natural language processing techniques to process and compare two corpora of West African folk tales. Additionally, the article “One Archaeology: A Manifesto for the Systematic and Effective Use of Mapped Data from Archaeological Fieldwork and Research” [6] proposes a public sector-oriented approach to managing and sharing archaeological geospatial information. The remaining articles in the Special Issue tackle the very relevant aspect of language and its connection to information generation and use. “Measuring Language Distance of Isolated European Languages” [7] employs corpus-based techniques, as opposed to phylogenetic approaches, Information 2020, 11, 359; doi:10.3390/info11070359 www.mdpi.com/journal/information http://www.mdpi.com/journal/information http://www.mdpi.com https://orcid.org/0000-0002-3976-7589 http://www.mdpi.com/2078-2489/11/7/359?type=check_update&version=1 http://dx.doi.org/10.3390/info11070359 http://www.mdpi.com/journal/information Information 2020, 11, 359 2 of 2 to obtain distance measurements between isolated languages in Europe, whereas “Software Support for Discourse-Based Textual Information Analysis: A Systematic Literature Review and Software Guidelines in Practice” [8] produces a systematic literature review of software tools for discourse analysis and introduces some guidelines for developing and adopting these tools. In summary, this Special Issue on information in digital humanities covers aspects of ontological modelling and reasoning, theorising on the past, natural language, geo-spatial information, and software tools, among others. We hope that these articles help us advance in our understanding of the roles that information play in humanistic research and practice. Funding: This research received no external funding. Conflicts of Interest: The author declares no conflict of interest. References 1. Gonzalez-Perez, C. Information Modelling for Archaeology and Anthropology; Springer: Berlin/Heidelberg, Germany, 2018. 2. Huggett, J. Capturing the Silences in Digital Archaeological Knowledge. Information 2020, 11, 278. [CrossRef] 3. Nuninger, L.; Verhagen, P.; Libourel, T.; Opitz, R.; Rodier, X.; Laplaige, C.; Fruchart, C.; Leturcq, S.; Levoguer, N. Linking Theories, Past Practices, and Archaeological Remains of Movement through Ontological Reasoning. Information 2020, 11, 338. [CrossRef] 4. Travé Allepuz, E.; del Fresno Bernal, P.; Mauri Martí, A. Ontology-Mediated Historical Data Modeling: Theoretical and Practical Tools for an Integrated Construction of the Past. Information 2020, 11, 182. [CrossRef] 5. Lô, G.; de Boer, V.; van Aart, C.J. Exploring West African Folk Narrative Texts Using Machine Learning. Information 2020, 11, 236. [CrossRef] 6. McKeague, P.; Corns, A.; Larsson, Å.; Moreau, A.; Posluschny, A.; Van Daele, K.; Evans, T. One Archaeology: A Manifesto for the Systematic and Effective Use of Mapped Data from Archaeological Fieldwork and Research. Information 2020, 11, 222. [CrossRef] 7. Gamallo, P.; Pichel, J.R.; Alegria, I. Measuring Language Distance of Isolated European Languages. Information 2020, 11, 181. [CrossRef] 8. Martin-Rodilla, P.; Sánchez, M. Software Support for Discourse-Based Textual Information Analysis: A Systematic Literature Review and Software Guidelines in Practice. Information 2020, 11, 256. [CrossRef] © 2020 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). http://dx.doi.org/10.3390/info11050278 http://dx.doi.org/10.3390/info11060338 http://dx.doi.org/10.3390/info11040182 http://dx.doi.org/10.3390/info11050236 http://dx.doi.org/10.3390/info11040222 http://dx.doi.org/10.3390/info11040181 http://dx.doi.org/10.3390/info11050256 http://creativecommons.org/ http://creativecommons.org/licenses/by/4.0/. References work_g267nfgkpjesdnlkfvfg4yqfwa ---- Scheduling algorithm for the picture configuration for secondary tasks of a digital human–computer interface in a nuclear power plant Research Article Scheduling algorithm for the picture configuration for secondary tasks of a digital human–computer interface in a nuclear power plant Gang Zhang1, Xuegang Zhang1, Yu Luan1, Jianjun Jiang2 and Hong Hu2 Abstract Secondary tasks of a digital human–computer interface in a nuclear power plant increase the mental workloads of operators and decrease their accident performance. To reduce the adverse effects of secondary tasks on operators, a picture configuration scheduling algorithm of secondary tasks is proposed. Based on the research background and operator interviews, a scheduling algorithm process is established, and variables and constraint conditions of the sche- duling process are defined. Based on the scheduling process and variables definitions, this article proposes a picture feature extraction method, a method for counting identical keywords, an arrangement method of queues in a buffer pool and a picture configuration scheduling algorithm of secondary tasks. The results of simulation experiments demonstrate that the algorithm realizes satisfactory performance in terms of the number of replacements, the average waiting time, and the accuracy. Keywords Digital human–computer interface, a picture configuration scheduling algorithm, buffer pool, constraint conditions Date received: 27 November 2019; accepted: 16 February 2020 Topic: Robot Manipulation and Control Topic Editor: Andrey V Savkin Associate Editor: Bin He Introduction An operator must perform his or her not only primary tasks but also secondary tasks of digital human–computer interfaces (HCIs) in a nuclear power plant (Npp) to deal with an accident. 1 The secondary tasks are also known as interface management tasks. Interface management tasks mainly include navigation, configuration, arrangement, interrogation, and automation. 2 An operator must execute secondary tasks to support primary tasks because many parameters and navigations and a substantial amount of information must be configured to correctly deal with an accident. An operator’s cognitive resources must be distributed when an accident is being addressed. If the allocated cognitive resources outweigh the support capability of an operator, task performance will decline 3 because the cog- nitive resources of any operator are limited. Then, if 1 State Key Laboratory of Nuclear Power Safety Monitoring Technology and Equipment, China Nuclear Power Design Company Ltd, Shenzhen, Guangdong Province, China 2 School of Safety and Environment Engineering, Hunan Institute of Technology, HengYang, HuNan Province, China Corresponding author: Jianjun Jiang, School of Safety and Environment Engineering, Hunan Institute of Technology, HengYang, HuNan Province 421002, China. Emails: jjjhnit@126.com; jiangjianjun310126@126.com; 13807474256 @126.com International Journal of Advanced Robotic Systems March-April 2020: 1–11 ª The Author(s) 2020 DOI: 10.1177/1729881420911256 journals.sagepub.com/home/arx Creative Commons CC BY: This article is distributed under the terms of the Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/ open-access-at-sage). https://orcid.org/0000-0001-5856-2055 https://orcid.org/0000-0001-5856-2055 mailto:jjjhnit@126.com mailto:jiangjianjun310126@126.com mailto:13807474256@�126.com mailto:13807474256@�126.com https://doi.org/10.1177/1729881420911256 http://journals.sagepub.com/home/arx https://creativecommons.org/licenses/by/4.0/ https://us.sagepub.com/en-us/nam/open-access-at-sage https://us.sagepub.com/en-us/nam/open-access-at-sage http://crossmark.crossref.org/dialog/?doi=10.1177%2F1729881420911256&domain=pdf&date_stamp=2020-03-16 secondary tasks consume additional cognitive resources of an operator, the mental load and work performance of the operator will be affected. Compared with the traditional operating control plat- form, a digital HCI provides operators with abundant infor- mation and parameters. The information and parameters on any display are not fixed; however, charts and graphs are discontinuous, which will increase the cognitive load of operators, consume their attentional resources, and gener- ate keyhole effects. 4 Then, misreading, misjudgment, and misoperation will easily occur, which will increase the probability of human-factor accidents. With the rapid development of science and technology, artificial intelligence technology has made great achieve- ments. Intelligent and mechanized machine instead of cum- bersome human operation has gradually become true. The flexibility and intelligence of robot control can make up for the security risks and the lack of efficiency and accuracy of manual operation or inspection. If the pictures for second- ary tasks of a digital HCI can be intelligently configured by robot technology, operators’ cognitive resources and the time of dealing with an event will be decreased, and the accidents caused by human errors will be decreased, so pictures configuration is necessary. Three core technologies of intelligent system are robot technology, artificial intelligence, and digital technology, respectively, in which, robot technology is the key prob- lem. For robot technology, software control technology is the core of the whole robot control system. To decrease the disturbance from secondary tasks, based on the soft- ware control technology of robot this article studies a scheduling algorithm that can be used for picture config- uration for secondary tasks. When an operator must obtain parameter information, if the operator need not configure related secondary tasks, he can save time and decrease his cognitive load. The research achievements regarding secondary tasks are few. Most studies focus on human–machine interfaces (HMIs). In 2011, 5 a visual strategy is used to design an interface between a human and a computer. The design strategy keeps in mind of human beings and on the assump- tion that the HMI should be as simple as possible. To improve highlighting in an HMI, Anuar and Kim 6 proposed a systematic method for an automatic system of Npps. Bhatti et al. 7 presented a user-centered design strategy that includes operation contexts and relevant interfaces that are suitable for users and standard designs. In 2015, 8 a particle swarm optimization method with weights was proposed for optimizing a complex problem. In 2009, 9 input perfor- mance, user comfort, and interface layout were studied. The study shows that input and comfort performances can be improved by optimizing the interface layout. Later, the topological structure and integrated design of the compo- nent layout and the shape of the HMI were studied based on a finite element network and a collision detection algorithm. 10,11 Some scholars studied how the HCI design of warehouse orders affects the perceived load, usability, comfort, and operation performance, and experimental data show that graphic user interfaces can reduce operation time of tasks and human error. 12 In the process of industrial operation, HCI can help operators get familiar with the factory state and deal with unexpected events. Therefore, some scholars put forward the idea of ecological interface design and a dynamic interface design model, which have been applied. 13 Aiming at the diversity in device interaction pro- cess, some scholars proposed a multi-objective and multi- mode interaction modeling method based on the interface description language, which could improve the usability of HCI end-user interaction. 14 For the disabled who have dif- ficulty in moving, some scholars studied the HCI based on the gesture interaction mode. The research process used mobile device robot platform, 3D image sensor, identifica- tion system based on the support vector machine, and vehi- cle positioning equipment. 15 Some scholars studied the HMI design for the enterprise online product trading plat- form. The experimental results show that color plays an important role in awakening customers and that warm and cool colors have different influence on people. 16 Through simulative experiments, Kantowitz et al. found that interface management tasks reduced the performance of first tasks, and had a direct impact on the reliability for an operator to complete first tasks. 17 Tijerina et al. tested interface management tasks had influence on professional operators of heavy vehicles, namely, interface management task had certain influence on reliability of professional operators. 18 To reduce the adverse impact of the interface management task on the operator, Howard and Kerst pro- posed that the interface management should been organized into a physical space model that could be easily recognized by the methods of path tracking, backtracking, status iden- tification, and scope limitation. 19 To improve the readabil- ity and visibility of the interface management task and reduce the attention resources allocation of operators, Cook and Woods proposed that the characteristics of interface management task could been moved to the data area using analog input device, data control device, and computer monitoring system. 20 The study confirmed that if two tasks are very similar, there is a learning transfer from one sec- ondary task to another secondary task. 21 Under the back- ground of secondary task, to explore the combined effect of anxiety, cognitive load, experience, the researchers designed experiments with secondary tasks, and without secondary tasks, respectively. The experimental situation is set as lower anxiety and higher anxiety. Eleven profes- sionals and 10 novices participated in the experiment; the results show that the anxiety causes performance degrada- tion for the novice and that secondary tasks increase mental load and reduce the rate of response. 22 In a concurrent eye task, some scholars tested whether a manual type secondary task could increase the awareness of eye movement error. The experiment found that the difficulty of a task had no 2 International Journal of Advanced Robotic Systems effect on the awareness of eye movement error, and the participants’ ability to monitor eye movement improved with the increase of interference. 23 In addition to these studies, other achievements regard- ing the design and evaluation methods of HMI have been realized, such as a virtual environment and a constraint genetic algorithm 24,25 and evaluation methods of HMI. 26–29 Naujoks et al. 30 studied the automation of longitudinal and lateral control during an on-road experiment in everyday traffic. The results demonstrated that driving safety with subjectivity or objectivity was not influenced by the degree of automation. A model for determining the likelihood of a driver’s involvement in secondary tasks based on attributes of driving behavior was developed. The model could be applied in crash investigations to resolve legal disputes in traffic accidents. 31 The descriptions above indicate that secondary tasks give interference for an operator, affect the operators’ execution of first task, increase psychological load, and affect the attention resources distribution. To decrease the mental load and distribution of the cognitive resources for operators, based on robot technology, this article proposes a scheduling algorithm for picture configuration of secondary tasks of HCI in an Npp. The article has two main contributions that are listed as follows: (1) the proposed method can be used to automatically configure pictures, which can reduce the time that is spent dealing with an accident and decrease the men- tal stress of operators, so that the incidence of human-factor accidents can be decreased and (2) the method is established under certain conditions including digital system features and constraint conditions, so the proposed method is more in line with the actual situation. Scheduling process and constraint conditions Scheduling process Based on the research background and operator interviews, the process of picture configuration scheduling mainly includes the following: acquiring priority, organizing data, tracking dynamic processes, and using a replacement algo- rithm. Figure 1 illustrates the process of picture configura- tion of secondary tasks. Constraint conditions of the picture configuration process Notations. Notations are listed below: Buffer: a buffer pool that is used to save related pic- tures and primary tasks; Task_fi: an implemented object of the ith primary task; K_time_long_task: implemented objects that have been recently visited; task_sij: the jth picture that is associated with the implemented object of the ith primary task; size(cur_task): the size of the current implemented objects for the ith primary task; size(task_sij): the size of the jth picture that is associ- ated with the implemented object of the ith pri- mary task; size(cur_sec_task): the sum of all pictures that are related to currently running objects; Dynamically tracking the Npp current status and running process of regulations Yes No Testing whether the pool size reaches its maximum Yes No Calculating the priorities of pictures of the primary task Priority>threshold value No Yes Putting a picture into the buffer pool to form a multilevel queue Dynamically changing the order of pictures in a queue Picture displays on one of screens Dequeue Replacing a running object of the primary task and pictures Dynamically maintaining the synchronous change in pictures in the buffer pool and the current plant status Information center Extracting the keywords for running objects of the primary task and pictures from feature library Data mapping Determining whether a programmed pool contains implementation tasks and pictures? Figure 1. Process of the picture configuration scheduling algorithm. Zhang et al. 3 cur_sec_task: all pictures that are related to currently running objects; v_time(task_fi): recent visitation time of implement- ing objects that are related to the ith primary task; cur_task: objects that are being implemented in cur- rent primary tasks; cur_f: the current picture; Suff_size: the size of the buffer pool; F_t_sizei: the stored size of the implemented object for the ith primary task; G_inf_sizeij: the stored size of the jth picture that is associated with the implemented object of the ith primary task; M: the number of implemented objects of the primary task in the buffer pool; Nij: the number of the jth picture that is related to the implemented object of the ith primary task; U_sumij: the number of visitations of the jth picture that is related to the implemented object of the ith primary task; S_f_sumi: the number of visitations of the implemen- ted object of ith primary task; Fti: the visitation frequency of the implemented object of the ith primary task; mti: the importance degree of the implemented object of the ith primary task; Fgij: the visitation frequency of the jth picture that is related to the implemented object of the ith pri- mary task; gmij: the importance degree of the jth picture that is related to the implemented object of the ith pri- mary task; pri_wij: the priority of the jth picture that is related to the implemented object of the ith primary task; fpi: the weight of the implemented object of the ith primary task; w_f: the threshold value of the implemented object weight of the ith primary task; k_w_fi: extracted keyword vector space of the imple- mented objects of the ith primary task; s(k_w_fi): the number of extracted keywords of the implemented objects of the ith primary task; k_w_sij: the keyword vector space of the jth picture that is related to the implemented object of the ith primary task; s(k_w_sij): the number of extracted keywords of the jth picture that is related to the implemented object of the ith primary task; sim(k_w_fi, k_w_sij): the similarity degree between the implemented object of the ith primary task and the jth picture that is related to the ith primary task; vfik: the extracted kth keyword of the implemented objects of the ith primary task; vsijp: the pth keyword of the jth picture that is related to the implemented object of the ith primary task; pri_f: priority threshold value; f_c: feature library; c_sum: the number of keywords in the feature library; f_s_p_sij: the number of the identical keywords between the implemented object of the ith primary task and the jth picture that is related to the ith primary task; f(cur_inf)ij: the current status of the plant with the jth- newest picture that is related to the implemented object of the ith primary task; t_s_infij: the current data or parameters of the jth pic- ture that is related to the implemented object of the ith primary task; flag: indicator of whether the implemented object of the ith primary task is changed; changeij: indicator of whether the jth picture that is related to the ith primary task is changed in the running process. Constraint conditions. 1. The buffer pool size must be greater than or equal to the sum of the sizes of the implemented objects of the primary task and the related pictures, which can be expressed as follows suf f size � Xm i¼1 f t sizei þ Xm i¼1 Xnij j¼1 g inf sizeij ð1Þ 2. The visitation frequency of the jth picture that is related to the implemented object of the ith primary task is as expressed in equation (2) f gij ¼ u sumij Pnij i¼1 u sumij ð2Þ Similarly, the visitation frequency of the implemented object of the ith primary task is as follows f ti ¼ s f sumiPnij i¼1 s f sumi ð3Þ 3. The weight of the implemented object of the ith primary task is defined as f pi ¼ f ti � mti ð4Þ 4. The buffer pool is initialized to determine which objects of the primary tasks should be added into it. The condition is expressed as follows f pi � w f ð5Þ 5. The similarity degree between the implemented object of the ith primary task and the jth picture that is related to the ith primary task is defined as simðk w f i; k w sijÞ¼ f s p sij sðk w f iÞþ sðk w sijÞ ð6Þ 4 International Journal of Advanced Robotic Systems 6. The priority is calculated via equation (7) pri wij ¼ 0:7 � simðk w f i; k w sijÞþ 0:2f gij þ 0:1gmij ð7Þ 7. The sum of the sizes of the implemented objects, all pictures that will be added into the buffer pool in the immediate future and all pictures that are currently in the buffer pool must be less than or equal to the buffer pool size, which can be expressed as follows sizeðcur taskÞþ sizeðcur sec taskÞ þ Xm i¼1 task f i þ Xm i¼1 XN ij j¼1 sizeðtask sijÞ <¼ suf f size ð8Þ For pictures or tasks in the buffer pool: ffi If pri_wij¼pri_f, Task_fi is added into the ith queue of the buffer pool and the queue is reordered. (9) The values of “flag” are defined as follows: ffi If flag ¼ 1, the current implemented object should be added into the buffer pool and it will be ready for configuring the pictures that are related to the implemented object. ffl If flag ¼ 0, pictures that are related to the imple- mented object continue to be configured. (10) The values of changeij are defined as follows: ffi If changeij ¼ 1, the configured pictures should be timely updated to keep pace with the current plant running status. ffl If changeij ¼ 0, pictures are not updated. Picture configuration scheduling algorithm Scheduling process The scheduling process, which is illustrated in Figure 1, mainly includes determining the priorities of each rele- vant picture and dynamically arranging the pictures and tasks in a buffer pool. These steps are described in the following. Calculating the priority of each picture. According to the con- straint conditions above, equation (7) can be used to calculate the priority of each picture. Equation (7) con- sists of three parts: (1) the similarity degree between the implemented objects of the primary task and the pic- tures; (2) the visitation frequency of the pictures; and (3) the importance degrees of the pictures. The visitation frequency of the pictures can be calculated via equation (2). The importance degree of the pictures can be obtained via operator interviews and expert judgments. The similarity degree can be obtained via equation (6). For equation (6), two steps must be conducted: (1) extracting the picture information keywords that are associated with the implemented objects of the current primary task from a feature library and (2) calculating the number of identical keywords. A feature library is established and improved by domain experts, supervi- sors, and advanced operators. Extraction of the key- words and calculation of the number of identical keywords can be conducted by following two algo- rithms, which are presented as follows 1) Algorithm for extracting keywords from a feature library (1) Algorithm process The algorithm steps are as follows: ffi Successively search for the current primary tasks in a feature library. ffl If the current primary tasks that are being implemented are identified, their keywords will be extracted; otherwise, return to step (1). � Add the ith primary task keywords into a vector space (k_w_fi). Ð Successively search for the current jth pic- ture keywords from the ith primary task keyword vector space (k_w_fi). ð If the current jth picture is identified, the pth keyword of the jth picture will be extracted; otherwise, return to step (4). Þ Add the pth keyword into a vector space (k_w_sij). This algorithm process for extracting keywords is illu- strated in Figure 2. (2) Pseudo code for extracting keywords from a feature library Feature_extract_algorithm() Begin i¼1; While(i<¼c_sum) begin If(task_fi¼cur_task) While(k<¼ s(k_w_fi)) begin K_w_fi vfik; k¼kþ1; end; Else i¼iþ1; End;m¼1;p¼1; While(m<¼c_sum) Begin If(taskij¼cur_f) While(p<¼ s(k_w_sij)) Begin k_w_sij vsijp; p¼pþ1; End; Else m¼mþ1; End; END. Zhang et al. 5 2) Algorithm for calculating the number of identical keywords (1) Algorithm steps ffi Find the keyword vector space of the implemented objects from the ith pri- mary task. ffl Find the keyword vector space of the jth picture that is related to the implemented object of the ith primary task. � Successively search for the pth keyword of the jth picture from the current ith primary task. Ð If the current task picture keyword is identified, the count is successively increased. The algorithm process is illustrated in Figure 3. (2) Pseudo code for calculating the number of identi- cal keywords Calculate_key_sum() Begin k¼1;p¼1; Locate(k_w_fi); Locate(k_w_sij); While(k<¼s(k_w_fi)) begin while(p<¼s(k_w_sij)) begin if(k_w_fi[vfik]¼k_w_sij[vsijp]) f_s_p_sij¼ f_s_p_sijþ1; p¼pþ1; end; k¼kþ1; end; End. Dynamically establishing the sequences of pictures and primary tasks in multilevel queues of a buffer pool. Two problems must be solved for picture configuration and primary tasks in a buffer pool: (1) arranging them in order and (2) dealing with the dynamic process when the latest pictures and tasks arrive to the buffer pool. The solutions of the two problems are described in the following sections. (1) Arranging the pictures and primary tasks in a buf- fer pool The proposed process for arranging the pictures and primary tasks is as follows: ffi based on corresponding Successively search for the current primary tasks in a feature library task_fi==current task? When k<= s(k_w_fi) K_w_fi←vfik; k=k+1 yes i=i+1 No Successively search for the current jth picture keywords in the ith primary task keyword vector space taskij= current picture? When p<= s(k_w_sij) k_w_sij←vsijp; p=p+1 yes m=m+1 No Figure 2. Algorithm process for extracting keywords. Find k_w_fi Find k_w_sij k<=s(k_w_fi) f_s_p_sij= f_s_p_sij+1 p<=s(k_w_sij) k_w_fi[vfik]==k_w _sij[vsijp] Yes Yes p=p+1 k=k+1 End no no Figure 3. Algorithm process for calculating the number of identical keywords. 6 International Journal of Advanced Robotic Systems accidents, the implemented objects of primary tasks for which the weights are greater than or equal to the threshold values are added into the buffer pool, and objects that were implemented earlier are arranged with higher priority in the queue; ffl all pictures that are related to the implemented objects of the primary tasks are searched; � all relevant pictures are arranged in order of their priorities to build a navigation path; and Ð if the sum of the sizes of all tasks is greater than the buffer pool size, then the pictures that are arranged behind other pictures in the same queue will be removed from the buffer pool. The multilevel queues structure of primary tasks and relevant pictures is illustrated in Figure 4. (2) Dealing with the dynamic process when the latest pictures and tasks arrive to a buffer pool If the sum of the sizes of all implemented objects of primary tasks and pictures in the buffer pool is greater than or equal to the buffer pool size, a few implemented objects and relevant pictures in the buffer pool will be replaced by other objects or related pictures. The replacement process is realized via an algorithm, which has the following algorithm process: ffi before the latest pictures and tasks are added into the buffer pool, the sums of the sizes of the buffer pool and tasks, respec- tively, must be calculated; ffl if equation (8) holds, the pictures or tasks will be directly added to the end of a queue of the buffer pool, where the queue structure is illustrated in Figure (4); � if equation (8) does not hold, before new pictures or tasks are added into the queues in order, a few pictures or tasks must be removed from the queues, namely the pictures or tasks that have been in the queues for the longest will be replaced by pictures or tasks that should be implemented as early as possible. The pseudo code of the replacement algorithm is as follows: Rep_task_algorithm() Begin If Eq. (8) then Those pictures or tasks are added into the queues; else Begin K_time_long_task¼task_f1; For i¼2 to m do If(v_time(task_fi)> K_time_long_task) then Begin K_time_long_task¼task_fi; v¼i; i¼iþ1; end; i¼v; task_fi$cur_task; task_fij$cur_sec_task; order(cur_sec_task); end End Picture configuration scheduling algorithm The process of the picture configuration scheduling algo- rithm is illustrated in Figure 1. According to Figure 1, the definitions of the constraint conditions and Scheduling process section, the picture con- figuration scheduling algorithm of the digital HCI in an Npp is defined as follows: Scheduling_Algorithm_picture_configuration() Begin Initialize W_f an initial value; pri_f an initial value; repeat For i ¼ 1 to the total quantity of objects to be executed do Begin mti specify a value; fgij according to Eq. (3); fpi according to Eq. (4); if (Eq. (5))then add task_fi into a buffer; for j¼1 to Nij do begin call Feature_extract_algorithm(), which was proposed in this article; call Calculate_key_sum(), which was proposed in this article; sim(k_w_fi, k_w_sij) according to Eq. (6); pri_wij according to Eq. (7); if (Eq. (1))then continue; else break; end if add task_sij into the buffer to form a navigation path; Task_f1 Task_f2 Task_fm Task_s11 Task_s12 Task_s1j Task_s21 Task_s22 …………………………………………………… ………………………… Task_sm1 Task_sm2 Task_smj …… … … Task_s2j … Figure 4. Queue construction of implemented tasks and pictures. Zhang et al. 7 end if; end; end; until(Eq. (8) is false) (2) Function pseudo codes of the running process Check the current plant status and regulations For i¼1 to m do begin If(cur_task¼task_fi) then For j ¼ 1 to Nij do Begin If(cur_f¼task_sij) then if(pri_wijpri_f then Add cur_f into buffer; re_order(buffer, task_sij); else goto L1; End if; For i ¼ 1 to m do Begin For j¼1 to Nij do Begin If changeij¼1 then Update(task_sij); Mapping(plant_data task_sij); End if; End; End; End Performance analysis Experimental background To evaluate the performance of the picture configuration scheduling algorithm, related experiments are conducted by the authors. A steam generator tube rupture (SGTR) accident in an Npp is used for illustration. As task points are more in SGTR accident, 10 task points were selected for the convenience and standard of experimental procedures, experimental participants mainly deal with these task points and the relevant pictures are obtained from DOS regulations of SGTR accidents. The task points are listed in Table 1. Each picture is represented by a number, as presented in Table 2. Experiment description Participants in the experiment must obtain parameters, evaluate the plant status, decide to how to deal with or restore an accident site, and access branches of accident regulations. To compare the time performance including configuring pictures and manual approach, picture con- figurations are scheduled via the proposed algorithm and participants in the experiment, respectively. Ten stu- dents from Hunan Institute of Technology participated Table 1. Task points. Number Task description 1 Confirm: Confirm RCV 017VP on RCV 0002BA (BY-pass demineralizers RCV) 2 Confirm REA on AUTO makeup the Boron concentration of the primary system 3 The volume of REA Boron tanks 4 Set RCP 404KU X the value of no load Set point (20% of �4 m) 5 Set RCV 046VP on AUTO 6 Reset CIB signal by RPA 284KG and RPB 284KG 7 Reset SI signal by RPA 060KG and RPB 060KG 8 Confirm the reactor trip by RPA 300TO and RPB 300TO 9 Check that all the CIA values are close 10 Confirm that RIS 061VP and 062VP are open Table 2. Picture numbers. Picture number 1 RIC003YCD 2 RCV002YCD 3 REA001YCD 4 ECP002YCD 5 TEP003TCD 6 RCV001YED 7 RCP002YCD 8 EPP002YFU 9 RIS100YFU 10 EAS100YFU 11 RGL001YCD 12 EPP001YFU 13 LHP001YCD 14 LHQ001YCD 15 DOS10AYST 8 International Journal of Advanced Robotic Systems in the simulative experiment; they were divided into five groups and were trained for 2 days. The experiment was conducted 10 times. Each group is required to do two trials. In the experiment, some parameters have dynamic values, such as Nij, U_sumij, S_f_sumi, Fti, Fgij, mti, and gmij. The dynamic values may be obtained during the simulative experiment according to related tasks. The initial values of a few parameters must be specified directly. Two initial values are set as w_f ¼ 0.5 and pri_f ¼ 0.5. Most parameter values are obtained or dynamically changed according to the running process of the SGTR accident. The experimental process is based on Figure 1. The simulation platform that is used for the experiment is Windows 7, with an i7-6700 CPU, 8 G RAM, and disk space of 500 GB. The experimental results are the mean values of all experimental data. Performance analysis The performance of the picture configuration scheduling algorithm is analyzed from several perspectives according to the experimental data. (1) The change curves of the numbers of replace- ments, which are plotted in Figure 5. Replacement is viewed as a process, namely ffi lower correlation pictures with current task are removed from buffer pool; ffl more correlation pictures with current run- ning task will get into the buffer pool. According to Fig- ure 5, the numbers of replacements of (a) and (b) are 0 when the number of tasks is 4, which is the optimal case. Fewer replacements correspond to less time being spent on picture configuration. Comparing with the least recently used and least fre- quently used methods, the algorithm proposed in this article conducts few replacements, which indicates the algorithm proposed has good performance on replacements. It is shown in Figure 5 that the number of replacements will increase with the number of tasks, which accords with the actual scenario, as the size of a buffer pool is fixed and the probabilities that relevant pictures are not in the buffer pool increase with the number of tasks. (2) Picture average waiting time, which is plotted in Figure 6. Waiting time is viewed as an interval, namely, it is after picture is get into the buffer pool, until is automatically configured on a screen. According to Figure 6, the picture average waiting time in the experiments with the algorithm that is proposed in 4 6 8 10 12 14 16 18 20 0 2 4 6 8 10 12 Number of tasks The algorithm proposed in this paper Least Recently used(LRU) Least Frequently used(LFU) 4 6 8 10 12 14 16 18 20 0 1 2 3 4 5 6 7 8 9 Number of tasks T h e a ve ra g e r e p la ce m e n t tim e s T h e a ve ra g e r e p la ce m e n t tim e s The algorithm proposed in this paper Least Recently used(LRU) Least Frequently used(LFU) (a) (b) Figure 5. Change curves of the numbers of replacements: (a) replacements when the buffer pool size is 3 and (b) replacements when the buffer pool size is 5. 1 2 3 4 5 6 7 8 9 10 4000 4500 5000 5500 6000 6500 The times of exeriment A ve ra g e w a iti n g t im e o f p ic tu re Algorithm proposed in this paper Short job first, SJF Highest Response Ration Next,HRRN First-come First served,FCFS Figure 6. Picture average waiting time (ms). Zhang et al. 9 this article is approximately 5200 ms. Comparing with the highest response ratio next (HRRN) and first come first served (FCFS) methods, the algorithm that is proposed in this article has a shorter waiting time; however, comparing with the shortest job first (SJF) method, it has a longer waiting time. Shorter waiting time means that time cost of picture configure is less. By and large, the algorithm performance on average waiting time is good. (3) Time cost analysis According to Figure 7, the time cost of the scheduling algorithm is far less than the time cost of the manual approach for picture configuration; hence, the scheduling algorithm outperforms the manual approach. If time cost of picture configuration is decreased, then psychology pres- sure of operators is decreased, and then accident safety can be improved. (4) Accuracy of picture configuration, which is plotted in Figure 8. According to Figure 8, the accuracy of the picture con- figuration scheduling algorithm proposed in this article is approximately 85%; hence, it is reliable. Comparing with the SJF, HRRN, and FCFS methods, the algorithm that is proposed in this article is more accurate. Conclusions This article discusses how secondary tasks in a digital HCI increase the mental loads of operators and analyzes the advantages that pictures were intelligently configured by robot technology. In this article, based on robot technology, a picture configuration scheduling algorithm of secondary tasks is obtained. All relevant variables of the scheduling algorithm are defined. Mathematical expressions for sev- eral constraint conditions are established. In addition, sev- eral algorithms for extracting information features, counting identical keywords, and configuring pictures of secondary tasks were proposed. The simulative experiment analysis results demonstrate that the picture configuration scheduling algorithm realizes satisfactory performance. Most of the data obtained via the simulation experiments reflect the algorithms’ performances for picture configura- tion, such as correctness, number of replacements, and waiting time. However, the participants are students; hence, the time that is spent on configuring pictures manu- ally might exhibit small deviations. However, the devia- tions have little effect on the performance of the scheduling algorithm, as the time difference between the manual approach and the scheduling algorithm is very large. Thus, the small deviations have no readily observa- ble effects on the difference in time cost between the man- ual approach and the scheduling algorithm. In the future, the constraint conditions will be further improved accord- ing to feedbacks in application process; the algorithm will be extended to other fields. Declaration of conflicting interests The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article. Funding The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work is supported in part by Hunan Provincial Natural Science Foundation of China (2019JJ40066, 2017JJ4019), the Social and Science Fund of Hunan Province of China (XSP18YBZ035), the Scientific Research Foundation of Hunan Institute of Technology of China (HQ19009), The key laboratory of Hunan Provin- ce(2019TP1020), China. Figure 7. Picture configuration times of the manual approach and the scheduling algorithm. 1 2 3 4 5 6 7 8 9 10 0 10 20 30 40 50 60 70 80 90 100 Experiment times A cc u ra cy (% ) Algorithm proposed in this paper Short job first, SJF Highest Response Ration Next,HRRN First-come First served,FCFS Figure 8. Accuracy of the picture configuration scheduling algorithm. 10 International Journal of Advanced Robotic Systems ORCID iD Jianjun Jiang https://orcid.org/0000-0001-5856-2055 References 1. Li Z, Da-xin Y, and Yi-qun W. The effect of information display on human reliability in a digital control room [in Chinese]. China Saf Sci J 2010; 20(9): 81–84. 2. Linfeng L. A monitoring transferring law research for opera- tors in a digital nuclear power plant [in Chinese]. Master Thesis, University of South China, 2013. 3. Wickens C. Processing resources and attention. Multiple task performance. London: Taylor & Francis Press, 1991. 4. Seong PH. Reliability and risk issues in large scale safety- critical digital control systems. New York: Springer Press, 2009. 5. Yan R. The Research of human-computer interface design based on vision communication. Procedia Eng 2011; 15: 3114–3118. 6. Anuar N and Kim J. A direct methodology to establish design requirements for human–system interface (HSI) of automatic systems in nuclear power plants. Ann Nucl Energy 2014; 63: 326–338. 7. Bhatti G, Bremond R, Jessel JP, et al. Design and evaluation of a user-centered interface to model scenario on driving simulators [Special Issue on Road Safety and Simulation]. Transport Res C Emer Technol 2015; 50: 3–12. 8. Futrell BJ, Ozelkan EC, and Brentrup D. Optimizing complex building design for annual daylighting performance and eva- luation of optimization algorithms. Energ Buildings 2015; 92(1): 234–245. 9. Haynes S. Effects of positioning optimization in an alterna- tive computer workstation for people with and without low back pain. Int J Ind Ergonom 2009; 39(5): 719–727. 10. Xia L, Zhu J, Zhang W, et al. An implicit model for the integrated optimization of component layout and structure topology. Comput Method Appl Mech Eng 2013; 257: 87–102. 11. Sørensen SE, Hansen MR, Ebbesen MK, et al. Non-linear optimization of track layouts in loop-sorting-systems. Auto- mat Constr 2013; 31: 19–30. 12. Kim S, Nussbaum MA, and Gabbard JL. Influences of aug- mented reality head-worn display type and user interface design on performance and usability in simulated warehouse order picking. Appl Ergon 2019; 74: 186–193. 13. Lindscheid C, Sakthithasan P, and Engell S. An ecological interface design based visualization of the energy balance of chemical reactors. IFAC Papers OnLine 2019; 51: 308–314. 14. Gaouar L, Benamar A, Le Goaer O, et al. HCIDL: human– computer interface description language for multi-target, multimodal, plastic user interfaces. Future Comput Inform J 2018; 3: 110–130. 15. Ding LR, Lin RZ, and Lin ZY. Service robot system with inte- gration of wearable Myo armband for specialized hand gesture human–computer interfaces for people with disabilities with mobility problems. Couput Electr Eng 2018; 69(7): 815–827. 16. Cheng FF, Wu CS, and Leiner B. The influence of user inter- face design on consumer perceptions: a cross-cultural com- parison. Comput Hum Behav 2019; 101: 394–401. 17. Kantowitz B, Hanowski R, and Tijeina L. Simulator evalua- tion of heavy-vehicle workload: Q : complex secondary tasks. In: 1996 Proceedings of the Human Factors Society-40th Annual Meeting, Santa Monca, CA, USA, August 1996, pp. 1002–1006. Human Factors Society. 18. Tijerina L, Kigr S, Rockweel T, et al. Workload assessment of in-cab test message system and cellular phone use by heavy vehicle drivers the road. In: 1995 proceeding of the Human Factors Society-39th Annual Meeting, Santa Moncia, CA, USA, September 1995, pp. 1015–1019. Washington, DC: Human Factors Society. 19. Howard J and Kerst S. Memory and perception of carto- graphic information for familiar and unfamiliar environ- ments. Hum Factors 1981; 23: 495–504. 20. Cook R and Woods D. Adapting to new technology in the operating room. Hum Factors 1995; 38: 593–613. 21. TiborPetzoldt SB and Krems JK. Learning effects in the lane change task (LCT) – realistic secondary tasks and transfer of learning. Appl Ergon 2014; 45(3): 639–646. 22. Nibbeling N, Oudejans RRD, and Daanen HAM. Effects of anxiety, a cognitive secondary task, and expertise on gaze behavior and performance in a far aiming task. Psychol Sport Exerc 2012; 13(4): 427–435. 23. Robinson MM and Irwin DE. Conscious error perception: the impact of response interference from a secondary task. Atten Percept Psychophys 2017; 79: 863–877. 24. Avola D, Spezialetti M, and Placidi G. Design of an efficient framework for fast prototyping of customized human–com- puter interfaces and virtual environments for rehabilitation. Comput Meth Prog Biomed 2013; 110(3): 490–502. 25. Troiano L and Birtolo C. Genetic algorithms supporting gen- erative design of user interfaces: examples. Inform Sci 2014; 259(20): 433–451. 26. Ramakrisnan P, Jaafar A, Hanis F, et al. Evaluation of user interface design for leaning management system (LMS): investigating student’s eye tracking pattern and experiences. Procedia Soc Behav Sci 2012; 67(10): 527–537. 27. Browne K and Anand C. An empirical evaluation of user interface for mobile video game. Entertain Comput 2012; 3(1): 1–10. 28. Chun-yan X, Sheng-yuan Y, Qing-fen L, et al. Experimental study on human–machine interface evaluation of main con- trol room in nuclear power plants. Chin Saf Sci J 2008; 18(8): 109–114 (In Chinese). 29. Wei Z and Wei H. Method of software interface evaluation based on eyetracking technology [in Chinese]. Electro Mech Eng 2013; 29(4): 62–64. 30. Naujoks F, Purucker C, and Neukum A. Secondary task engagement and vehicle automation – comparing the effects of different automation levels in an on-road experiment. Transport Res F Traf Psychol Behav 2016; 38(3): 67–82. 31. Ye M, Osman OA, Ishak S, et al. Detection of driver engage- ment in secondary tasks from observed naturalistic driving behavior. Accid Anal Prev 2017; 106(9): 385–391. Zhang et al. 11 https://orcid.org/0000-0001-5856-2055 https://orcid.org/0000-0001-5856-2055 https://orcid.org/0000-0001-5856-2055 << /ASCII85EncodePages false /AllowTransparency false /AutoPositionEPSFiles true /AutoRotatePages /None /Binding /Left /CalGrayProfile (Gray Gamma 2.2) /CalRGBProfile (sRGB IEC61966-2.1) /CalCMYKProfile (U.S. Web Coated \050SWOP\051 v2) /sRGBProfile (sRGB IEC61966-2.1) /CannotEmbedFontPolicy /Warning /CompatibilityLevel 1.4 /CompressObjects /Off /CompressPages true /ConvertImagesToIndexed true /PassThroughJPEGImages false /CreateJobTicket false /DefaultRenderingIntent /Default /DetectBlends true /DetectCurves 0.1000 /ColorConversionStrategy /LeaveColorUnchanged /DoThumbnails false /EmbedAllFonts true /EmbedOpenType false /ParseICCProfilesInComments true /EmbedJobOptions true /DSCReportingLevel 0 /EmitDSCWarnings false /EndPage -1 /ImageMemory 1048576 /LockDistillerParams true /MaxSubsetPct 100 /Optimize true /OPM 1 /ParseDSCComments true /ParseDSCCommentsForDocInfo true /PreserveCopyPage true /PreserveDICMYKValues true /PreserveEPSInfo true /PreserveFlatness false /PreserveHalftoneInfo false /PreserveOPIComments false /PreserveOverprintSettings true /StartPage 1 /SubsetFonts true /TransferFunctionInfo /Apply /UCRandBGInfo /Remove /UsePrologue false /ColorSettingsFile () /AlwaysEmbed [ true ] /NeverEmbed [ true ] /AntiAliasColorImages false /CropColorImages false /ColorImageMinResolution 266 /ColorImageMinResolutionPolicy /OK /DownsampleColorImages true /ColorImageDownsampleType /Average /ColorImageResolution 175 /ColorImageDepth -1 /ColorImageMinDownsampleDepth 1 /ColorImageDownsampleThreshold 1.50286 /EncodeColorImages true /ColorImageFilter /DCTEncode /AutoFilterColorImages true /ColorImageAutoFilterStrategy /JPEG /ColorACSImageDict << /QFactor 0.40 /HSamples [1 1 1 1] /VSamples [1 1 1 1] >> /ColorImageDict << /QFactor 0.76 /HSamples [2 1 1 2] /VSamples [2 1 1 2] >> /JPEG2000ColorACSImageDict << /TileWidth 256 /TileHeight 256 /Quality 30 >> /JPEG2000ColorImageDict << /TileWidth 256 /TileHeight 256 /Quality 30 >> /AntiAliasGrayImages false /CropGrayImages false /GrayImageMinResolution 266 /GrayImageMinResolutionPolicy /OK /DownsampleGrayImages true /GrayImageDownsampleType /Average /GrayImageResolution 175 /GrayImageDepth -1 /GrayImageMinDownsampleDepth 2 /GrayImageDownsampleThreshold 1.50286 /EncodeGrayImages true /GrayImageFilter /DCTEncode /AutoFilterGrayImages true /GrayImageAutoFilterStrategy /JPEG /GrayACSImageDict << /QFactor 0.40 /HSamples [1 1 1 1] /VSamples [1 1 1 1] >> /GrayImageDict << /QFactor 0.76 /HSamples [2 1 1 2] /VSamples [2 1 1 2] >> /JPEG2000GrayACSImageDict << /TileWidth 256 /TileHeight 256 /Quality 30 >> /JPEG2000GrayImageDict << /TileWidth 256 /TileHeight 256 /Quality 30 >> /AntiAliasMonoImages false /CropMonoImages false /MonoImageMinResolution 900 /MonoImageMinResolutionPolicy /OK /DownsampleMonoImages true /MonoImageDownsampleType /Average /MonoImageResolution 175 /MonoImageDepth -1 /MonoImageDownsampleThreshold 1.50286 /EncodeMonoImages true /MonoImageFilter /CCITTFaxEncode /MonoImageDict << /K -1 >> /AllowPSXObjects false /CheckCompliance [ /None ] /PDFX1aCheck false /PDFX3Check false /PDFXCompliantPDFOnly false /PDFXNoTrimBoxError true /PDFXTrimBoxToMediaBoxOffset [ 0.00000 0.00000 0.00000 0.00000 ] /PDFXSetBleedBoxToMediaBox false /PDFXBleedBoxToTrimBoxOffset [ 0.00000 0.00000 0.00000 0.00000 ] /PDFXOutputIntentProfile (U.S. Web Coated \050SWOP\051 v2) /PDFXOutputConditionIdentifier (CGATS TR 001) /PDFXOutputCondition () /PDFXRegistryName (http://www.color.org) /PDFXTrapped /Unknown /CreateJDFFile false /Description << /ENU >> /Namespace [ (Adobe) (Common) (1.0) ] /OtherNamespaces [ << /AsReaderSpreads false /CropImagesToFrames true /ErrorControl /WarnAndContinue /FlattenerIgnoreSpreadOverrides false /IncludeGuidesGrids false /IncludeNonPrinting false /IncludeSlug false /Namespace [ (Adobe) (InDesign) (4.0) ] /OmitPlacedBitmaps false /OmitPlacedEPS false /OmitPlacedPDF false /SimulateOverprint /Legacy >> << /AllowImageBreaks true /AllowTableBreaks true /ExpandPage false /HonorBaseURL true /HonorRolloverEffect false /IgnoreHTMLPageBreaks false /IncludeHeaderFooter false /MarginOffset [ 0 0 0 0 ] /MetadataAuthor () /MetadataKeywords () /MetadataSubject () /MetadataTitle () /MetricPageSize [ 0 0 ] /MetricUnit /inch /MobileCompatible 0 /Namespace [ (Adobe) (GoLive) (8.0) ] /OpenZoomToHTMLFontSize false /PageOrientation /Portrait /RemoveBackground false /ShrinkContent true /TreatColorsAs /MainMonitorColors /UseEmbeddedProfiles false /UseHTMLTitleAsMetadata true >> << /AddBleedMarks false /AddColorBars false /AddCropMarks false /AddPageInfo false /AddRegMarks false /BleedOffset [ 9 9 9 9 ] /ConvertColors /ConvertToRGB /DestinationProfileName (sRGB IEC61966-2.1) /DestinationProfileSelector /UseName /Downsample16BitImages true /FlattenerPreset << /ClipComplexRegions true /ConvertStrokesToOutlines false /ConvertTextToOutlines false /GradientResolution 300 /LineArtTextResolution 1200 /PresetName ([High Resolution]) /PresetSelector /HighResolution /RasterVectorBalance 1 >> /FormElements true /GenerateStructure false /IncludeBookmarks false /IncludeHyperlinks false /IncludeInteractive false /IncludeLayers false /IncludeProfiles true /MarksOffset 9 /MarksWeight 0.125000 /MultimediaHandling /UseObjectSettings /Namespace [ (Adobe) (CreativeSuite) (2.0) ] /PDFXOutputIntentProfileSelector /DocumentCMYK /PageMarksFile /RomanDefault /PreserveEditing true /UntaggedCMYKHandling /UseDocumentProfile /UntaggedRGBHandling /UseDocumentProfile /UseDocumentBleed false >> ] /SyntheticBoldness 1.000000 >> setdistillerparams << /HWResolution [288 288] /PageSize [612.000 792.000] >> setpagedevice work_g2jcebslp5elfmzdsxwrfoxmh4 ---- Umanistica Digitale - ISSN:2532-8816 - n.8, 2020 A. Lo Duca, C. Bacciu, A. Marchetti – The Use of Blockchain for Digital Archives: a comparison between Ethereum and Hyperledger DOI: http://doi.org/10.6092/issn.2532-8816/9959 The Use of Blockchain for Digital Archives: a comparison between Ethereum and Hyperledger 1Angelica Lo Duca, 2Clara Bacciu, 3Andrea Marchetti IIT CNR, Pisa, Italy 1angelica.loduca@iit.cnr.it 2clara.bacciu@iit.cnr.it 3andrea.marchetti@iit.cnr.it Abstract In recent years, blockchain technology is progressively spreading on a large scale in various research sectors, including Cultural Heritage. Different types of blockchain exist, which can be classified either according to the type of users that can access them, or based on the features they offer. This article describes a theoretical study in which two very different blockchains are compared: Ethereum and Hyperledger, in order to define which of the two is more suitable for storing tangible heritage contained in digital archives. After a brief description of the two technologies, a possible generic application scenario will be described in order to understand which of the two technologies best meets the requirements of the scenario. The comparison between the two blockchains will therefore be carried out on the basis of general issues, architectural requirements and various considerations. As a result of the comparison, it will emerge that Hyperledger Fabric is more suitable in the context of digital archives. Negli ultimi anni la tecnologia blockchain si sta diffondendo sempre più su larga scala in diversi settori di ricerca, inclusi i Beni Culturali. Esistono diverse tipologie di blockchain, che possono essere classificate sia in base al tipo di utenti che possono accedervi, sia in base alle funzionalità che offrono. Questo articolo descrive uno studio teorico in cui si confrontano due blockchain molto diverse tra di loro: Ethereum e Hyperledger, al fine di definire quale delle due è maggiormente indicata per la memorizzazione di beni culturali tangibili contenuti in archivi digitali. Dopo una breve descrizione delle due tecnologie, verrà descritto un possibile scenario di applicazione abbastanza generico per poter capire quale delle due tecnologie meglio soddisfa i requisiti. Verrà quindi effettuato il confronto tra le due blockchain sulla base di problematiche generali, requisiti architetturali e considerazioni varie. Come risultato del confronto, emergerà che Hyperledger Fabric è più adatta nel contesto degli archivi digitali. 145 Umanistica Digitale - ISSN:2532-8816 - n.8, 2020 Introduction Recently, the diffusion of applications based on blockchain technology [21], [25] has been increasing rapidly. The original focus of these technologies concerned cryptocurrencies (i.e., Bitcoin), but is shifting to finance and business in general, and is being extended progressively for a variety of applications in healthcare, government, Internet of Things, entity and assets management and eventually Cultural Heritage. In particular, a blockchain could be a good solution to store, protect and preserve over time data about tangible heritage, especially minor tangible heritage, i.e. artistically relevant artworks but not as famous as masterpieces. For example, in case of disasters (either natural or man made), the fact that the blockchain is a replicated registry can be exploited to retrieve information that in other circumstances would otherwise be lost forever. In addition, information contained in the blockchain cannot be erased or tampered with, so, in case of theft of the real artwork, related data will remain available and would be used to recognise the work if someone tries to sell it and to detect counterfeits. Using blockchain to store digital archives of artworks thus constitutes a promising field of application. This paper is an extension of the work illustrated in [2]. In particular, it describes the challenges and requirements for storing a digital archive of artworks in a blockchain. In addition, a possible framework based on blockchain is illustrated. Then the paper describes a comparison between two blockchains, Ethereum1 [24] and Hyperledger Fabric2 [5], used as main framework for digital archives. A preliminary implementation of a framework for digital archives, based on Ethereum, has already been defined in [6], [3]. On the basis of the described framework, a selection of some comparison criteria is done, including general issues related to blockchains, architectural requirements applied to the specific architecture and other considerations. As a result of the comparison, we can say Hyperledger Fabric better fits for the proposed scenario because is more configurable. However, due to its popularity Ethereum still remains a good solution. In addition to Ethereum and Hyperledger there are other implementations of the blockchain technology. Some of the most important are: Bitcoin [18], Corda,3 Quorum.4 Bitcoin was the first blockchain. Based on open source code, it implements a decentralized digital cryptocurrency where transactions are validated by miners through a rewarding process. Corda is an open source blockchain platform, designed mainly for business applications. Similarly, to Corda, Quorum, based on Ethereum with added control for permission and privacy, is a blockchain envisaged mainly for business applications. A complete comparison among the most important blockchains is done in [15]. The choice of which blockchain should be used to store digital archives depends mainly on two aspects: firstly, the blockchain should be general, i.e., not limited to financial applications. Secondly, it should be popular, i.e. technically mature 1 https://www.ethereum.org/ 2 https://www.hyperledger.org/projects/fabric 3 https://www.corda.net/ 4 https://www.jpmorgan.com/global/Quorum 146 https://www.jpmorgan.com/global/Quorum https://www.corda.net/ https://www.hyperledger.org/projects/fabric https://www.ethereum.org/ A. Lo Duca, C. Bacciu, A. Marchetti – The Use of Blockchain for Digital Archives: a comparison between Ethereum and Hyperledger and with community guaranteeing long-term sustainability. This paper compares only Ethereum and Hyperledger Fabric, mainly because they represent the two main ways of implementing a blockchain: on the one hand Ethereum is the most representative example of all permissionless blockchains. On the other hand, Hyperledger represents permissioned blockchains. Both Ethereum and Hyperledger can be applied to specific scenarios through the use of smart contracts/chain codes. Bitcoin is a digital currency, and programmability is very limited. Corda, instead, is a more recent blockchain, still not established as Ethereum and Hyperledger Fabric. Quorum is a specific implementation of Ethereum, thus some considerations done for Ethereum are valid also for Quorum. Related Works The problem of managing records through a blockchain has been largely investigated during the last few years. In her paper, Lemieux proposes a classification of blockchain applications [16], based on which information is stored in the blockchain: a) mirror type, b) digital record type, c) tokenized type. Mirror type In the mirror type, the blockchain serves as a mirror, which stores only records fingerprints. The complete information of a record is stored into an external repository and the blockchain is used only to verify records integrity. In [9] the authors describe a first implementation of a decentralized database for the storage of descriptive metadata related to digital records, based on the combination of the blockchain and IPFS technologies. In their paper Liang et. al. describe ProvChain [17], a system which guarantees data provenance in cloud environments. Vishwa et. al. [23] illustrate a blockchain-based framework, which guarantees copyright compliance of multimedia objects by means of smart contracts. Digital record type In the digital record type, the blockchain is used to store all the records in the form of smart contracts. In [4] the authors illustrate a distributed and tamper-proof framework for media. Each media is represented by a watermark, which is firstly compressed and then stored into a blockchain. Approved modifications to media are stored in the blockchain thus preventing tampering. In [8] the authors describe Archain, a blockchain-based archive system, which stores small sized records. Multiple roles are defined in the system, thus allowing records creation, approval and removal. 147 Umanistica Digitale - ISSN:2532-8816 - n.8, 2020 Tokenized type In the tokenized type, records are stored in the blockchain and they are linked to a cryptocurrency. Adding, updating or removing a record has a cost. This constitutes an innovative case, where the literature is not consolidated yet. An example of this type of blockchain is represented by the Ubitquity Project,5 which records land transactions on behalf of companies and government agencies. Background The concept of Digital Archive A digital archive is a repository of digital records that need long-term or even permanent preservation for their cultural, historical, or evidentiary value. In digital archives, a record can be anything holding a piece of information in the form of digital object, such as texts, images, pictures, videos and audios. This paper focuses on digital archives which contain collections of minor artworks. Minor artworks are artistically relevant works but not as well-known as famous masterpieces, or belonging to the so-called minor arts, such as books and manuscripts, pottery, lacquerware, furniture, jewellery, or textiles. Examples of minor tangible heritage could be those kept in some small libraries or countryside churches, or even in private households. The creation, management and sustainability of a digital archive is not an easy task, because there is a series of issues that must be taken into consideration [1], [13], [26], [11]. The InterPARES (International Research on Permanent Authentic Records in Electronic Systems) series of projects [22] focused on creating policies and guidelines for making and maintaining digital records, including authenticity requirements for record systems and long-term preservation of digital records. A digital archive is subject to obsolescence, in the sense that the hardware supports on which it is stored change over time (from the floppy disk to the Internet cloud). Thus a digital archive needs long-term preservation, i.e. digital artwork should remain accessible for a long period of time depending on legal, regulatory, operational, and historical requirements. Secondly, every artwork of the digital archive must be associated with different metadata (descriptive, structural, administrative), which should be maintained up-to-date by authorized accounted persons. This means that on the one hand that all the operations about the digital archive should be documented in an open and verifiable manner (transparency). On the other hand, artworks should be protected against forgery and identified correctly in case of loss and subsequent discovery (anti-counterfeiting). Thirdly, records of the digital archive are stored in different media formats, each defined by its own software and hardware. A digital archive should guarantee the availability of all the formats, i.e. artworks should be efficiently and accurately retrieved. Finally there are also other aspects that must be considered, such as corruption and loss of information, which need protection, integrity and traceability of artworks. 5 http://ubitquity.io/brazil_ubitquity_llc_pilot.html 148 http://ubitquity.io/brazil_ubitquity_llc_pilot.html A. Lo Duca, C. Bacciu, A. Marchetti – The Use of Blockchain for Digital Archives: a comparison between Ethereum and Hyperledger Integrity makes sure that the digital description of the artwork is not subject to unauthorized changes. Protection permits to protect the digital description of the artwork in case of natural disasters and/or attacks (it is obviously impossible to protect the real work only with IT tools). Traceability permits to trace all movements of individual artworks. Another relevant issue concerns how difficult it can be to find and access repositories due to the inconsistent description practices among different archives. The ISAD(G) (General International Standard Archival Description) [7] is a standard that addresses this issue and gives guidelines, to be used in conjunction with existing national standards, for the preparation of archive descriptions that are effective in presenting the content of archival material, so that it is easily identifiable and accessible. This description creates a hierarchy of metadata related to the entire archive, as opposed to those related to each record. An overview of blockchain technology A blockchain is a particular implementation of a Distributed Ledger (DL). A DL is essentially a database, which is shared among different nodes of a network. In practice, all the nodes of the network share the same copy of the database and any change made on a node, is replicated to all the other nodes in few minutes and, in some cases, even in few seconds. A DL can be public (as opposite of private) if any node can read the content, and permissionless (as opposed of permissioned) if any node can write content (Table 1). The protocol for the first functioning blockchain was introduced in 2008 to support the digital cash Bitcoin, and implements the ledger as a chain of blocks. Each block contains data, a timestamp and a cryptographic hash of the previous block. This way the integrity of the information stored in the blockchain is protected through a security system based on cryptography. With respect to a standard database, a blockchain is an append-only register. This means that information can only be added to the database, but it cannot be removed. Modifications to the stored data can be done by re-uploading a new version of the data. A distributed consensus algorithm is used to decide which updates to the ledger are to be considered valid. New participants (nodes) can start collaborating to the maintenance of the repository by following this algorithm. There is no need of a central authority or trust between nodes; the consensus algorithm and cryptography grant the correctness of data even in the presence of some malicious nodes. Each block is made tamper-resistant by adding in its header a cryptographic signature of the data it contains (usually a hash of the content), as well as a link to the previous block of the chain (the cryptographic hash of the block). This way each block is dependent on the content of all the previous blocks, making it impossible to modify the data contained in old blocks without rewriting the new ones. Initially designed for financial transactions, blockchain technology can be used to record anything of value. Even executable code can be stored in the blockchain, the so-called smart 149 Umanistica Digitale - ISSN:2532-8816 - n.8, 2020 contracts. A smart contract is not necessarily the transposition of a real contract, it is just code that is executed by all the nodes of the blockchain network, and the result of the computation is stored after a consensus is reached. A transaction carrying the payload of the contract is first broadcast to the network. Its result is the deployment of the payload as code linked by its public address. Any new transaction can then refer to this address to trigger the execution of the functions inside the contract. The big advantage of a blockchain is that it is an immutable, distributed, always available, secure and publicly accessible repository of data. The main issues with blockchain implementation of distributed ledgers are scalability and efficiency: often, consensus algorithms that are used to grant consistency are expensive in terms of time and resources. In some cases, a certain level of trust among participants can be present, thus simpler consensus algorithms can be used. Key technical choices of blockchain technology include: 1) permission design, i.e., whether permission is needed to access the blockchain; 2) choice of consensus algorithm, i.e., how a new block is added to the blockchain; 3) whether or not to use smart contracts, i.e., whether to use the blockchain as a virtual machine where programs representing business processes are run; 4) whether or not to use a cryptocurrency, i.e., whether the consensus algorithm and smart contract operations depend on an artificial currency or not. Those technical choices often result from the governance model that has been chosen for the ecosystem of participants. SOME ALL CAN READ PRIVATE PUBLIC CAN WRITE PERMISSIONED PERMISSIONLESS Table 1: Types of blockchains according who can access what. Ethereum Ethereum is a public open-source blockchain platform that has the capability of running so- called decentralised applications (dApps). At the moment, the consensus algorithm is based on Proof of Work. Mining nodes generate a cryptocurrency named Ether that is used to pay for transactions. The key characteristic of Ethereum is that it is a programmable blockchain, because it provides a Virtual Machine (EVM) that can execute user generated scripts (smart 150 A. Lo Duca, C. Bacciu, A. Marchetti – The Use of Blockchain for Digital Archives: a comparison between Ethereum and Hyperledger contracts) using the network of nodes. Smart contracts are usually written in Solidity6 language (but there are some alternatives), are compiled to EVM bytecode, and are deployed to the blockchain for execution. Contract computation consumes gas, which is paid spending Ether. Smart contracts are the foundation of dApps. Diagram in Figure 1 shows the simplified architecture of dApps: there is no central server to which Web every browser has to connect, but instead each one has its own instance of the application. Ethereum functions both as storage for data and code, and as the machine that executes the code. The Ethereum Network The Ethereum network is a public distributed network with two types of nodes: full nodes and lightweight nodes. Full nodes which contain the whole blockchain, i.e. all the validated transactions. Some full nodes, called miners, are also responsible for transaction validation. Miners can also be grouped in pools. Lightweight nodes contain a subset of the blockchain and rely on full nodes for missing information. Examples of lightweight nodes are e-wallets, i.e. electronic devices or apps which permit to do transactions. 6 https://solidity.readthedocs.io 151 Figure 1: The architecture diagram of an Ethereum Dapp. https://solidity.readthedocs.io/ Umanistica Digitale - ISSN:2532-8816 - n.8, 2020 Ether and Gas As already said, Ethereum is cryptocurrency-based blockchain where the cryptocurrency used is called Ether (ETH). The price of 1 ETH is 182.31 $ (updated on October 31, 2019). Together with Ether there is also Gas, which is used to pay computational resources in the network ( gas fee). The current value of Gas is called gas price. Every smart contract has associated a gas limit, which is the maximum amount of gas which it can consume. Hyperledger Hyperledger is an open source effort aimed at advancing cross-industry blockchain technologies. Hyperledger focuses on developing different blockchain frameworks and modules to support global enterprise solutions. Hyperledger blockchains are generally permissioned blockchains, which means that the parties that want to join the network must be authenticated and authorized. The focus of Hyperledger is to provide a transparent and collaborative approach to blockchain development. Within Hyperledger, there are eight different technology code projects, which define a common set of development principles: five distributed ledger frameworks and three support modules. The Hyperledger frameworks include: An append-only distributed ledger A consensus algorithm for agreeing to changes in the ledger Privacy of transactions through permissioned access Smart contracts to process transaction requests. In this paper only Hyperledger Fabric is described, because it is the most widespread. The Hyperledger Fabric blockchain is a distributed system consisting of many nodes that communicate with each other. Figure 2shows the Hyperledger Fabric Model. 152 Figure 2: The Hyperledger Fabric Model. Client A defines a chaincode (contract) through a transaction. Once the transaction approved, client B can invoke methods contained in the chaincode through another transaction. A. Lo Duca, C. Bacciu, A. Marchetti – The Use of Blockchain for Digital Archives: a comparison between Ethereum and Hyperledger Chaincodes and channels The blockchain runs programs called chaincodes, holds state and ledger data, and executes transactions. Chaincodes correspond to the Ethereum smart contracts. Each chaincode can be invoked through one or more operations, called transactions. Transactions have to be endorsed and only endorsed transactions may be committed and have an effect on the state of the ledger. The most peculiar aspect of Hyperledger is the possibility to define channels, which are data partitioning mechanisms that allow transaction visibility for only some defined users of the blockchain. Each channel is an independent chain of transaction blocks containing only transactions for that particular channel. The ledger contains the current world state of the network and a chain of transaction invocations. The world state reflects the current data about all the assets in the network. Ledger provides a verifiable history of all successful state changes (valid transactions) and unsuccessful attempts to change state (invalid transactions), occurring during the operation of the system. Roles and transactions In Hyperledger two roles can be defined: clients and validators. Clients are applications that act on behalf of a person to propose transactions on the network. Validators maintain the state of the network and a copy of the ledger. Unlike Ethereum, in Hyperledger Fabric there is no mining of blocks. In order to verify a transaction, each transaction is sent to one trusted validator, which broadcasts it to all the other validators of the network. All the validators reach consensus (using a specific algorithm) on the order to follow to execute all the transactions. Then each validator runs the transactions on its own, following the established order and builds a block with all the executed transactions. Since the execution of transactions is deterministic, all the validators build exactly the same block. Finally, the validators asynchronously notify the client application of the success or failure of the transaction. Clients are notified by each validator. The model of blockchain for digital archives The use of blockchain for digital archives guarantees a mechanism to access, manage and protect cultural heritage on a daily basis and at times of disasters (due for example to climate change or man-made). The blockchain-based framework should be designed both for minor tangible heritage and major tangible and intangible heritage. Thanks to the append-only-register property of the blockchain, the framework provides a layered protection and conservation means for cultural heritage. The framework exploits also some specific advantages of blockchain (integrity, transparency and authenticity of records) to allow the secure storage of minor tangible heritage contained in digital archives. The framework integrates also technologies for a distributed record storage, such as the InterPlanetary File 153 Umanistica Digitale - ISSN:2532-8816 - n.8, 2020 System7 [14], in order to guarantee the digital preservation and transmission of tangible and intangible heritage from generation to generation. The use of these storage technologies permits the development of a sustainable protection and enhancement of values as well as the long-term management of cultural heritage at risk. Thanks to the benefits of blockchain and the distributed technologies for record storage, the framework should improve sustainable access to digital heritage by contributing to the resilience of our societies in terms of: helping users to preserve the memory of cultural heritage in case of destruction of the physical artwork, due to natural disasters or man-made disasters, facilitating the restoration and/or the reconstruction of damaged heritage, thanks to the information contained in the ledger, preventing malicious changes to the ledger, registering temporary movements of movable heritage, for example for exhibits. Requirements The section The concept of Digital Archive describes the general requirements of a digital archive (long-term preservation, transparency, anti-counterfeiting, protection, integrity and traceability). The use of blockchain for digital archives should guarantee also the following architectural requirements: interoperability: this aspect should guarantee that the blockchain can easily interoperate with external modules, such as web interfaces and external storage (i.e. IPFS); customizable infrastructure: the system should guarantee that the underlying infrastructure is customizable, e.g. the number of nodes and costs can be decided independently; roles: the blockchain should define different users roles, according to what specified in the previous section. queries: this aspect refers to the ability to search data in the blockchain, e.g. search an artwork by title or author. In addition to these requirements, the following parameters that affect performance and scalability should be taken into account [19]: block frequency: inversely proportional to the time between two succeeding blocks. It is affected by mining difficulty; block size: the number of transactions that fit in a block; network size: the number of nodes of the network. Increasing the number of nodes in the network does not always improve performance. In fact, communication and consensus costs may increase. 7 https://ipfs.io/ 154 https://ipfs.io/ A. Lo Duca, C. Bacciu, A. Marchetti – The Use of Blockchain for Digital Archives: a comparison between Ethereum and Hyperledger throughput: the number of transactions per second; latency: the time elapsed between the submission of a transaction and its validation; finality: the property that once a transaction is completed, there is no way to alter it. Architecture Errore: sorgente del riferimento non trovata describes the architecture of the blockchain-based framework for digital archives. Starting from the bottom of the figure, there is a Data Lake where all the descriptions of artworks are stored. The Data Lake is a distributed storage. Artworks in Data Lake can be accessed through indexes contained in the blockchain. The blockchain contains also other basic information related to artworks descriptions, as well as a track of all the operations done on each artwork. This means that an external audit of the framework can always verify the status of an artwork and determine if something is wrong. The blockchain constitutes the backend of the framework, together with the Cache. The Cache service stores basic information about artworks, such as author name and description, in order to make users queries faster. In fact, natively, a blockchain is not suitable for fast queries such those required by a web search engine. The frontend of the framework is composed of the Authentication Service, which manages users access to the system, and three interfaces, one for each type of user: Search, Publisher, and Admin Interface. Users of the framework should play one of the following roles: generic user, publisher, verifier. A generic user can search for an approved artwork in the system. A publisher user can publish or update an artwork in the system. When a new artwork is published, its status is set to pending. This means that the artwork is not approved yet thus cannot be accessed by third parties neither can be updated by its author. A verifier is an expert in the field to which the artwork belongs, and can vote for the approval of the artwork description. This mechanism constitutes an algorithm for compliance with the principle of reliability of a record. Complex strategies can be defined to establish how an artwork should be approved. 155 Umanistica Digitale - ISSN:2532-8816 - n.8, 2020 Due to its intrinsic nature, the blockchain already satisfies the general requirements of a digital archive, but long-term preservation, which is guaranteed through the Data Lake. Protection would be achieved through the fact that the blockchain is replicated on different nodes. Anti- counterfeiting would be guaranteed by associating each work to a sort of digital identity card, containing all the information related to the work (including physical information). Finally, integrity and traceability would be intrinsically guaranteed by the immutability and timestamping properties of the blockchain. In fact, blockchain security assumptions guarantee that if at a certain time a piece of information has been added to a block that reached consensus, it will be impossible to alter that information without altering all the following blocks. Both Ethereum and Hyperledger Fabric satisfy the architectural requirements. However, depending on the type of blockchain, there are the following additional issues that should be taken into account: 156 Figure 3: The architecture of the blockchain-based framework. A. Lo Duca, C. Bacciu, A. Marchetti – The Use of Blockchain for Digital Archives: a comparison between Ethereum and Hyperledger costs: whether or not every transaction has a fee; popularity: how the blockchain is known, i.e. there is a supporting community and there are skilled programmers able to implement contracts; consensus: the distributed process which establishes the validation of transactions. Discussion Table 2 illustrates issues associated with the proposed architecture for digital archives and how the two blockchains address them. The fourth column of the table shows which blockchain fits better the requirements for digital archives. Regarding costs, Hyperledger Fabric fits better to the proposed architecture, because the network can be configured without costs on transactions. This means that all the categories of users can access the blockchain freely. However, if a business model were defined in the architecture, e.g. pay as you publish/access resources, also Ethereum could be suitable for digital archives. Anyway, a private network can be always set up on Ethereum, with a gas price set to zero, thus satisfying the model of the proposed architecture. When dealing with popularity, Ethereum is more popular and well-known than Hyperledger Fabric. This means that a technical problem in the implementation of the described architecture could find a greater support by the Ethereum community that the Hyperledger one. Regarding consensus, Ethereum bases it on Proof of Work, while Hyperledger Fabric implements a permissioned voting-based consensus that implies a level of trust among participants ad requires messages to be exchanged between nodes. In this case, Ethereum seems to be more fit because of the lack of need for trust among participants. Summarizing, from the point of view of issues, Ethereum seems to behave better than Hyperledger Fabric, but for costs. 157 Umanistica Digitale - ISSN:2532-8816 - n.8, 2020 Issue Ethereum Hyperledger Fabric Preferred blockchain Costs Every transaction has a cost dependent on gas price (current is about 4 gwei). In a private blockchain, the gas price can be configured. Costs can be established when configuring the network Hyperledger Fabric Popularity A well-established and wide community exist. Many programmers are able to write smart contracts A developing niche community exists. Ethereum Consensus Based on Proof of Work. The larger the network, the more reliable the consensus Consensus algorithm requiring a level of trust and message overhead. The larger the network, more time it takes to reach consensus. Ethereum Table 2: Considerations about issues in the two blockchains and which one is more suitable for digital archives. Table 3 describes the architectural requirements and how the two blockchains satisfy them. Like in the previous table, the fourth column specifies the preferred blockchain for a given requirement. Firstly, referring to the customizable architecture, Hyperledger Fabric is to prefer, because it permits to define who can access the network (permissioned blockchain). However, Ethereum can be set up as a private blockchain, in the sense that there is a single organization which manages it, and contracts handling user permissions can be implemented. Secondly, looking at interoperability with external storage, Hyperledger Fabric is better than Ethereum, because it has a native storage (data lake) and there is no need to configure external libraries to access it. Anyway, if a programmer has good skills with Ethereum, the configuration of external libraries to access external storage should not be difficult. The same analysis can be done for interoperability with Web Interfaces. Thirdly, regarding roles, Hyperledger Fabric is more configurable than Ethereum because of its native support of roles. Through channels, Hyperledger Fabric can define also more complex access policies. When defining roles, Ethereum has an overhead in terms of smart contracts thus is not indicated for the proposed 158 A. Lo Duca, C. Bacciu, A. Marchetti – The Use of Blockchain for Digital Archives: a comparison between Ethereum and Hyperledger architecture. Finally, queries are not supported neither by Ethereum nor by Hyperledger and this aspect constitutes a limit of all blockchains. Thus, an additional mechanism based on caching is defined in the architecture, in order to speed up data searches. Summarizing the comparison about the architectural requirements, Hyperledger Fabric is the best solution. Requirement Ethereum Hyperledger Fabric Preferred blockchain Customizable infrastructure Ethereum is natively a public network, however a private blockchain can be set up Native support of permissioned blockchain Hyperledger Fabric Interoperability with external storage (e.g. IPFS) External libraries exist to support IPFS (e.g. Infura)8 No support to external storage because nodes in hyperledger fabric have already a local storage Hyperledger Fabric, but Ethereum is a good alternative Interoperability with Web Interfaces External libraries exist (e.g. web3.js9 and Drizzle)10 Native support of interoperability with web interfaces (in Angular JS)11 Hyperledger Fabric, but Ethereum is a good alternative Roles A smart contract must be defined to manage roles Native support of roles through the definition of policies Hyperledger Fabric Queries No native support No native support - Table 3: Considerations about architectural requirements in the two blockchains and which one is more suitable for digital archives. A direct comparison between Ethereum private and Hyperledger in terms of performance is 8 https://infura.io/ 9 https://web3js.readthedocs.io/en/v1.2.2/ 10 https://www.trufflesuite.com/docs/drizzle/quickstart 11 https://angularjs.org/ 159 https://angularjs.org/ https://www.trufflesuite.com/docs/drizzle/quickstart https://web3js.readthedocs.io/en/v1.2.2/ https://infura.io/ Umanistica Digitale - ISSN:2532-8816 - n.8, 2020 difficult, due to the fact that are both highly configurable. In general, how the protocol and the network are configured (starting from the choice of a consensus algorithm) has a big impact on performance. An increase in mining difficulty leads to a block frequency decrease, throughput decrease and latency rise. Block size and block frequency are to be balanced (especially with PoW): incrementing block size improves performance only if the block period is large enough for nodes to be able to create, sign, propagate, execute transactions and reach consensus. Adding nodes to the network increases computation capability, but if block frequency is high and block size is large, some nodes may not have the resources to propagate information on time and keep in sync. Therefore, scaling is limited by the design of the blockchain platform. A study about performance and scalability of Ethereum private networks can be found in [19], while performance metrics about Hyperledger are studied in [10]. As for the Ethereum public platform, average block frequency is 10-20 seconds, and average block size is 20-30 KB. Troughput is about 15 transactions per second, with a latency of about 6 minutes. Current network size is 2.717.215 nodes.12 Summarizing, although Ethereum is more popular than Hyperledger Fabric, Hyperledger Fabric seems more suitable to store digital archives, because it is highly configurable and permits to define roles natively, without additional overhead. However, as the first implementation of the proposed architecture, Ethereum is more indicated because of its simplicity and popularity [6], [3]. Conclusions and future work This paper has presented challenges and requirements of storing digital archives through blockchain. In addition, a possible architecture based on blockchain has been illustrated as well as a preliminary comparison between Ethereum and Hyperledger Fabric as underlying blockchains in a framework for storing, protecting and preserving digital archives. The paper has compared the two blockchains at three levels: natively issues of the blockchain, architectural requirements and general considerations. As a result, Hyperledger Fabric is more suitable to store digital archives because of its high configurability. We are aware that this study is preliminary, but we believe that the effort to define a possible architecture of the framework, as well to select and analyse which parameters should be considered when comparing two or more blockchains for digital archives storage is useful in this field. As already said in the introduction, an implemented use case of this architecture can be found in [24], which exploits Ethereum as underlying blockchain. The further step should be the implementation of the framework in Hyperledger Fabric and then a comparison of the two implemented use cases. 12 https://etherscan.io/nodetracker/nodes 160 https://etherscan.io/nodetracker/nodes A. Lo Duca, C. Bacciu, A. Marchetti – The Use of Blockchain for Digital Archives: a comparison between Ethereum and Hyperledger References [1] ARMA. 2017. “International. Generally Accepted Recordkeeping Principles”. https://rim.ucsc.edu/management/images/ThePrinciplesMaturityModel.pdf [2] Bacciu, Clara, Angelica Lo Duca and Andrea Marchetti. 2019. “The Use of Blockchain for Digital Archives: Challenges and Perspectives”. AIUCD Annual Conference – Pedagogy, Teaching and Research in the Age of Digital Humanities, 78- 81. [3] Basile, Mariano, Gianluca Dini, Andrea Marchetti, Clara Bacciu and Angelica Lo Duca. 2019. “A blockchain-based support to safeguarding the Cultural Heritage”. EVA Proceedings of the Electronic Imaging & the Visual Arts, edited by V. Cappellini,64-73. Firenze: FUP. [4] Bhowmik, Deepayan, and Tian Feng. 2017. “The multimedia blockchain: A distributed and tamper-proof media transaction framework.” In DSP Proceedings of the 22nd International Conference on Digital Signal Processing, 1-5. DOI: 10.1109/ICDSP.2017.8096051 [5] Cachin, Christian. 2016. “Architecture of the hyperledger blockchain fabric.” In Proceedings of the Workshop on Distributed Cryptocurrencies and Consensus Ledgers. https://www.zurich.ibm.com/dccl/papers/cachin_dccl.pdf [6] Clara Bacciu, Angelica Lo Duca, and Andrea Marchetti. 2019. “A Blockchain-based Application to Protect Minor Artworks.” In Proceedings of the 15th International Conference on Web Information Systems and Technologies, edited by A. Bozzon, F. Dominguez Mayo and J. Filipe, 319-325. Setúbal: Scitepress. DOI:10.5220/0008347903190325 [7] Duranti, L., and R. Preston. 2008. International research on permanent authentic records in electronic systems (InterPARES) 2: Experiential, interactive and dynamic records. Padova: CLEUP. [8] Galiev, Albert, Shamil Ishmukhametov, Rustam Latypov, Nikolai Prokopyev, Evgeni Stolov and Ilya Vlasov. 2018. “Archain: a novel blockchain based archival system.” In WorldS4 Proceedings of the Second World Conference on Smart Trends in Systems, Security and Sustainability, 84-89. DOI: 10.1109/WorldS4.2018.8611607 [9] García-Barriocanal, Elena, Salvador Sánchez-Alonso and Miguel-Angel Sicilia. 2017. “Deploying metadata on blockchain technologies.” In Research Conference on Metadata and Semantics Research, 38-49. https://doi.org/10.1007/978-3-319-70863- 8_4 [10] https://doi.org/10.1007/978-3-030-30429-4_ [11] Hyperledger Performance and Scale Working Group. 2018. “Hyperledger Blockchain Performance Metrics” Whitepaper. https://www.hyperledger.org/wp- 161 https://www.hyperledger.org/wp-content/uploads/2018/10/HL_Whitepaper_Metrics_PDF_V1.01.pdf https://doi.org/10.1007/978-3-030-30429-4_ https://www.zurich.ibm.com/dccl/papers/cachin_dccl.pdf https://rim.ucsc.edu/management/images/ThePrinciplesMaturityModel.pdf Umanistica Digitale - ISSN:2532-8816 - n.8, 2020 content/uploads/2018/10/HL_Whitepaper_Metrics_PDF_V1.01.pdf [12] International Council on Archives / Conseil international des archives. 2000. ISAD (G): General International Standard Archival Description, Second Edition, Adopted by the Committee on Descriptive Standards, Stockholm, Sweden, 19-22 September 1999. Ottawa. [13] ISO. 2016. “ISO 15489-1/2: 2016- Information and documentation - Records management”. https://www.iso.org/standard/62542.html [14] Juan Benet. 2014. “Ipfs-content addressed, versioned, p2p file system.” arXiv preprint. arXiv:1407.3561. [15] Kuny, T. 1998. “The digital dark ages? Challenges in the preservation of electronic information.” International preservation news 17:8-13. [16] Lemieux, Victoria. 2017. “A typology of blockchain record- keeping solutions and some reflections on their implications for the future of archival preservation.” In Big Data 2017. Proceedings of the IEEE International Conference on Big Data, 2271– 2278. DOI: 10.1109/BigData.2017.8258180 [17] Liang, Xueping, Sachin Shetty, Deepak Tosh, Charles Kamhoua, Kevin Kwiat, Laurent Njilla. 2017. “Provchain: A blockchain-based data provenance architecture in cloud environment with enhanced privacy and availability.” In CCGRID 2017 Proceedings of the 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, 468-477. DOI: 10.1109/CCGRID.2017.8 [18] Nakamoto, Satoshi. 2008. “Bitcoin: A peer-to-peer electronic cash system”. Website. https://bitcoin.org/bitcoin.pdf [19] Schäffer, Markus, Monica Di Angelo and Gernot Salzer. 2019. “Performance and Scalability of Private Ethereum Blockchains.” In BPM 2019 Proceedings of the Business Process Management: Blockchain and Central and Eastern Europe Forum. DOI [20] Schwartz, David, Noah Youngs and Arthur Britto. 2014. “The ripple protocol consensus algorithm”. Ripple Labs Inc WhitePaper. https://ripple.com/files/ripple_consensus_whitepaper.pdf [21] Swan, Melanie. 2015. “Blockchain: Blueprint for a new economy”. O’Reilly Media. [22] Tsung-Ting Kuo, Hugo Zavaleta Rojas, Lucila Ohno-Machado. 2019. “Comparison of blockchain platforms: a systematic review and healthcare examples.” Journal of the American Medical Informatics Association 26, no. 5:462–478. [23] Vishwa, Alka, and Farookh Khadeer Hussain. 2018. “A blockchain based approach for multimedia privacy protection and provenance.” In SSCI 2018 Proceedings of the IEEE Symposium Series on Computational Intelligence, 1941-1945. DOI:10.1109/ssci.2018.8628636 162 https://ripple.com/files/ripple_consensus_whitepaper.pdf https://bitcoin.org/bitcoin.pdf https://www.iso.org/standard/62542.html https://www.hyperledger.org/wp-content/uploads/2018/10/HL_Whitepaper_Metrics_PDF_V1.01.pdf https://www.hyperledger.org/wp-content/uploads/2018/10/HL_Whitepaper_Metrics_PDF_V1.01.pdf A. Lo Duca, C. Bacciu, A. Marchetti – The Use of Blockchain for Digital Archives: a comparison between Ethereum and Hyperledger [24] Wood, Gavin. 2014. “Ethereum: A secure decentralised generalised transaction ledger”. Ethereum Project Yellow Paper. https://gavwood.com/paper.pdf [25] Zeilinger, Martin. 2018. “Digital art as ‘monetised graphics’: Enforcing intellectual property on the blockchain.” Philosophy & Technology 31, no. 1: 15-41. DOI: 10.1007/s13347-016-0243-1 [26] Zheng, Zibin, Shaoan Xie, Hong-Ning Dai, Xiangping Chen and Huaimin Wang. 2018. “Blockchain challenges and opportunities: A survey.” In International Journal of Web and Grid Services, 14/4, 352-375. Last access URLs: 28th October 2019. 163 https://gavwood.com/paper.pdf Abstract Introduction Related Works Mirror type Digital record type Tokenized type Background The concept of Digital Archive An overview of blockchain technology Ethereum Ether and Gas Hyperledger Chaincodes and channels Roles and transactions The model of blockchain for digital archives Requirements Architecture Discussion Conclusions and future work References work_g2vkzz5wvzfmreplmjszn7vqo4 ---- White Paper Report Report ID: 107673 Application Number: HT-50059-12 Project Director: Joseph Scheinfeldt (tom.scheinfeldt@uconn.edu) Institution: George Mason University Reporting Period: 9/1/2012-3/31/2015 Report Due: 6/30/2015 Date Submitted: 8/13/2015 Another Week | Another Tool: A Digital Humanities Barn Raising In 2010, the Roy Rosenzweig Center for History and New Media gathered twelve digital humanists of different stripes – developers, professors, designers, managers – for One Week | One Tool, A Digital Humanities Barn Raising. The goal was to conceive of, produce, and market a new digital humanities tool. The result was Anthologize, a WordPress plugin for publishing WordPress content to PDF, EPUB, and other forms. Following that success, RRCHNM attempted to recreate the experience with twelve different digital humanists, again from many different fields and backgrounds. Drawing on lessons learned from the first iteration, we put more emphasis on project management strategies and reduced the amount of time devoted to instruction in digital humanities tools and methods. The tool they produced is a Software as a Service application, Serendip-o-Matic, which allows users to enter text or their Zotero library and discover unexpectedly similar results in DPLA, Europeana, and other sources. In what follows, we hope that you will find insights and perspectives about the experience that will provide inspiration for other innovative programs, project management considerations, and digital humanities practices in general. The Another Week | Another Tool team was: Brian Croxall. Digital Humanities Specialist and Lecturer in English, Emory University Jack Dougherty. Associate Professor and Director of Educational Studies, Trinity College Meghan Frazer. Digital Resources Curator, The Ohio State University Scott Kleinman. Professor of English, California State University Rebecca Sutton Koeser. Software Engineer, Emory University Libraries Ray Palin. Teacher and Librarian, Sunapee Middle High School Amy Papaelias. Assistant Professor of Graphic Design and Foundation, SUNY Mia Ridge. Ph.D. candidate in Digital Humanities, Open University Eli Rose. Undergraduate student, Oberlin University Amanda Visconti. Ph.D. candidate in English, University of Maryland Scott Williams. Collections Database Adminsitrator, Univ of Pennsylvania Museum of Archaeology and Anthropology Amrys Williams. Postdoctoral Fellow, National Museum of American History The second iteration of One Week | One Tool was an extremely successful experience for the participants' professional development, instruction in digital humanities methods, and experience in collaboration with a variety of people in different roles. The results differ from the first iteration, very much following the changes RRCHNM implemented between the two events. In particular, this iteration's emphasis on a project management team led to several participants citing learning about project management structures as their most important takeaway from the experience. The product, the software-as-a-service application, Serendip-o-matic, continues online, though with fairly minimal traffic. The reach of the experience to affect other digital humanists' thinking has been extraordinary, taking the forms of a long session presentation at Digital Humanities 2014, and a post for ACRL's TechConnect series, among other successful informal and formal papers and presentations. Project Activities, Accomplishments, and Audiences The One Week | One Tool team coalesced into a group that quickly self-selected into distinct teams with clear leaders: the development team, the outreach team, and the project management team. This division, particularly within the development team, reflects the inroads that project management techniques, particularly Agile, have made into digital humanities. Mia Ridge became the “Scrum Master” of the development team, coordinating the teams activities and priorities. Meghan Frazer and Brian Croxall, the project management team, took on the role of coordinating between the the development and outreach teams. At times this was a controlled chaos, as sometimes within the course of a single day the realities of what could be built and the expectations of the outreach team would diverge. This was to be expected in such a condensed product launch time, and provided valuable experience for all participants, some of whom had not worked within anything analogous to the project management structures that developed. Indeed, the structure was noted by more than one participant as an important lesson in their professional lives and development. The adoption of those project management structures and gaining experience in learning and negotiating them is perhaps the most important lesson. Indeed, that management processes were extremely productive. Interestingly, this appears to have allow the participants to focus more on the 'playfulness' of both the event and ultimate product in their publications. Despite occasional frustrations and the somewhat more well-defined structure of the group, playfulness in research and tool-building became a major theme in their later presentations and reflection. The participants have produced many formal and informal presentations and documents. Many were avid bloggers about their experiences – most can be found in their Zotero group (https://www.zotero.org/groups/oneweekonetool2013/items). The range of presentations and publications speak to an extraordinarily wide audience, including the international Digital Humanities 2014 conference; Through Design, a popular design podcast; the Association of College and Research Libraries' TechConnect series; and regional technology and/or humanities conferences (see below for details, and Appendix I for a complete bibliography). https://www.zotero.org/groups/oneweekonetool2013/items Evaluation Evaluation of the participants' experiences was conducted via a survey in a Google form. Themes that can be seen in the survey include: • Collaboration Many participants cited lessons in team collaboration, particularly as part of project management. The importance of managing communication between all members of the team and the experience in doing so is described as producing important changes in their professional lives. • Structure Closely related to collaboration and communication, well-defined structure was noted as a factor in the success of the week. Interestingly, more structure, established both before and after the week itself, was noted as a suggestion for changing One Week | One Tool. • Time constraints after the week As discussed below, the most common reason cited for the cessation of development work on Serendip-o-matic is lack of available time in the year after the experience. This is consistent with the first iteration of One Week | One Tool. Continuation of the Project For most practical purposes development on Serendip-o-matic is at an end, though for the time being the application will be maintained. During our reunion at THATCamp in 2014, we discussed the possibility of active development continuing. The consensus was that continuing active development was an unrealistic goal. The lack of available time was consistently given as the primary reason for this. While enthusiasm for Serendip-o-matic remained high, the reality is that everyone's professional responsibilities left little opportunity to continue development and coordination. This is not surprising, as One Week | One Tool is by design an experience distinct from 'usual' professional life, and participants in the first iteration had much the same reaction. That said, the more fundamental part of One Week | One Tool – the development of professional skills that will be applied in working life and shared among others, appears destined to have a continuing effect on the participants and their colleagues. Occasional new presentations from the participants will expand the influence of their lessons learned. Hence, RRCHNM will continue to keep Serendip-o-matic up and running for as long as our technical infrastructure can reasonably support it. Grant Products Publications and Professional Development The One Week | One Tool team has been quite prolific in their ongoing professional development work about Serendip-o-matic and the experience of One Week | One Tool. Their Zotero library of their blog posts, presentations, papers, and other recognitions contains over seventy items. The most significant product is the long paper at Digital Humanities 2014 presented by Amy Papaelias, Brian Croxall, Mia Ridge, and Scott Kleinman. In the presentation, they reflect on the virtues of playfulness, both in the process of building Serendip-o-matic and in the product itself. They argued in favor of the benefits for incorporating more “playful work” in the context of academic research and scholarship. As current digital humanities work relies on collaborative environments (including hackathons, maker spaces, maker challenges, etc.), opportunities like One Week | One Tool provide a space for playful work to encourage more creative risk-taking and engaging user- experiences within the context of digital humanities scholarship and practice. Importantly, they included a considerations of the challenges of evaluation in their talk. Another notable and informative post is Meghan Frazer's post in the ACRL's TechConnect series (http://acrl.ala.org/techconnect/?p=3621), which provides an insightful summary of the lessons learned. Please see Appendix I for a full bibliography of resources related to One Week | One Tool. Serendip-o-matic The usage of Serendip-o-matic itself has been somewhat limited. The site usage statistics show that, after the initial release, visits declined sharply. That is not to say, however, that it does not continue to bear fruit. It is used as a demonstration tool not only for the One Week | One Tool process itself, but also as an example of using multiple APIs to produce research results. Overall, though, it is important to remember that One Week | One Tool is an exercise in rapid, immersive learning about technologies, tools, development, management, and outreach in digital humanities projects. What the twelve participants achieved, learned, and – most importantly – shared with their colleagues has been a significant success. http://acrl.ala.org/techconnect/?p=3621 Appendix I Bibliography of related presentations and publications [Generated from the Another Week | Another Tool Zotero Group https://www.zotero.org/groups/oneweekonetool2013/items] Andrew, Liam. “I’m Feeling Lucky: Can Algorithms Better Engineer Serendipity in Research — or in Journalism?” Nieman Journalism Lab, July 16, 2014. http://www.niemanlab.org/2014/07/im-feeling-lucky-can-algorithms-better-engineer- serendipity-in-research-or-in-journalism/. “Another Week | Another Tool Begins.” Roy Rosenzweig Center for History and New Media, July 31, 2013. http://chnm.gmu.edu/news/another-week-another-tool/. Baker, James. “@HumaBirdProject + #inspiringwomen.” Digital Scholarship Blog, August 7, 2013. http://britishlibrary.typepad.co.uk/digital-scholarship/2013/08/humabirdproject- inspiringwomen.html. Benatti, Francesca. “Mia Ridge Leads Developement at One Week|One Tool.” Digital Humanities at The Open University, August 7, 2013. http://www.open.ac.uk/blogs/dighum/? p=613. carlspina. “Spark New Paths of Research with Serendip-O-Matic.” Novel Technology, August 8, 2013. http://carlispina.wordpress.com/2013/08/08/serendip-o-matic/. Croxall, Brian. “Day 1 of OWOT: Check Your Ego at the Door,” July 30, 2013. http://www.briancroxall.net/2013/07/30/day-1-of-owot-check-your-ego-at-the-door/. ———. “Day 2 of OWOT: Pick Your Poison,” July 31, 2013. http://www.briancroxall.net/2013/07/31/day-2-of-owot-pick-your-poison/. ———. “Day 3 of OWOT: Of Names and Stories and Gophers,” August 1, 2013. http://www.briancroxall.net/2013/08/01/day-3-of-owot-of-names-and-stories-and-gophers/. ———. “Day 4 of OWOT: Stay Gold, Ponyboy,” August 2, 2013. http://www.briancroxall.net/2013/08/02/day-4-of-owot-stay-gold-ponyboy/. ———. “Day 5 of OWOT: We Did It! (Can We Do It Again? Please??),” August 3, 2013. http://www.briancroxall.net/2013/08/03/day-5-of-owot-we-did-it-can-we-do-it-again-please/. ———. “‘If Hippos Be the Dude of Love…’: Serendip-O-Matic at Digital Humanities 2014.” Brian Croxall, July 22, 2014. http://www.briancroxall.net/2014/07/22/if-hippos-be-the-dude- of-love-serendip-o-matic-at-digital-humanities-2014/. https://www.zotero.org/groups/oneweekonetool2013/items ———. “One Week | One Tool: Introducing Serendip-O-Matic.” The Chronicle of Higher Education. ProfHacker, August 5, 2013. http://chronicle.com/blogs/profhacker/one-week- one-tool-introducing-serendip-o-matic/51449. Dorn, Sherman. “One Week, Better Tools (spoof).” Sherman Dorn, August 2, 2013. http://shermandorn.com/wordpress/?p=6288. ———. “Pictura Invisibilis Collegio Artium Digital.” Sherman Dorn, August 2, 2013. http://shermandorn.com/wordpress/?p=6296. Dougherty, Jack. “Final Reflections from One Week One Tool: The Blur of Days 4-5,” August 4, 2013. http://commons.trincoll.edu/jackdougherty/2013/08/04/owot-4-5/. ———. “Learning Moments at One Week One Tool 2013, Day 1,” July 30, 2013. http://commons.trincoll.edu/jackdougherty/2013/07/30/owot1/. ———. “Metaphorical Learning Moments at One Week One Tool, Day 3,” August 1, 2013. http://commons.trincoll.edu/jackdougherty/2013/08/01/owot-3/. ———. “My Peggy Olson Learning Moment at One Week One Tool, Day 2,” July 31, 2013. http://commons.trincoll.edu/jackdougherty/2013/07/31/owot-2/. “DPLA Welcomes Serendip-O-Matic to the App Library.” Digital Public Library of America, August 2, 2013. http://dp.la/info/2013/08/02/welcome-serendip-o-matic/. “Europeana API Used in One Week | One Tool’s Serendip-O-Matic!” Europeana, August 5, 2013. http://pro.europeana.eu/web/guest;jsessionid=B6A260586E6411EF20C0AFA2DC95D6DB. Frazer, Meghan. “One Week, One Tool, Many Lessons.” ACRL TechConnect Blog, August 7, 2013. http://acrl.ala.org/techconnect/?p=3621. Graham, Shawn. “A Quick Run with Serendip-O-Matic.” Electric Archaeology, August 2, 2013. http://electricarchaeology.ca/2013/08/02/a-quick-run-with-serendip-o-matic/. Grossman, Sara. “How to Build a Digital-Humanities Tool in a Week.” The Chronicle of Higher Education. Wired Campus, August 2, 2013. http://chronicle.com/blogs/wiredcampus/how-to- build-a-digital-humanities-tool-in-a-week/45243. Heimburger, Franziska. “Vos Sources Vous Surprennent Avec Le Serendip-O-Matic.” La Boite à Outils Des Historiens, August 2, 2013. http://www.boiteaoutils.info/2013/08/vos-sources- vous-surprennent-avec-le.html. Hocking, Cameron. “Mining the Treasures of Trove.” Bright Ideas, August 8, 2013. http://slav.global2.vic.edu.au/2013/08/08/mining-the-treasures-of-trove/#.UgfcVmRAQls. Hovious, Amanda. “Serendip-O-Matic | Designer Librarian.” Designer Librarian, August 5, 2013. http://designerlibrarian.wordpress.com/tag/serendip-o-matic/. Hunt, Ryan. “Serendip-O-Matic as a Potential Model for Open Online Academic Work.” IVRYTWR, September 25, 2013. http://ivrytwr.com/2013/09/25/serendip-o-matic-as-a- potential-model-for-open-online-academic-work/. Kleinman, Scott. “Introducing Serendip-O-Matic,” August 5, 2013. http://scottkleinman.net/blog/2013/08/05/introducing-serendip-o-matic/. ———. “Play as Process and Product: On Making Serendip-O-Matic | Scottkleinman.net.” Accessed July 29, 2014. http://scottkleinman.net/blog/2014/07/10/play-as-process-and- product-on-making-serendip-o-matic/. ———. “Serendip-O-Matic (and Other Good News).” Digital Humanities - Southern California, August 12, 2013. http://dhsocal.blogspot.com/2013/08/serendip-o-matic-and-other-good- news.html. Machovec, George. “From Your Managing Editor: Fourteenth Annual Readers’ Choice Awards.” The Charleston Advisor 16, no. 2 (October 1, 2014): 3–10. Meacham, Rebecca. “‘Dear Lucky One’: The Westing Game Invites Us to Play.” The Ploughshares Blog, August 7, 2013. http://blog.pshares.org/index.php/dear-lucky-one-the- westing-game-invites-us-to-play/. Moravec, Michelle. “Serendip-O-Matic Seeks to Replicate Thrill of Archival Discovery Online.” History News Network, August 5, 2013. http://hnn.us/articles/serendip-o-matic-seeks- replicate-thrill-archival-discovery-online. “One Week | One Tool Has Built . . . Serendip-O-Matic.” One Week One Tool, August 2, 2013. http://oneweekonetool.org/. “One Week | One Tool Team Launches Serendip-O-Matic.” Roy Rosenzweig Center for History and New Media, August 2, 2013. http://chnm.gmu.edu/news/one-week-one-tool- team-launches-serendip-o-matic/. Palin, Ray. “One Week | One Tool: Bit by Bit,” August 4, 2013. http://raypalin.info/blog/archives/1157. Peter. “Serendip-O-Matic: Der Automat Für Zufallsfunde...” Hatori Kibble, August 5, 2013. http://hatorikibble.wordpress.com/2013/08/05/serendip-o-matic-der-automat-fur- zufallsfunde/. “Presentations, Exhibitions.” News Pulse: Faculty/Staff Newsletter, August 12, 2013. http://newspulse.newpaltz.edu/2013/08/12/presentations-exhibitions-14/. “Professor Kleinman Helps Develop Search Tool Serendip-O-Matic.” Department of English, California State University-Northridge, August 3, 2013. http://www.csun.edu/engl/news.php? op=story&id=30. “RECOMMENDED: Serendip-O-Matic, From the One Week | One Tool Team.” Dh+lib, August 6, 2013. http://acrl.ala.org/dh/2013/08/06/recommended-serendip-o-matic-from-the-one- week-one-tool-team/. Retief, Esther. “Serendip-O-Matic Search Engine - Connects Your Sources to Digital Materials in Libraries, Museums and Archives Around the World.” LIS Trends, October 14, 2014. http://listrends.blogspot.com/2014/10/serendip-o-matic-search-engine-connects.html. Ridge, Mia. “And so It Begins: Day Two of OWOT.” Open Objects, July 31, 2013. http://openobjects.blogspot.com/2013/07/and-so-it-begins-day-two-of-owot.html. ———. “Conference Paper: Play as Process and Product: On Making Serendip-O-Matic.” Mia Ridge, July 2, 2014. http://www.miaridge.com/conference-paper-play-as-process-and- product-on-making-serendip-o-matic/. ———. “Halfway Through. Day Three of OWOT.” Open Objects, August 1, 2013. http://openobjects.blogspot.com/2013/08/halfway-through-day-three-of-owot.html. ———. “Highs and Lows, Day Four of OWOT.” Open Objects, August 2, 2013. http://openobjects.blogspot.com/2013/08/highs-and-lows-day-4-of-owot.html. ———. “So We Made a Thing. Announcing Serendip-O-Matic at One Week, One Tool.” Open Objects, August 2, 2013. http://openobjects.blogspot.com/2013/08/so-we-made-thing- announcing-serendip-o.html. ———. “Working out What We’re Doing: Day One of One Week, One Tool.” Open Objects, July 30, 2013. http://openobjects.blogspot.com/2013/07/working-out-what-were-doing-day- one-of.html. Rybak, Chuck. “DH Toe Dip: The Serendip-O-Matic.” Sad Iron, August 28, 2014. http://www.sadiron.com/dh-toe-dip-the-serendip-o-matic/. ———. “DH Toe Dip: The Serendip-O-Matic | Sad Iron.” Sad Iron, August 28, 2014. http://www.sadiron.com/dh-toe-dip-the-serendip-o-matic/. “Serendip-O-Matic.” Bamboo DiRT, August 2, 2013. http://dirt.projectbamboo.org/resources/serendip-o-matic. “Serendip-O-Matic.” Designer Librarian, August 5, 2013. http://designerlibrarian.wordpress.com/2013/08/05/serendip-o-matic/. “Serendip-O-Matic.” Europeana Labs, July 2014. http://preview.labs.eanadev.org/apps/serendip-o-matic/. “Serendip-O-Matic - Csodálkozz a Bibliográfiádra.” Kereső Világ: Keresés, Szövegbányászat, Big Data, August 8, 2013. http://kereses.blog.hu/2013/08/09/serendip-o- matic_csodalkozz_a_bibliografiadra. “Serendip-O-Matic: It’s Not Search, It’s Serendipity.” Danegeld, August 8, 2013. http://danegeld.dk/2013/08/08/serendip-o-matic-its-not-search-its-serendipity/. “Serendip-O-Matic Launched.” Mason News, August 9, 2013. http://newsdesk.gmu.edu/2013/08/serendip-o-matic-launched/. “Serendip-O-Matic: Let’s Your Sources Surprise You.” Digital Meets Culture, August 2013. http://www.digitalmeetsculture.net/article/serendip-o-matic-lets-your-sources-surprise-you/. “Serendip-O-Matic: Let Your Sources Surprise You.” Stuff You Missed in History Class, August 5, 2013. http://missedinhistory.tumblr.com/post/57428548818/serendip-o-matic-let-your- sources-surprise-you. Serendip-O-Matic - Post Mortal Songs. Switzerland, 2005. http://www.discogs.com/Serendip- o-matic-Post-Mortal-Songs/release/6984447. “Serendip-O-Matic Results Using 2012 SPU Library Annual Report.” Keeping Time, August 3, 2013. http://forkeepingtime.tumblr.com/post/57254758403/serendip-o-matic-results-using- 2012-spu-library-annual. Smale, Maura. “New at the DPLA: There’s an App for That.” ACRLog, August 15, 2013. http://acrlog.org/2013/08/15/new-at-the-dpla-theres-an-app-for-that/. “SMHS Connects with Serendip-O-Matic.” Sunapee School District, August 13, 2013. http://www.sunapeeschools.org/home/announcement/smhsconnectswithserendip-o-matic. “Stained Glass, Google, Serendip-O-Matic, More: Short Wednesday Buzz.” ResearchBuzz, August 14, 2013. http://researchbuzz.me/2013/08/14/stained-glass-google-serendip-o- matic-more-short-wednesday-buzz-august-14-2013/. Starr, Julie. “Serendip-O-Matic Led Me to These Gorgeous Images of Early NZ.” Evolving Newsroom, August 12, 2013. http://evolvingnewsroom.co.nz/serendip-o-matic-led-me-to- these-gorgeous-images-of-early-nz/. “Surprising Results: A Search Engine Designed by and for Digital Humanities.” Explored.tech, August 6, 2013. http://sophia.smith.edu/blog/exploredtech/category/digital- humanities/. Verhoeven, Deb, and Toby Burrows. “Crowdsourcing for Serendipity.” The Australian: Higher Education, December 10, 2014. http://www.theaustralian.com.au/higher- education/opinion/crowdsourcing-for-serendipity/story-e6frgcko-1227150244558. Visconti, Amanda. “Digital Projects from Start to Finish: DH Mentorship from One Week One Tool (OWOT).” Literature Geek, July 30, 2013. http://www.literaturegeek.com/owotdayone/. ———. “#OWOT a Week: Introducing Serendip-O-Matic, a Tool for Digital Humanities Discovery and Delight.” Literature Geek, August 2, 2013. http://www.literaturegeek.com/owot-a-week/. Williams, Amrys O. “One Week, One Tool.” AmShazam., July 28, 2013. http://amrys.wordpress.com/2013/07/28/one-week-one-tool/. ———. “OWOT, Day 2.” AmShazam., July 29, 2013. http://amrys.wordpress.com/2013/07/29/owot-day-2/. ———. “OWOT, Day 3.” AmShazam., July 30, 2013. http://amrys.wordpress.com/2013/07/30/owot-day-3/. ———. “OWOT, Day 4.” AmShazam., July 31, 2013. http://amrys.wordpress.com/2013/07/31/owot-day-4/. ———. “OWOT, Day 5.” AmShazam, August 1, 2013. http://amrys.wordpress.com/2013/08/01/owot-day-5/. ———. “OWOT, Day 6.” AmShazam., August 2, 2013. http://amrys.wordpress.com/2013/08/02/owot-day-6/. ———. “OWOT, Day 7.” AmShazam., August 3, 2013. http://amrys.wordpress.com/2013/08/03/owot-day-7/. ———. “What We Built at OWOT: Serendip-O-Matic.” History of Science, Medicine, and Technology at the University of Wisconsin, August 2, 2013. http://wisconsinhstm.blogspot.com/2013/08/what-we-built-at-owot-serendip-o-matic.html. “Сервис Serendip-O-Matic иллюстрирует тексты картинками из библиотек и музеев.” Edutainme, August 5, 2013. http://www.edutainme.ru/news/servis-serendip-o-matic- illyustriruet-teksty-kartinkami-iz-bibliotek-i-muzeev/. Project Activities, Accomplishments, and Audiences Evaluation Continuation of the Project Grant Products Appendix I Bibliography of related presentations and publications work_g2z6gcjdengm7ikpgxd3bxgsqu ---- PARTHENOS Foresight - Executive Summary PARTHENOS is a Horizon 2020 project funded by the European Commission under Grant Agreement n. 654119. The views and opinions expressed in this publication are the sole responsibility of the author and do not necessarily reflect the views of the European Commission. PARTHENOS Foresight Executive Summary https://zenodo.org/record/2662490 Introduction In recent years there has been rapid growth both in the development of digital methods and tools and in their application across a wide range of disciplines within humanities and cultural heritage studies. The future development of this landscape depends on a complex and dynamic ecosystem of interactions between a range of factors: changing scholarly priorities, questions and methods; technological advances and new tool development; and the broader social, cultural and economic contexts within which both scholars and infrastructures are situated. This foresight study investigates how digital research methods, technologies and infrastructures in digital humanities and cultural heritage may develop over the next 5-10 years, and provides some recommendations for future interventions to optimize this development. Foresight Foresight research is a key mechanism for the development and implementation of research and innovation policy in the medium to long term, enabling policy-making bodies to set research priorities and influence the progress of research. Foresight research is not simply ‘future gazing’, nor is it just about forecasting by experts, rather it is a way of facilitating structured thinking and debate about long- term issues and developments, and of broadening participation in this process, by involving different stakeholders, to create a shared understanding about possible futures and to enable them to be shaped or influenced. Engaging a representative range of relevant and informed stakeholders in the dialogue brings several benefits: it extends the breadth and depth of the knowledge base created by the foresight process by drawing on distributed knowledge; it increases the ‘democratic basis and legitimacy’ of the study report by avoiding a top-down, expert-driven analysis; and it helps to spread the message about foresight activities and to embed it within participating organisations, thus improving sustainability. Foresight studies draw upon existing knowledge networks and stimulate new ones – in addition to any reports produced, these embedded networks are an important output of foresight activities, facilitating a longer-term thinking process that extends beyond the period of the study itself. PARTHENOS FORESIGHT EXECUTIVE SUMMARY POLICY-MAKING & PLANNING PARTICIPATION & NETWORKING PERSPECTIVE & FUTURE FORESIGHT PARTHENOS Foresight Methodology A foresight study may utilize a range of different information gathering methods in the construction of its knowledge base. Specifically, the PARTHENOS foresight study commenced with an initial literature review and landscape scanning, to set the context for the study. This was followed by a series of structured, interactive events that combined expert panels with interactive workshops to obtain input for the study’s foresight knowledge base, by curating multi-polar discussions among both experts from relevant backgrounds and a broader range of actual or potential stakeholders in research infrastructures, including (but not restricted to) users/researchers. These events then fed in turn into a series of interviews with targeted stakeholders. Lastly, the PARTHENOS Hub – which is a publication and interaction platform created by the project itself – provided a space to both present the methodology and ask for additional input through a questionnaire. The respective issue can be consulted here: http://www. parthenos-project.eu/portal/the-hub/issue-2. Within this overall framework, the study followed a thematic approach, structuring its investigations around a two-dimensional matrix of questions that addressed, firstly, the different aspects of the foresight process: ● current trends – what is happening, and what impact is it having? ● potentialities and opportunities – what may happen? ● requirements – what do we want to happen? ● obstacles, constraints, risks and threats – what might prevent this from happening? ● what activities and interventions (e.g. funding programmes, strategic research, service provision) might serve to ‘optimize’ outcomes? and, secondly, the different contexts to which those aspects relate: ● technology (e.g. new tools or methods); ● scholarly or professional practice (e.g. emerging research areas, changes in career structures); ● the broader ‘environment’ (e.g. social, cultural, economic, political, policy). Research/Scolarship (what are researchers doing? want to do?) Technological (new, evolving, potential technology) Enviromental (social, cultural, policy, economic ... ) Findings This study has found a dynamic field with a host of opportunities offered by new technologies, but requiring additional skills and infrastructure if full use is to be made of the opportunities. The main findings of the foresight study are summarized below, grouped according to identified trends, obstacles, potentialities and requirements. Trends The adoption of digital research methods is increasingly widespread in the humanities and cultural heritage sector, with the development of new data sources, technologies, and expanding collaborations creating a dynamic and innovative environment. The development of the digital humanities has been characterized by the explosion in data available for analysis: digitized collections; open data; born-digital content. There are limitations and issues in relation to these, however: there is still a need for further digitization, in particular of collections relating to marginalized groups; significant concerns have emerged about potential infringement of IPR and the GDPR; and big technology companies are raising barriers to access to their data. There is also a wide range of tools for analysing these data: open source software; natural language processing, machine learning, and artificial intelligence tools and libraries. Open source software enables the broad adoption of new tools and facilitates sustainability beyond a single project, while the development of software libraries for computational analysis offers the potential for widespread automated analysis. There is an important difference, however, between placing software on GitHub and ensuring it is sustainable in the long term, and there is a risk that artificial intelligence may be seen as a vague panacea for all difficulties, without the community fully understanding the potentials, limitations and biases of the tools. There has also been an increase in the number and variety of collaborations: interdisciplinary collaboration; intersectoral collaboration; and international collaboration. Collaborations between the humanities and other fields, universities and other sectors of society, and across national borders, are increasingly common and bring new perspectives and ideas to projects and data sets. This may be hindered, however, by humanists who are reluctant to embrace digital methodologies, a suspicion of the commercial sector, and certain restrictions on international funding. These trends towards increased data, tools and collaboration are all “Open source software enables the broad adoption of new tools and facilitates sustainability beyond a single project, while the development of software libraries for computational analysis offers the potential for widespread automated analysis.” expected to continue into the near future, albeit with the potential for some restrictions on access to data due to concerns about IPR and the GDPR, and more limitations imposed by the big technology companies. The rate of increased adoption of data, tools and collaboration is liable to be constrained by funding limitations. Obstacles The opportunities offered by recent technological advances in the humanities have not yet reached their full potential, a situation that has been heavily influenced by environmental obstacles. The three most often raised obstacles were: funding, the digital divide, and concerns about IPR and the GDPR. The lack of sufficient funding for the digital humanities and cultural heritage sectors, especially since the financial crisis of 2008 and the growing emphasis on the funding of STEM subjects, has had significant consequences for the capability of the sector to meet the challenges of the twenty-first century: ● Distortion of research interests: Insufficient funds drives researchers to focus on those areas where funding is available, with an accompanying lack of freedom to explore other areas that they consider important. ● Loss of people from the sector: Restricted budgets inevitably lead to a lack of job security, and the loss of team members has ramifications for the sustainability of projects and the loss of vital skills from the sector. The lack of funding also feeds into the digital divide within the digital humanities and cultural sectors. This digital divide can take many forms, including: ● International digital divide: There continues to be significant differences between the research infrastructures available to researchers and research institutes in different countries. ● Interdisciplinary digital divide: There are significant differences between the research infrastructures that are available to the digital humanities compared with STEM disciplines that have been prioritized for funding. This, in turn, has contributed to the digital divide in technical skills. ● Intradisciplinary digital divide: There continues to be a significant and ongoing divide within the humanities between those who embrace the potential of digital methodologies and those who do not. There are also concerns about IPR and the GDPR. The GDPR, in particular, is seen as blocking avenues of research, and preventing humanists researching some of the most important emerging issues affecting the EU, including fake news, populism, and nationalism. Potentialities The potential of digital research methods in the humanities and cultural heritage sectors is reliant not on the emergence of new technologies or discoveries, but rather on the application of existing technologies. The new digital technologies and primary sources offer a host of new possibilities, but a decade of underfunding has left much of the potential unrealized. Particular interest was noted in those technologies that potentially offer a technological solution to overcoming the problem of a lack of growth in the humanities: ● Crowdsourcing: Crowdsourcing offers the opportunity both to outsource certain tasks to the wider community, thus scaling up certain types of activity, and to engage the public more deeply with humanities research. ● Artificial Intelligence: Artificial Intelligence offers the potential to contribute to a wide range of research in the digital humanities, but it is important that humanities researchers are willing to investigate the black box of these technologies more fully. Neither is a panacea to the underfunding of the humanities, however. While they may offer the opportunity to increase the scale of projects, they nonetheless require expert guidance and a fuller understanding on the part of those researchers employing them. New technologies and publication models also offer the potential for greater public impact: ● Augmented Reality, Virtual Reality, and Mobile Applications: The near-ubiquitous mobile smartphone, and the growing potential of augmented reality and virtual reality technologies, offer numerous opportunities for promoting research and collections in new ways. Not all will be successful, however, and there needs to be room for experimentation and failure, which is increasingly difficult given the importance accorded to impact and metrics in research evaluation. ● Open Research: Open research is seen as having potential not only for improving research access and quality, but also for reaching out to the wider public. For this to be achieved, however, there is a need for funding to ensure that open access policies can be followed. From a technological perspective, the typical view was the expectation of more of the same. However, the impact of these technologies on the structure of the humanities, or the potential of the humanities for culture more broadly, is much less clear. Requirements There is a fundamental need for growth in the funding of the humanities and cultural heritage sector to ensure that it can meet the challenges of the twenty-first century and our increasingly technology-mediated society. This is not simply a request for unlimited funds to support blue-sky thinking, but reflects the need for a discussion about the “fundamental questions” and “inspirational goals” that the community has to offer society. It is not just a matter of technologies, but rather about finding the questions. At a European level there is a need for a stronger European lead, with a more explicit European Commission strategy on cultural heritage, and more visible public institutions offering leadership on research infrastructure and standards. It was suggested that cultural heritage institutes may contribute to the building of a European identity in the same way that 18th and 19th century cultural heritage institutes contributed to nation building. Europe is not a single homogenous region, however, and there is a need for segmentation in future digital humanities strategy, with different regions requiring different answers. This means that there is also an important role for national governments in ensuring sustainable levels of support for the humanities and cultural heritage sector. There is a need for a suitable information regulation framework that supports rather than hinders humanities research; this framework should distinguish between the work of academic or public sector researchers and those from private corporations, and should recognize that the protection required when handling personal health records differs from the protection required when analysing political commentary that is already in the public arena. Finally, as more than one contributor noted, there is a need for more projects similar to the PARTHENOS Foresight Study (or indeed a sustainment or continuation of this study), that engage with professionals in culture and heritage to ask them what they see happening and what their needs and issues are. The digital humanities and cultural heritage sectors form a diverse community, without a single voice, and it needs to find that voice if it is to meet some of the challenges of the twenty-first century. “There is a fundamental need for growth in the funding of the humanities and cultural heritage sector to ensure that it can meet the challenges of the twenty- first century and our increasingly technology-mediated society.” Research Agenda From the foresight study, five broad themes emerge that should form the basis of a research agenda in the digital humanities: public engagement; research infrastructures; development of the digital commons; artificial intelligence; and impact and evaluation methods and metrics. Public Engagement Public engagement is an essential part of ending the underfunding of the humanities and cultural heritage sectors. The contribution of STEM research to society is widely recognised in a way that the contribution of the humanities is not, and there is a need for humanists to make the case for their work more forcibly with a combined voice. There are many ways that the new technologies can be used by humanists and cultural heritage sector to ensure research outputs are as widely accessible as possible: open access, open data (following good data practice), social media, augmented reality, virtual reality, and mobile apps. Crowdsourcing platforms can also be used for soliciting contributions from the public. Engagement, however, is not just about promotion of research or extracting free labour, but about engaging with the public to ensure the humanities are meeting the challenges society faces at the beginning of the twenty-first century, whether that is fake news, nationalism, populism, or climate change, and demonstrating the contribution humanities research is making to these grand challenges. Research infrastructures The value of recent initiatives in the development of research infrastructures were widely recognized in the foresight study, as they provide a certain amount of sustainability to research projects, and more development of research infrastructures for the humanities and cultural heritage sector was seen as necessary. At a time when projects are often short and the competition for funding is fierce, research infrastructures need to facilitate collaboration and sustainability, establishing communities around the infrastructures that are developed. It is important that research infrastructures do not simply perpetuate or exacerbate existing inequalities but help to bridge the digital divide. New research infrastructures, or enhancements to existing ones, should: ● bring to the fore marginalised collections. ● ensure access and analysis is not only possible by the technologically literate. “There are many ways that the new technologies can be used by humanists and cultural heritage sector to ensure research outputs are as widely accessible as possible: open access, open data (following good data practice), social media, augmented reality, virtual reality, and mobile apps.” ● provide data services and tools as well as data. Importantly, research infrastructures should feed into the public engagement by being visible, and findable, and should be used to establish authority in the development of standards and best practice. Development of the digital commons New data sets and new technologies offer the potential for a host of new research questions to be addressed, but the humanities must be more critical in both the application of digital methodologies and the data that is available. The digital humanities should not be reduced to the application of trendy technologies and data sources looking for research questions, but rather answering the big questions, while at the same time enhancing the digital commons and other digital resources. There is significant work to be done in: ● making new collections freely available online, especially those from marginalised communities. ● integrating diverse data sets. ● building context and provenance for online resources. These issues are particularly important in the context of the widely recognised potential for artificial intelligence. Artificial intelligence The potential for artificial intelligence, machine learning, and other large- scale computational methodologies are as prevalent in the humanities and cultural heritage sector as the sciences. It is essential, however, that these technologies are not simply applied in an ad hoc manner, but are applied critically with attention to sustainability and ethical considerations. There is in particular a need to focus on: ● the ethical implications of the application of AI technologies. ● real world applications that are reusable. ● ensuring the technologies are used to help close rather than extend the digital divide. Impact and evaluation Impact and evaluation are important parts of the research process, especially when ensuring that limited funds are used in the best way possible, and it is essential that new methodologies and metrics are developed for measuring impact and evaluation that reflect the specific needs of the humanities and cultural heritage sector. These methodologies and metrics should incentivize innovation, sustainability, and public engagement. They should also recognize a far wider range of outputs and applications, and contribute to the development of standards and best practices in research evaluation. www.parthenos-project.eu work_g47ybedbfbcc3hyaxkzfxtrngi ---- Inderscience Publishers - linking academia, business and industry through research Log in Log in For authors, reviewers, editors and board members Username Remember me Go Forgotten? Help Sitemap Home For Authors For Librarians Orders Inderscience Online News Explore our journals Browse journals by titleAfrican Journal of Accounting, Auditing and FinanceAfrican Journal of Economic and Sustainable DevelopmentAfro-Asian Journal of Finance and AccountingAmerican Journal of Finance and AccountingAsian Journal of Management Science and ApplicationsAtoms for Peace: an International JournalElectronic Government, an International JournalEuroMed Journal of ManagementEuropean Journal of Cross-Cultural Competence and ManagementEuropean Journal of Industrial EngineeringEuropean Journal of International ManagementGlobal Business and Economics ReviewInterdisciplinary Environmental ReviewInternational Journal of Abrasive TechnologyInternational Journal of Accounting and FinanceInternational Journal of Accounting, Auditing and Performance EvaluationInternational Journal of Ad Hoc and Ubiquitous ComputingInternational Journal of Adaptive and Innovative SystemsInternational Journal of Additive and Subtractive Materials ManufacturingInternational Journal of Advanced Intelligence ParadigmsInternational Journal of Advanced Mechatronic SystemsInternational Journal of Advanced Media and CommunicationInternational Journal of Advanced Operations ManagementInternational Journal of AerodynamicsInternational Journal of Aerospace System Science and EngineeringInternational Journal of Agent-Oriented Software EngineeringInternational Journal of Agile and Extreme Software DevelopmentInternational Journal of Agile Systems and ManagementInternational Journal of Agricultural Resources, Governance and EcologyInternational Journal of Agriculture Innovation, Technology and GlobalisationInternational Journal of Alternative PropulsionInternational Journal of Applied CryptographyInternational Journal of Applied Decision SciencesInternational Journal of Applied Management ScienceInternational Journal of Applied Nonlinear ScienceInternational Journal of Applied Pattern RecognitionInternational Journal of Applied Systemic StudiesInternational Journal of Arab Culture, Management and Sustainable DevelopmentInternational Journal of Artificial Intelligence and Soft ComputingInternational Journal of Arts and TechnologyInternational Journal of Auditing TechnologyInternational Journal of Automation and ControlInternational Journal of Automation and LogisticsInternational Journal of Automotive CompositesInternational Journal of Automotive Technology and ManagementInternational Journal of Autonomic ComputingInternational Journal of Autonomous and Adaptive Communications SystemsInternational Journal of Aviation ManagementInternational Journal of Banking, Accounting and FinanceInternational Journal of Behavioural Accounting and FinanceInternational Journal of Behavioural and Healthcare ResearchInternational Journal of Bibliometrics in Business and ManagementInternational Journal of Big Data IntelligenceInternational Journal of Big Data ManagementInternational Journal of Bioinformatics Research and ApplicationsInternational Journal of Bio-Inspired ComputationInternational Journal of Biomechatronics and Biomedical RoboticsInternational Journal of Biomedical Engineering and TechnologyInternational Journal of Biomedical Nanoscience and NanotechnologyInternational Journal of BiometricsInternational Journal of BiotechnologyInternational Journal of Blockchains and CryptocurrenciesInternational Journal of Bonds and DerivativesInternational Journal of Business and Data AnalyticsInternational Journal of the Built Environment and Asset ManagementInternational Journal of Business and Emerging MarketsInternational Journal of Business and GlobalisationInternational Journal of Business and Systems ResearchInternational Journal of Business Competition and GrowthInternational Journal of Business Continuity and Risk ManagementInternational Journal of Business EnvironmentInternational Journal of Business ExcellenceInternational Journal of Business Forecasting and Marketing IntelligenceInternational Journal of Business Governance and EthicsInternational Journal of Business Information SystemsInternational Journal of Business Innovation and ResearchInternational Journal of Business Intelligence and Data MiningInternational Journal of Business Intelligence and Systems EngineeringInternational Journal of Business Performance and Supply Chain ModellingInternational Journal of Business Performance ManagementInternational Journal of Business Process Integration and ManagementInternational Journal of Chinese Culture and ManagementInternational Journal of Circuits and Architecture DesignInternational Journal of Cloud ComputingInternational Journal of Cognitive BiometricsInternational Journal of Cognitive Performance SupportInternational Journal of Collaborative EngineeringInternational Journal of Collaborative EnterpriseInternational Journal of Collaborative IntelligenceInternational Journal of Communication Networks and Distributed SystemsInternational Journal of Comparative ManagementInternational Journal of CompetitivenessInternational Journal of Complexity in Applied Science and TechnologyInternational Journal of Complexity in Leadership and ManagementInternational Journal of Computational Biology and Drug DesignInternational Journal of Computational Complexity and Intelligent AlgorithmsInternational Journal of Computational Economics and EconometricsInternational Journal of Computational Intelligence in Bioinformatics and Systems BiologyInternational Journal of Computational Intelligence StudiesInternational Journal of Computational Materials Science and Surface EngineeringInternational Journal of Computational Medicine and HealthcareInternational Journal of Computational Microbiology and Medical EcologyInternational Journal of Computational Science and EngineeringInternational Journal of Computational Systems EngineeringInternational Journal of Computational Vision and RoboticsInternational Journal of Computer Aided Engineering and TechnologyInternational Journal of Computer Applications in TechnologyInternational Journal of Computers in HealthcareInternational Journal of Computing Science and MathematicsInternational Journal of Continuing Engineering Education and Life-Long LearningInternational Journal of Convergence ComputingInternational Journal of Corporate GovernanceInternational Journal of Corporate Strategy and Social ResponsibilityInternational Journal of Creative ComputingInternational Journal of Critical AccountingInternational Journal of Critical Computer-Based SystemsInternational Journal of Critical InfrastructuresInternational Journal of Cultural ManagementInternational Journal of Cybernetics and Cyber-Physical SystemsInternational Journal of Data Analysis Techniques and StrategiesInternational Journal of Data Mining and BioinformaticsInternational Journal of Data Mining, Modelling and ManagementInternational Journal of Data ScienceInternational Journal of Decision Sciences, Risk and ManagementInternational Journal of Decision Support SystemsInternational Journal of Design EngineeringInternational Journal of Digital Culture and Electronic TourismInternational Journal of Digital Enterprise TechnologyInternational Journal of the Digital HumanInternational Journal of Digital Signals and Smart SystemsInternational Journal of Diplomacy and EconomyInternational Journal of Dynamical Systems and Differential EquationsInternational Journal of Earthquake and Impact EngineeringInternational Journal of Ecological Bioscience and BiotechnologyInternational Journal of Economic Policy in Emerging EconomiesInternational Journal of Economics and AccountingInternational Journal of Economics and Business ResearchInternational Journal of Education Economics and DevelopmentInternational Journal of Electric and Hybrid VehiclesInternational Journal of Electronic BankingInternational Journal of Electronic BusinessInternational Journal of Electronic Customer Relationship ManagementInternational Journal of Electronic DemocracyInternational Journal of Electronic FinanceInternational Journal of Electronic GovernanceInternational Journal of Electronic HealthcareInternational Journal of Electronic Marketing and RetailingInternational Journal of Electronic Security and Digital ForensicsInternational Journal of Electronic TradeInternational Journal of Electronic TransportInternational Journal of Embedded SystemsInternational Journal of Emergency ManagementInternational Journal of Emerging Computing for Sustainable AgricultureInternational Journal of Energy Technology and PolicyInternational Journal of Engineering Management and EconomicsInternational Journal of Engineering Systems Modelling and SimulationInternational Journal of Enterprise Network ManagementInternational Journal of Enterprise Systems Integration and InteroperabilityInternational Journal of Entertainment Technology and ManagementInternational Journal of Entrepreneurial VenturingInternational Journal of Entrepreneurship and Innovation ManagementInternational Journal of Entrepreneurship and Small BusinessInternational Journal of Environment and HealthInternational Journal of Environment and PollutionInternational Journal of Environment and Sustainable DevelopmentInternational Journal of Environment and Waste ManagementInternational Journal of Environment, Workplace and EmploymentInternational Journal of Environmental EngineeringInternational Journal of Environmental Policy and Decision MakingInternational Journal of Environmental Technology and ManagementInternational Journal of ExergyInternational Journal of Experimental and Computational BiomechanicsInternational Journal of Experimental Design and Process OptimisationInternational Journal of Export MarketingInternational Journal of Family Business and Regional DevelopmentInternational Journal of Financial Engineering and Risk ManagementInternational Journal of Financial Innovation in BankingInternational Journal of Financial Markets and DerivativesInternational Journal of Financial Services ManagementInternational Journal of Food Safety, Nutrition and Public HealthInternational Journal of Forensic EngineeringInternational Journal of Forensic Engineering and ManagementInternational Journal of Forensic Software EngineeringInternational Journal of Foresight and Innovation PolicyInternational Journal of Functional Informatics and Personalised MedicineInternational Journal of Fuzzy Computation and ModellingInternational Journal of Gender Studies in Developing SocietiesInternational Journal of Global Energy IssuesInternational Journal of Global Environmental IssuesInternational Journal of Global WarmingInternational Journal of Globalisation and Small BusinessInternational Journal of Governance and Financial IntermediationInternational Journal of Granular Computing, Rough Sets and Intelligent SystemsInternational Journal of Green EconomicsInternational Journal of Grid and Utility ComputingInternational Journal of Happiness and DevelopmentInternational Journal of Healthcare PolicyInternational Journal of Healthcare Technology and ManagementInternational Journal of Heavy Vehicle SystemsInternational Journal of High Performance Computing and NetworkingInternational Journal of High Performance Systems ArchitectureInternational Journal of Higher Education and SustainabilityInternational Journal of Hospitality and Event ManagementInternational Journal of Human Factors and ErgonomicsInternational Journal of Human Factors Modelling and SimulationInternational Journal of Human Resources Development and ManagementInternational Journal of Human Rights and Constitutional StudiesInternational Journal of Humanitarian TechnologyInternational Journal of Hybrid IntelligenceInternational Journal of Hydrology Science and TechnologyInternational Journal of HydromechatronicsInternational Journal of Image MiningInternational Journal of Immunological StudiesInternational Journal of Indian Culture and Business ManagementInternational Journal of Industrial and Systems EngineeringInternational Journal of Industrial Electronics and DrivesInternational Journal of Information and Coding TheoryInternational Journal of Information and Communication TechnologyInternational Journal of Information and Computer SecurityInternational Journal of Information and Decision SciencesInternational Journal of Information and Operations Management EducationInternational Journal of Information Privacy, Security and IntegrityInternational Journal of Information QualityInternational Journal of Information Systems and Change ManagementInternational Journal of Information Systems and ManagementInternational Journal of Information Technology and ManagementInternational Journal of Information Technology, Communications and ConvergenceInternational Journal of Innovation and LearningInternational Journal of Innovation and Regional DevelopmentInternational Journal of Innovation and Sustainable DevelopmentInternational Journal of Innovation in EducationInternational Journal of Innovative Computing and ApplicationsInternational Journal of Instrumentation TechnologyInternational Journal of Integrated Supply ManagementInternational Journal of Intellectual Property ManagementInternational Journal of Intelligence and Sustainable ComputingInternational Journal of Intelligent Defence Support SystemsInternational Journal of Intelligent Engineering InformaticsInternational Journal of Intelligent EnterpriseInternational Journal of Intelligent Information and Database SystemsInternational Journal of Intelligent Internet of Things ComputingInternational Journal of Intelligent Machines and RoboticsInternational Journal of Intelligent Systems Design and ComputingInternational Journal of Intelligent Systems Technologies and ApplicationsInternational Journal of Intercultural Information ManagementInternational Journal of Internet and Enterprise ManagementInternational Journal of Internet Manufacturing and ServicesInternational Journal of Internet Marketing and AdvertisingInternational Journal of Internet of Things and Cyber-AssuranceInternational Journal of Internet Protocol TechnologyInternational Journal of Internet Technology and Secured TransactionsInternational Journal of Inventory ResearchInternational Journal of Islamic Marketing and BrandingInternational Journal of Knowledge and LearningInternational Journal of Knowledge and Web IntelligenceInternational Journal of Knowledge Engineering and Data MiningInternational Journal of Knowledge Engineering and Soft Data ParadigmsInternational Journal of Knowledge Management in Tourism and HospitalityInternational Journal of Knowledge Management StudiesInternational Journal of Knowledge Science and EngineeringInternational Journal of Knowledge-Based DevelopmentInternational Journal of Lean Enterprise ResearchInternational Journal of Learning and ChangeInternational Journal of Learning and Intellectual CapitalInternational Journal of Learning TechnologyInternational Journal of Legal Information DesignInternational Journal of Leisure and Tourism MarketingInternational Journal of Liability and Scientific EnquiryInternational Journal of Lifecycle Performance EngineeringInternational Journal of Logistics Economics and GlobalisationInternational Journal of Logistics Systems and ManagementInternational Journal of Low RadiationInternational Journal of Machine Intelligence and Sensory Signal ProcessingInternational Journal of Machining and Machinability of MaterialsInternational Journal of Management and Decision MakingInternational Journal of Management and Enterprise DevelopmentInternational Journal of Management and Network EconomicsInternational Journal of Management Concepts and PhilosophyInternational Journal of Management DevelopmentInternational Journal of Management in EducationInternational Journal of Management PracticeInternational Journal of Managerial and Financial AccountingInternational Journal of Manufacturing ResearchInternational Journal of Manufacturing Technology and ManagementInternational Journal of Masonry Research and InnovationInternational Journal of Markets and Business SystemsInternational Journal of Mass CustomisationInternational Journal of Materials and Product TechnologyInternational Journal of Materials and Structural IntegrityInternational Journal of Materials Engineering InnovationInternational Journal of Mathematical Modelling and Numerical OptimisationInternational Journal of Mathematics in Operational ResearchInternational Journal of Mechanisms and Robotic SystemsInternational Journal of Mechatronics and AutomationInternational Journal of Mechatronics and Manufacturing SystemsInternational Journal of Medical Engineering and InformaticsInternational Journal of Metadata, Semantics and OntologiesInternational Journal of MetaheuristicsInternational Journal of Microstructure and Materials PropertiesInternational Journal of Migration and Border StudiesInternational Journal of Migration and Residential MobilityInternational Journal of Mining and Mineral EngineeringInternational Journal of Mobile CommunicationsInternational Journal of Mobile Learning and OrganisationInternational Journal of Mobile Network Design and InnovationInternational Journal of Modelling in Operations ManagementInternational Journal of Modelling, Identification and ControlInternational Journal of Molecular EngineeringInternational Journal of Monetary Economics and FinanceInternational Journal of Multicriteria Decision MakingInternational Journal of Multimedia Intelligence and SecurityInternational Journal of Multinational Corporation StrategyInternational Journal of Multivariate Data AnalysisInternational Journal of Nano and BiomaterialsInternational Journal of NanomanufacturingInternational Journal of NanoparticlesInternational Journal of NanotechnologyInternational Journal of Network ScienceInternational Journal of Networking and SecurityInternational Journal of Networking and Virtual OrganisationsInternational Journal of Nonlinear Dynamics and ControlInternational Journal of Nuclear DesalinationInternational Journal of Nuclear Energy Science and TechnologyInternational Journal of Nuclear Governance, Economy and EcologyInternational Journal of Nuclear Hydrogen Production and ApplicationsInternational Journal of Nuclear Knowledge ManagementInternational Journal of Nuclear LawInternational Journal of Nuclear Safety and SecurityInternational Journal of Ocean Systems ManagementInternational Journal of Oil, Gas and Coal TechnologyInternational Journal of Operational ResearchInternational Journal of Organisational Design and EngineeringInternational Journal of Petroleum EngineeringInternational Journal of Physiotherapy and Life PhysicsInternational Journal of Planning and SchedulingInternational Journal of Pluralism and Economics EducationInternational Journal of Portfolio Analysis and ManagementInternational Journal of Postharvest Technology and InnovationInternational Journal of Power and Energy ConversionInternational Journal of Power ElectronicsInternational Journal of PowertrainsInternational Journal of Precision TechnologyInternational Journal of Private LawInternational Journal of Process Management and BenchmarkingInternational Journal of Process Systems EngineeringInternational Journal of Procurement ManagementInternational Journal of Product DevelopmentInternational Journal of Product Lifecycle ManagementInternational Journal of Product Sound QualityInternational Journal of Productivity and Quality ManagementInternational Journal of Project Organisation and ManagementInternational Journal of Public Law and PolicyInternational Journal of Public PolicyInternational Journal of Public Sector Performance ManagementInternational Journal of Qualitative Information Systems ResearchInternational Journal of Qualitative Research in ServicesInternational Journal of Quality and InnovationInternational Journal of Quality Engineering and TechnologyInternational Journal of Quantitative Research in EducationInternational Journal of Radio Frequency Identification Technology and ApplicationsInternational Journal of Rapid ManufacturingInternational Journal of Reasoning-based Intelligent SystemsInternational Journal of Reliability and SafetyInternational Journal of RemanufacturingInternational Journal of Renewable Energy TechnologyInternational Journal of Research, Innovation and CommercialisationInternational Journal of Responsible Management in Emerging EconomiesInternational Journal of Revenue ManagementInternational Journal of Risk Assessment and ManagementInternational Journal of Satellite Communications Policy and ManagementInternational Journal of Security and NetworksInternational Journal of Semantic and Infrastructure ServicesInternational Journal of Sensor NetworksInternational Journal of Service and Computing Oriented ManufacturingInternational Journal of Services and Operations ManagementInternational Journal of Services and StandardsInternational Journal of Services Operations and InformaticsInternational Journal of Services SciencesInternational Journal of Services Technology and ManagementInternational Journal of Services, Economics and ManagementInternational Journal of Shipping and Transport LogisticsInternational Journal of Signal and Imaging Systems EngineeringInternational Journal of Simulation and Process ModellingInternational Journal of Six Sigma and Competitive AdvantageInternational Journal of Smart Grid and Green CommunicationsInternational Journal of Smart Technology and LearningInternational Journal of Social and Humanistic ComputingInternational Journal of Social Computing and Cyber-Physical SystemsInternational Journal of Social Entrepreneurship and InnovationInternational Journal of Social Media and Interactive Learning EnvironmentsInternational Journal of Social Network MiningInternational Journal of Society Systems ScienceInternational Journal of Soft Computing and NetworkingInternational Journal of Software Engineering, Technology and ApplicationsInternational Journal of Space Science and EngineeringInternational Journal of Space-Based and Situated ComputingInternational Journal of Spatial, Temporal and Multimedia Information SystemsInternational Journal of Spatio-Temporal Data ScienceInternational Journal of Sport Management and MarketingInternational Journal of Strategic Business AlliancesInternational Journal of Strategic Change ManagementInternational Journal of Strategic Engineering Asset ManagementInternational Journal of Structural EngineeringInternational Journal of Student Project ReportingInternational Journal of Supply Chain and Inventory ManagementInternational Journal of Supply Chain and Operations ResilienceInternational Journal of Surface Science and EngineeringInternational Journal of Sustainable Agricultural Management and InformaticsInternational Journal of Sustainable AviationInternational Journal of Sustainable DesignInternational Journal of Sustainable DevelopmentInternational Journal of Sustainable EconomyInternational Journal of Sustainable ManufacturingInternational Journal of Sustainable Materials and Structural SystemsInternational Journal of Sustainable Real Estate and Construction EconomicsInternational Journal of Sustainable SocietyInternational Journal of Sustainable Strategic ManagementInternational Journal of Swarm IntelligenceInternational Journal of System Control and Information ProcessingInternational Journal of System of Systems EngineeringInternational Journal of Systems, Control and CommunicationsInternational Journal of Teaching and Case StudiesInternational Journal of TechnoentrepreneurshipInternational Journal of Technological Learning, Innovation and DevelopmentInternational Journal of Technology and GlobalisationInternational Journal of Technology Enhanced LearningInternational Journal of Technology Intelligence and PlanningInternational Journal of Technology ManagementInternational Journal of Technology MarketingInternational Journal of Technology Policy and LawInternational Journal of Technology, Policy and ManagementInternational Journal of Technology Transfer and CommercialisationInternational Journal of Telemedicine and Clinical PracticesInternational Journal of Theoretical and Applied Multiscale MechanicsInternational Journal of Tourism AnthropologyInternational Journal of Tourism PolicyInternational Journal of Trade and Global MarketsInternational Journal of Transitions and Innovation SystemsInternational Journal of Trust Management in Computing and CommunicationsInternational Journal of Ultra Wideband Communications and SystemsInternational Journal of Value Chain ManagementInternational Journal of Vehicle Autonomous SystemsInternational Journal of Vehicle DesignInternational Journal of Vehicle Information and Communication SystemsInternational Journal of Vehicle Noise and VibrationInternational Journal of Vehicle PerformanceInternational Journal of Vehicle SafetyInternational Journal of Vehicle Systems Modelling and TestingInternational Journal of Virtual Technology and MultimediaInternational Journal of WaterInternational Journal of Web and Grid ServicesInternational Journal of Web Based CommunitiesInternational Journal of Web Engineering and TechnologyInternational Journal of Web ScienceInternational Journal of Wireless and Mobile ComputingInternational Journal of Work InnovationInternational Journal of Work Organisation and EmotionJournal for Global Business AdvancementJournal for International Business and Entrepreneurship DevelopmentJournal of Design ResearchJournal of Supply Chain RelocationLatin American Journal of Management for Sustainable DevelopmentLuxury Research JournalMENA Journal of Cross-Cultural ManagementMiddle East Journal of ManagementNordic Journal of TourismProgress in Computational Fluid Dynamics, An International JournalProgress in Industrial Ecology, An International JournalThe Botulinum JournalWorld Review of Entrepreneurship, Management and Sustainable DevelopmentWorld Review of Intermodal Transportation ResearchWorld Review of Science, Technology and Sustainable Development Browse journals by subject Computing and Mathematics Economics and Finance Education, Knowledge and Learning Energy and Environment Healthcare and Biosciences Management and Business Public Policy and Administration Risk, Safety and Emergency Management Science, Engineering and Technology Society and Leisure All Subjects Research picks Securing telemedicineTelemedicine is slowly maturing allowing greater connectivity between patient and healthcare providers using information and communications technology (ICT). One issue that is yet to be addressed fully, however, is security and thence privacy. Researchers writing in the International Journal of Ad Hoc and Ubiquitous Computing, have turned to cloud computing to help them develop a new and strong authentication protocol for electronic healthcare systems. Prerna Mohit of the Indian Institute of Information Technology Senapati in Manipur, Ruhul Amin of the Dr Shyama Prasad Mukherjee International Institute of Information Technology, in Naya Raipur, and G.P. Biswas of the Indian Institute of Technology (ISM) Dhanbad, in Jharkhand, India, point out how medical information is personal and sensitive and so it is important that it remains private and confidential. The team's approach uses the flexibility of a mobile device to authenticate so that a user can securely retrieve pertinent information without a third party having the opportunity to access that information at any point. In a proof of principle, the team has carried out a security analysis and demonstrated that the system can resist attacks where a malicious third party attempts to breach the security protocol. They add that the costs in terms of additional computation and communication resources are lower than those offered by other security systems reported in the existing research literature. Mohit, P., Amin, R. and Biswas, G.P. (2021) 'An e-healthcare authentication protocol employing cloud computing', Int. J. Ad Hoc and Ubiquitous Computing, Vol. 36, No. 3, pp.155–168. DOI: 10.1504/IJAHUC.2021.113873 Anticancer drugs from the monsoonA small-branched shrub found in India known locally as Moddu Soppu (Justicia wynaadensis) is used to make a sweet dish during the monsoon season by the inhabitants of Kodagu district in Karanataka exclusively during the monsoons. Research published in the International Journal of Computational Biology and Drug Design has looked at phytochemicals present in extracts from the plant that may have putative anticancer agent properties. C.D. Vandana and K.N. Shanti of PES University in Bangalore, Karnataka and Vivek Chandramohan of the Siddaganga Institute of Technology also in Tumkur, Karnataka, investigated several phytochemicals that had been reported in the scientific literature as having anticancer activity. They used a computer model to look at how well twelve different compounds "docked" with the relevant enzyme thymidylate synthase and compared this activity with a reference drug, capecitabine, which targets this enzyme. Thymidylate synthase is involved in making DNA for cell replication. In cancer, uncontrolled cell replication is the underlying problem. If this enzyme can be blocked it will lead to DNA damage in the cancer cells and potentially halt the cancer growth. Two compounds had comparable activity and greater binding to the enzyme than capecitabine. The first, campesterol, is a well-known plant chemical with a structure similar to cholesterol, the second stigmasterol is another well-known phytochemical involved in the structural integrity of plant cells. The former proved itself to be more stable than the latter and represents a possible lead for further investigation and testing as an anticancer drug, the team reports. Vandana, C.D., Shanti, K.N., Karunakar, P. and Chandramohan, V. (2020) 'In silico studies of bioactive phytocompounds with anticancer activity from in vivo and in vitro extracts of Justicia wynaadensis (Nees) T. Anderson', Int. J. Computational Biology and Drug Design, Vol. 13, Nos. 5/6, pp.582–601. DOI: 10.1504/IJCBDD.2020.113836 Native reforestation benefits biodiversityTimber harvest and agriculture have had an enormous impact on biodiversity in many parts of the world over the last two hundred years of the industrial era. One such region is 20 to 50 kilometre belt of tropical dry evergreen forest that lies inland from the southeastern coast of India. Efforts to regenerate the biodiversity has been more successful when native tropical dry evergreen forest has been reinstated rather than where non-native Acacia planting has been carried out in regeneration efforts, according to research published in the Interdisciplinary Environmental Review. Christopher Frignoca and John McCarthy of the Department of Atmospheric Science and Chemistry at Plymouth State University in New Hampshire, USA, Aviram Rozin of Sadhana Forest in Auroville, Tamil Nadu, India, and Leonard Reitsma of the Department of Biological Sciences at Plymouth explain how reforestation can be used to rebuild the ecosystem and increases population sizes and diversity of flora and fauna. The team has looked at efforts to rebuild the ecosystem of Sadhana Forest. An area of 28 hectares had its water table replenished through intensive soil moisture conservation. The team has observed rapid growth of planted native species and germination of two species of dormant Acacia seeds. The team's standard biological inventory of this area revealed 75 bird, 8 mammal, 12 reptile, 5 amphibian, 55 invertebrate species, and 22 invertebrate orders present in the area. When they looked closely at the data obtained from bird abundance at point count stations, invertebrate sweep net captures and leaf count detections, as well as Odonate and Lepidopteran visual observations along fixed-paced transects they saw far greater diversity in those areas where native plants thrived rather than the non-native Acacia. "Sadhana Forest's reforestation demonstrates the potential to restore ecosystems and replenish water tables, vital components to reversing ecosystem degradation, and corroborates reforestation efforts in other regions of the world," the team writes. "Sadhana Forest serves as a model for effective reforestation and ecosystem restoration," the researchers conclude. Frignoca, C., McCarthy, J., Rozin, A. and Reitsma, L. (2021) 'Greater biodiversity in regenerated native tropical dry evergreen forest compared to non-native Acacia regeneration in Southeastern India', Interdisciplinary Environmental Review, Vol. 21, No. 1, pp.1–18. DOI: 10.1504/IER.2021.113781 Protection from coronavirus and zero-day pathogensResearchers in India are developing a disinfection chamber that integrates a system that can deactivate coronavirus particles. The team reports details in the International Journal of Design Engineering. As we enter the second year of the COVID-19 pandemic, there are signs that the causative virus SARS-CoV-2 and its variants may be with us for many years to come despite the unprecedented speed with vaccines against the disease have been developed, tested, and for some parts of the world rolled out. Sangam Sahu, Shivam Krishna Pandey, and Atul Mishra of the BML Munjal University suggest that we could adapt screening technology commonly used in security for checking whether a person is entering an area, such as airports, hospitals, or government buildings, for instance, carrying a weapon, explosives, or contraband goods. Such a system might be augmented with a body temperature check for spotting a person with a fever that might be a symptom of COVID-19 or another contagious viral infection. They add that the screening system might also incorporate technology that can kill viruses on surfaces with a quick flash of ultraviolet light or a spray of chemical disinfectant. Airborne microbial diseases represent a significant ongoing challenge to public health around the world. While COVID-19 is top of the agenda at the moment, seasonal and pandemic influenza are of perennial concern as is the emergence of drug-resistant strains of tuberculosis. Moreover, we are likely to see other emergent pathogens as we have many times in the past any one of which could lead to an even greater pandemic catastrophe than COVID-19. Screening and disinfecting systems as described by Sahu could become commonplace and perhaps act as an obligatory frontline defense against the spread of such emergent pathogens even before they are identified. Such an approach to unknown viruses is well known in the computer industry where novel malware emerges, so-called 0-day viruses, before the antivirus software is updated to recognize it and so blanket screening and disinfection software is often used. Sahu, S., Pandey, S.K. and Mishra, A. (2021) 'Disinfectant chamber for killing body germs with integrated FAR-UVC chamber (for COVID-19)', Int. J. Design Engineering, Vol. 10, No. 1, pp.1–9. DOI: 10.1504/IJDE.2021.113247 Wetware data retrievalA computer hard drive can be a rich source of evidence in a forensic investigation... but only if the device is intact and undamaged otherwise many additional steps to retrieve incriminating data from within are needed and not always successful even in the most expert hands. Research published in the International Journal of Electronic Security and Digital Forensics considers the data retrieval problems for investigators faced with a hard drive that has been submerged in water. Alicia Francois and Alastair Nisbet of the Cybersecurity Research Laboratory at Auckland University of Technology in New Zealand, point out that under pressure suspects in an investigation may attempt to destroy digital evidence prior to a seizure by the authorities. A common approach is simply to put a hard drive in water in the hope that damage to the circuitry and the storage media within will render the data inaccessible. The team has looked at the impact of water ingress on solid-state and conventional spinning magnetic disc hard drives and the timescale over which irreparable damage occurs and how this relates to the likelihood of significant data loss from the device. Circuitry and other components begin to corrode rather quickly following water ingress. However, if a device can be retrieved and dried within seven days, there is a reasonable chance of it still working and the data being accessible. "Ultimately, water submersion can damage a drive quickly but with the necessary haste and skills, data may still be recoverable from a water-damaged hard drive," the team writes. However, if the device has been submerged in saltwater, then irreparable damage can occur within 30 minutes. The situation is worse for a solid-state drive which will essentially be destroyed within a minute of saltwater ingress. The research provides a useful guide for forensic investigators retrieving hard drives that have been submerged in water. Francois, A. and Nisbet, A. (2021) 'Forensic analysis and data recovery from water-submerged hard drives', Int. J. Electronic Security and Digital Forensics, Vol. 13, No. 2, pp.219–231. DOI: 10.1504/IJESDF.2021.113374 Of alcohol and bootlacesThere is no consensus across medical science as to whether or not there is a safe lower limit on alcohol consumption nor whether a small amount of alcohol is beneficial. The picture is complicated by the various congeners, such as polyphenols and other substances that are present in different concentrations in different types of alcoholic beverage, such as red and white wine, beers and ales, ciders, and spirits. Moreover, while, there has been a decisive classification of alcohol consumption as a cause of cancer, there is strong evidence that small quantities have a protective effect on the cardiovascular system. Now, writing in the International Journal of Web and Grid Services, a team from China, Japan, Taiwan, and the USA, has looked at how a feature of our genetic material, DNA, relates to ageing and cancer and investigated a possible connection with alcohol consumption. The ends of our linear chromosomes are capped by repeated sequences of DNA base units that act as protective ends almost analogous to the stiff aglets on each end of a bootlace. These protective sections are known as telomeres. Which each cell replication the length of the telomeres on the ends of our chromosomes get shorter. This limits the number of times a cell can replicate before there is insufficient protection for the DNA between the ends that encodes the proteins that make up the cell. Once the telomeres are damaged beyond repair or gone the cell will die. This degradative process has been linked to the limited lifespan of the cells in our bodies and the aging process itself. Yan Pei of The University of Aizu in Aizuwakamatsu, Japan, and colleagues Jianqiang Li, Yu Guan, and Xi Xu of Beijing University of Technology, China, Jason Hung of the National Taichung University of Science and Technology, Taichung, Taiwan, and Weiliang Qiu of Brigham and Women's Hospital in Boston, USA, have carried out a meta-analysis of the scientific literature. Their analysis suggests that telomere length is associated with alcohol consumption. Given that shorter telomeres, before they reach the critical length, can nevertheless lead to genomic instability, this alcohol-associated shortening could offer insight into how cancerous tumour growth might be triggered. Telomere shortening is a natural part of the ageing process. However, it is influenced by various factors that are beyond our control such as paternal age at birth, ethnicity, gender, age, telomere maintenance genes, genetic mutations of the telomeres. However, telomere length is also affected by inflammation and oxidative stress, environmental, psychosocial, behavioural exposures, and for some of those factors we may have limited control. For others, such as chronic exposure to large quantities of alcohol we have greater control. Li, J., Guan, Y., Xu, X., Pei, Y., Hung, J.C. and Qiu, W. (2021) 'Association between alcohol consumption and telomere length', Int. J. Web and Grid Services, Vol. 17, No. 1, pp.36–59. DOI: 10.1504/IJWGS.2021.113686 Quality after the pandemicAdedeji Badiru of the Air Force Institute of Technology in Dayton, Ohio, USA, discusses the notion of quality insight in the International Journal of Quality Engineering and Technology and how this relates to motivating researchers and developers working on quality certification programs after the COVID-19 pandemic. In the realm of product quality, we depend on certification based on generally accepted standards to ensure high quality. Badiru writes that the ongoing COVID-19 pandemic has led to serious disruption to production facilities and led to the upending of normal quality engineering and technology programs. In the aftermath of the pandemic, there will be a pressing need to redress this problem and its impact on quality management processes may, as with many other areas of normal life, continue to be felt for a long time. Badiru suggests that now is the time to develop new approaches to ensure that we retrieve the pre-COVID quality levels. He suggests that in the area of quality certification, we must look at other methods in this field, perhaps borrowing from other areas of quality oversight. One mature area from which the new-normal of certification might borrow is academic accreditation. The work environment has changed beyond recognition through the pandemic and we are unlikely to revert to old approaches entirely. Indeed, the pandemic has already necessitated the urgent application of existing quantitative and qualitative tools and techniques to other areas, such as work design, workforce development, and the form of the curriculum in education. Action now, from the systems perspective in engineering and technology, "will get a company properly prepared for the quality certification of the future, post-COVID-19 pandemic," he writes. This will allow research and development of new products to satisfy the triage of cost, time, and quality requirements as we ultimately emerge from the pandemic. Badiru, A. (2021) 'Quality insight: product quality certification post COVID-19 using systems framework from academic program accreditation', Int. J. Quality Engineering and Technology, Vol. 8, No. 2, pp.218–227. DOI: 10.1504/IJQET.2021.113728 Spotting and stopping online abuseSocial media has brought huge benefits to many of those around the world with the resources to access its apps and websites. Indeed, there are billions of people using the popular platforms every month in almost, if not, every country of the world. Researchers writing in the International Journal of High Performance Systems Architecture, point out that as with much in life there are downsides that counter the positives of social media. One might refer to one such negative facet of social media as "cyber violence". Randa Zarnoufi of the FSR Mohammed V University in Rabat, Morocco, and colleagues suggest that the number of victims of this new form of hostility is growing day by day and is having a strongly detrimental effect on the psychological wellbeing of too many people. A perspective that has been little investigated in this area with regard to reducing the level of cyber violence in the world is to consider the psychological status and the emotional dimension of the perpetrators themselves. New understanding of what drives those people to commit heinous acts against others in the online world may improve our response to it and open up new ways to address the problem at its source rather than attempting to simply filter, censor, or protect victims directly. The team has analysed social media updates using Ensemble Machine Learning and the Plutchik wheel of basic emotions to extract the character of those updates in the context of cyber violence, bullying and trolling behaviour. The analysis draws the perhaps obvious, but nevertheless highly meaningful, conclusion that there is a significant association between an individual's emotional state and the personal propensity to harmful intent in the realm of social media. Importantly, the work shows how this emotional state can be detected and perhaps the perpetrator of cyber violence be approached with a view to improving their emotional state and reducing the negative impact their emotions would otherwise have on the people with whom they engage online. This is very much the first step in this approach to addressing the serious and growing problem of cyber violence. The team adds that they will train their system to detect specific issues in socoal media updates that are associated with harassment with respect to sexuality, appearance, intellectual capacity, and political persuasion. Zarnoufi, R., Boutbi, M. and Abik, M. (2020) 'AI to prevent cyber-violence: harmful behaviour detection in social media', Int. J. High Performance Systems Architecture, Vol. 9, No. 4, pp.182–191 DOI: 10.1504/IJHPSA.2020.113679 Me too #metooSexual harassment in the workplace is a serious problem. To address it, we need a systematic, multistage preventive approach, according to researchers writing in the International Journal of Work Organisation and Emotion. One international response to sexual harassment problems across a range of industries but initially emerging from the entertainment industry was the "#metoo" movement. Within this movement victims of harassment and abuse told their stories through social media and other outlets to raise awareness of this widespread problem and to advocate for new legal protections and societal change. Anna Michalkiewicz and Marzena Syper-Jedrzejak of the University of Lodz, Poland, describe how they have explored perception of the #metoo movement with regards to in reducing the incidence of sexual harassment. "Our findings show that #metoo may have had such preventive potential but it got 'diluted' due to various factors, for example, cultural determinants and lack of systemic solutions," the team writes. They suggest that because of these limitations the #metoo movement is yet to reach its full potential. The team's study considered 122 students finishing their master's degrees in management studies and readying themselves to enter the job market. They were surveyed about the categorisation of psychosocial hazards – such as sexual harassment – in the workplace that cause stress and other personal problems as opposed to the more familiar physical hazards. "Effective prevention of [sexual harassment] requires awareness but also motivation and competence to choose and implement in the organisations adequate measures that would effectively change the organisational culture and work conditions," the team writes. The #metoo movement brought prominence to the issues, but the team suggests that it did not lead to the requisite knowledge and practical competence that would facilitate prevention. They point out that the much-needed social changes cannot come about within a timescale of a few months of campaigning. Cultural changes need more time and a willing media to keep attention focused on the problem and how it might be addressed. There is also a pressing need for changes in the law to be considered to help eradicate sexual harassment in the workplace. Michałkiewicz, A. and Syper-Jędrzejak, M. (2020) 'Significance of the #metoo movement for the prevention of sexual harassment as perceived by people entering the job market', Int. J. Work Organisation and Emotion, Vol. 11, No. 4, pp.343–361. DOI: 10.1504/IJWOE.2020.113699 Data mining big data newsWhile the term "big data" has become something of a buzz phrase in recent years it has a solid foundation in computer science in many contexts and as such has emerged into the public consciousness via the media and even government initiatives in many parts of the world. A North American team has looked at the media and undertaken a mining operation to unearth nuggets of news regarding this term. Murtaza Haider of the Ted Rogers School of Management at Ryerson University in Toronto, Canada and Amir Gandomi of the Frank G. Zarb School of Business at Hofstra University in Hempstead, New York, USA, explain how big data-driven analytics emerged as one of the most sought-after business strategies of the decade. They have now used natural language processing and text mining algorithms to find the focus and tenor of news coverage surrounding big data. They mined a five million-word body of news coverage for references to the novelty of big data, showcasing the usual suspects in big data geographies and industries. "The insights gained from the text analysis show that big data news coverage indeed evolved where the initial focus on the promise of big data moderated over time," the team found. There work also demonstrates how text mining and NLP algorithms are potent tools for news content analysis. The team points out that academic journals have been the main source of trusted and unbiased advice regarding computing technologies, large databases, and scalable analytics, it is the popular and trade press that are the information source for over-stretched executives. It was the popular media that became what the team describes as "the primary channel for spreading awareness about 'big data' as a marketing concept". They add that the news media certainly helped popularise innovative ideas being discussed in the academic literature. Moreover, the latter has had to play catchup during the last decade on sharing the news. That said, much of the news coverage during this time has been about the novelty and the promise of big data rather than the proof of principles that are needed for it to proceed and mature as a discipline. Indeed, there are many big data clichés propagated in an often uncritical popular media suggesting that big data analytics is some kind of information panacea. In contrast, the more reserved nature of academic publication knows only too well that big data does not represent a cure-all for socio-economic ills nor does it have unlimited potential. Haider, M. and Gandomi, A. (2021) 'When big data made the headlines: mining the text of big data coverage in the news media', Int. J. Services Technology and Management, Vol. 27, Nos. 1/2, pp.23–50. DOI: 10.1504/IJSTM.2021.113574 More about Research Picks News New Editor for International Journal of Applied Nonlinear Science 23 March, 2021 Prof. Wen-Feng Wang from the Interscience Institute of Management and Technology in India and Shanghai Institute of Technology in China has been appointed to take over editorship of the International Journal of Applied Nonlinear Science. New Editor for Journal of Design Research 11 March, 2021 Prof. Jouke Verlinden from the University of Antwerp in Belgium has been appointed to take over editorship of the Journal of Design Research. The journal's former Editor in Chief, Prof. Renee Wever of Linköping University in Sweden, will remain on the board as Editor. Inderscience Editor in Chief receives Humboldt Research Award 5 March, 2021 Inderscience is pleased to announce that Prof. Nilmini Wickramasinghe, Editor in Chief of the International Journal of Biomedical Engineering and Technology and the International Journal of Networking and Virtual Organisations, has won a Humboldt Research Award. This award is conferred in recognition of the award winner's academic record. Prof. Wickramasinghe will be invited to carry out research projects in collaboration with specialists in Germany. Inderscience's Editorial Office extends its warmest congratulations to Prof. Wickramasinghe for her achievement, and thanks her for her continuing stellar work on her journals. Best Reviewer Award announced by International Journal of Environment and Pollution 11 February, 2021 We are pleased to announce that the International Journal of Environment and Pollution has launched a new Best Reviewer Award. The 2020 Award goes to Prof. Steven Hanna of the Harvard T.H. Chan School of Public Health in the USA. The senior editorial team thanks Prof. Hanna sincerely for his exemplary efforts. Inderscience new address 11 February, 2021 As of 1st March 2021, the address of Inderscience in Switzerland will change to: Inderscience Enterprises Limited Rue de Pré-Bois 14 Meyrin - 1216 Geneva SWITZERLAND For Authors Registered authors log in here Online submission: new author registration Preparing articles Submitting articles Copyright and author entitlement Conferences/Events Orders Journal subscriptions Buying one-off articles and issues Books and Conference Proceedings See our subscription rates (PDF format) 2021 New titles International Journal of Cybernetics and Cyber-Physical Systems MENA Journal of Cross-Cultural Management International Journal of Family Business and Regional Development International Journal of Forensic Engineering and Management International Journal of Big Data Management Previous Next Keep up-to-date Our Blog Follow us on Twitter Visit us on Facebook Our Newsletter (subscribe for free) RSS Feeds New issue alerts Return to top Contact us About Inderscience OAI Repository Privacy and Cookies Statement Terms and Conditions Help Sitemap © Inderscience Enterprises Ltd. work_g6nalraddzc7df7eqmjqbnazm4 ---- 淡江大學機構典藏:Bitstream Authorization Required English | 正體中文 | 简体中文 | Items with full text/Total items : 57915/91485 (63%) Visitors : 13672868 Online Users : 46 RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library & TKU Library IR team. Scope All of 機構典藏文學院理學院工學院商管學院外國語文學院國際研究學院教育學院創業發展學院全球化研究與發展學院社區發展學院全球發展學院技術學院行政單位體育事務處淡江出版期刊 66週年校慶研討會 67週年校慶研討會教育部教學實踐研究計畫 Tips: please add "double quotation mark" for query phrases to get precise results please goto advance search for comprehansive author search Adv. Search Home ‧ Login ‧ Upload ‧ Help ‧ About ‧ Administer Browse all Communities & Collections Title Date Authors Relevant Link 機構典藏網站流量統計各系所授權人數統計機構典藏相關使用文件臺灣機構典藏 TAIR 日本機構典藏 JAIRO 西文出版社授權政策查詢淡江大學電子學位論文淡江大學出版期刊淡江大學覺生紀念圖書館 Loading... Bitstream Authorization Required Sorry, the bitstream file is not authorized now. The constraint is as followed, Access time constraint: 9999/12/31 ~ Access place constraint: If you're having problems, or you expected the ID to work, feel free to contact the site administrators. 淡江大學機構典藏 iradmin+admin@www.lib.tku.edu.tw Go to the 機構典藏 home page DSpace Software Copyright © 2002-2004 MIT & Hewlett-Packard / Enhanced by NTU Library & TKU Library IR teams. Copyright © - Feedback work_g7k5s3mvk5bnvnyj6ygtd6gv4y ---- Dodgson, N., Patterson, J., and Willis, P. (2010) What's up prof? Current issues in the visual effects & post-production industry. Leonardo: Art Science and Technology, 43 (1). pp. 92-93. ISSN 0024-094X http://eprints.gla.ac.uk/47904/ Deposited on: 9 January 2012 Enlighten – Research publications by members of the University of Glasgow http://eprints.gla.ac.uk http://eprints.gla.ac.uk/view/author/8689.html http://eprints.gla.ac.uk/view/journal_volume/Leonardo=3A_Art_Science_and_Technology.html http://eprints.gla.ac.uk/view/journal_volume/Leonardo=3A_Art_Science_and_Technology.html http://eprints.gla.ac.uk/47904/ T r a n s a c T io n s 92 LEONARDO, Vol. 43, No. 1, pp. 92–93, 2010 ©2010 isasT WHAT’S UP PROF? CURRENT ISSUES IN THE VISUAL EFFECTS & POST- PRODUCTION INDUSTRY Neil Dodgson, University of Cambridge, Computer Laboratory, CB3 0FD, U.K. E-mail: John Patterson, University of Glasgow, Dep’t of Computing Science, G12 8QQ U.K. E-mail: Phil Willis, University of Bath, Computer Science Dep’t, BA2 7AY U.K. E-mail: P.J.Willis@bath.ac.uk Submitted: 25/2/2009 Abstract We interviewed creative professionals at a number of London visual effects and post-production houses. We report on the key issues raised in those interviews: desirable new technologies, infrastruc- ture challenges, personnel and process management. Visual effects companies began to estab- lish themselves, in the film industry, in the 1980s. The potential of computers became fully apparent during the 1990s when they began to generate realistic imagery [1]. In the U.K. alone, visual effects and post-production are now worth over a billion U.S. dollars. Today, the industry faces many issues critical to its future. To get a snapshot of current issues, we interviewed a range of creative professionals in London in De- cember 2008. In particular, we elicited how those professionals in the creative industry thought that the universities could best help them. The Organizations We visited six organizations [A–F] rep- resenting different facets of the industry: A. A large visual effects company, dealing mostly with movies. The company employs 20 technical staff, 400 artists, plus management. B. A medium-sized post-production company, working on advertising, television, and movies. The com- pany has over 100 employees, mostly visual effects artists. C. A software developer with 50 em- ployees, producing software for post-production and visual effects. D. A systems developer with 70 em- ployees producing combined soft- ware and hardware solutions for colour grading. E. A scanning and recording house, a member of an international group providing full services to the film industry, specializing in converting between digital and analogue media. F. An independent consultancy spe- cializing in coordinating research projects in this industry. The Issues We asked each organization to discuss current problems and desires. We subse- quently categorised them three ways: 1. Desirable new technologies. 2. Infrastructure. 3. Managing people and process. 1. Desirable new technologies a) Human in the loop. There is much good university research on fully- automatic methods for image processing and computer vision. These work well at the low quality end of the market (e.g., segmentation and 3D reconstruction). However, this work has had little impact on the high quality end, where every- thing is still done manually. It would be useful to investigate methods that solve particular problems (e.g., optical flow, boundary detection, and object detec- tion) to help a human being either to direct the automated algorithm or to ad- just the output of the automated algo- rithm quickly and efficiently. In either case the semi-automatic method will only be useful if the result is superior to the manual method while taking less time to achieve. [D] b) Repurposing. Research is needed into effective ways to reuse both footage and 3D models. Models tend to be made anew for each sequel. This is under- standable as technology moves on, but it is increasingly expensive. However, we also find that the 3D models used for a movie are not used for the simultane- ously-released accompanying game. How can we make better use of existing assets? [C,F] c) Finding assets. The databases of as- sets are now so large that we need to develop better ways to catalogue them and to search both images and 3D mod- els. There are usually many different versions of a given asset: it is vital to find the correct version, not just the cor- rect asset. [A,F] d) 3D reconstruction. Reasonable methods for the reconstruction of 3D objects exist but they work best with frame-synchronised views from binocu- lar cameras. The next challenge is the extraction of data of good enough quality for the reconstruction of a complete 3D scene from multiple movie cameras. Some aspects of this problem remain challenging. Support for 3D (stereo- scopic) movie-making has become a priority for the industry following the popularity of recent 3D releases. [2,C] e) Artistic control of physical simula- tion. Movie effects need to be visually plausible but the simulations that under- lie them do not have to be physically realistic nor work for longer than the shot. There has been considerable re- search on producing physically realistic simulations. The industry needs physi- cally plausible simulation that can be directed and modified by the artist [3]. For example, can we build a water simu- lator where the artist can control where the water goes? Could we make a cloth simulator which is physically plausible but which gives the artist control over specific behaviours? How do we make things that look plausible when they are physically impossible? [A,E] f) Making convincing digital humans. Human beings are good at recognizing and analysing the appearance and behav- iour of other human beings. It is still difficult to make a convincing digital human. We know that there is evidence that a digital human that is not quite convincing is more disturbing to the average viewer than a digital human that is clearly not meant to be realistic (“the uncanny valley” [4]). Compounding this is the problem that it is difficult to cap- ture good face data and difficult to pro- duce plausible animation of face data. Acquisition of human motion on set or on a soundstage is particularly expensive and therefore is only used if it is abso- lutely necessary. [A] g) Breaking free from pixels. A non- pixel format (e.g., that in [5]) could be useful to break free from the problem of producing the same material at many different resolutions and needing to en- sure that the original material is always shot at the highest resolution that you will need. Such a format would need to be able to handle all the processing that we currently do on pixelised images. In the long term there would need to be input devices (cameras) and output de- vices (projectors) that could handle the non-pixel format. [B] 2. Infrastructure a) Trans-coding media between digital formats. There has been a proliferation of formats. For example, a single work can be required in a dozen different for- mats resulting in a lot of CPU time and staff time converting between them. One way in which we could tackle this is to develop a video version of Adobe’s Portable Document Format, a single file format that can be converted at need Transactions 93 T r a n s a c T io n s either at the player or at the server when the player requests the file. [B] b) Backup of large data stores. A post- production or visual effects house pro- duces gigabytes of new data each day. At the small end of the scale, a 2K DPX movie frame requires 12MB, and a 4K frame can require as much as 144MB. At the large end, an advertising poster can be rendered with up to 600 megapixels, requiring 1.8GB. One company uses a 160 TB file store; another mentioned data volumes of several hundred tera- bytes. One company reported that no vendor of off-site backup was able to cope with the quantity of new data that they produce. Two companies com- mented that, because of the volume problem, they maintain their backups on site, with the obvious risks. [A,B,D,F] c) Keeping up with technology. Tech- nology changes rapidly. Companies de- vote much resource to making best use of new technology to speed up processes and keep ahead of the competition. It is not just a question of optimizing the effects algorithms: one company re- ported that less than 20% of their code did the actual effects work, with the rest of the code being required for data man- agement. [D] d) Archiving and cataloguing assets. Archiving everything is problematic. If done, cataloguing is important (see 1(c)). For example, an upcoming feature film has 1700 effects shots, with 4 million assets, variations on those assets produce 10 million identifiable objects. These take up several hundred terabytes. How do we archive something like this? There are many subsidiary questions within this problem: for example, is it sufficient to store the original imagery and models along with a description of the process to get to the final shot? [A,F] e) Archiving footage in perpetuity. In addition to archiving assets in the short to medium term, there is a desire to ar- chive the finished product forever. All physical media deteriorates, whether physical film, magnetic tape, or optical disc. Film has a life of around 40 years, though this varies considerably with storage conditions [6]. Some film has survived reasonably intact over 70 years [7]. LTO Ultrium (½" digital ar- chive tape) has a predicted life of 15–30 years [8]. Can we develop mechanisms that robustly store digital footage for decades or centuries? If so, can we automatically migrate existing film ar- chives to secure digital media. This is not a small problem: the British Film Institute has an archive of 150,000 mov- ies [9]. The Internet Movie Database [10] reports 14,692 movies released in 2008, the equivalent of a hundred mil- lion feet of film per year. [E,F] f) Healing the 2D/3D divide. There are currently separate workflows for 2D data (images) and 3D data (modelling). It would be useful to join the workflows in some way, especially as stereoscopic movies become more popular. [2,C] g) Improving digital capture. There are currently no digital capture devices that can compete with film in quality of captured imagery. [E] 3. Managing people and process a) Managing creative input. A decade ago, visual effects artists were generally aware of the underlying technology and of the entire pipeline from concept to the finished film. Today, young artists, while still skilled creatively, are far less knowledgeable technically. They can thus either fail to use the full power of the technology or fail to understand the implications of their actions for the later stages of the pipeline. [A] b) Managing workflow. The current methods for visual effects and post- production follow a production line: each step in the process building on the previous one. Can we break free of this production line method and provide ef- fective feedback loops between the dif- ferent links in the production chain? [A] c) Managing a large workforce. The industry once consisted of small compa- nies, within each of which everyone knew everyone else. Over the last dec- ade, several of the companies have be- come too large to work in this way. How do we manage this creative, collabora- tive process when people in different parts of the chain do not know each other and have only a basic understanding of each other’s roles? [A] d) Managing client expectations. Vis- ual effects are now an ordinary part of the production pipeline, rather than any- thing special. Some movies now have over a thousand effects shots and even non-effects movies employ a lot of digi- tal post-production. For example, a re- cent live-action movie with no visual effects still had over 900 shots that re- quired CGI post-production, such as changing the sky colour and moving or removing background elements. Much effects work is time-consuming and la- bour-intensive. Many effects are gener- ated using one-off solutions that are thrown together to get the result wanted by the director. Despite these difficulties, the companies find that their clients have little appreciation of which effects are straightforward to produce and which are extraordinarily expensive. There is a common belief that, if they have seen an effect in some other movie, then it must be straightforward to produce. [A,B] Implications and Conclusions With regard to research timescales, the universities and companies differ. The companies need solutions to their current problems, on a timescale of 6 to 24 months. The universities need to work on problems that will become pressing in 5 to 10 years time or on problems for which no solution is obvious to industry. The latter are those problems to which no company will devote resources but for which a solution would be useful, if one could be found. Computer graphics and image proc- essing researchers are best placed to tackle the development of new technolo- gies in (1). These are also the problems best suited to university timescales. We are working with some of the companies to research certain of these. Our col- leagues in networking, information re- trieval, databases, and engineering are best placed to tackle research issues in infrastructure (2), particularly how to handle backup and archive of large data- sets. The managerial issues (3) demon- strate that some of the biggest problems facing the industry have little to do with technology and everything to do with people. References and Notes 1. Richard Rickitt, Special Effects: the history & technique (Virgin Books, 2000). 2. Lenny Lipton, “Digital stereoscopic cinema: the 21st century”, Proc. SPIE 6803, 2008. 3. Ronen Barzel, “Faking Dynamics of Ropes and Springs”, IEEE Computer Graphics & Applications 17(3), pp. 31–39, 1997. 4. F.C. Gee, W.N. Browne, K. Kawamura, “Un- canny valley revisited”, IEEE International Work- shop on Robot and Human Interactive Communication 2005 (ROMAN 2005), pp. 151– 157, 2005, 5. John Patterson, Philip J. Willis, “Image Process- ing and Vectorisation” International patent applica- tion PCT/GB2007/002470, filed 5 July 2007, U.K. 6. James M. Riley, IPI Storage Guide for Acetate Film, Image Permanence Institute, 1993. 7. British Film Institute Mitchell & Kenyon Collec- tion, , accessed 26 February 2009. 8. Sun Microsystems LTO Ultrium tape cartridge specifications, , accessed 26 February 2009. 9. British Film Institute National Archive, , accessed 25 February 2009. 10. Internet Movie Database, , accessed 25 February 2009. citation_temp.pdf http://eprints.gla.ac.uk/47904/ citation_temp.pdf http://eprints.gla.ac.uk/47904/ citation_temp.pdf http://eprints.gla.ac.uk/47904/ work_gbwegerwnngtpd5va6ifmrlnnu ---- On edited archives and archived editions | SpringerLink Advertisement Search Log in Search SpringerLink Search Associated Content Part of a collection: Special Issue on Digital Scholarly Editing Research Article Published: 29 April 2019 On edited archives and archived editions Wout Dillen1 International Journal of Digital Humanities volume 1, pages263–277(2019)Cite this article 774 Accesses 3 Altmetric Metrics details Abstract Building on a longstanding terminological discussion in the field of textual scholarship, this essay explores the archival and editorial potential of the digital scholarly edition. Following Van Hulle and Eggert, the author argues that in the digital medium these traditionally distinct activities now find the space they need to complement and reinforce one another. By critically examining some of the early and more recent theorists and adaptors of this relatively new medium, the essay aims to shed a clearer light on some of its strengths and pitfalls. To conclude, the essay takes the discussion further by offering a broader reflection on the difficulties of providing a ‘definitive’ archival base transcription of especially handwritten materials, questioning if this should be something to aspire to for the edition in the first place. This is a preview of subscription content, access via your institution. Access options Buy single article Instant access to the full article PDF. US$ 39.95 Tax calculation will be finalised during checkout. Rent this article via DeepDyve. Learn more about Institutional subscriptions Fig. 1 Fig. 2 Fig. 3 Notes 1.As Patrick Sahle posited the second part of his Digitale Editionsformen: ‘Das Kennzeichen des gegenwärtigen Medienwandels ist nicht so sehr ein Wechsel des Medien, sondern vielmehr ein Transmedialisierung!’ (2013: 161; see also 162). 2.In ‘Edition, Project, Database, Archive, Thematic Research Collection: What’s in a Name?’ Price weighed a series of alternatives against one another and makes a case for switching to the concept of ‘arsenal’ instead (2009). 3.See: http://www.beckettarchive.org/introduction.jsp. Note the use of the word ‘series’ here, another term to add to the list – and one that is again perhaps more firmly rooted in print culture. 4.Gerrit Brünning, one of the collaborators on the Faust Edition explained as much at a talk that he gave at the University of Antwerp as part of the Platform Digital Humanities Lecture Series (26 March 2018). 5.More specifically, Eggert mentions the ISO-646 character set. This character set is a successor of ASCII (the American Standard Code of Information Interchange), and the predecessor of today’s international standard character set called Unicode. 6.In fact, Shillingsburg’s own list of these ‘visual elements with semantic force’ for manuscripts explicitly includes ‘insertions above and below lines and in margins’ (2015, 17). 7.In his paper, Shillingsburg foresees two exceptions to this rule: ‘a new authoritative witness to the work or the discovery of error in the original work’ (2015: 24). But the images that represent the document may need to be updated as well, if the edition wants to conform to newer and higher digital imaging standards. Such an update will invariably have a number of implications for the image-text linking tools that the content management framework uses, but it may also have consequences for the text, if the new image clarifies a textual feature the discovery of that the old image could not. 8.The CHCA and its multi-version-document (MVD) encoding scheme are discussed in more detail elsewhere in this volume. References Beckett Digital Manuscript Project. Retrieved March 30 2018 from: www.beckettarchive.org. Boot, P. Fischer, F. and Van Hulle, D. (2017). Introduction. In Boot, P. Cappellotto, A., Dillen, W., Fischer, F., Kelly, A., Mertgens, A., Sichani, A., Spadini, E., and Van Hulle, D., (Eds.), Advances in digital scholarly editing. Papers presented at the DiXiT conferences in the Hague, Cologne, and Antwerp (pp. 15–22). Leiden: Sidestone Press. Google Scholar Brünning, G., Henzel, K., & Pravida, D. (2013). Multiple encoding in genetic editions: The case of Faust. Journal of the Text Encoding Initiative, 4, 1–12 http://jtei.revues.org/697. Accessed 23 April 2019. Dahlström, M. (2000). Drowning by versions. Human IT, 4(4) http://etjanst.hb.se/bhs/ith/4-00/md.htm. Accessed 23 April 2019. Dahlström, M. (2009). The Compleat edition. In M. Deegan & K. Sutherland (Eds.), Text editing, print, and the digital world (pp. 27–44). Basingstoke: Ashgate. Google Scholar Dahlström, M., & Dillen, W. (2017). Review of Litteraturbanken: the Swedish Literature Bank. RIDE, 6. https://doi.org/10.18716/ride.a.6.2. Eggert, P. (2005). Text-encoding, theories of the text, and the “work- site”. Literary and Linguistic Computing, 20(4), 425–435. Article Google Scholar Eggert, P. (2017). The archival impulse and the editorial impulse. In P. Boot, A. Cappellotto, W. Dillen, F. Fischer, A. Kelly, A. Mertgens, A.-M. Sichani, E. Spadini, & D. Van Hulle (Eds.), Advances in digital scholarly editing. Papers presented at the DiXiT conferences in the Hague, Cologne, and Antwerp (pp. 121–124). Leiden: Sidestone Press. Google Scholar Evenson, J. (1999). Electronic Archives: Creating a New Bibliographic Code. Paper presented at the ACH-AALC conference in Charlottesville. USA: Virginia. Google Scholar Faust Edition. Retrieved March 30 2018 from: http://beta.faustedition.net. Henny-Krahmer, U., & Neuber, F. (2017). Editorial: Reviewing digital text collections. RIDE, 6. https://doi.org/10.18716/ride.a.6.0 Accessed on 30 March 2018. Huitfeldt, C., & Sperberg-McQueen, C. M. (2008). What is a transcription? Literary and Linguistic Computing, 23(3), 295–310. Article Google Scholar Litteraturbanken. Retrieved March 30 2018 from: https://litteraturbanken.se/start. Neuber, F., & Henny-Krahmer, U. (2018). Editorial: Digital text collections - take two, Action! RIDE, 8. https://doi.org/10.18716/ride.a.8.0. NietzcheSource. Retrieved March 30 2018 from: http://www.nietzschesource.org. Nixon, M., & Van Hulle, D. (2017). Samuel Beckett’s library. Cambridge: Cambridge University Press. Google Scholar Price, K. (2007). Electronic scholarly editions. In S. Schreibman & R. Siemens (Eds.), A companion to digital literary studies (pp. 434–450). Malden: Blackwell Publishing. Google Scholar Price, K. (2009). Edition, Project, Database, Archive, Thematic Research Collection: What’s in a Name? DHQ, 3(3) http://www.digitalhumanities.org/dhq/vol/3/3/000053/000053.html. Accessed 23 April 2019. Robinson, P. (1996). Is there a text in these variants? In R. Finneran (Ed.), The literary text in the digital age (pp. 99–115). Ann Arbor: University of Michigan Press. Google Scholar Robinson, P. (2009). What text really is not, and why editors have to learn to swim. Literary and Linguistic Computing, 24(1), 41–52. Article Google Scholar Robinson, P. (2016, 6 October). The Revolution is Coming. Paper presented at Digital Scholarly Editing: Theory, Practice, Methods. ESTS 2016 / DiXiT 3. Antwerp, Belgium. Robinson, P., & Solopova, E. (1993). Guidelines for transcription of the manuscripts of the wife of Bath’s prologue. In N. Blake & P. Robinson (Eds.), The Canterbury Tales project occasional papers (pp. 19–52). Oxford: Office for Humanities Communication. Google Scholar Sahle, P. (2005). Digitales Archiv–Digitale Edition. Anmerkungen zur Begriffsklärung. In M. Stolz (Ed.), Literatur und Literaturwissenschaft auf dem Weg zu den neuen Medien. Bern: germanistik.ch https://www.germanistik.ch/publikation.php?id=Digitales_Archiv_und_digitale_Edition. Accessed 23 April 2019. Sahle, P. (2013). Digitale Editionsformen. Zum Umgang mit der Überlieferung unter den Bedingungen des Medienwandels. Teil 2: Befunde, Theorie und Methodik. Norderstedt: Books on Demand. Google Scholar Shillingsburg, P. (1996). Scholarly editing in the computer age. Theory and practice (3rd ed.). Ann Arbor: The University of Michigan Press. Google Scholar Shillingsburg, P. (2015). Development principles for virtual archives and editions. Variants. The Journal of the European Society for Textual Scholarship, 11, 11–28. Google Scholar Steinkrüger, P. (2014). Review of Nietzschesource. RIDE, 1. https://doi.org/10.18716/ride.a.1.4. Van Hulle, D. (1999). Authenticity or Hyperreality in hypertext editions. Human IT, 1, 227–244 http://etjanst.hb.se/bhs/ith/1-99/dvh.htm. Accessed 23 April 2019. Van Hulle, D. (2009). Editie en/of Archief: modern manuscripten in een digitale architectuur. Verslagen en Mededelingen van de Koninklijke Academie voor Nederlandse Taal- en Letterkunde, 119(2), 163–178. Google Scholar Download references Author information Affiliations Centre for Manuscript Genetics, University of Antwerp, Antwerp, Belgium Wout Dillen Authors Wout DillenView author publications You can also search for this author in PubMed Google Scholar Corresponding author Correspondence to Wout Dillen. Additional information Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. Rights and permissions Reprints and Permissions About this article Cite this article Dillen, W. On edited archives and archived editions. Int J Digit Humanities 1, 263–277 (2019). https://doi.org/10.1007/s42803-019-00018-4 Download citation Published: 29 April 2019 Issue Date: 04 July 2019 DOI: https://doi.org/10.1007/s42803-019-00018-4 Keywords Digital scholarly editing Textual criticism Archives Editions Associated Content Part of a collection: Special Issue on Digital Scholarly Editing Access options Buy single article Instant access to the full article PDF. US$ 39.95 Tax calculation will be finalised during checkout. Rent this article via DeepDyve. Learn more about Institutional subscriptions Advertisement Over 10 million scientific documents at your fingertips Switch Edition Academic Edition Corporate Edition Home Impressum Legal information Privacy statement California Privacy Statement How we use cookies Manage cookies/Do not sell my data Accessibility Contact us Not logged in - 128.182.81.34 North East Research Libraries (8200828607) - LYRASIS (3000176756) - Carnegie Mellon University (3000133174) - Carnegie Mellon University Hunt Library (1600047252) Springer Nature © 2021 Springer Nature Switzerland AG. Part of Springer Nature. \ work_ge4v2fbe3naqpivqnw56pl3h4m ---- Review of Oliver Grau, Janina Hoth, & Eveline Wandl-Vogt (Eds.) (2019). Digital Art through the Looking Glass: New strategies for archiving, collecting and preserving in digital humanities | SpringerLink Advertisement Search Log in Search SpringerLink Search Reviews Published: 17 January 2020 Review of Oliver Grau, Janina Hoth, & Eveline Wandl-Vogt (Eds.) (2019). Digital Art through the Looking Glass: New strategies for archiving, collecting and preserving in digital humanities Hamburg/Krems/Vienna: Edition Donau-Universität Krems and Austrian Academy of Sciences. 312 pp. ISBN 9783903150515 (E-Book) Penesta Dika1,2 Postdigital Science and Education volume 2, pages506–510(2020)Cite this article 987 Accesses 3 Altmetric Metrics details This is a preview of subscription content, access via your institution. Access options Buy single article Instant access to the full article PDF. US$ 39.95 Tax calculation will be finalised during checkout. Subscribe to journal Immediate online access to all issues from 2019. Subscription will auto renew annually. US$ 79 Tax calculation will be finalised during checkout. Rent this article via DeepDyve. Learn more about Institutional subscriptions Notes 1.See http://www.mediaarthistory.org/retrace. 2.See http://www.virtualart.at/nc/home.html. 3.See https://www.hek.ch/en.html. 4.See https://www.comune.venezia.it/it/content/algorithmic-signs-ernest-edmonds-manfred-mohr-vera-moln-r-frieder-nake-roman-verostko-0 (in Italian) and https://aru.ac.uk/storylab/our-research/algorithmic-signs (in English). References Beiguelman, G., & Conçalves Magalhães, A. (Eds.). (2014). Possible futures. art, museums and digital archives. Sao Paulo: Editora Peiropolis. Google Scholar Grau, O., Coones, W., & Rühse, V. (Eds.). (2017). Museum and archive on the move. Changing cultural institutions in the digital era. Berlin and Boston: Walter De Gruyter. Google Scholar Grau, O., Hoth, J., & Wandl-Vogt, E. (Eds.). (2019). Digital art through the looking glass: new strategies for archiving, collecting and preserving in digital humanities. Hamburg/Krems/Vienna: Edition Donau-Universität Krems and Austrian Academy of Sciences. Google Scholar Hall, G. (2002). Culture in bits: the monstrous future of theory. London: Continuum. Google Scholar Jandrić, P., Knox, J., Besley, T., Ryberg, T., Suoranta, J., & Hayes, S. (2018). Postdigital science and education. Educational Philosophy and Theory, 50(10), 893–899. https://doi.org/10.1080/00131857.2018.1454000. Article Google Scholar Jandrić, P., Ryberg, T., Knox, J., Lacković, N., Hayes, S., Suoranta, J., Smith, M., Steketee, A., Peters, M. A., McLaren, P., Ford, D. R., Asher, G., McGregor, C., Stewart, G., Williamson, B., & Gibbons, A. (2019). Postdigital dialogue. Postdigital Science and Education, 1(1), 163–189. https://doi.org/10.1007/s42438-018-0011-x. Article Google Scholar Negroponte, N. (1998). Beyond digital. Wired, (12 January) http://www.wired.com/wired/archive/6.12/negroponte.html. Download references Author information Affiliations Kunstuniversität Linz, Linz, Austria Penesta Dika University of Business and Technology in Prishtina, Prishtina, Kosovo Penesta Dika Authors Penesta DikaView author publications You can also search for this author in PubMed Google Scholar Corresponding author Correspondence to Penesta Dika. Rights and permissions Reprints and Permissions About this article Cite this article Dika, P. Review of Oliver Grau, Janina Hoth, & Eveline Wandl-Vogt (Eds.) (2019). Digital Art through the Looking Glass: New strategies for archiving, collecting and preserving in digital humanities. Postdigit Sci Educ 2, 506–510 (2020). https://doi.org/10.1007/s42438-020-00100-z Download citation Published: 17 January 2020 Issue Date: April 2020 DOI: https://doi.org/10.1007/s42438-020-00100-z Keywords Postdigital Arts Archive Collection Preservation Digital humanities Access options Buy single article Instant access to the full article PDF. US$ 39.95 Tax calculation will be finalised during checkout. Subscribe to journal Immediate online access to all issues from 2019. Subscription will auto renew annually. US$ 79 Tax calculation will be finalised during checkout. Rent this article via DeepDyve. Learn more about Institutional subscriptions Advertisement Over 10 million scientific documents at your fingertips Switch Edition Academic Edition Corporate Edition Home Impressum Legal information Privacy statement California Privacy Statement How we use cookies Manage cookies/Do not sell my data Accessibility Contact us Not logged in - 128.182.81.34 North East Research Libraries (8200828607) - LYRASIS (3000176756) - Carnegie Mellon University (3000133174) - Carnegie Mellon University Hunt Library (1600047252) Springer Nature © 2021 Springer Nature Switzerland AG. Part of Springer Nature. \ work_gheccvgisbbcvplzywm7tjv5be ---- Microsoft Word - fenlon_RO2019_preprintSubmission.docx 1 Interactivity, Distributed Workflows, and Thick Provenance: A Review of Challenges confronting Digital Humanities Research Objects Katrina Fenlon (kfenlon@umd.edu; https://orcid.org/0000-0003-1483-5335) Introduction While Research Objects (ROs) are primarily oriented toward scientific research workflows, the RO model and parallel approaches have gained some uptake in the humanities, enough to suggest their potential to undergird sustainable, networked humanities research infrastructures. Digital scholarship in the humanities takes a great variety of forms that range widely beyond traditional publications, and which incorporate narratives, media, datasets and interactive components—any of which may be physically dispersed as well as dynamic and evolving over time. Despite the rapid growth of digital scholarship in the humanities, most existing research infrastructures lack support for the creation, management, sharing, maintenance, and preservation of complex, networked digital objects. ROs, and the community and tools that are growing around ROs, offer a potential, partial solution. While the concept of the RO has seen significantly more uptake in the humanities than has the formal data model (Bechhofer, 2013; Belhajjame et al., 2015), several compelling applications of the concept that suggest the time is ripe for considering broader integration of the model into distributed infrastructures. These applications include platforms for data sharing and collaborative scholarship, platforms for digital and semantic publishing, and digital repositories in several domains. This paper reviews existing applications of the ROs model to identify challenges confronting the application of ROs to humanities digital scholarship. This paper builds on Fenlon (2019), which investigated the application of the ROs model to digital humanities collections, and which identified three promising strengths of the model for the realm of digital humanities: (1) ROs readily perform the most essential function of a collection: to aggregate related resources in order to support scholarly objectives; (2) ROs have the capacity for explicit, semantic descriptions of interrelationships among components that are often hidden in digital humanities collections (and therefore vulnerable to dissolution); and (3) the RO model accommodates aggregations of linked data, offering researchers the opportunity to create and annotate virtual, fully referential collections. Having identified some strengths and limitations of the RO model for digital humanities collections through one experimental application of model, this paper builds on that analysis by reviewing the literature on ROs in the humanities and examining a range of applications of the RO and similar models within humanities and cultural heritage domains. This paper frames the review around three main challenges and their implications for future implementations of ROs to support digital research in the humanities: First, digital humanities scholarship requires specialized interactive use, so realizing the advantages of ROs for the humanities will depend on implementations that create platforms for experimentation and development by communities. Second, the idiosyncratic workflows employed in the construction of networked humanities scholarship means that workflow-oriented ROs will not gain significant uptake in the humanities unless they can capture distributed, sociotechnical workflows in meaningful ways. Third, humanities ROs will require capturing provenance in ways and at a level of detail that may be unfamiliar to the ROs scientific origins; humanities scholarship requires “thick,” multilayered, context-rich provenance descriptions that can accommodate conflicting assertions and formalize uncertainty. 2 Challenge 1. Essential interactivity for specialized use Much of humanities digital scholarship is essentially interactive. New modes of production and publication in the humanities are intended for user interaction or participation, and dynamic and responsive representation based on research context. Digital collections and archives, digital editions, maps, models, and simulations, and other modes of digital scholarship all rely on interactive components to express their interpretive contributions, or to enact their scholarly purposes. The interactive and dynamic components of digital scholarship include things like customized browsing and searching facilities that take advantage of extensive, rich scholarly encodings and annotations; platforms for collaborative annotation; dynamic maps and visualizations; etc. Such components are intended to do multiple things at once: to make arguments, to manifest interpretive stances, to enable knowledge transfer, and simultaneously to serve as active platforms for ongoing interpretation and research (Palmer, 2009; Fenlon, 2017; and others). Prior empirical work on applying the RO model to digital humanities collections found the main limitation of the model for digital humanities collections to be that functional components, designed for ongoing end-user interaction, are not usefully captured in a basic RO model and instead fall to the implementations built on top of research-object management systems (Fenlon, 2019). ROs can, of course, accommodate as flat code objects that are intended to be interactive; and ROs have been employed for this purpose to support data migration and archiving (e.g., the RO BagIt profile). But the purpose of digital humanities scholarship is to be alive and functional, and for ROs to be useful in this domain will require implementations that support platforms for flexible, participatory development. In a conceptual sense, the RO model has demonstrated value for this kind of platform approach in the humanities. The Perseids project offers a platform for sharing and peer-review of the transcriptions, annotations, and analyses that constitute research data in the Classics. The Perseids architecture is built around the concept of data publications, which are modeled as collections of related data objects. The Perseids team explicitly relates the data publication model to the RO model (Almas, 2017). Like ROs, Perseids data publications weave in several domain standards (including the TEI Epidoc schema, W3C Web Annotation, and others) to undergird an infrastructure that supports scholarly requirements specific to the Classics domain: transcription, fine-grained annotation, collaborative editing (with versioning), a research environment that facilitates data-type-specific extensions, and tailored workflows for peer review (Almas, 2017). Similarly, the CERES (Community Enhanced Repository for Engaged Scholarship) toolkit, created by the Northeastern University Libraries Digital Scholarship Group, explicitly draws on the concept of the RO in its system for supporting networked humanities scholarship and publishing. CERES allows digital humanities creators to build custom publications that pull objects from different repositories using APIs (including the Northeastern University Libraries’ Digital Repository Service and the Digital Public Library of America) (Sweeney, Flanders & Levesque, 2017). It is unclear how the RO model may fit into the broader, more diversified landscape of linked data and the Semantic Web in cultural institutions and in the humanities, but the conceptual fit within digital scholarship is established. ROs and similar models have substantial potential to underpin systems that support a variety of implementations. Realizing the advantages of ROs for the humanities will depend on implementations that create platforms for experimentation and collaborative development by distributed communities (Fenlon, 2019). Such platforms must accommodate dynamic interface-building, to allow scholarly communities with distinctive interests and needs to mobilize ROs in different ways. They must also accommodate participation and co- 3 creation through contributions of linked-data annotations and enrichments, including linking among ROs and the concepts and entities within ROs. Challenge 2. Distributed and idiosyncratic workflows of networked humanities scholarship Humanities digital scholarship is increasingly networked: heavily interconnected with and dependent on external resources for functionality and meaning. Many digital humanities publications in various forms—monographs, multimedia productions, exhibits, collections—draw on, reference, embed, and patch together distributed resources called from other collections, often via API. For example, a collection may center on a set of high-resolution images of primary sources, which are called from another digital library’s IIIF image server. Some of the longest- running, large-scale cultural heritage digital libraries (including Europeana and the Digital Public Library of America) are aggregations of descriptive surrogates, which link to original content hosted externally. Externally maintained schemas, authorities, and utilities undergird digital editions. Visualization and mapping projects generate content using external services. And with the growth of linked data in cultural collections, projects increasingly leverage external data sources as primary content, to which scholars then add layers of interpretive narrative, annotations, context, and interconnection. Humanities workflows rarely happen in self-contained or end-to-end research infrastructures, thwarting the possibility of sufficiently rich, automatic workflow capture. Indeed, efforts to build a workflow-oriented, unified cyberinfrastructure for supporting humanities scholarship tend to founder (e.g., Dombrowski, 2014). However, niche, task- or domain-specific infrastructures can capture constrained workflows. For example, in the domain of musicology, Page et al. (2017) observe how digital editions and annotations of encoded works are “manifestations of workflows deployed in musicological scholarship,” and offer a compelling framework for representing musical ROs, which include images, text, audio, and encoded music (Page et al., 2017; De Roure et al., 2018). Computational workflows are readily captured within humanities research environments, and ROs have come into play for this purpose. For example, the HathiTrust Research Center Data Capsule environment is moving toward systematic provenance-capture for computational text analysis workflows. These workflows take as inputs worksets (Jett et al., 2017), which are conceptually and technically akin to ROs: aggregate digital objects that implement addressability for and relational expressivity among components using domain ontologies. Unlike ROs, worksets are envisioned as the inputs of workflows in the current model of the HathiTrust Data Capsule environment, rather than encompassing whole research workflows (Murdock et al., 2017). But workflow-oriented ROs will not gain significant uptake in humanities contexts unless they can also capture and make useful more complex, distributed, sociotechnical workflows in meaningful ways. With their capacity for linked data using domain vocabularies, ROs readily accommodate many of the artifacts of networked digital scholarship in the humanities, along with their interrelationships (Fenlon, 2019). But can ROs accommodate humanities workflows in useful ways? In their effort to undergird DARIAH (pan-European infrastructure for digital arts and humanities research) through the systematic production of humanities ROs, Blanke and Hedges (2013) observed that humanities scholars employ sequential workflows, but “except in relatively specialised cases we rarely encountered workflows that could be automated, shared with and used by others, such as occur in many scientific disciplines.” While auto-generated and computer- useable workflows may not apply to most humanities research processes, formally characterized, (semi-) manually captured workflows would be highly useful for review, validation, archiving, reproducibility, reuse, and other purposes. While the RO model has the capacity and flexibility for complex workflow representation, more research is needed to characterize humanities workflows; 4 to identify how such characterizations can be made useful; and to identify model extensions and unique implementation strategies workflows might require in different domains. Challenge 3. Thick provenance Drilling down on the problem of workflow capture, digital humanities scholarship places special demands on data provenance—not only on the provenance of digital resources (such as files, compound objects, datasets) or components thereof (such as passages of music, paragraphs of a text, or lines of a poem), but also the provenance of attached, contextual information. Archival artifacts—the evidence of the humanities—often possess simultaneous, multiple and parallel provenances (Gilliland, 2014; Hurley, 2005). Documenting the provenance of the evidence itself can be complicated, but beyond that, the provenance of the provenance must also be documented. Any assertion made about any artifact (in the form of metadata or annotation), or any contextual and secondary information attached to artifacts in the context of digital scholarship, require provenance. Annotations and metadata are often, in the humanities, products of scholarly, interpretive work. Therefore, each annotation or metadata proposition itself is subject to claims of authorship, competing perspectives, expression of uncertainty, and further annotation—all requiring provenance information. Because provenance is a multilayered thing in humanities scholarship, different humanities disciplines and subdisciplines may require domain-specific provenance schemas and standards, which specialize existing standards for the expression of the provenance of different kinds of resources, ranging from digital media files to annotations. Humanities ROs will require thick, multilayered, context-rich provenance descriptions, which can accommodate conflicting assertions and formalize uncertainty. It is unclear whether existing implementations of the RO model can accommodate this level of description, though the model itself has the capacity. The ResearchSpace environment (Oldman and Tanase, 2018) offers exemplary support for documentation of thick, multifaceted provenance of humanities ROs. ResearchSpace is an open- source platform created by the British Museum to facilitate scholarly data sharing, formal argumentation, and semantic publishing within communities of researchers. ResearchSpace does not directly employ the RO model, though its architecture does rely on aggregates of linked data, taking advantage of related standards including W3C Web Annotation and Linked Data Platform containers. In this environment, provenance and argumentation are expressed using the CIDOC-CRM specialization CRMInf (The Argumentation Model). Scholars can use this vocabulary to build narratives and thick descriptions around digital ROs through annotation and data-linking. These narratives of provenance allow and formalize the expression of uncertainty and competing perspectives, and the environment also serves to document the scholarly work that goes into building these narratives (ResearchSpace Team, 2018). The reasons for highlighting the ResearchSpace approach to provenance in this review of humanities ROs are (1) to exemplify the unique demands of formalizing humanities provenance, and (2) to exemplify the highly distinctive, domain-specific implementation requirements that confront the RO and other domain-independent data models. Describing humanities provenance will require vocabularies to express argument and belief, as Oldman et al. (2015) observe. Beyond the RO model’s use of Prov and Web Annotation, humanities provenance will demand domain- specific argumentation extensions such as CRMInf. It is clear that ROs can theoretically accommodate thick provenance description, just as they can theoretically accommodate the representation of highly complex workflows, but can they usefully undergird implementations that are centered in humanities research needs? The ResearchSpace interface is tailored toward knowledge work, toward the collaborative construction of multifaceted provenance descriptions, 5 without requiring users to code or gain expert-level knowledge of domain ontologies. Tools for the authorship of humanities ROs, or tools that implement ROs behind the scenes, may benefit from taking the same approach. Conclusion ROs make a great deal of sense for modeling cultural information; skeletons of a similar shape— the simple and powerful combination of aggregation and annotation to represent compound digital objects—already structures large-scale cultural data aggregations, e.g., through the Europeana Data Model and the Digital Public Library of America Metadata Application Profile, which are both founded on ore:aggregations plus oa:annotations. But the challenges confronting widespread application of the RO model to humanities digital scholarship are significant. This review of existing applications has identified three central challenges: 1. Digital humanities scholarship requires specialized interactive use, so realizing the advantages of ROs for the humanities will depend on implementations that create platforms for experimentation and development by communities. 2. The idiosyncratic workflows of networked humanities scholarship means that workflow- oriented ROs will not gain significant uptake in the humanities unless they can capture distributed, sociotechnical workflows in meaningful ways. 3. Humanities ROs will require thick, multilayered, context-rich provenance descriptions that can accommodate conflicting assertions and formalize uncertainty, along with implementations that support the documentation of such provenance. In particular, the challenge of characterizing and formally expressing diverse humanities workflows, along with the provenance of data and contextual information within those workflows, presents the most urgent challenge and exciting opportunity for the future of humanities cyberinfrastructure. To many stakeholders in humanities cyberinfrastructure, “workflows are the new content” (Dempsey, 2016; Baynes et al., 2016; Schonfeld and Waters, 2018). While research on workflows is underway on multiple fronts (including Liu et al., 2017), it is clear already that there will be significant semantic differences between conceptual and technical elements in scientific workflows (and provenance) and those in the humanities; and these differences will affect the implementation of ROs for humanities research. Historically, attempts to implement scientific research infrastructures (including data models like the RO model) to support humanities scholarship have hit an obstacle in the form of semantic gulfs. For example, in the Linking and Querying Ancient Texts (LaQuAT) project, an effort to transfer eScience infrastructure in support of a humanities virtual research environment, Anderson and Blanke observed a fundamental challenge in integrating humanities data from different databases. They located the solution to that problem in humanities research communities: “integrating humanities research material...will require researchers to make the connections themselves, including decisions on how they are expressed and how to understand and explore the data more effectively” (Anderson and Blanke, 2012). Oldman et al. (2015), reviewing the state of linked data in the humanities, observed that basic linked data publication for many kinds of humanities sources can be counterproductive, “unless adapted to reflect specific methods and practices, and integrated into the epistemological processes they genuinely belong to.” This caution resonates with the challenges identified for the adoption of the RO model—or indeed for the importation of any data model, even domain- independent data models—into the humanities. The main challenges to implementing ROs for humanities research also present exciting opportunities for a more sustainable cross-disciplinary infrastructure (Fenlon, 2019), but implementation strategies must be centered in scholarly communities, and grow out from the practices, needs, and epistemologies of specific areas of study in the humanities and cultural institutions. 6 References Almas, B. (2017). Perseids: Experimenting with Infrastructure for Creating and Sharing Research Data in the Digital Humanities. Data Science Journal, 16(0). https://doi.org/10.5334/dsj-2017-019 Anderson, S., & Blanke, T. (2012). Taking the Long View: From e-Science Humanities to Humanities Digital Ecosystems. Historical Social Research / Historische Sozialforschung, 37(3 (141)), 147–164. Baynes, M. A., Sommer, D., Melley, D., & Lickiss, T. (2016, April). Workflow is the new content: Expanding the scope of interaction between publishers and researchers. Panel presentation presented at the Society for Scholarly Publishing. Retrieved from https://www.sspnet.org/events/past-events/workflow-is-the-new-content-expanding-the- scope-of-interaction-between-publishers-and-researchers/ Bechhofer, S., Buchan, I., De Roure, D., Missier, P., Ainsworth, J., Bhagat, J., … Goble, C. (2013). Why linked data is not enough for scientists. Future Generation Computer Systems, 29(2), 599–611. https://doi.org/10.1016/j.future.2011.08.004 Belhajjame, K., Zhao, J., Garijo, D., Gamble, M., Hettne, K., Palma, R., … Goble, C. (2015). Using a suite of ontologies for preserving workflow-centric research objects. Journal of Web Semantics, 32, 16–42. https://doi.org/10.1016/j.websem.2015.01.003 Blanke, T., & Hedges, M. (2013). Scholarly primitives: Building institutional infrastructure for humanities e-Science. Future Generation Computer Systems, 29(2), 654–661. https://doi.org/10.1016/j.future.2011.06.006 De Roure, D., Klyne, G., Page, K., Pybus, J., Weigl, D. M., & Willcox, P. (2018, July). Digital Music Objects: Research Objects for Music. Presented at the Research Object workshop (RO2018) at IEEE eScience Conference 2018. Retrieved from https://zenodo.org/record/1442453#.XB6Chc9KhhE Dempsey, L. (2016, October). The Library in the Life of the User: Two Collection Directions. Education. Retrieved from https://www.slideshare.net/lisld/the-library-in-the-life-of-the- user-two-collection-directions Dombrowski, Q. (2014). What Ever Happened to Project Bamboo? Literary and Linguistic Computing, 29(3), 326–339. https://doi.org/10.1093/llc/fqu026 Fenlon, K. (2017). Thematic research collections: Libraries and the evolution of alternative scholarly publishing in the humanities (Doctoral dissertation, University of Illinois at Urbana-Champaign). Retrieved from http://hdl.handle.net/2142/99380 Fenlon, Katrina. (2019). Modeling Digital Humanities Collections as Research Objects. Presented at the ACM/IEEE Joint Conference on Digital Libraries 2019. Retrieved from https://hcommons.org/deposits/item/hc:24889/ Gilliland, A. J. (2014). Conceptualizing 21st-Century Archives. ALA Editions. Hurley, C. (2005). Parallel provenance [Series of parts]: Part 1: What, if anything, is archival description?. [An earlier version of this article was presented at the Archives and Collective Memory: Challenges and Issues in a Pluralised Archival Role seminar (2004: Melbourne).]. Archives and Manuscripts, 33(1), 110. Jett, J., Cole, T. W., & Downie, J. S. (2017). Exploiting graph-based data to realize new functionalities for scholar-built worksets. Proceedings of the Association for Information Science and Technology, 54(1), 716–717. https://doi.org/10.1002/pra2.2017.14505401128 Liu, A., Kleinman, S., Douglass, J., Thomas, L., Champagne, A., & Russell, J. (2017). Open, Shareable, Reproducible Workflows for the Digital Humanities: The Case of the 4Humanities.org “WhatEvery1Says” Project. Presented at the Digital Humanities (DH2017). Retrieved from https://dh2017.adho.org/abstracts/034/034.pdf Murdock, J., Jett, J., Cole, T., Ma, Y., Downie, J. S., & Plale, B. (2017). Towards Publishing Secure Capsule-based Analysis. Proceedings of the 17th ACM/IEEE Joint Conference on 7 Digital Libraries, 261–264. Retrieved from http://dl.acm.org/citation.cfm?id=3200334.3200367 Oldman, D., Doerr, M., & Gradmann, S. (2015). Zen and the Art of Linked Data. In A New Companion to Digital Humanities (pp. 251–273). https://doi.org/10.1002/9781118680605.ch18 Oldman, D., & Tanase, D. (2018). Reshaping the Knowledge Graph by Connecting Researchers, Data and Practices in ResearchSpace. In D. Vrandečić, K. Bontcheva, Mari Carmen Suárez-Figueroa, V. Presutti, I. Celino, M. Sabou, … E. Simperl (Eds.), The Semantic Web – ISWC 2018 (pp. 325–340). Retrieved from https://link.springer.com/chapter/10.1007%2F978-3-030-00668-6_20 Page, K., Lewis, D., & Weigl, D. (2017). Contextual interpretation of digital music notation. Presented at the Digital Humanities (DH2017), Montréal, Canada. Palmer, C. L., Teffeau, L. C., & Pirmann, C. M. (2009). Scholarly Information Practices in the Online Environment: Themes from the Literature and Implications for Library Service Development. Retrieved from OCLC Research and Programs website: http://www.oclc.org/content/dam/research/publications/library/2009/2009-02.pdf ResearchSpace Team, British Museum. (2018, December). Moving from Documentation to Knowledge Building: ResearchSpace Principles and Practices. Presented at the Stiftung Preußischer Kulturbesitz (Prussian Cultural Heritage Foundation) Berlin. Retrieved from https://www.researchspace.org/docs/Berlin.pdf Schonfeld, R. C., & Waters, D. (2018, April). The turn to research workflow and the strategic implications for the academy. Presented at the Coalition for Networked Information (CNI) Spring Membership Meeting, San Diego, CA. Retrieved from https://vimeo.com/271130388 Sweeney, S. J., Flanders, J., & Levesque, A. (2017). Community-Enhanced Repository for Engaged Scholarship: A case study on supporting digital humanities research. College & Undergraduate Libraries, 24(2–4), 322–336. https://doi.org/10.1080/10691316.2017.1336144 work_ghhuhx2om5grtncgc72zosmaz4 ---- 19/7/2016 DH 2016 Abstracts http://dh2016.adho.org/abstracts/86 1/3 (http://dh2016.adho.org) DH Home (http://www.dh2016.adho.org) / Abstracts (/abstracts/) / 86 (/abstracts/86) Show info How to cite XML Version (/static/data/305.xml) Title: EVI-LINHD. A Virtual Research Environment for the Spanish-speaking Community Authors: Gimena del Rio Riande, Elena González-Blanco García, Clara Martínez Cantón, Juan José Escribano Category: Paper:Poster Keywords: Virtual Research Environment, Virtual Research Community, Digital Scholarly Edition, Spanish- speaking Community, DH Center del Rio Riande, G., González-Blanco García, E., Martínez Cantón, C., Escribano, J. (2016). EVI-LINHD. A Virtual Research Environment for the Spanish-speaking Community. In Digital Humanities 2016: Conference Abstracts. Jagiellonian University & Pedagogical University, Kraków, pp. 776-777. EVILINHD. A Virtual Research Environment for the Spanishspeaking Community Although Digital Humanities have been defined from a discipline perspective in many ways, it is surely a field still looking for its own objects, practices and methodologies. Their development in the Spanish-speaking countries is no exception to this process and, even it is complex to trace a unique genealogy to give account for the evolving field in Spain and Latin America (Gonzalez-Blanco, 2013; Spence and Gonzalez-Blanco, 2014; Rio Riande 2014a, 2014b), the emergence of various associations in Mexico (RedDH), Spain (HDH) and Argentina (AAHD) that seek for a constant dialogue (Galina, González-Blanco and Rio Riande, 2015), and academic lab and DH center initiatives such as LINHD (Spain and Argentina), GRINUGR (Spain), Medialab USAL, LABTEC (Argentina), TadeoLab (Colombia), Elabora HD (Mexico), among others, make it clear that research has become increasingly “global, multipolar and networked” (Llewellyn Smith, et al., 2011) and that the academic field is looking for a global outreach and aims to open spaces of shared virtual work. Virtual Research Communities (VRCs) are a consequence of these changes. Virtual Research Environments (VREs) have become central objects for digital humanist community, as they help global, interdisciplinary and networked research taking of profit of the changes in “data production, curation and (re‐)use, by new scientific methods, by changes in technology supply” (Voss and Procter, 2009: 174-90). DH Centers, labs or less formal structures such as associations benefit from many kind of VREs, as they facilitate researchers and users a place to develop, store, share and preserve their work, making it more visible. The focus and implementation of each of these VREs is different, as Carusi and Reimer (2010) show in their comparative analysis, but there are some common guidelines, philosophy and standards that are generally shared (as an example, see the Centernet map and guidelines of TGIR Huma-Num, 2015). This poster presents the structure and design of the VRE of LINHD, the Digital Innovation Lab at UNED ( http://linhd.uned.es (http://linhd.uned.es)), and the first Digital Humanities Center in Spain. This VRE focuses on the possibilities of a collaborative environment for (profane or advanced) Spanish-speakers scholarly digital editors. Taking into account the language barrier that English may suppose for a Spanish-speakers scholar or http://dh2016.adho.org/ http://www.dh2016.adho.org/ http://dh2016.adho.org/abstracts/ http://dh2016.adho.org/abstracts/86 http://dh2016.adho.org/static/data/305.xml http://linhd.uned.es/ 19/7/2016 DH 2016 Abstracts http://dh2016.adho.org/abstracts/86 2/3 student and the distance they may encounter with the data and organization of the interface (in terms of computational knowledge) while facing a scholarly digital edition or collection, LINHD’s VRE comes as a solution for the VRC interested in scholarly digital work. Moreover, it will make it possible to add an apply tools that contribute to improve Spanish-English applications or tools developed locally, such as Contawords, by Iula- UPF http://contawords.iula.upf.edu/executions (http://contawords.iula.upf.edu/executions). Opening such an environment to the Spanish speaking world will make it possible to reach different kinds of communities, whose profile and training in digital humanities differ from the typical users of DH tools and environment. Testing all these tools in this new environment will, for sure, draw interesting project results. In this sense, our project dialogues and aims to join the landscape of other VREs devoted to digital edition, such as Textgrid, e-laborate, etc. and, in a further stage, to build a complete virtual environment to collect and classify data, tools and projects, work and publish them and share the results with the research community. After having studied the structure and components of other digital virtual environment, our VRE has been designed on a humanist-user centered perspective, in which interface design, accessibility easiness and familiarity with tools and standards are key factors. Therefore, the key of our VRE is the combination of different open-source software that will enable users to complete the whole process of developing a digital editorial project. The environment is, up-to-now, divided into three parts: 1) A repository of data to (projects, tools, etc.) with permanent identifiers in which the information will be indexed through a semantic structured ontology of metadata and controlled vocabularies (such as Isidore and Huni, but using LINDAT software by Clarin. eu). 2) A working space based on the possibilities of eXistDB to work on text encoding together with Tei-Scribe, a tool developed at LINHD to tag texts in an intuitive way, storing and querying, plus some publishing tools (pre-defined stylesheets and some other open-source projects, such as Sade, Versioning machine, etc.). 3) A collaborative cloud workspace which integrates a wiki, a file archiving system and a publishing space for each team. Sustainability and long-term preservation are issues which we contemplate from the beginning, as our group is leading the addition of Spain into Dariah and LINHD is also part of a Clarin-Knowledge center with two powerful NLP groups from U.Pompeu Fabra in Barcelona and IXA in País Vasco. Our project has been conceived according to DH standards and open-source tools and its infrastructure is supported by our university UNED. Bibliography 1. Candela, L. Virtual Research Environments. GRDI2020. http://www.grdi2020.eu/Repository/FileScaricati/eb0e8fea-c496- 45b7-a0c5-831b90fe0045.pdf (http://www.grdi2020.eu/Repository/FileScaricati/eb0e8fea-c496-45b7-a0c5-831b90fe0045.pdf) (accessed 28-10-2015). 2. Carusi, A. and T. Reimer, (2010). Virtual Research Environment Collaborative Landscape Study. A JISC funded project. Oxford e-Research Centre, University of Oxford and Centre for e-Research, King's College London https://www.jisc.ac.uk/rd/projects/virtual-research-environments (https://www.jisc.ac.uk/rd/projects/virtual-research- environments) (accessed 28-10-2015). 3. Galina, I., González Blanco García, E. and Rio Riande, G. del (2015). Se habla español. Formando comunidades digitales en el mundo de habla hispana. Abstracts of the HDH 2015 Conference, Madrid, Spain. http://hdh2015.linhd.es/ebook/hdh15- galina.xhtml (http://hdh2015.linhd.es/ebook/hdh15-galina.xhtml) (accessed 28-10-2015). 4. González-Blanco Garcí A., E. (2013). Actualidad de las Humanidades Digitales y un ejemplo de ensamblaje poético en la red: ReMetCa. Cuadernos Hispanoamericanos, 761: 53-67. 5. Llewellyn Smith, C., Borysiewicz, L., Casselton, L., Conway, G., Hassan, M., Leach, M., et al. (2011). Knowledge, Networks and Nations: Global Scientific Collaboration in the 21st Century. London: The Royal Society. 6. Rio Riande, G. del (2014a). ¿De qué hablamos cuando hablamos de Humanidades Digitales? Abstracts of the AAHD Conference. “Culturas, Tecnologías, Saberes Buenos Aires, Argentina. http://www.aacademica.com/jornadasaahd/toc/6? abstracts (http://www.aacademica.com/jornadasaahd/toc/6?abstracts) (accessed 28-10-2015). 7. Rio Riande, G. del (2014b). ¿De qué hablamos cuando hablamos de Humanidades Digitales? http://blogs.unlp.edu.ar/didacticaytic/2015/05/04/de-que-hablamos-cuando-hablamos-de-humanidades-digitales/ (http://blogs.unlp.edu.ar/didacticaytic/2015/05/04/de-que-hablamos-cuando-hablamos-de-humanidades-digitales/). (accessed 28-10-2015). 8. Spence, P. and González-Blanco, E. (2014). A historical perspective on the digital humanities in Spain,H-Soz-Kult, doi: 22.10.2014, http://www.hsozkult.de/text/id/texte-2535 (http://www.hsozkult.de/text/id/texte-2535). The Status Quo of Digital Humanities in Europe, H-Soz-Kult, doi: 22.10.2014. (accessed 28-10-2015). http://contawords.iula.upf.edu/executions http://www.grdi2020.eu/Repository/FileScaricati/eb0e8fea-c496-45b7-a0c5-831b90fe0045.pdf https://www.jisc.ac.uk/rd/projects/virtual-research-environments http://hdh2015.linhd.es/ebook/hdh15-galina.xhtml http://www.aacademica.com/jornadasaahd/toc/6?abstracts http://blogs.unlp.edu.ar/didacticaytic/2015/05/04/de-que-hablamos-cuando-hablamos-de-humanidades-digitales/ http://www.hsozkult.de/text/id/texte-2535 19/7/2016 DH 2016 Abstracts http://dh2016.adho.org/abstracts/86 3/3 9. Tgir H.-N. (2011). Le guide des bonnes pratiques numériques. http://www.huma-num.fr/ressources/guide-des-bonnes- pratiques-numeriques (http://www.huma-num.fr/ressources/guide-des-bonnes-pratiques-numeriques) (version of 13-1-2015). (accessed 28-10-2015). 10. Voss, A. and Procter, R. (2009). Virtual research environments in scholarly work and communications, Library Hi Tech, 27(2): 174–90. This paper has been developed thanks to the Starting Grant research project: Poetry Standardization and Linked Open Data: POSTDATA (ERC-2015-STG-679528), funded by the European Research Council (ERC) under the European Union´s Horizon 2020 research and innovation programme. http://www.huma-num.fr/ressources/guide-des-bonnes-pratiques-numeriques work_gio25lsxfnawho3jaoh4w6te54 ---- Szmrecsanyi_rerevised.dvi Corpus-based dialectometry: aggregate morphosyntactic variability in British English dialects* Benedikt Szmrecsanyi Freiburg Institute for Advanced Studies bszm@frias.uni-freiburg.de Abstract The research reported in this paper departs from most previous work in dialectometry in several ways. Empirically, it draws on frequency vectors derived from naturalistic corpus data and not on discrete atlas classifi- cations. Linguistically, it is concerned with morphosyntactic (as opposed to lexical or pronunciational) variability. Methodologically, it marries the careful analysis of dialect phenomena in authentic, naturalistic texts to aggregational-dialectometrical techniques. Two research questions guide the investigation: First, on methodological grounds, is corpus-based di- alectometry viable at all? Second, to what extent is morphosyntactic variation in non-standard British dialects patterned geographically? By way of validation, findings will be matched against previous work on the dialect geography of Great Britain. 1 Introduction The overarching aim in this study is to provide a methodological sketch of how to blend philologically responsible corpus-based research with aggregational- dialectometrical analysis techniques. The bulk of previous research in dialec- tometry has focussed on phonology and lexis (however, for work on Dutch dialect syntax see Spruit 2005, 2006, 2008, Spruit et al. t.a.). Moreover, orthodox di- alectometry draws on linguistic atlas classifications as its primary data source. The present study departs from these traditions in several ways. It endeavours, first, to measure aggregate morphosyntactic distances and similarities between traditional dialects in the British Isles. Second, the present study does not rely on atlas data but on frequency information deriving from a careful analysis of language use in authentic, naturalistic texts. This is another way of saying that the aggregate analysis in this paper is frequency-based, an approach that contrasts with atlas-based dialectometry, which essentially relies on categorical input data. Succinctly put, the difference is that atlas-based approaches typi- cally aggregate observations such as of two variants X and Y, variant X is the dominant one in dialect Z, while frequency-based approaches are empirically based on corpus findings along the lines of, say, in dialect Z, variant X is 3.5 times more frequent in actual speech than variant Y. 1 The corpus resource drawn on is fred, the Freiburg English Dialect Corpus, a naturalistic speech corpus sampling interview material from 162 different lo- cations in 38 different counties all over the British Isles, excluding Ireland. The corpus was analyzed to obtain text frequencies of 62 morphosyntactic features, yielding a structured database that provides a 62-dimensional frequency vector per locality. The Euclidean distance measure was subsequently applied to com- pute aggregate morphosyntactic distances, which then served as the input to dialectometrical analysis. Two research questions guide the present study’s inquiry: first, on the methodological plane we are interested in whether and how corpus-based (that is, frequency-based) dialectometry is viable. Substantially, we will seek to un- cover if and to what extent morphosyntactic variation in non-standard British dialects is patterned along geographic lines. By way of validation, findings will be matched against previous work (dialectological, dialectometrical, and per- ceptual) on the dialect geography of Great Britain. 2 Previous work on aggregate dialect differences in Great Britain Let us first turn to the literature in order to eclectically review extant scholar- ship on dialect differences in Great Britain. ?:20–35 is one of the best-known dialectological accounts of accent differences in traditional British dialects. ? studies eight salient accent features to establish a composite map dividing Eng- land into 13 traditional dialect areas. These can be grouped into six macro areas: (1) Scots, (2) northern dialects (Northumberland and the Lower North), (3) western central (Midlands) dialects (Lancashire, Staffordshire), (4) eastern central (Midlands) dialects (South Yorkshire, Lincolnshire, Leicestershire), (5) southwestern dialects (western Southwest, northern Southwest, eastern South- west), and (6) southeastern dialects (central East and eastern Countries). In the realm of perceptual dialectology, Inoue (1996) conducted an experi- ment to study the subjective dialect division in Great Britain. 77 students at several universities in Great Britain were asked, among other things, to draw lines on a blank map ‘according to the accents or dialects they perceived’ (Inoue 1996:146), based on their experience. The result of this exercise can be sum- marised as follows: dialects of English in Wales and Scotland are perceived as being very different from English English dialects. Within England, the North is differentiated from the Midlands, and the Midlands are differentiated from the South (Inoue 1996:map 3). This division is quite compatible with ?’s (?) classification, except that in Inoue’s (1996) experiment, Lancashire is part of the North, not of the western Midlands, and the northern Southwest (essentially, Shropshire and Herfordshire) patterns with Midland dialects, not southwestern dialects. As for atlas-based dialectometry, Goebl (2007) draws on the Computer De- veloped Linguistic Atlas of England (which is based on the Survey of English Dialects) to study aggregate linguistic relationships between 314 sites all over England. The aggregate analysis is based on 597 lexical and morphosyntactic features. Among many other things, Goebl (2007) utilises cluster analysis to partition England into discrete dialect areas (Goebl 2007:maps 17–18). It turns 2 out that there is ‘a basic opposition between the North [. . . ] and the South of England’ (Goebl 2007:145). The dividing line runs south of Lancashire and South Yorkshire, and thus cuts right across what ? and Inoue (1996) classify as the Midlands dialect area. In southern English dialects, Goebl (2007) finds a major split between southwestern and other southern dialects. 3 Methods and data The present study is an exercise in corpus-based dialectometry. Corpus linguis- tics is a methodology that draws on principled collections of naturalistic texts to explore authentic language usage. A hallmark of the methodology is the ‘extensive use of computers for analysis, using both automatic and interactive techniques’ and the reliance ‘on both quantitative and qualitative analytical techniques’ (Biber et al. 1998:4). This section will discuss the corpus as well as the feature frequency portfolio that will serve as the basis for the subsequent aggregate analysis. 3.1 Data source: the Freiburg English Dialect Corpus (FRED) This study will tap the Freiburg English Dialect Corpus (henceforth: fred) (see Hernández 2006; Szmrecsanyi and Hernández 2007 for manuals) as its primary data source. fred contains 372 individual texts and spans approximately 2.5 million words of running text, consisting of samples (mainly transcribed so- called ‘oral history’ material) of dialectal speech from a variety of sources. Most of these samples were recorded between 1970 and 1990; in most cases, a field- worker interviewed an informant about life, work etc. in former days. The 431 informants sampled in the corpus are typically elderly people with a working- class background (so-called ‘non-mobile old rural males’). The interviews were conducted in 162 different locations (that is, villages and towns) in 38 different pre-1974 counties in Great Britain plus the Isle of Man and the Hebrides. The corpus is annotated with longitude/latitude information for each of the loca- tions sampled. From this annotation, county coordinates can be calculated by computing the arithmetic mean of all the location coordinates associated with a particular county. At present, fred is neither part-of-speech annotated nor syntactically parsed. 3.2 Feature selection and extraction Corpus-based dialectometry is essentially frequency-based dialectometry; thus the approach outlined here bears a certain similarity to the method in Hop- penbrouwers and Hoppenbrouwers (2001) (discussed in Heeringa 2004:16–20). Following a broadly variationist approach in the spirit of, for example, Labov (1966), a catalogue spanning 35 morphosyntactic variables with typically (but not always) two variants each was defined. This catalogue of 35 variables yields a list of p = 62 morphosyntactic target variants (henceforth: features); the Ap- pendix provides a comprehensive list. In an attempt to aggregate as many vari- ables as possible, the features included in the catalogue are the usual suspects in the dialectological, variationist, and corpus-linguistic literature, regardless of 3 whether a geographic distribution has previously been reported for a particular feature or not. To qualify for inclusion, however, a candidate feature had to fulfill the following criteria: 1. For statistical reasons, the feature had to be relatively frequent, specifi- cally: ≥ 1 occurrence per 10,000 words of running text (this criterion rules out interesting but infrequent phenomena such as resumptive relative pro- nouns or double modals). 2. For practical purposes, the feature had to be extractable subject to a reasonable input of labour resources by a human coder (ruling out, for example, hard-to-retrieve null phenomena such as zero relativisation, or phenomena where semantics enters heavily into consideration, such as gendered pronouns). Next, the material in fred was coded for the features in the catalogue. 26 features for which automatic recall was feasible were extracted automatically using Perl (Practical Extraction and Report Language) scripts. 36 features were coded manually after pre-screening the data using Perl scripts, a step which considerably narrowed down the number of phenomena which had to be in- spected manually. Even so, the frequency database utilised in the present study is based on 75,124 manual (that is, qualitative) coding decisions. Szmrecsanyi (forthcoming) provides a detailed description of the procedure along with the detailed coding schemes that regimented the coding process. Once coding was complete, another line of Perl scripts was used to extract vectors of ptotal = 62 feature frequencies per locality. The feature frequencies were subsequently normalised to frequency per ten thousand words (because textual coverage in fred varies across localities) and log-transformed* to de- emphasise large frequency differentials and to alleviate the effect of frequency outliers. The resulting 38 × 62 table (on the county level – that is, 38 coun- ties characterised by 62 feature frequencies each for the full dataset) yields a Cronbach’s α value of .86, indicating satisfactory reliability. Finally, the 38 × 62 table was converted into a 38 × 38 distance matrix using Euclidean distance – the square root of the sum of all squared frequency differentials – as an interval measure. This distance matrix was subsequently analyzed dialectometrically.* 4 Results We now move on to a discussion of empirical findings. Unless stated otherwise, the level of areal granularity is the county level (N = 38). 4.1 On the explanatory power of geography Let us first consider the role that geographic distance plays in aggregate mor- phosyntactic variability. First, how much of this variability can be explained by geography? Second, looking at the morphosyntactic dialect landscape in the British Isles, to what extent are we dealing with a continuum such that transitions are gradual and not abrupt? 4 As for the first question, a Perl script was run on the Euclidean distance ma- trix based on all ptotal = 62 features and on fred’s geographic longitude/latitude annotation to generate a table specifying pairwise morphosyntactic and geo- graphic distances. This yielded an exhaustive list of all N × N−1 2 = 703 pos- sible county pairings, each pairing being annotated for morphosyntactic and geographic distance. On the basis of this list, the scatterplot in Figure 1 illus- trates the correlation between morphosyntactic and geographic distance in the database at hand. [Figures 1 and 2 here] Figure 1 highlights two facts. First, while the correlation between mor- phosyntactic and geographic distance is highly significant (p = .00), it is rela- tively weak (Pearson correlation coefficient: r = .22). In other words, geography explains overall only 4.7 per cent of the morphosyntactic variance (R2 =.047). To put this value into perspective, Spruit et al. (to appear:Table 7) – in a study on aggregate linguistic distances in Dutch dialects – report R2 values of .47 for the correlation between geography and pronunciation, .33 for lexis, and .45 for syntax. Second, the best curve estimation for the relationship be- tween morphosyntactic and geographic distance in British English dialects is actually linear.* Given Séguy (1971) and much of the atlas-based dialectometry literature that has followed Séguy’s seminal study, one would actually expect a sublinear or logarithmic relationship. Having said that, we note that Spruit (2008:54-55), in his study of Dutch dialects, finds that the correlation between syntactic and geographic distance is also more linear than logarithmic. Hence, it may simply be the case that (morpho)syntactic variability has a different relationship to geographic distance than lexical or pronunciational variability. Against this backdrop, it is interesting to note that not all of the 62 features entered into aggregate analysis correlate significantly with geography. In fact, only 23 features do (these are marked with an asterisk in the Appendix).* When the aggregate analysis is based on only those pgeo = 23 features, we obtain the scatterplot in Figure 2. The correlation coefficient between morphosyntactic and geographic distance is now approximately twice as high as in Figure 1 (r = .41), which means that for this particular feature subset geography explains about 16.6 per cent of the morphosyntactic variance (R2 = .166).* While these numbers begin to approximate the explanatory potency of geography in atlas- based dialectometry, it still seems that we should base the aggregate analysis on all available data. This is why the subsequent analysis in this paper will be based on the entire feature portfolio (ptotal = 62), despite the weaker geographic signal it provides. Still, we observe that feature selection does matter a great deal, and one is left to wonder to what extent compilers of linguistic atlases – the primary data source for those studies that report high coefficients for geography – really draw on all available features, or rather on those features that seem geographically interesting. [Figure 3 here] 5 Comparatively weak as the overall correlation between morphosyntactic and geographic distance may be, are we nonetheless dealing with a morphosyn- tactic dialect continuum? To answer this question, we will now visualise ag- gregate morphosyntactic variability using cartographic techniques, all relying on Voronoi tesselation (see Goebl 1984) to project linguistic results to geogra- phy. Regular multidimensional scaling (henceforth: mds) (see Kruskal and Wish 1978) was utilised to scale down the original 62-dimensional Euclidean distance matrix to three dimensions; the distances in the three-dimensional mds solution correlate with the distances in the original distance matrix to a satisfactory degree (r = .82). Subsequently, the three mds dimensions were mapped to the red–green–blue colour components, giving each of the county polygons in Figure 3 a distinct colour.* In continuum maps such as Figure 3, smooth (as opposed to abrupt) colour transitions implicate the presence of a dialect con- tinuum. As can be seen, the morphosyntactic dialect landscape in the British Isles is overall not exceedingly continuum-like.* While colour transitions in the south of England are fairly smooth (meaning that this is a fairly homogeneous dialect area), the picture is more noisy in the North of England and, especially, in Scotland. To aid interpretation of Figure 3, each of the 62 normalised log- transformed feature frequencies was correlated against each of the three mds dimensions to determine which of the features correlate most strongly with the red–green–blue colour scheme in Figure 3 (see Wieling et al. 2007 for a similar procedure). It turns out that more reddish colours correlate best with increased frequencies of multiple negation (feature [34]) (r = .79), greenish colours corre- late most strongly with higher frequencies of non-standard weak past tense and past participle forms (feature [23]) (r = .63), and bluish colours correlate best with increased frequencies of wh-relativisation (feature [49]) (r = .57). By way of an interim summary, the research discussed in this section has two principal findings. Firstly, the explanatory potency of geography is com- paratively weak in the data at hand and accounts for only between 4.7 to 16.6 per cent of the observable morphosyntactic variance (depending on whether all available features or only those with a significant geographic distribution are studied). Secondly, the morphosyntactic dialect landscape in Great Britain does not have a very continuum-like structure overall, although transitions appear to be more gradual in England than in Scotland. 4.2 Classification and validation The task before us now is to examine higher-order patterns and groupings among British English dialects. Is it possible to identify dialect areas on morphosyn- tactic grounds (and on the empirical basis of frequency data)? If so, do these dialect areas conform to those previously identified in the literature (see section 2)? To answer these questions, hierarchical agglomerative cluster analysis (see Aldenderfer and Blashfield 1984), a data classification technique used to par- tition observations into discrete groups, was applied to the dataset. Simple clustering can be unstable, hence a procedure known as ‘clustering with noise’ (Nerbonne et al. 2008) was conducted: the original Euclidean distance matrix was clustered repeatedly, adding some random amount of noise in each run. This exercise yielded a cophenetic distance matrix which details consensus (and thus more stable) cophenetic distances between localities, and which is amenable 6 to various cartographic visualisation techniques. This study uses the cluster- ing parameters described in Nerbonne et al. (2008), setting a noise ceiling of c = σ/2 and performing 100 clustering runs. There are many different clustering algorithms; in addition to using the – quite customary – Weighted Pair Group Method using Arithmetic Averages (wpgma), we also apply Ward’s Minimum Variance Method (ward), as the two algorithms yield interestingly different clustering outcomes.* [Figures 4, 5, 6, and 7 here] The resulting higher-order structures can be visualised, for example, via so- called composite cluster maps (see Nerbonne et al. 2008 for a discussion). These highlight the fuzzy nature of dialect boundaries such that darker borders be- tween localities represent more robust linguistic oppositions (which, thanks to the clustering-with-noise technique utilized, can be considered statistically sig- nificant). Figure 4 presents a composite cluster map that visualises the outcome of wpgma noisy clustering, which is contrasted with the corresponding ward outcome in Figure 5. An alternative visualisation, which highlights rough group memberships and fuzzy transition areas, can be attained by applying mds to the cophenetic distance matrix (see, for instance, Alewijnse et al. 2007:section 5.3) and subsequently assigning component colours to each of the three resulting mds dimensions. Such maps – where similar colourings indicate likely member- ship in the same dialect area – are displayed in Figure 6 (wpgma) and Figure 7 (ward). Note, in this context, that the distances in the three-dimensional mds solution correlate very highly with the distances in the cophenetic distance matrix (r = .96 and r = 1.00, respectively). Figures 4 through 7 can be interpreted as follows. Both the wpgma and ward algorithms characterise Scotland as heterogeneous and geographically fairly incoherent (more so according to wpgma than according to ward). Both algorithms moreover tend to differentiate between English English dialects and non-English English dialects (Scottish English dialects and northern Welsh di- alects, in particular Denbighshire [DEN]). This is consonant with the sharp perceptual split between English English dialects and Welsh/Scottish dialects reported in Inoue (1996). As for divisions among English English dialects, how- ever, the two clustering algorithms generate fairly different classifications: • wpgma classifies England as a rather homogeneous dialect area vis-à-vis Scotland and Wales. The only outlier in England is the county Warwick- shire (WAR; the brownish polygon in Figure 6), which is more similar to Denbighsire (DEN; Welsh English) and some Scottish dialects than to the other English counties. • ward broadly distinguishes between southern English dialects (reddish/ pinkish colours in Figure 7) and northern English dialects (brownish/dark- ish colours). Northumberland (NBL, dark green), Durham (DUR, blue), and Warwickshire (WAR; light blue), albeit English counties, pattern with Scottish dialects. Middlesex (MDS) is grouped with the northern dialects, although the county is located in the geographic Southeast (this fact is re- sponsible for the salient southeastern ‘box’ in Figure 5). In sum, the ward 7 algorithm finds a rather robust North–South split in England, which is compatible with all three accounts surveyed in Section 2 (?Inoue 1996; Goebl 2007). Figures 5 and 7 can also be seen to reveal a split among northern dialects into Midland dialects (darkish/brownish colours, in par- ticular Leicestershire [LEI], Shropshire [SAL], Lancashire [LAN], West- morland [WES], and Yorkshire [YKS]) versus northern dialects (Durham [DUR] and Northumberland [NBL]). This opposition would be in accor- dance with Inoue (1996) as well as ?. In summary, we have seen in this section that it seems to be possible – despite a good deal of apparent geographical incoherence – to identify rough dialect areas on morphosyntactic grounds, and that these are not incompatible with previous accounts of dialect differences in Great Britain. For one thing, most English English dialects are rather robustly differentiated from non-English English dialects. Second, the ward algorithm in particular finds a North–South split among English English dialects that appears meaningful given extant schol- arship. At the same time, we note that both algorithms fail to identify mean- ingful and coherent patterns among Scottish dialects. Also, neither algorithm detects a split between the Southwest of England and other southern dialects, as posited by ? and Goebl (2007). 5 Conclusions This study has demonstrated that frequency vectors derived from naturalistic corpus data – as opposed to, for instance, categorical linguistic atlas classifi- cations – can serve as the empirical basis for aggregate analysis. Focussing on morphosyntactic variability in British English dialects, we have seen that the dataset yields a significant geographic signal which is, however, comparatively weak in comparison to previous atlas-based dialectometrical findings. The anal- ysis has also suggested that overall variability in British English dialects does not seem to have an exceedingly continuum-like structure, and that there is quite a bit of geographical incoherence. Future study will want to investigate whether the comparatively weak explanatory potency of geography is real, or whether it is an artefact of the specific methodology or data type used. Having said that, the results do reveal that British English dialects can be partitioned into rough dialect areas on morphosyntactic grounds. Although the match with the literature is not perfect – as a matter of fact, we should not expect it to be perfect, given that some of the studies cited ‘are based on entirely different things and on not very much at all’, as one reviewer of this paper noted – the classification suggested here is not incompatible with previous work on dialect divisions in Great Britain. This enhances confidence in the method utilized here. A more detailed discussion of the outlier status of counties such as Warwickshire and Middlesex (including the identification of the features that are responsible for this outlier status), and of the extent to which the methodology presented here uncovers hitherto unknown generalisations is reserved for another occasion. More generally speaking, though, the present study highlights the fact that a careful and philologically responsible identification and analysis of features occurring in naturalistic, authentic texts (as customary in, for example, varia- tionist sociolinguistics and corpus-based dialectology) advertises itself for aggre- gation and computational analysis. The point is that the qualitative-philological 8 jeweller’s eye perspective and the quantitative-aggregational bird’s eye perspec- tive are not mutually exclusive, but can be fruitfully combined to explore large- scale patterns and generalisations. It should be noted in this connection that the line of aggregate analysis sketched out in this paper could easily be extended to other humanities disciplines that rely on naturalistic texts as their primary data source (for instance, literary studies, historical studies, theology, and so on). The methodology outlined in the present study can and should be refined in many ways. For one thing, work is under way to utilise Standard English text corpora to determine aggregate morphosyntactic distances between British English dialects, on the one hand, and standard English dialects (British and American) on the other hand. Second, the feature-based frequency informa- tion on which the present study rests will be supplemented in the near future by part-of-speech frequency information, on the basis of a coding scheme that distinguishes between 73 different part-of-speech categories. Third, given that geography does not seem to play an exceedingly important role in the dataset analyzed here, it will be instructive to draw on network diagrams (in the spirit of, for example, McMahon et al. 2007) as an additional visualisation and inter- pretation technique. 9 Notes *I am grateful to John Nerbonne, Wilbert Heeringa, and Bart Alewijnse for having me over in Groningen in spring 2007 to explain dialectometry to me. I also wish to thank Peter Kleiweg for creating and maintaining the RuG/L04 package. The audience at the Workshop on ‘Measuring linguistic relations between closely related varieties’ at the MethodsXIII conference in Leeds (August 2008) provided very helpful and valuable feedback on an earlier version of this paper, as did four anonymous reviewers. The usual disclaimers apply. *Zero frequencies were rendered as .0001, which yields a log frequency of -4. *The analysis was conducted using some custom-made Perl scripts, standard statistical soft- ware (spss), and Peter Kleiweg’s RuG/L04 package (available online at http://www.let.rug.nl/~kleiweg/L04/) as well as the L04 web interface maintained by Bart Alewijnse (http://l04.knobs-dials.com/). * R 2 linear = .0469, R 2 logarithmic = .0439 *In order to test individual features for significant geographic distributions, dialect dis- tances were also calculated on the basis of individual features (using one-dimensional Euclidean distance as interval measure) and correlated with geographical distance. If the ensuing correla- tion coefficient was significant, a given feature was classified as having a significant geographic distribution. *Still, the relationship is more linear (R2linear = .0166) than logarithmic (R 2 logarithmic = .134). *To do justice to fred’s areal coverage – which is unparalleled in the corpus-linguistic realm, but certainly not perfect – the polygons in Figure 3 have a maximum radius of ca. 40 km. This yields a ‘patchy’ but arguably more realistic geographic projection. *Having said that, it should be made explicit that the present study is based on an aggregate analysis of features that are known to display variation (though not necessarily geographic variation). As one reviewer noted, the inclusion of more invariable features – say, basic word order or the like – would yield smoother dialect transitions. This is of course true, yet we note that linguistic atlases, and thus atlas-based dialectometry, also of course have a bias towards variable features. *Notice that given the present study’s dataset, the Unweighted Pair Group Method using Arithmetic Averages (upgma), another popular algorithm used in, for instance, Nerbonne et al. (2008), yields almost exactly the same classification as wpgma. 10 Appendix: the feature catalogue Features whose distribution correlates significantly with geography are marked by an asterisk (*). A. The pronominal system [1]* vs. [2] non-standard vs. standard reflexives [3] vs. [4] archaic thee, thou, thy vs. standard you, yours, you B. The noun phrase [5]* vs. [6] synthetic vs. analytic adjective comparison [7] vs. [8] the of -genitive vs. the s-genitive [9] vs. [10]* preposition stranding vs. preposition/particle frequencies C. Primary verbs [11] vs. [12]* the primary verb to do vs. the primary verbs to be/have note: this includes both main verb and auxiliary verb usages D. Tense, mood, and aspect [13] vs. [14] the future marker be going to vs. will/shall [15] vs. [16]* would vs. used to as markers of habitual past [17]* vs. [18] progressive vs. unmarked verb forms [19]* vs. [20] the present perfect with auxiliary be vs. the present perfect with auxiliary have E. Verb morphology [21] vs. [22] a-prefixing on -ing-forms vs. bare -ing-forms [23] vs. [24] non-standard weak past tense and past participle forms vs. standard strong forms [25]* vs. [26] non-standard ‘Bybee’ verbs vs. corresponding standard forms note: ‘Bybee’ verbs (see Anderwald 2009) have a three-way paradigm – e.g. begin/began/begun – in Standard English but can be reduced to a two-way paradigm – e.g. begin/begun/begun – in dialect speech [27] non-standard verbal -s [28]* vs. [29] non-standard past tense done vs. standard did [30] vs. [31] non-standard past tense come vs. standard came 11 F. Negation [32]* vs. [33] invariant ain’t vs. not/*n’t/*nae-negation [34]* vs. [35] multiple negation vs. simple negation [36]* vs. [37] negative contraction vs. auxiliary contraction [38]* vs. [39]* don’t with 3rd person singular subjects vs. standard agree- ment [40] vs. [41] never as a preverbal past tense negator vs. standard nega- tion G. Agreement [42] existential/presentational there is vs. was with plural sub- jects [43]* vs. [44] deletion of auxiliary be in progressive constructions vs. auxiliary be present [45]* vs. [46]* non-standard was vs. standard was [47] vs. [48]* non-standard were vs. standard were H. Relativisation [49] wh-relativisation [50]* relative particle what [51] relative particle that [52] relative particle as I. Complementation [53]* as what or than what in comparative clauses [54] vs. [55]* unsplit for to vs. to-infinitives [56] vs. [57] infinitival vs. gerundial complementation after to begin, to start, to continue, to hate, to love [58] vs. [59] zero vs. that complementation after to think, to say, and to know J. Word order phenomena [60] lack of inversion and/or of auxiliaries in wh-questions and in main clause yes/no-questions [61]* vs. [62]* prepositional dative vs. double object structures after the verb to give 12 References M. S. Aldenderfer and R. K. Blashfield (1984), Cluster Analysis, Quantitative Applications in the Social Sciences (Newbury Park, London, New Delhi). B. Alewijnse, J. Nerbonne, L. van der Veen, and F. Manni (2007), ‘A Compu- tational Analysis of Gabon Varieties’, in P. Osenova, ed., Proceedings of the RANLP Workshop on Computational Phonology. 3–12. L. Anderwald (2009), The Morphology of English Dialects (Cambridge). D. Biber, S. Conrad, and R. Reppen (1998), Corpus Linguistics: Investigating Language Structure and Use (Cambridge). H. Goebl (1984), Dialektometrische Studien: Anhand italoromanischer, rätroromanischer und galloromanischer Sprachmaterialien aus AIS und ALF (Tübingen). H. Goebl (2007), ‘A bunch of dialectometric flowers: a brief introduction to dialectometry’, in U. Smit, S. Dollinger, J. Hüttner, G. Kaltenböck, and U. Lutzky, eds, Tracing English through time: Explorations in language vari- ation (Wien), 133–172. W. Heeringa (2004), Measuring dialect pronunciation differences using Leven- shtein distance (Ph. D. thesis, University of Groningen). N. Hernández (2006), User’s Guide to FRED. http://www.freidok.uni-freiburg.de/volltexte/2489/ (Freiburg). C. Hoppenbrouwers and G. Hoppenbrouwers (2001), De indeling van de Neder- landse streektalen. Dialecten van 156 steden en dorpen geklasseerd volgens de FFM (Assen). F. Inoue (1996), ‘Subjective Dialect Division in Great Britain’, American Speech, 71(2), 142–161. J. B. Kruskal and M. Wish (1978), Multidimensional Scaling, Volume 11 of Quantitative Applications in the Social Sciences (Newbury Park, London, New Delhi). W. Labov (1966), ‘The linguistic variable as a structural unit’, Washington Linguistics Review, 3, 4–22. A. McMahon, P. Heggarty, R. McMahon, and W. Maguire (2007), ‘The sound patterns of Englishes: representing phonetic similarity’, English Language and Linguistics, 11(1), 113–142. J. Nerbonne, P. Kleiweg, and F. Manni (2008), ‘Projecting dialect differences to geography: bootstrapping clustering vs. clustering with noise’, in C. Preisach, L. Schmidt-Thieme, H. Burkhardt, and R. Decker, eds, Data Analysis, Ma- chine Learning, and Applications. Proceedings of the 31st Annual Meeting of the German Classification Society (Berlin), 647–654. J. Séguy (1971), ‘La relation entre la distance spatiale et la distance lexicale’, Revue de Linguistique Romane, 35, 335–357. 13 M. R. Spruit (2005), ‘Classifying Dutch dialects using a syntactic measure: the perceptual Daan and Blok dialect map revisited’, Linguistics in the Nether- lands, 22(1), 179–190. M. R. Spruit (2006), ‘Measuring syntactic variation in Dutch dialects’, Literary and Linguistic Computing, 21(4), 493–506. M. R. Spruit (2008), Quantitative perspectives on syntactic variation in Dutch dialects (Ph. D. thesis, University of Amsterdam). M. R. Spruit, W. Heeringa, and J. Nerbonne (to appear), ‘Associations among Linguistic Levels’, Lingua. B. Szmrecsanyi (forthcoming), Woods, trees, and morphosyntactic distances: traditional British dialects in a corpus-based dialectometrical view . B. Szmrecsanyi and N. Hernández (2007), Manual of Information to ac- company the Freiburg Corpus of English Dialects Sampler (”FRED-S”). http://www.freidok.uni-freiburg.de/volltexte/2859/ (Freiburg). M. Wieling, W. Heeringa, and J. Nerbonne (2007), ‘An aggregate analysis of pronunciation in the Goeman-Taeldeman-van Reenen-Project data’, Taal en Tongval, 59(1), 84–116. 14 9008007006005004003002001000 geographic distance (in km) 26 24 22 20 18 16 14 12 10 8 6 4 m o rp h o s y n ta c ti c d is ta n c e R Sq Linear = 0.047 Figure 1: Correlating linguistic and ge- ographic distances, county level (N = 38), all features (ptotal = 62), r = .22, p = .00. 9008007006005004003002001000 geographic distance (in km) 18 16 14 12 10 8 6 4 2 0 m o rp h o s y n ta c ti c d is ta n c e R Sq Linear = 0.166 Figure 2: Correlating linguistic and ge- ographic distances, county level (N = 38), geographically significant features only (pgeo = 23), r = .41, p = .00. 15 ANS BAN CON DEN DEV DFS DUR ELN FIF GLA HEB INV MAN KCD KEN KRS LAN LEI LKS LND MDX MLN NBL NTT OXF PEE PER ROC SAL SEL SFK SOM SUT WAR WES WIL WLN YKS Figure 3: Continuum map: regular mds on Euclidean distance ma- trix (county level). Labels are three-letter Chapman county codes (see http://www.genuki.org.uk/big/Regions/Codes.html for a legend). Smooth colour transitions indicate the presence of a dialect continuum. Reddish colours correlate best with increased frequencies of multiple negation, greenish colours correlate best with higher frequencies of non-standard weak past tense and past participle forms, and bluish colours correlate best with increased frequencies of wh-relativisation. 16 Figure 4: Composite cluster map, county level (N = 38), all features (ptotal = 62); input: cophenetic distance matrix (clustering algorithm: wpgma). Darker borders indicate more robust dialect boundaries. Figure 5: Composite cluster map, county level (N = 38), all features (ptotal = 62); input: cophenetic distance matrix (clustering algorithm: ward). Darker borders indicate more robust dialect boundaries. 17 Figure 6: Fuzzy mds map, county level (N = 38), all features (ptotal = 62); in- put: cophenetic distance matrix (clus- tering algorithm: wpgma); felicitous- ness of the mds solution: r = .96. Sim- ilar colours indicate likely membership in the same dialect area. Figure 7: Fuzzy mds map, county level (N = 38), all features (ptotal = 62); in- put: cophenetic distance matrix (clus- tering algorithm: ward); felicitousness of the mds solution: r = 1.00. Sim- ilar colours indicate likely membership in the same dialect area. 18 work_gnmc5jq4yvbsjj7l2xxzqpev2e ---- 1 GEOPARSING, GIS, AND TEXTUAL ANALYSIS: CURRENT DEVELOPMENTS IN SPATIAL HUMANITIES RESEARCH Ian Gregory, Christopher Donaldson, Patricia Murrieta-Flores, and Paul Rayson Introduction The spatial humanities constitute a rapidly developing research field that has the potential to create a step-change in the ways in which the humanities deal with geography and geographical information. As yet, however, research in the spatial humanities is only just beginning to deliver the applied contributions to knowledge that will prove its significance. Demonstrating the potential of innovations in technical fields is, almost always, a lengthy process, as it takes time to create the required datasets and to design and implement appropriate techniques for engaging with the information those datasets contain. Beyond this, there is the need to define appropriate research questions and to set parameters for interpreting findings, both of which can involve prolonged discussion and debate. The spatial humanities are still in early phases of this process. Accordingly, the purpose of this special issue is to showcase a set of exemplary studies and research projects that not only demonstrate the field’s potential to contribute to knowledge across a range of humanities disciplines, but also to suggest pathways for future research. Our ambition is both to demonstrate how the application of exploratory techniques in the spatial humanities offers new insights about the geographies embedded in a diverse range of texts (including letters, works of literature, and official reports) and, at the same time, to encourage other scholars to integrate these techniques in their research. To date, a standard definition of the spatial humanities has yet to be determined (a fact which, no doubt, has much to do with the pioneering nature of the field); however Richardson et al’s 2011 description of GeoHumanities as a ‘rapidly growing zone of creative interaction between geography and the humanities’ offers an excellent starting place.1 Although there is nothing explicitly digital about this definition, such ‘creative interaction’ certainly owes much to the widespread application of geographical technologies within the digital humanities.2 In this way, one can begin to trace the relation between GeoHumanities and what Gregory and Geddes describe when referring to the spatial humanities as a field that employs ‘geographical technologies to develop new knowledge about the geographies of human cultures past and present.’3 For Gregory and Geddes, the spatial humanities has its origins in historical geographical information systems (HGIS), from which it has developed both on account of technological advances in geographic information science (GISc) and on account of the 2 increased acceptance of digital technologies as tools for knowledge creation across a range of humanities disciplines. In their seminal collection, The Spatial Humanities, Bodenhamer et al advance a similar conception of the field, presenting the spatial humanities both through critical engagements with specific spatial technologies and through more general evaluations of the benefits these technologies can bring.4 Implicit in each of these accounts is the notion that the spatial humanities has come about through the creative adaptation and application of geographical technologies in humanities research. Taking this basic definition of the spatial humanities as a starting point, this issue draws attention to important developments that are shaping the field. 1. Digital collections and geographical technologies Information technology is currently advancing in ways that make both non-quantitative source materials and geographical technologies increasingly available and accessible. The amount of the digital textual material available to researchers is proliferating rapidly. Some of the major digital collections of historical books and reports include the Old Bailey Online, Early English Books Online, and the British Library’s Nineteenth-Century Newspaper collection,5 which each comprise many millions or even billions of words. The digitisation of these corpora (as large collections of digital texts are called) was principally driven by the desire to make them available online with keyword-search facilities. With recent advances in text analysis software, however, it is becoming apparent that researchers can gain new insights about these corpora through more complex forms of analyses. In addition, researchers have also begun to recognize the manifest importance of ‘born digital’ material, including corpora compiled from websites, email archives, and social media, which can be analysed in many of the same ways. Alongside these developments, the last several years have also witnessed unprecedented advances in the design and distribution of geographical technologies. Once these were the preserve of geographical information systems (GIS) software, such as ArcGIS, which, although powerful, were both expensive and highly specialised and therefore only accessible to a small number of specialists.6 Thanks to the increasing number of open-source GIS software packages, however, technologies that were once available only to a select few have begun to reach a mass audience. Moreover, with the ubiquity of satnavs and virtual globes (such as Google Earth), interest in and familiarity with digital mapping has increased exponentially. 7 Indeed, as Bodenhamer et al proffer, ‘we are more aware than ever of the power of the map to facilitate commerce, enable knowledge discover, [and] make geographic information visual 3 and socially relevant.’8 When we draw these threads together – the proliferation of digital corpora, the increasing availability and accessibility of geographical technologies, and the growing awareness of the potential contributions to knowledge that these technologies can make – the potential of the spatial humanities to advance humanities scholarship becomes clear. This is especially so when we consider the recent advances made in the development of automated geoparsing. 2. Georeferencing and automated geoparsing Creating georeferenced databases (where each item of data is assigned geographic coordinates, allowing it to be mapped and spatially analysed) has long been identified as one of the main challenges in implementing GIS and other geospatial technologies in humanities research.9 Much of the foundational work in quantitative HGIS was completed by projects that built major databases of census and related statistics. These projects entailed time-consuming research, and frequently millions of dollars in funding, to create systems that linked digitised changing historical administrative boundaries with databases of statistical tables.10 The challenge facing humanities researchers today is rather different. In general, the sources used by humanities researchers are unstructured texts in which geographical information is present in the form of specific named-entities such as place-names. In order to georeference an unstructured text, one needs to identify the place-names it contains and then to assign each place-name to the coordinates that represent its location. The first of these steps can be achieved by implementing Natural Language Processing techniques that are capable of automatically recognizing the place-names within a text. Completing the second step entails pairing these place-names with coordinate data from a gazetteer, such as GeoNames or GNIS.11 This two-step process is known as geoparsing.12 Geoparsing provides a solution to georeferencing texts; however, the next (and arguably the more important) task is to decide what to do with the fully georeferenced text. The software and the analytic techniques developed for working with georeferenced databases have been developed to handle quantitative sources, usually from scientific or social science paradigms. But how does one analyse a georeferenced text in ways that are sensitive to the complex nature of humanities sources? Moreover, what contributions to knowledge can we expect from these approaches? This special issue addresses these questions by presenting series of studies and research projects that demonstrate not only the opportunities afforded by geoparsing and working with georeferenced texts, but also the challenges this presents and their implications 4 for spatial humanities research. 3. Trends in spatial humanities research The early days of HGIS were characterised by projects that used quantitative data to study spatial patterns in fields such as historical demography and environmental and economic history. Over the past decade, the potential of these approaches to make contributions to knowledge has become increasingly apparent,13 and scholars in disciplines across the humanities and social sciences have begun to incorporate GIS, and cognate geospatial technologies, into their research. In what follows, we offer a brief survey of these developments, beginning with archaeology and history and then moving on to consider literary studies and, finally two closely related areas, corpus and computational linguistics. 3.1. Archaeology, history and classics Archaeology was one of the first humanities disciplines to integrate geospatial technologies in its methods and to apply these technologies in its research. That this was the case is largely because the study of the material past is inherently spatial in nature. In order to make sense of artefacts and sites, archaeologists need to understand their spatial contexts. They need, in other words, to determine how specific objects, features, and structures relate not only to the places where they are found, but also to the wider landscapes those places comprise. It is only by understanding the spaces inhabited by past cultures that archaeologists are able to reconstruct the customs, beliefs, and institutions that defined those cultures. At this rudimentary level, all forms of archaeology can be characterised by a preoccupation with spatial thinking. Given this, it comes as little surprise that the methodologies and theories that have shaped the discipline over the past fifty years have continued to emphasize the spatial dimension of archaeological practice. The emergence of ‘spatial archaeology’ during the 1970s can be seen as paradigmatic in this respect; for, although not uncontroversial, its influence is apparent not only in the widespread use of spatial analysis for studying excavated artefacts to landscape compositions, but also in the most recent trends in archaeological computing.14 Archaeologists first began to utilise GIS during the 1980s.15 Since that time, other spatial technologies such as Remote Sensing and GPS have also been incorporated in almost every branch of the discipline.16 By contrast, the spatial study of textual sources in archaeology is a much more recent phenomenon. The reasons for this are varied; but, for the present purposes, it suffices to say that the use of textual sources in archaeology is different than it is in most 5 other humanities disciplines. Whereas texts are the core source for most humanities disciplines, artefacts found in digs are the core of archaeology. This is one of the reasons why digital archaeology has often seemed to have such a different ‘scene’ during the emergence of digital humanities.17 Indeed, pioneering projects have only recently begun to harness the potential of spatial technologies to investigate text corpora relevant to archaeological research. One example of this is using techniques from corpus and computational linguistics to mine grey literature reports, extracting potential spatial and contextual information for archaeological interpretation.18 Although this type of approach provides a foundation for assessing the geographies underlying such texts, the methodologies that can go beyond data exploration and enter the realm of corpus analysis have yet to materialise in archaeology. We think, however, that this will not take long. With the combination of the experience in spatial methods and thinking from archaeology, and the diverse approaches developed in history, corpus and computational linguistics, we expect to see advanced forms of textual spatial analysis in archaeology in the near future. In the case of history, the application of geospatial methods is a more recent phenomenon. Here, as was the case in archaeology, the integration of GIS came about largely in response to the need to find methodologies to answer specific research questions in the wider context of the discipline. HGIS, the result of these endeavours, has over the past fifteen years become a diverse and dynamic subfield. Although, as noted above, HGIS projects initially concentrated on the quantitative exploration of economic and political data,19 more recent studies have focused on the application of GIS in the analysis of historical documents.20 Exploratory HGIS scholarship has also recently undertaken experimental research combining spatial and corpus analysis,21 and, in some cases, even integrating GIS with serous gaming engines to facilitate the 3-D virtual modelling of historical places and landscapes.22 Thus, by different routes, GIS has become important to both archaeology and to historical geography. For archaeologists this evolved from the need to survey and record sites, for historical geographers from the desire to make better use of quantitative data, but for both subjects it has led to the development of new analytic approaches. In both cases, there is a clear potential to apply these technologies and methods to textual sources. Work done in classics has also been exploiting the potential benefits of harnessing the spatial information in texts and representing them using enhanced visualisations. Examples include HESTIA23 and Google Ancient Places24 which have developed Web-based visual resources to facilitate the exploration of places of interest mentioned in ancient literature.25 6 3.2. Literary studies A number of ground-breaking research projects have also recently identified the transformative potential of GIS, and related technologies, for the discipline of literary studies. Led by research teams at centres of excellence in Britain, Europe, the United States, and Australia, these projects have proven that GIS and its cognates have the power to revolutionise how we interpret the material, imaginative, and discursive geographies not only of individual novels, poems, and plays, but also of large corpora of literary works. In doing so, these projects have helped reinvigorate both literary geography (the study of the spatiality of literary works) and, more generally, what might be called the geography of literature (the study of the place-bound nature of the acts of writing, publishing, and reading). Even more remarkably, they have also suggested the potential of wholly new modes and practices for literary scholarship. A key development here has been the emergence of digital literary atlas projects, such as ETH Zurich’s Literary Atlas of Europe, the University of Queensland’s Cultural Atlas of Australia, Trinity College Dublin’s Digital Literary Atlas of Ireland, and the New University of Lisbon’s LITSCAPE.PT (which is featured in this collection). Taking their cue from the pioneering work of Franco Moretti, Matthew Jockers, and the Stanford Literary Lab, the creators of these atlases have embraced the idea that literary critics can use maps as ‘analytical tools: that dissect the text in unusual ways, and bring to light relations that would otherwise remain hidden.’26 Underlying this methodological premise is a conviction in the value of the map as a form of abstraction that, in reducing the object of study – the literary text or corpus – to a few particulars, both defamiliarizes it and, in the process, helps to generate new research questions and to guide critical inquiry. Equally operative here is a new, contested paradigm for literary hermeneutics, variously called ‘distant reading’27 or ‘macroanalysis’28, which has sought to supplement more traditional critical approaches through the aggregate analysis of large literary corpora. These new practices, as Jockers explains, are poised to take advantage of ‘the massive digital-text collections’ available on the World Wide Web and, in the process, to launch literary studies into a new age: Today, in the age of digital libraries and large-scale book-digitization projects, the nature of the evidence available to us has changed, radically. Which is not to say that we should no longer read books looking for, or noting, random ‘things,’ but rather to emphasize that massive digital corpora offer us unprecedented access to the literary record and invite, even 7 demand, a new type of evidence gathering and meaning making.29 In other words, instead of simply identifying, isolating, and analysing the features of a handful of ‘representative’ texts, literary scholars today should strive to create new knowledge about those features and texts by contextualizing them in relation to the large text corpora that are now available to them. The creation of digital literary atlases, such as the Literary Atlas of Europe and LITSCAPE.PT, have demonstrated how GIS and spatial analysis can assist in facilitating this sort of contextualization. But, even though these projects are successful in their own terms, they have also stimulated important debates about the merits of such macro-mapping activities for literary scholarship. A key issue here is the widely perceived incongruity between the methodologies of GISc – with their reliance on precise, quantifiable data – and the kinds of equivocal or, what Bushell calls, ‘slippery’ information with which literary scholars typically engage.30 Put bluntly, tools and techniques designed to measure absolute Euclidean space often prove inadequate for modelling the complex, contingent, and, at times, contradictory geographies of literary works of art – a criticism that can also be applied to other humanities sources. Consequently, many literary scholars have dismissed the application of GIS, and other digital tools, in the macro-mapping of literary corpora as problematically instrumentalist and reductive, noting that this approach tends to flatten out and suppress the differences that distinguish literary works from one another. Other scholars have been more conciliatory, expressing interest in the results of such projects whilst advising that the value of mapping as a critical practice depends largely on the nature of texts being studied. Notably, Hewitt councils that the analysis of literary works ‘is more revealing when sensitivity is shown to the approaches of individual texts and authors,’ and that, accordingly, that ‘mapping … cannot be the first step in a mass hermeneutic process,’ but should come ‘after an exploration of … evidence of a work’s engagement with spatial concerns.’31 Another common criticism of large-scale, literary atlas projects is that they problematically conflate the real world and the world of the literary text. For, although acknowledging that ‘the geography of fiction follows its own distinctive rules, since literature can create its own space, without physical restrictions,’ in the end these projects are often simply predicated on the positivist assumption ‘that a large part of fiction indeed refers to the physical/real world.’32 As Bushell reminds us, ‘the points where [a literary map] does not correspond directly to the world of the book may be more interesting than the points where it does.’33 8 These are major impediments for the integration of spatial humanities approaches in literary studies. For although it is clear that GIS and distant reading are clearly compatible and can be usefully combined to build spatial models of specific narrative structures, their efficacy in aiding textual study is, as yet, limited. The problem here, as one research team has noted, is both epistemological and technological: both a consequence of differing research cultures and of the limitations of current research tools.34 In order to overcome these deficits, researchers within the field of the digital humanities – including literary scholars, computer scientists, and GISc specialists – need to work together not only to develop new practices and frameworks for interdisciplinary collaboration and creative exchange, but also to produce substantial works of scholarship that demonstrate how those practices and frameworks contribute to close study and analysis of specific literary texts. Encouragingly, this is precisely the direction in which much work in the field of literary cartography is moving. Notably, research centres such as the Stanford Literary Lab have continued to pioneer innovative interdisciplinary approaches to the study of literary spaces, most recently by modelling the use of digital crowdsourcing to construct an ‘emotional map’ of London based on a corpus of eighteenth- and nineteenth-century novels.35 At Lancaster University, moreover, a team of literary scholars, GIScientists, quantitative historians, corpus and computational linguists is using data mining in conjunction with GIS to investigate literary representations of and responses to the English Lake District.36 Furthermore, an interdisciplinary team of scholars at the University of Edinburgh is currently using immersive- mapping and mobile-computing technologies to enable users to explore the Edinburgh cityscape through geo-located extracts of literary works from the early modern period to the twentieth century.37 Alongside these projects, other integrative forms of geospatial technology are informing the development of deep mapping, a newly emerging concept that an increasing body of literature discusses in detail.38 3.3 Corpus and computational linguistics A key requirement underpinning the mapping of texts is the ability to connect the GIS techniques with the underlying qualitative texts. As already described, at a minimum, this entails geoparsing the texts to extract place-names and locate these on a map. Additional techniques from the closely related areas of corpus and computational linguistics can enable the spatial humanities researcher to link their distant reading back to close analysis of the underlying text. 9 Computational linguistics, or Natural Language Processing (NLP), techniques drawn from computer science allow the automatic summarisation of meaning or extraction of a variety of patterns from text. One example NLP technique is named-entity extraction which can, to a certain level of accuracy, find all mentions of personal names, organisations, place names, dates and times in a text. As evidenced in this special issue, combining this technique with toponym resolution39 to locate the extracted place-names on a map forms one of these bi- directional links from analysis to text. Another large class of NLP techniques enable the automatic annotation of words or phrases within a text at various levels of linguistic detail. Part-of-speech (POS) annotation enables highly accurate (97-98%) identification of major word classes in text, such as adjectives, nouns, verbs and adverbs. Once these categories are marked in the corpus, they can be searched alongside the word forms. Proper nouns are particularly useful for finding place-names and personal names. Adjectives are a good source when searching for evaluative descriptions of landscape features for example. A second level of tagging, called semantic annotation,40 adds meaning or conceptual labels to words and phrases in a text. This enables searching by concepts e.g. health and disease, finance and money, education within the corpus. The second family of related techniques stems from corpus linguistics which is a method or collection of methods for text analysis stemming from the discipline of linguistics. With the increase in power and storage capacity of computers in the 1970s and 1980s, a set of methods emerged which enabled large quantities of machine-readable text to be analysed and explored semi-automatically for language description purposes. Similar developments in the digital humanities can be traced back to Roberta Busa working with IBM in 1949 to produce his computer-generated Index Thomisticus of the writings of Thomas Aquinas. In parallel, dictionary publication was revolutionised in the 1980s with the creation of machine-readable corpora such as COBUILD alongside new searching and analysis software. At least five corpus linguistics methods are worthy of mention here. In combination, they provide a semi-automatic approach to data-driven exploratory analysis which can uncover patterns within the data that are otherwise difficult or impossible to extract by more manual analyses. First, frequency lists show all the different word types in a text and how often they occur allowing the researcher to focus their efforts on the most represented features in a corpus. Frequency lists can also be extended to show how well a word is dispersed within a corpus and this is key to understanding the salience of a word in a corpus. Second, concordances show every occurrence of a word in a text with a small amount of context, usually 4-5 words either side. This enables the researcher 10 to look for patterns and meanings by sorting the surrounding text. Third, the keywords method compares two or more frequency lists to identify words which are statistically more represented in one text relative to another sub-corpus or a much larger reference corpus. This can show what a text is about and highlight interesting terms for further analysis. Fourth, n-grams (sometimes called lexical bundles or clusters) shows repeated consecutive sequences of words of a given length (n) which extends the single word frequency lists. Finally, the collocation technique assists in finding which words regularly co-occur in close proximity in texts. In the context of the spatial humanities, the collocation technique has proven useful for discovering which topics are discussed in relation to different places that are mentioned in a corpus. In addition, the combination of keywords methods, semantic analysis and collocation means that it is possible to uncover and visualise the topics associated with particular place-names by connecting GIS databases to their semantic collocates, so called ‘visual GISting’.41 4. The essays in this issue The above discussion shows that the spatial humanities draw from, and applied to, a wide range of disciplines. Nevertheless, at its core there are similarities in approach, methods and limitations that draw the spatial humanities together. The essays in this volume have been selected to represent both the diversities and commonalities of the field. The first essay, by Alex et al, is principally concerned with geoparsing, and with how the automated extraction of geographic information can aid the analysis of large text corpora. The main focus here is the Edinburgh Geoparser, a state of the art Web-based tool, which has been adapted to facilitate the georeferencing of historical texts. In order to illustrate the power and flexibility of the Edinburgh Geoparser, Alex et al present three brief, but contrasting, case studies: one concerned with nineteenth-century trade, another concerned with the ancient world, and a third concerned with a historical gazetteer of English place-names. The four essays that follow Alex et al move from geoparsing to consider how working with georeferenced corpora can inform humanities research. In the first, Schwartz engages with a field that is near the traditional heartland of HGIS: environmental history. Drawing on georeferenced texts from the British Parliamentary Papers, his chapter examines nineteenth- century reports on fish stocks in British waters. In order to do this, Schwartz combines computer-assisted qualitative data analysis (CAQDAS) methods with GIS to assess changing perceptions about the decline of fish stocks by comparing texts from Royal Commissions in 1863 and 1893. Although Schwartz is keen to stress that CAQDAS and GIS are reductionist 11 approaches, his essay convincingly demonstrates how they can aid more traditional forms of historical analysis by generating research questions and guiding critical inquiry. Alves and Queiroz’s essay, which focuses on the application of geospatial tools in literary studies, also suggests how GIS-based distant reading can complement more traditional close reading practices. Here the focus is on LITSCAPE.PT, a digital literary atlas of historical and modern Portuguese literature. Rather than using geoparsing software, as Alves and Queiroz explain, LITSCAPE.PT relies on a creative combination of crowdsourcing and relational databases to facilitate the mapping and analysis of excerpts from a wide variety of literary works. Using this mixed-methods approach, the project has successfully catalogued more than 6,000 excerpts, which are being used to study literary representations of mainland Portugal as well as social and environmental history. As examples, Alves and Queiroz present two case studies: a comparative examination of the evolving physical and literary geographies of Lisbon and a transhistorical assessment of the declining presence of wolves in Portuguese literature. The fourth essay, by Purves and Derungs, also addresses the representation of landscapes in writing; however, they are more concerned with the way that texts represent landscape than what those representations reveal about the texts themselves. As Purves and Derungs argue, engaging with written representations of space can help human geographers move beyond the GIS-facilitated measurement of Euclidean space towards a more nuanced conception of place. As proof-of-concept, Purves and Derungs focus on two apparently contrasting corpora related to mountain landscapes in Switzerland and Britain: the Text+Berg archive of the Yearbooks of the Swiss Alpine Club and georeferenced photographs from Flickr. Although the Yearbooks (a historical corpus documenting 150 years of Alpine mountaineering) seem like a typical resource to draw on in such an analysis, Flickr (a web 2.0 site that allows users to upload photographs) is a much less obvious choice. However, as Purves and Derungs explain, the metadata added to Flickr photos make them an equally excellent resource for understanding how places are experienced and perceived. Van den Heuvel’s essay, which concludes this special issue, also emphasizes the need to move beyond a purely Euclidean conception of space, which, he contends, is inadequate for modelling the sorts of networks and systems of knowledge exchange in which scholars in the humanities are often interested. As his example, van den Heuvel concentrates on the Republic of Letters, suggesting that spatial humanities approaches can aid us in reconstructing the geographical distribution of documents and drawings that defined this epistolary intellectual 12 community. Taking inspiration from the notion of ‘deep maps’, van den Heuvel, concludes by positing the idea of ‘deep networks’ as a means for visualising how knowledge was communicated and disseminated in early modern Europe. Taken together, these essays offer a representative sample of the sorts of projects currently being pursued within the spatial humanities. They call attention to new developments within the field and, moreover, present different perspectives on the challenges and the potentials of research in the field. In doing so, they affirm that the creative adaption and application of geographical technologies has the potential to revolutionize scholarship across a number of different humanities disciplines. Our hope is that, in modelling the use of innovative methods they will encourage other scholars to integrate these approaches in their research. Acknowledgements This special edition resulted from an expert meeting on ‘Digital Texts and Geographical Technologies in the Digital Humanities’ held at Lancaster University 8-9th July 2013, funded by the European Research Council (ERC) under the European Union’s Seventh Framework Programme (FP7/2007-2013) / ERC grant ‘Spatial Humanities: Texts, GIS, Places’ (agreement number 283850). This introductory essay also benefited from support under the same grant. End Notes 1 D. Richardson, S. Luria, J. Ketchum and M. Dear, ‘Introducing the geohumanities’, in M. Dear, J. Ketchum, S. Luria and D. Richardson, eds., GeoHumanities: Art, history, text at the edge of place (Abingdon, 2011), 3-4. Cited here at 3. 2 See, for instance, K. Offen ‘Historical geography II: Digital imaginations’, Progress in Human Geography, 37, no. 4 (2013), 564-77. 3 I.N. Gregory and A. Geddes, ‘Introduction: From Historical GIS to Spatial Humanities: Deeping scholarship and broadening technology’, in I.N. Gregory and A. Geddes, eds., Towards Spatial Humanities: Historical GIS and Spatial History (Bloomington, IN, 2014), ix-xix. Cited here at xv. 4 See: D.J. Bodenhamer, J. Corrigan, and T.M. Harris, ‘Introduction’, in D.J. Bodenhamer, J. Corrigan and T.M. Harris, eds., The Spatial Humanities: GIS and the future of humanities scholarship (Bloomington, 2010), vii-xv; and D.J. Bodenhamer, ‘The potential of spatial humanities’, in D.J. Bodenhamer, J. Corrigan, and T.M. Harris, eds., The Spatial Humanities: GIS and the future of humanities scholarship (Bloomington, 2010),14-30. 13 5 See: The Proceedings of the Old Bailey – London’s Central Criminal Court, 1674 to 1913, http://www.oldbaileyonline.org, last accessed 5 Aug 2014; Early English Books Online, http://eebo.chadwyck.com/home, last accessed 5 Aug 2014; British Newspapers, 1600-1950, http://gale.cengage.co.uk/product-highlights/history/19th-century-british-library-newspapers.aspx, last accessed 5 Aug 2014. 6 See ArcGIS, [accessed 5 Aug 2014]. 7 See: Google Earth, [accessed 5 Aug 2014]. Examples of free and open source GIS software include: QGIS: A free and open source geographic information system, http://www.qgis.org, last accessed 5 Aug 2014; and MapWindow, http://www.mapwindow.org, last accessed 5 Aug 2014. 8 Bodenhamer et al (2010), vii. 9 I.N. Gregory and P.S. Ell, Historical GIS: Technologies, methodologies, scholarship (Cambridge, 2007), Chap 3. 10 A.K. Knowles, ed. ‘Reports on National Historical GIS projects’ Historical Geography, 33 (2005), 293-314 provides a review. 11 GeoNames, [accessed 5 Aug 2014]; GNIS, [accessed 5 Aug 2014]. 12 C. Grover, R. Tobin, K. Byrne, M. Woollard, J. Reid, S. Dunn and J. Ball, ‘Use of the Edinburgh geoparser for georeferencing digitized historical collections’, Philosophical Transactions of the Royal Society A, 368 (2010), 3875-3889. 13 I.N. Gregory, ‘Further reading: From historical GIS to spatial humanities: An evolving literature’ in I.N. Gregory and A. Geddes, eds., Towards Spatial Humanities: Historical GIS and Spatial History (Bloomington, IN, 2014), 186-202; A.K. Knowles, ed., Placing History: How maps, spatial data, and GIS are changing historical scholarship (Redlands, CA, 2008). 14 J. Huggett, ‘What Lies Beneath: Lifting the Lid on Archaeological Computing’ in A. Chrysanthi, P. Murrieta Flores, and C. Papadopoulos, eds., Thinking Beyond the Tool: Archaeological Computing and the Interpretative Process (Oxford, 2012) 204–214. 15 M. Aldenderfer, H. Maschner, and M. Goodchild, Anthropology, space, and geographic information systems (New York, 1996); A.S. Fotheringham, C. Brunsdon, M. and Charlton, Quantitative geography: perspectives on spatial data analysis (Thousand Oaks, CA, 2000); and D. Wheatley and M. Gillings, Spatial Technology and Archaeology. The Archaeological Applications of GIS (London, 2002). 16 R.N. Parker and E.K. Asencio, GIS and spatial analysis for the social sciences: coding, mapping, and modelling (New York, 2008); and M.F. Goodchild and D.G. Janelle, ‘Toward critical spatial thinking in the social sciences and humanities’, GeoJournal 75, no. 1 (2010), 3-13. 17 J. Huggett, ‘Core or periphery? Digital Humanities from an archaeological perspective’, Historical Social Research, 37, no. 3 (2012) 86-105. 14 18 J. Richards, S. Jeffrey, S. Waller, F. Ciravegna, S. Chapman, and Z. Zhang, ‘The Archaeology Data Service and the Archaeotools Project: Faceted Classification and Natural Language Processing’, in E. C. Kansa, S. Whitcher Kansa, & E. Watrall, eds., Archaeology 2.0. New Approaches to Communication and Collaboration (Los Angeles, 2011), 31–56. 19 I.N. Gregory and R.G. Healey ‘Historical GIS: Structuring, mapping and analysing geographies of the past’ Progress in Human Geography, 31 (2007), 638-653. 20 B. Donahue, ‘Mapping Husbandry in Concord: GIS as a Tool for Environmental History’ in A. K. Knowles and A. Hillier, eds., Placing history: how maps, spatial data, and GIS are changing historical scholarship (Redlands, CA, 2008), 151-77; A. K. Knowles, W. Roush, C. Abshere, L. Farrell, A. Feinberg, and T. Humber, ‘What could Lee see at Gettysburg’, in A. K. Knowles and A. Hillier, eds., (2008), 235-66; R.M. Schwartz, I. Gregory, and J. Marti-Henneberg, ‘History and GIS: railways, population change, and agricultural development in late nineteenth century Wales’, in M. Dear, et al (2011), 251-266. 21 I. Gregory and A. Hardie ‘Visual GISting: bringing together corpus linguistics and Geographical Information Systems’, Literary and Linguistic Computing 26, no. 3 (2011), 297-314; P. Murrieta- Flores, A. Baron, I. Gregory, A. Hardie, and P. Rayson, ‘Automatically analysing large texts in a GIS environment: the Registrar General’s reports and cholera in the nineteenth century’, Transactions in GIS (2014). 22 See, for example, UCLA RomeLab, [accessed 5 Aug 2014]; Virtual St Paul’s Cross Project, [accessed 5 Aug 2014]; and T.M. Harris, L.J. Rouse, and S. Bergeron, ‘Humanities GIS: Adding place, Spatial storytelling and Immersive visualization into the Humanities’, in M. Dear, et al (2011), 226-240. 23 E. Barker, S. Bouzarovski, C. Pelling, and L. Isaksen, ‘Mapping an ancient historian in a digital age: the Herodotus Encoded Space-Text-Image Archive (HESTIA)’, Leeds International Classical Studies, 9, no. 1 (2010), 1-24. 24 Google Ancient Places, [accessed 5 Aug 2014]. 25 E. Barker, K. Byrne, L. Isaksen, E. Kansa, and N. Rabinowitz, Google Ancient Places, [accessed 5 Aug 2014]. 26 F. Moretti, Atlas of the European Novel, 1800-1900 (London, 1998), 3. 27 F. Moretti, ‘Conjectures on World Literature’, New Left Review, 1 (2000), 54–68. 28 M. L. Jockers, Macroanalysis: Digital Methods and Literary History (Illinois, 2013). 29 Jockers (2013), 7-8. 30 S. Bushell, ‘The slipperiness of literary maps: Critical cartography and literary cartography’, Cartographica, 47, no. 3 (2012), 149-60. 31 R. Hewitt, ‘Mapping and Romanticism’, Wordsworth Circle, 42, no. 2 (2011), 157-65. 15 32 B. Piatti and L. Hurni. ‘Editorial: The cartographies of fictional worlds’, The Cartographic Journal, 48, no. 4 (2011), 218-23: 218-19. 33 Bushell (2012), 154. 34 D.J. Bodenhamer, T.M. Harris, and J. Corrigan, ‘Spatial Narratives and Deep Maps: A Special Report’, International Journal of Humanities and Arts Computing, 7, nos. 1-2 (2013), 170-75. 35 See Stanford Literary Lab, [accessed 5 Aug 2014]. 36 See D. Cooper and I. Gregory, ‘Mapping the Lakes District: A Literary GIS’, Transactions of the Institute of British Geographers, 36, no. 1 (2011), 89-108; see also Spatial Humanities: Texts, GIS, Places, [accessed 5 Aug 2014]. 37 See Palimpsest, [accessed 5 Aug 2014]. 38 See D.J. Bodenhamer, T.M. Harris, and J. Corrigan, Deep Maps and Spatial Narratives (Bloomington: Indiana University Press, forthcoming 2014). 39 See J. L. Leidner. ‘Toponym resolution in text: annotation, evaluation and applications of spatial grounding’. SIGIR Forum 41, 2 (2007), 124-126. 40 Rayson, P., Archer, D., Piao, S. L., McEnery, T. ‘The UCREL semantic analysis system’. In proceedings of the workshop on Beyond Named Entity Recognition Semantic labelling for NLP tasks in association with 4th International Conference on Language Resources and Evaluation (LREC 2004), 25th May 2004, Lisbon, Portugal, (2004) pp. 7-12. 41 Gregory and Hardie (2011). work_gsarfotn5zdibdrpi2gq3nhpoq ---- Diapositiva 1 Taipei, 22/2/2012 Antonella FRESA Technical Coordinator Central Institute Union Catalogue Italian Libraries A data infrastructure for digital cultural heritage: characteristics, requirements and priority services 22 February 2012 TELDAP 2012 Conference Taipei, 22/2/2012 Table of content • The Digital Cultural Heritage sector: characteristics and needs • The vision towards a DCH data infrastructure • Two inter-related projects: DC-NET and INDICATE • Positioning of the DCH sector Taipei, 22/2/2012 Initiatives of the European Member States in the last 10 year A wide range of activities: • Building a shared platform of tecommendations and guidelines • Agreement on common data models • Experimenting and launching innovative online services • E-infrastructures for the citizens • E-infrastructures for the research • International cooperation: in Europe and abroad • Digitisation within national and regional programmes 3 3 Taipei, 22/2/2012 DATA MODEL & SERVICES DIGITAL CULTURAL HERITAGE e-INFRASTRUCTURE 2002 2005 2009 2012 2014 NATIONAL & REGIONAL INITIATIVES EUROPEANA RECOMMENDATIONS & GUIDELINES E-INFRASTRUCTURES for the citizens for the researchers Taipei, 22/2/2012 The amount of digitised material is growing very rapidly • National, regional and European programmes support the digitisation of the content of Museums, Libraries, Archives, Archaeological sites and Audiovisual repositories • The generation of digital cultural heritage is accelerated also by the impulse of Europeana that is fostering the European cultural institutions to produce even more digital content • Digital cultural heritage content are complex and interlinked through many relations Digital cultural content characteristics Taipei, 22/2/2012 6 NATIONAL PROGRAMMES REGIONAL PROGRAMMES EUROPEAN PROGRAMMES Digital cultural content National portals National portals Regional portals Thematic portals Data Continuum THE VISION …….. International portals Taipei, 22/2/2012 1. high quality information technology management, to ensure trust, availability, reliability, long term safety of content, security, preservation and sustainability; 2. access facilities to the final users (the researchers) who will search into the DCH e-Infrastructure for their research and to the cultural institutions that will deliver their data to the DCH e-Infrastructure; 3. interoperation among existing cultural heritage repositories and of cultural heritage data with research data. The needs of the DCH sector Taipei, 22/2/2012 The e-infrastructure for DCH It is not a “new infrastructure”, but it is instead a “new approach” - based on national and regional systems - Valorising existing resources The keyword is INTEROPERABILITY Regional system National system Thematic system National system Regional system Taipei, 22/2/2012 Expected impacts • e-Infrastructures The adoption of the e-Infrastructures by the digital cultural heritage community will open new scenarios of use and exploitation • Cultural Heritage Cultural managers will become more aware about the potential that the e- infrastructures can offer to their work: storage, preservation, access services for the cultural institutions, etc. • Research A better integration of the cultural sector with the e-Infrastructures will enable the research of new advanced services and applications • Other sectors Digital cultural content will become more usable and re-usable for education, cultural tourism, long-life learning, non-professional cultural interests, creative industry, etc. 9 9 Taipei, 22/2/2012 • To focus on the use of existing e-infrastructures as a channel for digital cultural heritage data • Storage, computing, connectivity together with authentication , authorisation and accounting mechanisms offered by the e-infrastructures can well serve the needs of the sector: the issue here is to establish factual cooperation among two sectors (the research and the cultural heritage) that are not used to work together DCH V/S e-Infrastructures Taipei, 22/2/2012 • Key players from the DCH: – Ministries of Culture – Cultural institutions  Cross-domain: museums, libraries and archives together • Key player from the research: – Ministries of Research – Researchers in the Humanities – Researchers in ICT applied to CH • E-Infrastructure providers Key players Taipei, 22/2/2012 • To define priorities among the services to be deployed • To consult and to advocated with stakeholders • To engage with programme owners • To improve awareness: standards, who-is-who, … • To promote trust building, covering different aspects and including organisational, operational and legal issues • To run experiments: pilots and use case studies • To open international cooperation • To establish an e-culture community Preparatory actions Taipei, 22/2/2012 1. DC-NET: joint activities plan for DCH e-infrastructure implementation 2. INDICATE: international cooperation, use case studies, pilots, policy harmonisation Two integrated projects Priorities and progamming Support and demonstration Taipei, 22/2/2012 DC-NET ERA-NET A Network for the European Research Area: • Composed by Programme Owners and Programme Managers in the cultural sector • To agree common perspectives & priorities across EU Member States • To establish an operative dialogue between cultural heritage and e-Infrastructures communities in Europe, • To identify constraints and capabilities in order to establish a plan of joint activities Started in December 2009, it will last until March 2012 A project funded by EC FP7 e-Infrastructures 14 Taipei, 22/2/2012 INDICATE A concrete approach within an international dimension – Stimulating the international cooperation of eInfrastructures providers and cultural heritage users – Target areas: • Mediterranean region, (Egypt, Turkey and Jordan) • Cooperation with China in liaison with the EPIKH Grid School • exchanges with South America in the frame of experiments for live distributed performances – Case studies: preservation, virtual exhibitions, GIS Started in September 2010, it will last until September 2012 A project funded by EC FP7 e-Infrastructures 15 Taipei, 22/2/2012 • The two projects share the same coordinator and have many partners in common. • The e-infrastructure programmes identified in DC-NET will be at the basis of the sustainability of the results of INDICATE. • The two projects represent the same DCH community. DC-NET 1/12/2009 1/9/2010 INDICATE 31/05/2012 31/8/2012 1/4/2011 Taipei, 22/2/2012 Research workflow and Service priorities Priorities for the Digital Cultural Heritage sector have been put together, having in mind the typical workflow of the DCH research. 17 17 Taipei, 22/2/2012 Typical DCH research workflow • Find: accessing information • Process: tools for manipulating information • Publish: make the results visible online • Conference: discuss and annotate published information • Preserve: maintaining access to content over the longer term • Secure Plus lower-level “basic digital services” such as email, data storage, web hosting, etc. 18 18 Taipei, 22/2/2012 Services priorities On the basis of the typical workflow of the DCH research, services are divided into 3 categories: 1. Services for content providers, i.e. those related to the creation of online data resources for DCH 2. Services for managing and adding value to the content itself 3. Services which enable, support and enhance virtual research communities and the activities of content consumers 19 19 Taipei, 22/2/2012 Services for content provides and data resource creation FROM common issues TO common priorities 20 20 Taipei, 22/2/2012 Services for content provides and data resource creation Common issues: • Interoperability of online resources • Insularity in terms of searching • Changes in location • High cost of establishment • Vulnerability to technical problems • Limitation on servers capacity and processing 21 21 Taipei, 22/2/2012 Services for content provides and data resource creation Common priorities: • Interoperation of systems • Aggregation of content • Cross-search • Semantic search • Persistent identification of digital objects • Simplification of set-up services • Stable platform • Scalability 22 22 Taipei, 22/2/2012 Services for managing and adding value to content e.g.: • Geo-referencing • 3D representation • Virtual reality and immersive interfaces • Annotation • Linked data generation 23 23 Taipei, 22/2/2012 Services for content consumers The “cafeterial model”: a broad range of services to be made available, without the need to actually deliver them for all members of the community. e.g.: • User authentication and access control • Collaborative environments • Advanced search • Visualisation 24 24 Taipei, 22/2/2012 Services priority ordering A prioritised list of the most immediately important services has been agreed: 1. Long-term preservation 2. Persistent identifiers 3. Interoperability and Aggregation 4. Advanced search 5. Data resource set-up 6. User authentication and access control 7. IPR and digital rights management Taipei, 22/2/2012 26 culture research e- infrastructures Cooperation and coordination among these three sectors is at the core of the DCH e-infrastructure Taipei, 22/2/2012 The network of common interest It combines: – regional, national and international levels, – bottom-up (working groups) and top-down (Joint Programming) approaches Working groups: experts seconded by their cultural, research and infrastructure organisations Cooperation with other networks and projects: EPIKH, CHAIN, EUMEDGRID-Support, EUMEDCONNECT2, LINKED HERITAGE, …. Taipei, 22/2/2012 Liaisons with strategic bodies Factual cooperation is established with: – e-IRG e-Infrastructure Reflection Group – ESFRI European Strategy Forum on Research Infrastruftures (SSH thematic working group) – EGI European Grid Initiative – TERENA Trans-European Research and Networking Association – MSEG Member States Expert Group on digitisation – ASREN – Arab States Research and Education Network Taipei, 22/2/2012 Position Paper Open consultation Green Paper on Common Strategic Framework 1. European Coordination: the role of Member States and European Commission 2. Europeana: towards its full deployment 3. Preservation: a task for the Member States 4. Digital Cultural Heritage: the need for a research e-Infrastructure 5. Research and innovation in the digital cultural heritage: an international matter 6. Users involvement: the success factor 7. Coordination and demonstration: a requirement for the DCH sector Taipei, 22/2/2012 Next appointment 8 March 2012, Rome – DC-NET Final Conference 20 April 2012, Catania – INDICATE Technical Conference to demonstrate the e-Culture Science Gateway and to present the result of the use case studies on long-term preservation, virtual exhibitions and geo-coded cultural content 9-10 July 2012, Cairo – INDICATE Final Conference Taipei, 22/2/2012 The vision • INDICATE and DC-NET are part of a wider process, which started 10 years ago among cultural institutions • This process entered in a new phase joining the research e- infrastructures • Time is ready to start working towards an Open Science Infrastructure for Digital Cultural Heritage in 2020 Joint Programming Support and demonstrations Roadmaps DCH-RP Proposal Taipei, 22/2/2012 Thank you Antonella Fresa DC-NET and INDICATE Technical Coordinator fresa@promoter.it antonella.fresa@beniculturali.it www.dc-net.org www.indicate-project.org mailto:fresa@promoter.it mailto:antonella.fresa@beniculturali.it http://www.dc-net.org/ http://www.dc-net.org/ http://www.dc-net.org/ http://www.indicate-project.org/ http://www.indicate-project.org/ http://www.indicate-project.org/ work_gtrr6eekknan3eo7i2anh4hjya ---- 0 Neurocognitive Literary Studies and Digital Humanities Dr. Valiur Rahaman (Paper Presenter) Asstt Professor, Department of English Madhav Institute of Technology & Science Gwalior-INDIA Founder President, Indian Society of Digital Humanities (formed 2016) Principal Investigator of CRS Research Project on Humanities Inspired Technology Long Presentation at ADHO Conference / Digital Humanities 2020 /Virtual Conference Keywords: Humanities-Inspired Technology, Research in Digital Humanities, Neurocriticism, Autism, Literary Studies, Literary data Modeling, Digital Narrative, Social Media, Transdisciplinary Research 1 Neurocognitive Literary Studies and Digital Humanities ABSTRACT The paper demonstrates how neurocognitive social psychology can be applied to study human behavior through literary character analysis with digital tools; and how the digital literary studies in terms of neurocognitive psychology may help develop new models for technology and theories of contemporary science. On the basis of the theses, the paper illustrates the theoretical methodology called “Humanities-inspired technology for society” as an essential sub-branch of Digital Humanities and its application to the two major research studies: to great classics of all times and to etiology of autism. The paper advocates to bring literary theory and neurocognitive literature in the curricular of science and technology. Submitted for Long Presentation at ADHO Conference / Digital Humanities 2020 /Virtual Conference Keywords: Humanities-Inspired Technology, Research in Digital Humanities, Neurocriticism, Autism, Literary Studies, Literary data Modeling, Digital Narrative, Social Media, Transdisciplinary Research 1. Introduction Psychology, Cognitive Science and Psychoanalysis are often intersectional subjects with literary studies. Digital Humanities strengthens literary studies when its scholarship help develop models for advancement of science and technology. Till date, a very few studies have gone to this direction-how DH scholarship help technological modeling for challenging social problems and healthcare issues. The paper highlights the conceptual ground of humanities-inspired technology for society (HITS), its applications and functions. It has a major component 'neurocognitive literary study' through digital tools and hence the 2 paper establishes a networked rapport of literary arts with neurocognitive science and digital humanities/studies. At the beginning of the paper, the author defines HITS as an approach to knowledge system and concludes with its applications. 2. HITS as Sub-branch of DH: A Study in Digital Humanities to Technological Advancement Digital Humanities scholarship is utilized to disseminate, preserve, conserve and represent visuals of the knowledge system but seldom used for advancing human technologies for social welfare. The paper explores humanities inspired technology as a subdiscipline of Digital humanities which studies how humanities scholarship intersected or interpreted or analyzed with digital technological tools and it demonstrates attributes to modelling for technological development. It deals with practical expositions of literary or language philosophers, and critical theorists as impetuses for modeling of cognitive computational technology. Hence, it strongly establishes an inseparable bridge between practices in technology and humanities epistemology. The function of Humanities-inspired technology for society (HITS) essentially lies with developing models based on digital studies in philosophy of language and literary studies in terms of brain, mind and behavior. It coordinates the two different streams of knowledge system for three reasons: first, to remind; second to upgrade; and third to develop. It reminds what is missed by the world of technology; suggests to upgrade technological tools and devices for their humane utilization without their hazardous impacts on the earth and beyond; and develops new models out of scholarly studies in humanities for technological advances. For instance, there is no neuro- model based technology developed till date to identify the factors of sexual deviant criminals, to control or detect such heinous criminals. Begun with empathy to the victims, a HITS scholar studies the behavior patterns of such personalities in Literature in terms of neuro-cognitive psychology and social psychology and may develop behavior semiotic model based on the studies patterns and prepared corpus. Such studies develop industry-based research and development in the fields of Digital Humanities, which is much awaited epistemological contention in the arena of humanities departments in India and across the world. 3 For ages, Literature is studied in its own terms: Aristotelian, Longinian, Classicist, Romantics, Modern, Postmodern, Gender, Colonial and Postcolonial. Literary studies seldom go beyond its defined disciplinary territories and this was the major reason for its fall across the world. Its boundary is defined for its users and the users are not allowed to go beyond the boundaries, thus, communication with the real world is questioned in literary studies. The influences of Marx, Freud, Nietzsche, Foucault, Lacan, and Derrida are irresistible penetrating human thinking so they could touch the offshoots of the literary studies despite the disciplinary resistance of classical rhetoricians. Now, something has happened more than that: interferences of science and technology in the study of Humanities with slow but steady manners; in respective phases resulting in Humanities Computing, Computational Humanities, Digital Humanities, Speculative Digital Humanities (SpecLab), and Public Digital Humanities. 3. Conceptualization, Experimentation, and Invention The demand of transdisciplinary studies of science and arts, aesthetics and technology are observed in the history of ideas of contentions of difference and epistemological hybridity. I.A. Richards’s collaborative works with C.K Ogden developed a transdisciplinary approach to the poetics called ‘science of criticism’ (Green); C. P. Snow observed two cultures in the “intellectual life of the whole of western society” (Rede lectures); E. O. Wilson‘s Consilience: The Unity of Knowledge (Wilson) is the finest exposition of trans-disciplinary thought argues for “consilience” referring to “the synthesis of knowledge” derived from different specialized fields of human endeavor to envision a new field of knowledge serving the society. “The greatest enterprise of the mind has always been and always will be the attempted linkage of the sciences and humanities.” (Wilson; Morris) How this linkage is possible? Let’s understand with few examples: Descartes’ painting is a part popular science known as a pattern-design of the first experimentation in designing the airplane.(Miller) The coordinate system is ingrained in Descartes's philosophy; and similarly, Thomas Carlyle’s Circle is well-known model in Mathematics (DeTemple) as “a certain circle in a coordinate plane associated with a quadratic equation” and may similar studies are yet to be done. The implications of humanities knowledge of the two are examples of Humanities inspired technology and science. Such findings of interferences of Humanities in the domains of science and technology are observable to establish an ideation that science and technology are developed also by the epistemological influences of Humanities (esp. linguistics, literature and 4 cultural heritage). The HITS never establishes superiority of a knowledge system over another one such as demonstrated in Science and Poetry as a problem in epistemological enquiries. (Midgley) 4. Literature, Neurocognitive science, and Technology: Substantial Studies in Neurocognitive Digital Humanities Based on the concept argued above, the paper now reflects substantiated studies research on Humanities inspired technology. In this, it is shown how knowledge of Humanities polishes, cherishes the motives for developing technological tools to guarantee the safety and security of the human society at large. We conducted two studies together: I theorized the ‘Neurocognitive literary theory’ based on “activated neurons affecting/effecting the human behavior (ANAEHB)” patterns and applied to study Hamlet’s neurological problems equating his mental status with existing persons in real society; to study R.N. Tagore’s The Post Office in terms of how neurocognitive forces in an author empathetically influence the audiences of the play resulting in its translation and staging across the world during the World War. (Rahaman and Sharma); and to study neurodevelopment issues reflected through behavior such as the mental anguish and moral dilemmas of Rodion Raskolnikov in Fyodor Dostoyevsky’s Crime and Punishment (1866), and neurocognitive factors of racial discriminative behavior patterns of Marlowe & Kurtz in Joseph Conrad’s Heart of Darkness (1899), and sexual deviant behavior of David Lurie in J. M. Coetzee’s Disgrace (1999). These characters illustrate the behavior patterns of the socially disturb mindset resulting in numerous societal problems at large. The specific factors of behaviors disturbing the other members of the society and their connection with the CNS are etiologically studied and replied to the research questions: Can Literary reading be intersected with neurological and computational studies? Can reading in Humanities or knowledge of humanities help solve complex problems in the development of AI, Neurocomputation, Human Nature Inspired Computing, and Medical computing? Based on the following findings which are observed as outcomes of neurocognitive literary studies: 1. the impulses of human beings through deep reading of literary classics, and compare with real-life situations in Human Society is feasible 2. Understanding human impulses identifying 5 neurological causes behind human behavior and developed computational modeling to express the criminal mindset 3. Based on Humanities and Knowledge Engineering for Medical & Technology, developed a device to protect a woman from the unwanted accident 4. Established the possibility of Trans-disciplinary research in Arts & Literature intersected with cognitive sciences and computational studies 5. Established Literature & Language as a reflection of socio-neuron behavior and identified mental patterns of the neurological disorder in humans to commit Rape and Murder. For literary studies, words are the only media for assessing human behaviors so Atlas.ti the software application is used to analyze the patterns of behavior through frequencies of words used by the characters of the literary works. 5. Literary Narratives, Neurodevelopment and Techno-epidemiology As argued, the trans-disciplinary approach always brings novelty in the procedures of experimentation resulting in prismatic ways to see the world. For example, Friedrich Salomon Rothschild (1899-1995), a psychiatrist and colleague of Erich Fromm (1900-1980) developed the theory of biosemiotics. Rothschild was a reader of Charles W. Morris (I have cited above) who studied Engineering and Psychology at NU and earned a Ph.D. under the research supervision of psycho-sociologist George Herbert Mead (1863-1931). His book Signs, Language, and Behavior (1946) elucidates the signs representing human behavior; specific modes of signifying adequacy, truth, and reliability of signs; and defined life is but the semiotic narrative, and as the signature of human behavior. Similarly, J. C. Whitehorn and G. K. Zipf collaboratively wrote “Schizophrenic language” (1943); G. K. Zipf edited The Psycho-Biology of Language (1939), “The Unity of Nature, Least-Action, and Natural Social Science” (1942), and “Observations of the Possible Effect of Mental Age Upon the Frequency-Distribution of Words, from the viewpoint of Dynamic Philology” are the oldest research papers archived in the PubMed and remain foundational works in cognitive- linguistic disorders which symptomatize the Autism basically. These works are the consequences of inclinations towards what we called “research consilience” a trans-disciplinary approach to knowledge serving humanity and its associated agencies. 6. ZEF factor of Autism The prevalence of the rate of autism in the world states itself the facts of a less effective 6 approach to cure and challenge autism. To do so, there is unavoidable necessity to observe the history of the etiology of autism: from the second decade of the twentieth century to the WW I & II, and to 2019. The entire history of autism reveals various factors of autism established by medical practices or special treatment. The keen observation of etiology of autism states that the epidemiological historians of autism could really not differentiate the terms between symptomatology and etiology of autism. The problem is strongly put forth in “Deconstructing the Etiology of Autism and its Cure through Social Media & Digital Literary Narratives” (Rahaman 2020) and came up with a major finding that Autism eventuates during fertilization periods, longtime before the birth of a child. It is the evaluative study of the research pursued in the etiology of ASD and the possibility to develop a parallel treatment way by deconstructing the established hardcore medical practices for ASD. We studied, critically evaluated articles published between 1943 and 2019, consulted the world health organization reports of the prevalence of ASD in USA & eight South Asian countries, and develop an additional idea as therapy of ASD through “Social Media” & “Literary Narratives” differentiating technological and developed a model of post-technological Autism treatment. The study contributed to help the cure procedures for ASD through “Social media” and “literary narratives” further requirement of upgradation in epidemiological treatment through technological imaging and development of technology based on the ZEF factors of Autism. The other findings establish the open possibilities of research in the fields required to design further research and make policies to resist the prevalence of ASD around the world. Acknowledgements The concept of “Humanities Inspired Technology & Science” (HITS) sprung from the readings for the ongoing research project sponsored by the Collaborative Research Scheme under TEQIP-III, National Project Implementation Unit of MHRD, Govt. of India. The aim of the project is to define the potential of Humanities & Social Sciences to be used for the development of technology and science for the welfare of human beings with minimum after-effects or side-effects upon common lives or its target groups. Thanks to Dr R K Pandit, Director MITS for his supports for rich discussion in terms of the research studies. Works Cited 7 DeTemple, Duane W. “Carlyle Circles and the Lemoine Simplicity of Polygon Constructions.” The American Mathematical Monthly, vol. 98, no. 2, 1991, pp. 97–108, doi:10.1080/00029890.1991.11995711. Green, Elspeth. “I . A . Richards Among the Scientists.” ELH Fall, vol. 86, no. 3, 2019, pp. 751– 77, doi:https://doi.org/10.1353/elh.2019.0028. Midgley, Mary. Science and Poetry. Routledge London, 2001. Miller, Leonard G. “Descartes, Mathematics, and God.” Philosophical Review, vol. 66, no. 4, 1957, pp. 451–65, doi:10.1093/mind/xx.80.592. Morris, Charles William. “Symbolism and Reality : A Study in the Nature of Mind.” Foundations of Semiotics, no. 15, 1993, pp. xxv, 128 p. Rahaman, Valiur, and Sanjiv Sharma. Reading an Extremist Mind through Literary Language: Approaching Cognitive Literary Hermeneutics to R.N. Tagore’s Play The Post Office for Neuro-Computational Predictions. Edited by G R Sinha and Jasjit S B T - Cognitive Informatics Suri Computer Modelling, and Cognitive Science, Academic Press, 2020, pp. 197–210, doi:https://doi.org/10.1016/B978-0-12-819445-4.00010-2. Wilson, Edward O. Consilience: The Unity of Knowledge. Vintage Books Random House New York, 1999. Consulted Works 1. Arbib, Michael A. James J. Bonaiuto. From Neuron to Cognition via Computational Neuroscience. The MIT Press. 2016 2. Bara, B. G., Ciaramidaro, A., Walter, H., & Adenzato, M. Intentional minds: A philosophical analysis of intention tested through fMRI experiments involving people with schizophrenia, people with autism, and healthy individuals. Frontiers in Human Neuroscience, 5(7), 111. 2011. 3. Baron-Cohen, S. Mindblindness: An essay on autism and Theory of Mind. Cambridge, MA: MIT Press. 1995 4. Brown, Julie. Writers on the Spectrum: How Autism and Asperger Syndrome have 8 Influenced Literary Writing, Jessica Kingsley Publishers, London. 2010. 5. Cook, Amy. Shakespearean Neuroplay: Reinvigorating the Study of Dramatic Texts and Performance through Cognitive Science. Palgrave-Macmillan. 2010. 6. Corbett BA, et al “Treatment Effects in Social Cognition and Behavior following a Theater- based Intervention for Youth with Autism.” Cortex. 2019 Jun; 115:15-26. doi: 10.1016/j.cortex.2019.01.003. Epub 2019 Jan 22. 7. Dutta, Krishna; Robinson, Andrew, eds. Rabindranath Tagore: an anthology. Macmillan. 1998. 8. Einstein AJ, Henzlova MJ, Rajagopalan S. Estimating risk of cancer associated with radiation exposure from 64-slice computed tomography coronary angiography. JAMA 2007;298 (3):317–323. 9. Emmeche, Claus; Kull, Kalevi Towards a Semiotic Biology: Life is the Action of Signs” Imperial College Press. 2011 10. Fitzgerald, Michael The Genesis of Artistic Creativity Asperger’s Syndrome and the Arts. Jessica Kingsley Publishers. 2005. 11. George McKenzie, Jackie Powell and Robin Usher Ed. Understanding Social Research: Perspectives on Methodology and Practice, The Falmer Press. London. 1997. 12. Glynn. Dylan, Quantitative Methods in Cognitive Semantics: Corpus-Driven Approaches (Cognitive Linguistic Research). De Gruyter Mouton.2010. 13. Hickok, Gregory. The Myth of Mirror Neurons: The Real Neuroscience of Communication and Cognition, WW Norton. London. (2014) 14. Hickok, Gregory. “Eight Problems for the Mirror Neuron Theory of Action Understanding in Monkeys and Humans” 9 Available.https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2773693/(2009) 15. Korczak, Janusz. Ghetto Diary with an Introduction of Betty Jean Lifton Available https://ia800401.us.archive.org/2/items/GhettoDiary- EnglishJanuszKorczak/ghettodiary.pdf. Retrieved on 1.8.2019 16. Pandit, RK, and Rahaman, Valiur, “Critical Pedagogy in Digital Era: Understanding the Importance of Arts & Humanities for Sustainable IT Development” (May 12, 2019). Proceedings of International Conference on Digital Pedagogies (ICDP) 2019. Available at SSRN: https://ssrn.com/abstract=3387020 or http://dx.doi.org/10.2139/ssrn.3387020 17. Peter Tepe, Discourse Studies, Vol. 13, No. 5, Special Issue on Hermeneutics and Discourse Analysis (October 2011), pp. 601-608. 18. Pineda, Jaime A., ed. 2013. Mirror Neuron Systems: The Role of Mirroring Processes in Social Cognition. Springer. Humana Press. 19. Rahaman, Valiur. Introducing Digital Humanities. Yking Books. Jaipur. India 2016. 20. Rahaman, Valiur. 2020. “Epi/Pandemic in Literature: A Study in Medical Humanities for COVID 19 Prevention Plenary Speaker. National Webinar on Literature & Epidemics. May 2020. MK Bhavnagar University, Gujarat. India.” Bhavnagar: Bhavnagar University India. https://sites.google.com/view/webinar-eng-mkbu/plenaries?authuser=0. 21. Rahaman, Valiur, and Sanjiv Sharma. 2020. “Chapter 10 - Reading an Extremist Mind through Literary Language: Approaching Cognitive Literary Hermeneutics to R.N. Tagore’s Play The Post Office for Neuro-Computational Predictions.” Cognitive Informatics Computer Modelling, and Cognitive Science. Ed. G R Sinha and Jasjit Suri., 197–210. Academic Press. Elsevier. doi:https://doi.org/10.1016/B978-0-12-819445-4.00010-2. 22. Ramchandran V. Blackslee, Sanda. (1998) Phantoms in the Brain: Probing the Mysteries of the Human Mind. HarperCollins. London. 1999. P 368. 23. Shakespeare, William.. Hamlet. Ed. Burton Raffe and Bloom. Yale University Press. London. 2003. doi:10.1017/CBO9781107415324.004. 24. Wolfrey, Julian. 2011. Introducing Criticism at the 21st Century. Introducing Criticism in the 21st Century, Edinburgh University Press. London. 25. Wilson, Matthew W. “Cyborg Geographies: Towards Hybrid Epistemologies”. Gender, https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2773693/(2009) https://ia800401.us.archive.org/2/items/GhettoDiary-EnglishJanuszKorczak/ghettodiary.pdf.%20Retrieved%20on%201.8.2019 https://ia800401.us.archive.org/2/items/GhettoDiary-EnglishJanuszKorczak/ghettodiary.pdf.%20Retrieved%20on%201.8.2019 https://ia800401.us.archive.org/2/items/GhettoDiary-EnglishJanuszKorczak/ghettodiary.pdf.%20Retrieved%20on%201.8.2019 https://ssrn.com/abstract%3D3387020 https://dx.doi.org/10.2139/ssrn.3387020 https://sites.google.com/view/webinar-eng-mkbu/plenaries?authuser=0 10 Place and Culture, 16(5) (2009): 499–515. 26. Yeo, Richard. Defining Science, William Whewell, natural knowledge, and public debate in early Victorian Britain. Cambridge University Press. 1993. 27. V. Gallese, M.A. Gernsbacher, C. Heyes, G. Hickok, M. Iacoboni, “Mirror Neuron Forum”, Perspectives on Psychological Science 6 (4) (2011) 369-407. https://dx.doi.org/10.1177%2F1745691611413392 work_gv7vwol2tvfddmns3xm2mj7aje ---- Can forward dynamics simulation with simple model estimate complex phenomena?: Case study on sprinting using running-specific prosthesis Murai et al. Robomech J (2018) 5:10 https://doi.org/10.1186/s40648-018-0108-8 R E S E A R C H A R T I C L E Can forward dynamics simulation with simple model estimate complex phenomena?: Case study on sprinting using running-specific prosthesis Akihiko Murai* , Hiroaki Hobara, Satoru Hashizume, Yoshiyuki Kobayashi and Mitsunori Tada Abstract Surpassing the world record in athletic performance requires extensive use of kinematic and dynamic motion analy- ses to develop novel body usage skills and training methods. Performance beyond the current world record has not been realized or measured; therefore, we need to generate it with dynamics consistency using forward dynamics simulation, although it is technologically difficult because of the complexity of the human structure and its dynamics. This research develops a multilayered kinodynamics simulation that uses a detailed digital human model and a simple motion-representation model to generate the detailed sprinting performances of individuals with lower extremity amputations (ILEAs) aided by carbon-fiber running-specific prostheses (RSPs), which have complex interactions with humans. First, we developed a digital human model of an ILEA using an RSP. We analyzed ILEA sprinting based on experimental motion measurements and kinematics/dynamics computations. We modeled the RSP-aided ILEA sprint- ing using a simple spring-loaded inverted pendulum model, comprising a linear massless spring, damper, and mass, and we identified the relevant parameters from experimentally measured motion data. Finally, we modified the sprint motion by varying the parameters corresponding to the RSP characteristics. Here, the forward dynamics have been utilized to simulate detailed whole-body sprinting with different RSP types (including simulated RSPs not worn by the subject). Our simulations show good correspondence with the experimentally measured data and further indicate that the sprint time can be improved by reducing the RSP viscosity and increasing stiffness. These simulation results are validated by the experimentally measured motion modifications obtained with different types of RSPs. These results show that the multilayered kinodynamics simulation using the detailed digital human model and the simple motion-representation model has the capacity to generate complex phenomena such as RSP-aided ILEA sprinting that contains complex interactions between the human and the RSP. This simulation technique can be applied to RSP design optimization for ILEA sprinting. Keywords: Digital human technology, Running-specific prosthesis, Motion modification simulation © The Author(s) 2018. This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creat iveco mmons .org/licen ses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. Introduction We have measured the kinetic and physiological aspects of human performance using an optical motion cap- ture system, force plate, etc., and applied kinematics and dynamics analyses to compute the joint angles and torques and estimate muscle activities, in the fields of biomechanics and sports science. This method has real- ized excellent athletic performances and clarified injury mechanisms, although it cannot analyze the perfor- mances that have not been realized, for instance, a per- formance that surpasses the current world record. We usually generate the motions of robots, for instance, a grounded manipulator with dynamic consistency, by (1) joint angle or task-based motion generation and (2) forward dynamics simulation. However, applying this Open Access *Correspondence: a.murai@aist.go.jp Digital Human Research Group, National Institute of Advanced Industrial Science and Technology (AIST ), 2-3-26, Aomi, Koto-ku, Tokyo 135-0064, Japan http://orcid.org/0000-0002-2035-4346 http://creativecommons.org/licenses/by/4.0/ http://crossmark.crossref.org/dialog/?doi=10.1186/s40648-018-0108-8&domain=pdf Page 2 of 8Murai et al. Robomech J (2018) 5:10 technique to human whole-body motion generation is considerably difficult, because (1) humans have many more degrees of freedom and a much more complicated structure compared to robots, and (2) humans are float- ing systems; therefore, we need to estimate the contact forces that can easily become unstable, especially during dynamic motions such as sprinting [1–4]. This research solves these problems by developing a multilayered kino- dynamics simulation that uses a detailed digital human model and a simple motion-representation model, which parametrically represents human motion mechanisms. Here, kinodynamics represents the discipline that tries to solve kinematic constraints and dynamic constraints simultaneously, as defined in [5]. In this study, we ana- lyzed and modelled the sprinting performances of indi- viduals with lower extremity amputations (ILEAs), aided by carbon-fiber running-specific prostheses (RSPs), which entail complex interactions between humans and RSPs that form the kinematic and dynamic constraints, to improve the RSP-aided ILEA sprinting performance. Carbon-fiber RSPs have enabled ILEAs to realize hith- erto unachieved degrees of high-level sprinting [6]. While the running mechanics in able-bodied sprinters and ILEAs have been previously examined, these researches were mainly limited to biomechanical studies [7, 8]. Fur- ther, many studies have investigated the RSP behavior and performance during sprinting through rigid-body dynamics [9] and finite element analysis [10], but the relationship between the RSP characteristics and sprint performance remains unclear. In particular, RSP-aided ILEA sprinting involves humans and RSPs, as well as the kinematic and dynamic interactions between them, and its kinematics and dynamics analyses are techni- cally complex compared with those of general rigid body systems that are often analyzed in the robotics field. We generated the RSP-aided ILEA sprinting motion, which contains the complex interactions between humans and RSPs, by developing a multilayered kinodynamics simu- lation, which uses a detailed digital human model and a simple motion-representation model that parametrically represents human motion mechanisms. Digital human models have been developed to study body kinematics and perform dynamics analyses. These models have been developed based on the knowledge of human anatomy, and they can estimate and ana- lyze human motion through kinematics and dynamics computations [11–14]. We extended the spring-loaded inverted pendulum (SLIP) model for the simple motion- representation model that parametrically represents human motion mechanisms. The sprinting motion is often simplified using the SLIP model, which models the entire human body as a spring-mass model and describes the spring-like leg movement during sprinting [15]. We applied a unilateral SLIP model with a spring, damper, and mass, similar to the one used in [16], to model the RSP-aided ILEA sprinting motion. Figure 1 shows the concept of multilayered kinodynamics simulation. This simulation consists of (A) simplification of sprint motion by using the SLIP model comprising a spring, damper, and mass, and identification of the relevant parameters from experimentally measured motion, (B) modification of the sprint motion by varying the SLIP model param- eters and simulation of its forward dynamics, and (C) reconstruction of the detailed whole-body sprint motion from the simulated SLIP model motion. Our approach realizes the simulation of the detailed whole-body sprint- ing of the specific subject using different RSP types and properties (including simulated RSPs not worn by the subject). We evaluated our simulation results by com- paring them with the experimentally measured motion, and both result sets showed good correspondence. This modeling and simulation technique can contribute to the quantitative evaluation and design of RSPs to realize higher levels of RSP-aided ’ILEA sprinting’ performances. Methods We first modified our anatomographic digital human model [11] by adjusting the surface shape and skeleton, so that the model represented the kinematic and dynamic characteristics of the ILEAs using the RSPs. The able- bodied individual and ILEA models consisted of 18 and 23 bones, respectively. Each bone was represented as - simplify sprinting motion with SLIP model - identify model paremters - reconstruct detailed whole-body motion - modify model parameters - simulate forward dynamics - compare/evaluate motions - apply to RSP designing, sports training, etc. experimentally measured motion motion that has not been performed by subject A) B) C) Fig. 1 Concept chart of multilayered kinodynamics simulation of motion modification. This simulation consists of three steps: A sim- plify the sprinting motion with the SLIP model and identify its param- eters using experimentally measured motion, B modify the identified model parameters and simulate the forward dynamics of the simple model, and C reconstruct the detailed whole-body motion from the motion of the simple model Page 3 of 8Murai et al. Robomech J (2018) 5:10 a rigid-body linkage with inertial parameters, and the bones were connected to each other via spherical joints. Figure 2 shows the digital human model of an able-bod- ied individual and an ILEA with unilateral transfemoral amputation wearing an RSP (Sprinter 1E90, Ottobock, Duderstadt, Germany, and Xtreme, Ossur, Reykjavik, Ice- land) with a prosthetic knee joint (3S80, hydraulic single- axis knee joint, Ottobock, Duderstadt, Germany) (male, height 1.68E+00 m and weight 7.01E+01 kg for an able- bodied individual and 6.65E+01 kg for the ILEA). Next, ILEA sprinting with the unilateral transfemo- ral amputation was captured by means of a commer- cial marker-based optical motion capture system with 20 cameras (VICON, Oxford, England) operating at a frame rate of 200 Hz. The subject wore two types of RSPs (Sprinter 1E90 and Xtreme) with the prosthetic knee joint (3S80). The subject was free from any injuries at the time of data collection, and our study protocol was approved by the local institutional review board and conformed to the guidelines outlined in the Declaration of Helsinki (1983). The subject wore 57 markers, whose locations were determined based on an improved ver- sion of the Helen Hayes Hospital marker set. Further, 20 additional markers were attached to each RSP to capture the detailed RSP deformation. The positions of the mark- ers are indicated in Fig. 2 by means of white spheres. We also recorded the contact force between the ILEA and the floor using seven force plates (AMTI, MA, USA), each of which measured the six-axis contact force and momentum at a rate of 2 kHz. The inverse kinematics was computed with DhaibaWorks [17], and the inverse dynamics was solved with OpenSim [12]. The multilay- ered kinodynamics simulation for motion modification consisted of the following three steps (Fig. 1). (A) Simplification of sprint motion using the SLIP model and identification of relevant parameters The sprinting was analyzed with simplified models to extract the kinematic and dynamic characteristics to realize stable dynamic simulations. The SLIP model, which represents the entire human body as a spring- mass model, has been previously applied to describe the spring-like leg movement during locomotion and sprint- ing [15]. In this study, we applied the unilateral ’spring- damper-mass SLIP model’ to represent RSP-aided ILEA sprinting (Fig. 3). Here, the whole body was modeled as a mass supported by a spring and damper connected in parallel. Next, we identified the relevant parameters of this model using experimentally measured motion data. The natural length of the leg ( Lleg,0 ) and the spring and damper parameters ( Kleg and Dleg , respectively) were identified for the intact limb and the RSP, respectively, by mathematical optimization. This optimization minimized the error between the spring and damper forces and measured the contact force between the ground and the intact limb or the RSP, as given in the following equation: (1) Ef = T∑ t (Fleg(t) − (Kleg(Lleg(t) − Lleg,0) + Dleg ˙Lleg(t))) 2 , able-bodied model RSP (Sprinter 1E90) RSP (Xtreme) Fig. 2 Digital human models of an able-bodied individual and ILEA fitted with an RSP. Left: digital human model of the able-bodied individual, mid- dle: model of the ILEA fitted with Sprinter 1E90, and right: model of the ILEA fitted with Xtreme Page 4 of 8Murai et al. Robomech J (2018) 5:10 where Fleg is the measured contact force, Lleg is the meas- ured length of the leg, and ˙Lleg is its velocity. We also computed the height ( H0 ) of the center of mass (COM) and the contact angle ( θ0 ) at forefoot strike for each case from the experimental data. (B) Modification and simulation of the forward dynamics of RSP‑aided ILEA sprinting The kinematic and dynamic characteristics of an RSP can be modified by changing its shape and material. We varied the SLIP model parameters that correspond to the RSP characteristics, ( Kleg and Dleg ). Here, we remark that RSP-aided ILEA sprinting is the result of complex interactions between the human controller and the RSP characteristics. The identified parameters in "Results and discussion" (Table 1) indicate that all the parameters, except the RSP characteristics ( Kleg and Dleg of the pros- thesis), yield similar values during sprinting with differ- ent RSPs. Therefore, we assumed that this specific subject utilizes the same control strategy for all RSPs, and only the parameters corresponding to the RSP characteris- tics change when the individual wears a different type of RSP. We simulated ILEA sprinting using different types of RSPs by modifying Kleg and Dleg of the prosthesis and computed the forward dynamics. Here, we computed COM acceleration ( ACOM(t) ) in the following process. 1 if state == flight & PCOM,y(t) < H0 2 state = stance 3 PCOP,x = PCOM,x + PCOM,y/tanθ0 4 PCOP,y = 0 5 if state = stance 6 Fleg = Kleg(Lleg(t) − Lleg,0 + Dleg) ˙Lleg(t) 7 if Fleg > 0 8 state = flight 9 if state == flight 10 ACOM,x(t) = 0 11 ACOM,y(t) = −g 12 else 13 ACOM,x(t) = (PCOP,x − PCOM,x(t))Fleg(t)/Lleg(t)m 14 ACOM,y(t) = (PCOP,y − PCOM,y(t))Fleg(t)/Lleg(t)m − g where state represents the phase of sprinting and m rep- resents the total mass of the body. Kleg and Dleg change depending on whether the intact or the prosthetic leg is in contact with the ground. The time integration of ACOM(t) computes the trajectory of COM and COP in this forward dynamics simulation. (C) Reconstruction of detailed whole‑body motion from the simulated SLIP model motion We reconstructed the detailed whole-body motion from the simulated simple SLIP model motion for detailed kinematics and dynamics analyses and visualization. The trajectories of all 77 markers, which were experimentally measured, were represented using the quadratic form of the SLIP model status. The parameters of this mapping function from the SLIP model status to the trajectories of all 77 markers were optimized by minimizing the follow- ing function: where Pmar is the measured marker position, i is the marker ID, j ∈ (x, y, z) , and M is the quadratic form of the COM position ( PCOM ) and the position of the center of pressure (COP) ( PCOP ), as given in the following function: We optimized the parameters α and β to minimize the evaluation function Em . The trajectories of the 77 mark- ers were reconstructed from the SLIP model motion in the forward dynamics simulation, the abovementioned parameters, and the kinematics constraints arising from the COP position using this M(PCOM,j, PCOP,j) . This step (2) Em = T∑ t (Pmar(i, j, t) − M(PCOM,j(t), PCOP,j(t))) 2 , (3) M(PCOM,j(t), PCOP,j(t)) = (α(PCOM,j(t) − PCOP,j(t)) + β) 2 . H0 θ0 PCOP(t) PCOM(t), VCOM(t) Fleg(t) flight phase stance phase flight phase x y Kleg Dleg Fig. 3 SLIP model for RSP-aided ’ILEA sprinting’ Table 1 Parameters for the RSPs and intact leg in the SLIP model Prosthetic Intact Sprinter 1E90 Xtreme Sprinter 1E90 Xtreme Lleg,0 (m) 1.05E+00 1.06E+00 1.06E+00 1.06E+00 Kleg (N/m) 1.69E+04 2.04E+04 2.14E+04 2.25E+04 Dleg (Ns/m) 8.77E+00 9.23E+01 2.92E+01 2.01E+01 H0 (m) 9.41E−01 9.81E−01 9.63E−01 9.96E−01 θ0 (rad) 1.34E+00 1.31E+00 1.37E+00 1.36E+00 Page 5 of 8Murai et al. Robomech J (2018) 5:10 significantly contributes to the detailed kinematics and dynamics analyses. The whole-body joint angles were estimated using the inverse kinematics computation per- formed using these marker trajectories and the detailed digital human model. The whole-body joint torques were estimated using the inverse dynamics computation per- formed using these joint angles and the contact forces that were estimated in step (B) using the SLIP model. Results and discussion We can observe the following points from the experi- mental results: 1. Figure 4 shows the analyzed motion of RSP-aided ILEA sprinting (Sprinter 1E90). Our model simu- lates the COM trajectories with an average error of 1.94E+01 mm during the stance phase of sprinting. The kinematics and dynamics of the digital human model compute both the human joint torque and bending torque of the RSP during sprinting using its shape and the external force acting upon it. Figure 5 illustrates the bending moment at each point of the RSPs. With regard to step (A) in "Introduction", Table 1 lists the parameters identified from the stance phases of the Sprinter 1E90 and Xtreme RSPs and the intact leg. The values of the prosthetic parame- ters Kleg and Dleg exhibit apparent differences, which correspond to the RSP characteristics, although the other parameters yield similar values. 2. Our model identifies these parameters with average errors of 1.04E+01 ± 6.50 E+00% and 1.83E+01 ± 1.40E+01% (average ± SD) in the contact forces for Sprinter 1E90 and Xtreme, respectively. In step (B), the COM trajectories during the RSP-aided ILEA sprinting (Sprinter 1E90) was simulated (Fig. 4). Our model simulated the COM trajectories with error of 1.94E+01 ± 5.08E+00 mm (average ± SD) during the stance phase of sprinting. In step (C), we reconstructed the whole-body 77-marker posi- tions through steps (A to C) without changing the SLIP model parameters. Our method reconstructed the marker positions with an error of 4.32E+00 ± 1.76E+00 mm (average ± SD), whose maximum errors ranged from 2.20E+00 mm on the marker of the right tragus to 3.02E+01 mm on the marker of the top of the RSP during the stance phase of sprint- ing. 3. Figure 6 shows the detailed whole-body motion and COM trajectories of ILEA sprinting with differ- ent RSP types and properties. Figure 7 shows the hip joint torques at the sides of the RSP during the stance phase of ILEA sprinting with different RSP types and properties, which are the results of the kin- ematics and dynamics analyses of the detailed whole- body motion. Figure 8 shows the 100-m sprint time obtained with different types of RSPs. These results have three implications: 1. The appropriate digital human model, motion meas- urements, and kinematics and dynamics computa- tions aid in realizing dynamics analysis. Figure 5 represents the bending moment at each point of the RSPs during ILEA sprinting. The radii of these curva- tures fit well with the values listed in [18]. The SLIP model with the spring, damper, and mass suitably represent the kinematics and dynamic characteristics Fig. 4 Synthesized motion of ILEA sprinting using RSP (20 fps). Blue: measured center of mass (COM) trajectory; green: simulated COM trajectory 0 0.1 0 0.1 0 200 400 600 0 200 400 600 P1 P2 P3 P4 P5 P6 time [s] Sprinter 1E90 Xtreme jo in t t or qu e [N ·m ] P1 P2 P3 P4 P5 P6 P1P2 P3 P4 P5 P6 Fig. 5 RSP torques during sprinting. Horizontal axis: time, the origin of which represents the instant of the left forefoot strike, and vertical axis: flexion/extension torque whose positive value represents flexion. Left: Sprinter 1E90 and right: Xtreme. Each line corresponds to a point shown in the corresponding figure at the bottom Page 6 of 8Murai et al. Robomech J (2018) 5:10 of the experimentally measured ILEA sprinting using an RSP for both the intact limb and the RSP. 2. The forward dynamics simulation with the simple SLIP model realizes the kinematics and dynamics analyses of the motions that were not performed by the subject. Figure 6 shows the COM trajectories of ILEA sprinting with different properties of the RSPs. The RSPs with stiffness values of 75 and 125% were not worn by the subject during the measure- ments; they were simulated. The simulation results show that both types of RSPs exhibit similar patterns: the subject moves upward when Kleg increases and downward when Kleg decreases. Figure 7 shows the hip joint torques at the sides of the RSPs during the stance phase of the RSP-aided ILEA sprinting with different RSP types and properties, which were the result of the detailed whole-body kinematics and dynamics analyses. The RSPs with stiffness values of 75 and 125% were not worn by the subject during the 0.75 × Kleg, Xtreme Kleg, Xtreme 1.25 × Kleg, Xtreme 0.75 × Kleg, Sprinter 1E90 Kleg, Sprinter 1E90 1.25 × Kleg, Sprinter 1E90 Fig. 6 Simulated whole-body sprint motion and COM trajectory during the stance phase with different RSPs. Top row: Sprinter 1E90, bottom row: Xtreme, left (red): 75% of Kleg , middle (green): 100% of Kleg , and right (blue): 125% of Kleg 10050 -100 0 100 200 300 0 100 % of stance phase Sprinter 1E90 Xtreme jo in t t or qu e [N ·m ] 500 0.75 Kleg, Sprinter 1E90 Kleg, Sprinter 1E90 1.25 Kleg, Sprinter 1E90 0.75 Kleg, Xtreme Kleg, Xtreme 1.25 Kleg, Xtreme Fig. 7 Simulated left hip joint torque during the stance phase with different types of RSPs. Left graph: Sprinter 1E90, right graph: Xtreme, red dotted line: 75% of Kleg , green solid line: 100% of Kleg , and blue dashed-dotted line: 125% of Kleg Page 7 of 8Murai et al. Robomech J (2018) 5:10 measurements; they were simulated. The simulation results show that both types of RSPs exhibit simi- lar patterns: the required hip joint torque increases when the RSP stiffness ( Kleg ) increases. Here, we note that the relationship between the RSP stiffness and the required hip joint torque is not a simple lin- ear relationship. The complex relationships in the temporal and amplitude directions appear because of the kinematic and dynamic interactions between humans and the RSPs. The multilayered kinodynam- ics simulation using the detailed digital human model and simple motion-representation model represents these complex interactions and realizes the non-lin- ear complex relationship between the RSP stiffness and the hip joint torque that is necessary for sprint- ing using the same control strategy. 3. From Fig. 8, we note that the 100-m sprint time is significantly improved with decrease in Dleg , and the model falls down ( PCOM,y(t) becomes 0) when Dleg increases drastically. An increase in Kleg also contributes to slightly reducing the sprint time. The sprint time was 8.53E+00 s when Kleg = 1.69E+04 N/m, Dleg = 8.77E+00 Ns/m (Sprinter 1E90), and 9.33E+00 s (9.38E+00% slower) when Kleg = 2.04E+04 N/m, Dleg = 9.23E+01 Ns/m (Xtreme). The experimentally measured sprint speeds were 7.11E+00 m/s and 6.60E+00 m/s (7.73E+00% slower) for Sprinter 1E90 and Xtreme, respectively. Here, we first note that the simulated 100-m sprint times were relatively short because of the limitations in our model. One limitation was that a fatigue model was not considered in these simulations. In addition, there were certain dynamic and physical limitations; for instance, the friction parameter and maximum muscle tension have not yet been implemented in our SLIP model. Regardless of the above limitations, our results indicate that the forward dynamics simu- lation with the simple SLIP model agrees satisfac- torily with the measured data at the point that the ratio between the simulated 100-m sprint times using RSPs whose Kleg and Dleg correspond to Sprinter 1E90 and Xtreme is close to the ratio between the measured sprint speeds using the corresponding RSPs. The simulation results indicate that an increase in Kleg improves the sprint time; therefore, this prin- ciple can be applied to RSP design to improve the ILEA sprinting performance. These results, how- ever, are limited to one ILEA sprinter with unilateral transfemoral amputation and using several types of RSPs. Further, we have also assumed that the subject adopts the same control strategy when using differ- ent types of RSPs. Conclusion In conclusion, our multilayered kinodynamics simula- tion realized stable forward dynamics simulation of ILEA sprinting with an RSP on a specific subject, and estimated the detailed kinematic and dynamic characteristics of this complex phenomena. We believe that our approach can contribute to simulating performances that surpass human performances, and particularly contribute to the optimization of RSP design for ILEA sprinting. Authors’ contributions In this study, AM performed the model development, kinematics/dynamics computations, and data analysis, and participated in acquiring the measure- ments. HH, SH, and YK performed the measurements and helped in drafting the manuscript. MT performed the software development. All authors read and approved the final manuscript. Acknowledgements This research was supported by a Grant-in-Aid for Young Scientists (A) #17H04700 and Scientific Research(A) #939778. Competing interests The authors declare that they have no competing interests. Ethics approval and consent to participate Written informed consent was obtained from the patient for the publication of this report and any accompanying images. Funding This research was supported by a Grant-in-Aid for Young Scientists (A) #17H04700 and Scientific Research(A) #939778. 7.5 8.0 8.5 9.0 9.5 50 100 150 50 100 150 Kleg [%] D le g [% ] 7.0 7.5 8.0 8.5 9.0 9.5 10.0 time [s] 7.0 Fig. 8 Time taken for 100-m sprint for the SLIP model with varying Kleg and Dleg values. Horizontal axis: Kleg , vertical axis: Dleg of Sprinter 1E90. The 100-m sprint time with the RSP having the corresponding parameters is represented in a color scale ranging from blue to yel- low. The color red indicates that the model falls down before crossing 100 m Page 8 of 8Murai et al. Robomech J (2018) 5:10 Publisher’s Note Springer Nature remains neutral with regard to jurisdictional claims in pub- lished maps and institutional affiliations. Received: 31 January 2018 Accepted: 8 May 2018 References 1. Taylor GW, Hinton GE, Roweis ST (2007) Modeling human motion using binary latent variables. In: NIPS’06 Proceedings of the 19th international conference on neural information processing systems. pp 1345–1352 2. Safonova A, Hodgins JK, Polland NS (2004) Synthesizing physically real- istic human motion in low-dimensional, behavior-specific spaces. ACM Trans Graph (TOG) 23:514–521 3. Yamane K, Nakamura Y (2008) Dynamics simulation of humanoid robots: forward dynamics, contact, and experiments. In: The 17th CISM-IFToMM symposium on robot design, dynamics, and control 4. Otten E (2003) Inverse and forward dynamics: models of multi-body systems. Philos Trans R Soc B 358:1492–1500 5. Motonaka K, Watanabe K, Maeyama S (2015) Kinodynamic notion plan- ning for an X4-Flyer. In: Habib MK (ed) Handbook of research on advance- ments in robotics and mechatronics. IGI Global, Hershey, pp 455–474 6. Nolan L (2008) Carbon fiber prostheses and running in amputees: a review. Foot Ankle Surg 14:125–129 7. Grabowski AM, McGowan CP, McDermott WJ, Beale MT, Kram R, Herr HM (2010) Running-specific prostheses limit ground-force during sprinting. Biol Lett 6:201–204 8. Brüggemann GP, Arampatzis A, Emrich F, Potthast W (2009) Biomechan- ics of double transtibial amputee sprinting using dedicated sprinting prostheses. Sports Technol 1:220–227 9. Dumas R, Cheze L, Frossard L (2009) Loading applied on prosthetic knee of transfemoral amputee: comparison of inverse dynamics and direct measurements. Gait Posture 30:560–562 10. Rigney SM, Simmons A, Kark L (2015) Concurrent multibody and finite element analysis of the lower-limb during amputee running. IEEE EMBS Annu Int Conf 2015:2434–2437 11. Murai A, Endo Y, Tada M (2016) Anatomographic volumetric skin-muscu- loskeletal model and its kinematic deformation with surface-based SSD. IEEE Robot Autom Lett 1:1–7 12. Delp SL, Anderson FC, Arnold AS, Loan P, Habib A, John CT, Guendel- man E, Thelen DG (2007) OpenSim: open-source software to create and analyze dynamic simulations of movement. IEEE Trans Biomed Eng 54:1940–1950 13. Nakamura Y, Yamane K, Fujita Y, Suzuki I (2005) Somatosensory computa- tion for man-machine interface from motion capture data and musculo- skeletal human model. IEEE Trans Robot 21:58–66 14. Rasmussen J, Damsgaard M, Surma E, Christensen S, de Zee M, Vondrak V (2003) AnyBody—a software system for ergonomic optimization. In: Fifth world congress on structural and multidisciplinary optimization 15. Blickhan R (1989) The spring-mass model for running and hopping. J Biomech 22:1217–1227 16. Derrick TR, Caldwell GE, Hamill J (2000) Modeling the stiffness characteris- tics of the human body while running with various stride lengths. J Appl Biomech 16:36–51 17. Endo Y, Tada M, Mochimaru M (2014) Dhaiba: development of virtual ergonomic assessment system with human models. Digit Hum Model 2014:1–8 18. Funken J, Willwacher S, Böcker J, Müller R, Heinrich K, Potthast W (2014) Blade kinetics of a unilateral prosthetic athlete in curve sprinting. In: 32 International conference of biomechanics in sports Can forward dynamics simulation with simple model estimate complex phenomena?: Case study on sprinting using running-specific prosthesis Abstract Introduction Methods (A) Simplification of sprint motion using the SLIP model and identification of relevant parameters (B) Modification and simulation of the forward dynamics of RSP-aided ILEA sprinting (C) Reconstruction of detailed whole-body motion from the simulated SLIP model motion Results and discussion Conclusion Authors’ contributions References work_gzap5knjfzfb5lzgkyw43rmece ---- Introduction: Digital Humanities as Dissonant Research How to Cite: O’Sullivan, James. 2018. “Introduction: Digital Humanities as Dissonant.” Digital Studies/Le champ numérique 8(1): 3, pp. 1–7, DOI: https://doi.org/10.16995/dscn.286 Published: 23 January 2018 Peer Review: This is a peer-reviewed article in Digital Studies/Le champ numérique, a journal published by the Open Library of Humanities. Copyright: © 2018 The Author(s). This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. See http://creativecommons.org/licenses/by/4.0/. Open Access: Digital Studies/Le champ numérique is a peer-reviewed open access journal. Digital Preservation: The Open Library of Humanities and all its journals are digitally preserved in the CLOCKSS scholarly archive service. https://doi.org/10.16995/dscn.286 http://creativecommons.org/licenses/by/4.0/ O’Sullivan, James. 2018. “Introduction: Digital Humanities as Dissonant.” Digital Studies/Le champ numérique 8(1): 3, pp. 1–7, DOI: https://doi.org/10.16995/dscn.286 RESEARCH Introduction: Digital Humanities as Dissonant James O’Sullivan University College Cork, IE james.osullivan@ucc.ie The Digital Humanities Summer Institute gives students and scholars a chance to broaden their knowledge of the Digital Humanities within a feasible timeframe. The DHSI Colloquium was first founded by Diane Jakacki and Cara Leitch to act as a means of supporting graduates who wanted to be a part of such a gathering. The Colloquium has grown in recent years, to the point where it is now seen as an important part of the field’s conference calendar for emerging and established scholars alike, but it remains a non-threatening space in which students, scholars, and practitioners can share their ideas. This issue is testament to that diversity, as well as the strength of the research being presented at the Colloquium. It includes Scott B. Weingart and Nickoal Eichmann-Kalwara, Mary Borgo, William B. Kurtz, and John Barber. “What’s Under the Big Tent?: A Study of ADHO Conference Abstracts,” which portrays the discipline as one which is dominated by specific groups and practices. Using the Victorian Women Writers Project as a case-study, Mary Borgo treats models for the sustainable growth of TEI-based digital resources. William B. Kurtz details his experiences working on a digital initiative, in this instance, Founders Online: Early Access, and engages with the need for such projects to hold broader public appeal. John Barber’s “Radio Nouspace: Sound, Radio, Digital Humanities,” describes the curation of sound within the context of radio, and how such activity connects to creative digital scholarship. Together, these articles represent the purpose of facilitating a community comprised of divergent interests and perspectives, a community which can often be positively dissonant. Keywords: DHSI; Digital Humanities Summer Institute; colloquium; colloque Le Digital Humanities Summer Institute (DHSI) offre une chance aux étudiants et érudits d’étoffer leurs connaissances en humanités numériques pendant un délai réalisable. Diane Jakacki et Cara Leitch ont établi le premier colloque du DHSI pour soutenir des diplômés qui voulaient participer à un tel rassemblement. Ces dernières années, le colloque s’est développé jusqu’au point d’être considéré maintenant comme une conférence importante sur https://doi.org/10.16995/dscn.286 mailto:james.osullivan@ucc.ie O’Sullivan: Introduction2 le calendrier non seulement pour les érudits émergeants mais aussi pour les érudits établis dans le domaine. Le colloque continue cependant à être un espace non menaçant où les étudiants, les érudits et les professionnels peuvent échanger leurs idées. Ce numéro est un témoignage de cette diversité et de la qualité de la recherche présentée au colloque. Le numéro inclut l’article « What’s Under the Big Tent?: A Study of ADHO Conference Abstracts » par Scott B. Weingart et Nickoal Eichmann-Kalwara, ce qui présente les humanités numériques comme une discipline dominée par des groupes et pratiques spécifiques. En se servant du Victorian Women Writers Project comme étude de cas, Mary Borgo traite des maquettes pour la croissance durable des ressources numériques basées sur la TEI. William B. Kurtz détaille les expériences qu’il a acquises en travaillant sur l’initiative numérique Founders Online: Early Access ainsi que l’importance que de tels projets constituent un facteur attractif pour un plus large public. Dans le texte de John Barber, « Radio Nouspace: Sound, Radio, Digital Humanities », il s’agit du traitement de sons radiophoniques et du lien entre cette activité et l’érudition numérique créative. Tous ces articles correspondent au but de faciliter une communauté composée des intérêts et perspectives divergents qui peut souvent être véritablement dissonante. Mots-clés: Digital Humanities; DHSI Special Issue; Digital Humanities Summer Three years ago, Diane Jakacki passed control of the University of Victoria’s DHSI Colloquium1 to Mary Galvin and me. Our task was to continue to develop what Diane, alongside Cara Leitch, had started in 2009. Initially, the Colloquium was intended as a means of giving graduates an opportunity to present their research to the burgeoning community of Digital Humanities scholars. It was an opportunity for students to discuss their research with a large, international, and interdisciplinary audience, and furthermore, it enabled them to take advantage of institutional mechanisms designed to support participation at conferences. At the present phase in the development of the Digital Humanities, there is a marked emphasis on the acquisition of technical skills—emerging and established scholars alike are under intense pressure to develop their expertise in this domain. Here is not the most appropriate venue to discuss the positive and negative consequences of this reality, but it is the reality, one which is largely compelled by the demands of employers, 1 For more on the Colloquium, see the event’s dedicated website, http://dhsicolloquium.org. http://dhsicolloquium.org O’Sullivan: Introduction 3 funders, and the broader socio-cultural climates in which our institutes of education reside. Community-driven learning opportunities like the Digital Humanities Summer Institute are vital in such a context, helping us to learn, and further build our community, in a fashion that is suited to the hyper-demands of present-day academia. Truly wonderful is the scholar who can specialise in Medieval Studies while becoming equally adept in French, Python, statistics, and 3D modelling— perhaps I speak for myself, but this isn’t most of us. Mastery, of the true kind, comes from a lifetime of repetition, of focusing on that one little thing and questioning it and yourself for decades on end. Hiring committees, promotion boards—they often expect the former, the academic Swiss Army knife2 capable of achieving excellence in disciplinary discord. Through its broad range of foundational and intensive programs, DHSI gives students and scholars a chance to broaden their knowledge within a feasible timeframe. DHSI does not make masters, but it does allow the curious to recognise the ways in which they might re-imagine their intellectual practice. Mastery can always be pursued in the aftermath of Victoria, but we should also be content to progress with a valuable measure of fluency—one doesn’t need to be an adept programmer to interact with computer scientists, a certain level of proficiency is sufficient to enable the conversations that make meaning happen. This fluency, and the vibrant community that emerges out of its exchange, is what DHSI offers—the Colloquium was invented as a means of supporting graduates who wanted to be a part of such a gathering. In 2012, the Colloquium’s leadership agreed that there was sufficient demand to broaden the scope of the event beyond graduate submissions. Concurrently, DHSI continued to attract an increasing number of students, resulting in significant growth for the Colloquium and its audience—it is not unusual for participants to find themselves addressing an auditorium housing several hundred of their peers. This growth has continued in recent years, and as the Colloquium remains an addendum to the course-based pedagogical mission of DHSI, a measure of invention has been required to satisfy the increased volume of submission. In addition to more 2 I am of course referencing last year’s opening ceremony, wherein instructors are tasked with describing their courses. In-keeping with tradition, offerings are outlined through something of a pun-off. O’Sullivan: Introduction4 traditional presentations—though the current cap stands at 10 minutes—submissions are now welcome across a number of high-impact formats, such as lightning talks. In 2014, Mary Galvin initiated the Colloquium’s first poster session, which has become increasingly popular amongst participants. At DHSI 2016, we were proud to host a joint session with the concurrent Electronic Literature Organization Conference and Festival, while at DHSI 2017, posters and demonstrations were incorporated from the Society for the History of Authorship, Reading and Publishing’s annual conference. Developing the Colloquium is about continuing to respond to the needs of the community, finding ways to assist scholars and practitioners at various junctures in their careers to disseminate their research, ideas, and projects. A book of abstracts has been circulated since 2015, while a select number of presentations from DHSI 2014 were transformed into the Colloquium’s first special issue, published in Digital Humanities Quarterly.3 At the forthcoming gathering, our hope is to incorporate more audio-visual approaches to the capture of contributions. Such has been the growth of the Colloquium that last year saw a number of registrations from scholars not participating in courses. There was also a need to appoint the first Program Assistant, Lindsey Seatter, who has since succeeded Mary Galvin as co-chair. Mary committed much of her time to the development of this event, and, as with many of our field’s instigators, our community is all the better for her efforts. Despite its growth, the ethos of the Colloquium remains consistent: it is a non-threatening space in which students, scholars, and practitioners can share their ideas. To this end, we operate a peer-review policy wherein all reviewers are instructed to offer collegial feedback—constructive criticism is a requirement, not a recommendation. Unlike some other conferences, we have the luxury of accepting submissions if they meet a minimum threshold in terms of scholarly value. Those submissions that are considered to have fallen short of this standard are finessed through reviewer feedback so that they improve to a 3 O’Sullivan, James, Mary Galvin, and Diane Jakacki. 2016. DHSI Colloquium 2014 Special Issue, in Digital Humanities Quarterly 10.1. Web. http://www.digitalhumanities.org/dhq/vol/10/1/index.html O’Sullivan: Introduction 5 point where they are ready to be presented. I say this is a luxury because all we have to do as organisers and reviewers is to improve and accept submissions— accommodating the rising number of presentations is a task that falls to Daniel Sondheim, Assistant Director of the Electronic Textual Cultures Lab at the University of Victoria, and Ray Siemens, Director of DHSI. Dan, Ray, and the University of Victoria are yet to deny any of the Colloquium’s scheduling requirements, and the product of that facilitation is a diverse and inclusive final program. This issue is testament to that diversity, as well as the strength of the research being presented at the Colloquium. While there are only four papers, they each represent a significant contribution to the field, spanning a range of subjects that includes radio, metadata standards, Victorian women writers, and macro-level explorations of the wider Digital Humanities. One of the peculiarities of our realm’s interdisciplinary nature is that community gatherings draw a seemingly discordant group of individuals—is there value in conferences and publications comprised of historians, linguists, programmers, archivists, artists, and statisticians? Is the DH mix simply too broad to have meaning? I was disappointed to see Literary and Linguistic Computing become Digital Scholarship in the Humanities for this very reason—I liked having a journal that was entirely focused on my particular interests, and wasn’t overly enthused at the prospect of a publication that would meld an array of research on all kinds of everything. But, if the Digital Humanities are truly meant to be disruptive, then disciplinarity—which has a great many merits—should not be isolated from this process of disruption. In 2014, we stopped clustering Colloquium sessions into themes—the argument Mary advanced was that themes divided audiences, and as we aren’t forced to schedule parallel sessions, we should follow in the footsteps of the discipline’s pioneers and use the opportunity to encourage dissonance. Dissonance is at the very heart of the Digital Humanities, and we should embrace it, because dissonance is what gave us computational approaches to literary criticism, it is what compelled us to try and think beyond the codex, and most importantly, it is what shows us the failings in our techniques and approaches to scholarship. The Colloquium, and O’Sullivan: Introduction6 this special issue,4 like other journals and gatherings in this field, seeks to embrace dissonance as a valuable means of producing knowledge through the exchange of ideas and expertise that seemingly lack harmony, while simultaneously maintaining the utmost respect for the principles of differing disciplines. Such collaborative principles are what DHSI is founded on, and its Colloquium is merely an opportunity to encourage curiosity, and breed inter- and transdisciplinary creativity. In this respect, it is perhaps fitting that this issue includes Scott B. Weingart’s and Nickoal Eichmann-Kalwara’s “What’s Under the Big Tent?: A Study of ADHO Conference Abstracts.” While one can believe in dissonance, diversity, and interdisciplinarity, the reality does not always reflect the mantra. Quantifying submissions to our field’s flagship Digital Humanities conference, Weingart and Eichmann-Kalwara portray the discipline as one which is dominated by specific groups and practices. These findings, they argue, are at odds with anecdotal experiences, and they suggest a number of ways through which we might respond to such failings. Using the Victorian Women Writers Project as a case-study, Mary Borgo treats models for the sustainable growth of TEI-based digital resources. Discussing some of the most salient issues in the development of a digital edition—technical barriers, student involvement, ethics—this essay demonstrates the value of the Colloquium through the dissemination of those lessons that have been learned by its author as a consequence of her involvement in this project. William B. Kurtz also details his experiences working on a digital initiative, in this instance, Founders Online: Early Access. Kurtz’s examination is more specific to large-scale Digital Humanities work, and engages with the need for such projects to hold broader public appeal. John Barber’s “Radio Nouspace: Sound, Radio, Digital Humanities,” is something of a departure from the other contributions, in that it describes the curation of sound within the context of radio, and how such activity connects to creative digital scholarship, reflecting on digital storytelling, sound-based narrative, 4 I would like to thank a number of editors from Digital Studies/Le champ numérique, particularly Daniel O’Donnell, Paul Esau, Vanja Spiric, and Virgil Grandfield for their tireless efforts in bringing this special issue to fruition. O’Sullivan: Introduction 7 and practice-based research. In isolation, each of these essays offer insight from which interested readers will benefit—together, they represent the purpose of facilitating a community comprised of divergent interests and perspectives. Competing Interests The author has no competing interests to declare. How to cite this article: O’Sullivan, James. 2018. “Introduction: Digital Humanities as Dissonant.” Digital Studies/Le champ numérique 8(1): 3, pp. 1–7, DOI: https://doi. org/10.16995/dscn.286 Submitted: 04 November 2017 Accepted: 04 November 2017 Published: 23 January 2018 Copyright: © 2018 The Author(s). This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. See http://creativecommons.org/licenses/by/4.0/. OPEN ACCESS Digital Studies/Le champ numérique is a peer-reviewed open access journal published by Open Library of Humanities. https://doi.org/10.16995/dscn.286 https://doi.org/10.16995/dscn.286 http://creativecommons.org/licenses/by/4.0/ Competing Interests work_h372vnzqpjh7zdvdjbzcwakgja ---- Simulation-Based Evaluation of Ease of Wayfinding Using Digital Human and As-Is Environment Models International Journal of Geo-Information Article Simulation-Based Evaluation of Ease of Wayfinding Using Digital Human and As-Is Environment Models Tsubasa Maruyama 1,*, Satoshi Kanai 2, Hiroaki Date 2 and Mitsunori Tada 1 1 National Institute of Advanced Industrial Science and Technology, Tokyo 135-0064, Japan; m.tada@aist.go.jp 2 Graduate School of Information Science and Technology, Hokkaido University, Sapporo 060-0814, Japan; kanai@ssi.ist.hokudai.ac.jp (S.K.); hdate@ssi.ist.hokudai.ac.jp (H.D.) * Correspondence: tbs-maruyama@aist.go.jp; Tel.: +81-3-3599-8201 Received: 30 June 2017; Accepted: 24 August 2017; Published: 26 August 2017 Abstract: As recommended by the international standards, ISO 21542, ease of wayfinding must be ensured by installing signage at all key decision points on walkways such as forks because signage greatly influences the way in which people unfamiliar with an environment navigate through it. Therefore, we aimed to develop a new system for evaluating the ease of wayfinding, which could detect spots that cause disorientation, i.e., “disorientation spots”, based on simulated three-dimensional (3D) interactions between wayfinding behaviors and signage location, visibility, legibility, noticeability, and continuity. First, an environment model reflecting detailed 3D geometry and textures of the environment, i.e., “as-is environment model”, is generated automatically using 3D laser-scanning and structure-from-motion (SfM). Then, a set of signage entities is created by the user. Thereafter, a 3D wayfinding simulation is performed in the as-is environment model using a digital human model (DHM), and disorientation spots are detected. The proposed system was tested in a virtual maze and a real two-story indoor environment. It was further validated through a comparison of the disorientation spots detected by the simulation with those of six young subjects. The comparison results revealed that the proposed system could detect disorientation spots, where the subjects lost their way, in the test environment. Keywords: wayfinding; digital human model; signage; laser-scanning; structure-from-motion; accessibility evaluation 1. Introduction It is increasingly important in our rapidly aging society [1] to perform accessibility evaluations for enhancing the ease and safety of access to indoor and outdoor environments for all people, including the elderly and the disabled. Under international standards [2], “accessibility” is defined as “provision of buildings or parts of buildings for people, regardless of disability, age or gender, to be able to gain access to them, into them, to use them and exit from them.” As recommended in the ISO/IEC Guide 71 [3], accessibility must be assessed considering both the physical and cognitive abilities of individuals. From the physical viewpoint, for example, tripping risks in an environment [4] must be assessed to ensure the environment is safe to walk in, as conducted in our previous study [5]. By contrast, from the cognitive aspect, ease of wayfinding [6] must be assessed to enable people to gain access to destinations in unfamiliar environments. Wayfinding is a basic cognitive response of people trying to find their way to destinations in an unfamiliar environment based on perceived information and their own background knowledge [7]. Visual signage influences the way in which people unfamiliar with an indoor environment navigate through it [8]. As shown in Table 1, visual signage can be classified into positional, directional, routing, and identification signage depending on the type of navigation information on the signage. As recommended in the guidelines [2], these four types of signage must be arranged appropriately ISPRS Int. J. Geo-Inf. 2017, 6, 267; doi:10.3390/ijgi6090267 www.mdpi.com/journal/ijgi http://www.mdpi.com/journal/ijgi http://www.mdpi.com http://dx.doi.org/10.3390/ijgi6090267 http://www.mdpi.com/journal/ijgi ISPRS Int. J. Geo-Inf. 2017, 6, 267 2 of 22 at key decision points considering the relationship between the navigation information on signage and the path structure of the environment. In addition, as mentioned in the literature [9], ease of wayfinding must be evaluated considering not only signage continuity, visibility, and legibility but also signage noticeability. Table 1. Signage type and navigation information. Signage Type Navigation Information Positional signage Next goal position to be reached to arrive at a destination (e.g., map) Directional signage Next walking direction to take to reach a destination (e.g., right or left) Routing signage Walking route to be taken to reach a destination (e.g., route drawn on map or indicated by textual information) Identification signage Name of current place Currently, ease of wayfinding is evaluated using four approaches: real field testing [10], virtual field testing [11,12], CAD model analysis [13], and wayfinding simulation [14–20]. In real field tests [10], a certain number of human subjects are asked to perform experimental wayfinding tasks in a real environment. By contrast, in virtual field tests [11,12], subjects are asked to perform wayfinding tasks in a virtual environment using virtual reality devices. In these real or virtual field tests [10–12], ease of wayfinding is evaluated by analyzing subjects’ responses to a questionnaire and their wayfinding results, e.g., walking route, gaze duration, and gaze direction. However, in these tests, prolonged wayfinding experiments involving a variety of wayfinding tasks must be conducted by various human subjects of different ages, genders, body dimensions, and visual capabilities. Thus, field tests are not necessarily efficient and low-cost approaches. In CAD model analysis [13], signage continuity is evaluated by analyzing the relationships among various pieces of user-specified navigation information indicated by signage. However, this approach cannot evaluate ease of wayfinding in terms of signage visibility, legibility, and noticeability because three-dimensional (3D) interactions between individuals and signage are not considered. Recently, a variety of wayfinding simulations has been proposed [14–20]. Such simulation-based approaches have made it possible to evaluate the ease of wayfinding by simulating the wayfinding of the pedestrian model. However, these simulations consider only a part of signage factors such as signage location, continuity, visibility, legibility, and noticeability. In addition, these simulations involve only simplified as-planned environment models that do not model the detailed environmental geometry, including obstacles on the walkway, and realistic environmental textures. For reliable evaluation, an environment model must be created to reflect the as-is situation of the environment because detailed 3D geometry and realistic textures affect the wayfinding of individuals [17,21]. Given the above background, the purpose of this study is to develop a new system for evaluating ease of wayfinding. The system makes it possible to detect spots that cause disorientation, i.e., “disorientation spots”, based on simulated 3D interactions among realistic wayfinding behaviors, as-is environment model, and realistic signage system. In this study, the as-is environment model represents an environment model that reflects a given environment as-is, i.e., detailed 3D geometry including obstacles and realistic textures. A schematic of the proposed system is shown in Figure 1. To achieve this goal, we draw on the results of our previous studies, in which algorithms of as-is environment modeling [22], walking simulation of a digital human model (DHM) in that environment model [23], and basic wayfinding simulation of the DHM [24] were developed. As shown in Figure 1, first, the as-is environment model consisting of the walk surface points WS, navigation graph GN , and textured 3D environmental geometry GI is automatically generated from 3D laser-scanned point clouds [22] and a set of photographs of the environment [24]. Then, a set of signage entities is created by the user by manually assigning signage information. Then, a wayfinding simulation scenario is specified manually by the user. Thereafter, the DHM commences its wayfinding in accordance with the navigation information indicated by the arranged signage, while ISPRS Int. J. Geo-Inf. 2017, 6, 267 3 of 22 estimating signage visibility, noticeability, and legibility based on imitated visual perception. As a result, disorientation spots are detected. ISPRS Int. J. Geo-Inf. 2017, 6, 267 3 of 22 while estimating signage visibility, noticeability, and legibility based on imitated visual perception. As a result, disorientation spots are detected. Figure 1. Overview of system for evaluating ease of wayfinding. The proposed system is demonstrated in a virtual maze and a real two-story indoor environment. The system is further validated by comparing the disorientation spots detected by the simulation with those obtained in a test involving six young subjects in the two-story indoor environment. The rest of this paper is organized as follows. Section 2 introduces the related literature and clarifies the contributions of this study. Section 3 presents a brief introduction of the previously developed as-is environment modeling system [22,24]. In Section 4, an overview of signage entity creation is described. In Section 5, the algorithm for the simulation in which DHM performs wayfinding is introduced. Finally, in Section 6, the system is demonstrated and validated. 2. Related Work This study is related primarily to wayfinding simulation research. A variety of simulation algorithms aiming to evaluate the ease of wayfinding have been studied. Chen et al. [14] proposed a wayfinding simulation algorithm based on architectural information such as egress width, height, contrast intensity, and room illumination in a 3D as-planned environment model. Furthermore, Morrow et al. [15] proposed an environmental visibility evaluation system using 3D pedestrian model. In the study, environmental visibilities from Figure 1. Overview of system for evaluating ease of wayfinding. The proposed system is demonstrated in a virtual maze and a real two-story indoor environment. The system is further validated by comparing the disorientation spots detected by the simulation with those obtained in a test involving six young subjects in the two-story indoor environment. The rest of this paper is organized as follows. Section 2 introduces the related literature and clarifies the contributions of this study. Section 3 presents a brief introduction of the previously developed as-is environment modeling system [22,24]. In Section 4, an overview of signage entity creation is described. In Section 5, the algorithm for the simulation in which DHM performs wayfinding is introduced. Finally, in Section 6, the system is demonstrated and validated. 2. Related Work This study is related primarily to wayfinding simulation research. A variety of simulation algorithms aiming to evaluate the ease of wayfinding have been studied. Chen et al. [14] proposed a wayfinding simulation algorithm based on architectural information such as egress width, height, contrast intensity, and room illumination in a 3D as-planned environment model. Furthermore, Morrow et al. [15] proposed an environmental visibility evaluation system using 3D pedestrian model. In the study, environmental visibilities from pedestrian models were evaluated to assist facility managers in designing architectural layout and signage placement. However, these ISPRS Int. J. Geo-Inf. 2017, 6, 267 4 of 22 studies [14,15] are not applicable to the evaluation of ease of wayfinding based on signage system because the pedestrian models used in them were not modeled to incorporate the surrounding signage in the simulation. Hajibabai et al. [16] proposed a wayfinding simulation using directional signage in an as-planned 2D environment model for emergency evacuation during a fire. The 2D pedestrian model used in the study could make decisions about its walking route based on perceived signage and fire propagation. However, in that study, signage visibility and legibility were estimated by oversimplified human visual perception, and signage noticeability was not considered. In addition, performing a precise 3D wayfinding simulation using a 3D as-is environment model using their framework is infeasible. Recently, signage-based 3D wayfinding simulation has been advancing. Brunnhuber et al. [17] and Becker-Asano et al. [18] proposed schemes for wayfinding simulation using directional and identification signage in a 3D as-planned environment model. In these simulations, the next walking direction of the pedestrian models was determined autonomously based on the navigation information on the perceived signage. Signage perception was realized by estimating signage visibility and legibility based on the imitated visual perception of the pedestrian model. However, signage noticeability was not considered in these simulations, although it has a significant effect on the wayfinding of people in unfamiliar environments [9]. More recently, advanced approaches for estimating suitable signage locations have been proposed. Zhang et al. [19] proposed a system for planning the placement of directional signage for evaluation. In their system, a minimum number of signage and appropriate signage locations were determined automatically by simulating interactions between the pedestrian models and the signage system. In addition, Motamedi et al. [20] proposed a system for optimizing the arrangement of directional and identification signage in building information model (BIM)-enabled environments. Their system estimated optimal signage arrangement based on signage visibility and legibility for a 3D pedestrian model walking in a BIM-based environment model. However, as in cases of other previous simulations, signage noticeability was not considered in these studies [19,20]. In addition, the system [19] was validated with an oversimplified environment model imitating a large rectangular space having an egress, and the feasibility of its use in realistic and complex as-is environments was not validated. By contrast, in the system [20], the walking route of the pedestrian model was not changed based on the navigation information indicated by perceived signage, so evaluation based on signage continuity was basically infeasible. Furthermore, these simulations [16–20] treated only one or two types of signage—Directional and/or identification. Thus, these simulations cannot be applied to actual signage systems including all signage types in Table 1. Moreover, with the exception of the simulation proposed by Motamedi et al. [20], simplified as-planned environment models were used in the previous wayfinding simulations. Therefore, to realize a reliable evaluation of ease of wayfinding, simulation users and/or facility managers are urged to create detailed and realistic as-planned environment models, including small obstacles and environmental textures based on measurements of the environment. Unlike the simulations developed in these previous studies [14–20], the proposed system can evaluate the ease of wayfinding by simulating 3D interactions among realistic wayfinding behaviors, as-is environment model, and realistic signage system. Specifically, the contributions of the present study are as follows: 1. DHM can make a decision based on the surrounding signage perceived by its imitated visual perception in consideration of signage location, continuity, visibility, noticeability, and legibility. 2. As-is environment model including detailed environmental geometry and realistic textures, can be generated automatically using 3D laser-scanning and SfM. 3. Proposed system can simulate the wayfinding of the DHM by discriminating among four types of signage, namely, positional, directional, routing, and identification signage. ISPRS Int. J. Geo-Inf. 2017, 6, 267 5 of 22 4. Proposed system is validated through a comparison of disorientation spots between simulations and measurements obtained from young subjects. 3. Automatic 3D As-Is Environment Modeling In the proposed system, first, an as-is environment model is generated automatically. As shown in Figure 2, the model comprises walk surface points WS, a navigation graph GN , and textured 3D environmental geometry GI . WS represents a set of laser-scanned point clouds on walkable surfaces such as floors, slopes, and stair-treads. Specifically, WS is used to estimate the footprints of the DHM during the simulation. GN generated from WS represents the environmental pathways that the DHM would navigate through during the simulation. The graph GN = 〈V, E, c, t, ES〉 comprises a set of graph nodes V and a set of edges E. Each node vk ∈ V represents free space in the environment, and has a position vector t(vk) and cylinder attribute c(vk), whose radius r(vk) and height hv represent the distance to the wall and walkable step height, respectively. Each edge ek, representing the connectivity of free spaces, is generated between two adjacent nodes with a common region. ES = {esk} represents a set of stair edges connecting two graph nodes at the end of stairs. WS and GN can be generated automatically using our method [22]. By contrast, GI represents a 3D mesh model with high-quality textures, and it is used to estimate signage visibility and noticeability during simulation. GI can be created automatically using SfM with a set of photographs of the environment [24]. Detailed algorithms and demonstrations are given in our previous studies [22–24]. ISPRS Int. J. Geo-Inf. 2017, 6, 267 5 of 22 4. Proposed system is validated through a comparison of disorientation spots between simulations and measurements obtained from young subjects. 3. Automatic 3D As-Is Environment Modeling In the proposed system, first, an as-is environment model is generated automatically. As shown in Figure 2, the model comprises walk surface points , a navigation graph , and textured 3D environmental geometry . represents a set of laser-scanned point clouds on walkable surfaces such as floors, slopes, and stair-treads. Specifically, is used to estimate the footprints of the DHM during the simulation. generated from represents the environmental pathways that the DHM would navigate through during the simulation. The graph = 〈 , , , , 〉 comprises a set of graph nodes and a set of edges . Each node ∈ represents free space in the environment, and has a position vector ( ) and cylinder attribute ( ), whose radius ( ) and height ℎ represent the distance to the wall and walkable step height, respectively. Each edge , representing the connectivity of free spaces, is generated between two adjacent nodes with a common region. = { } represents a set of stair edges connecting two graph nodes at the end of stairs. and can be generated automatically using our method [22]. By contrast, represents a 3D mesh model with high-quality textures, and it is used to estimate signage visibility and noticeability during simulation. can be created automatically using SfM with a set of photographs of the environment [24]. Detailed algorithms and demonstrations are given in our previous studies [22–24]. (a) (b) (c) Figure 2. 3D as-is environment model: (a) Walk surface points ; (b) navigation graph ; (c) textured environmental geometry . 4. Creation of Signage Entity In the proposed scheme, the signage system is modeled as a set of signage entities = { }. Each signage entity = [ , ] consists of a 3D textured mesh model of the signage and a set of signage information entities = { , } ( ∈ [1, ] ), where represents the number of signage information items included in . When modeling the existing signage, is constructed using SfM; otherwise, is created using 3D CAD software. , is created by manually assigning the geometric, navigation, and legibility properties in Table 2. The details are given below. 4.1. Geometric Property The geometric property includes the description region , center position , unit normal vector , width , and transformation matrix T . As shown in Figure 3a, = [ , ] consists of two diagonal points of the rectangular description region on , in which the signage information is written. , , and are estimated from . T represents a transformation matrix from the local coordinate system of , to the coordinate system of , where is defined to satisfy three conditions: (1) the origin of is located on , (2) y-axis of is aligned with , and (3) z-axis of is aligned with the z-axis of . Under this definition, T is calculated automatically from and . Figure 2. 3D as-is environment model: (a) Walk surface points WS; (b) navigation graph GN ; (c) textured environmental geometry GI . 4. Creation of Signage Entity In the proposed scheme, the signage system is modeled as a set of signage entities S = {Si}. Each signage entity Si = [Gi, Ii] consists of a 3D textured mesh model Gi of the signage and a set of signage information entities Ii = {Ii,j} (j ∈ [1, Ni]), where Ni represents the number of signage information items included in Si. When modeling the existing signage, Gi is constructed using SfM; otherwise, Gi is created using 3D CAD software. Ii,j is created by manually assigning the geometric, navigation, and legibility properties in Table 2. The details are given below. 4.1. Geometric Property The geometric property includes the description region Rg, center position pg, unit normal vector ng, width wg, and transformation matrix TG I . As shown in Figure 3a, Rg = [pto p, pbottom] consists of two diagonal points of the rectangular description region on Gi, in which the signage information is written. pg, ng, and wg are estimated from Rg. TG I represents a transformation matrix from the local coordinate system XI of Ii,j to the coordinate system XG of Gi, where XI is defined to satisfy three conditions: (1) the origin of XI is located on pg, (2) y-axis of XI is aligned with ng, and (3) z-axis of XI is aligned with the z-axis of XG . Under this definition, TG I is calculated automatically from pg and ng. ISPRS Int. J. Geo-Inf. 2017, 6, 267 6 of 22 Table 2. Signage information entity. Property Attribute Assignment Method Geometric property Description region Rg = [pto p, pbottom] Assigned by user by picking two diagonal points Center position pg Estimated from RgUnit normal vector ng Width wg Transformation matrix TG I Estimated from pg and ng Navigation property Type of signage Tn ∈ {′positional′, ′directional′, ′routing′, ′identi f ication′} Assigned by user based on the signage designName of indicated place Dn Navigation information NI Legibility property Maximum viewing distance dl Measured from human subjects Center point of 3D VCA pl Estimated from dlRadius of 3D VCA rl ISPRS Int. J. Geo-Inf. 2017, 6, 267 6 of 22 Table 2. Signage information entity. Property Attribute Assignment Method Geometric property Description region = [ , ] Assigned by user by picking two diagonal points Center position Estimated from Unit normal vector Width Transformation matrix T Estimated from and Navigation property Type of signage ∈{ , ′ , ′ , ′ ′} Assigned by user based on the signage design Name of indicated place Navigation information Legibility property Maximum viewing distance Measured from human subjects Center point of 3D VCA Estimated from Radius of 3D VCA (a) (b) (c) Figure 3. Overview of signage information: (a) Geometric property; (b) navigation property; (c) legibility property. 4.2. Navigation Property The navigation property includes the type of signage , name of indicated place , and navigation information . As listed and shown in Table 3 and Figure 3b, respectively, is Destination A (A) Positional sign ( is ) (B) Directional sign ( is ) (C) Routing sign ( is ) Destination A Destination A To Destination A (A) (B) (C) Figure 3. Overview of signage information: (a) Geometric property; (b) navigation property; (c) legibility property. 4.2. Navigation Property The navigation property includes the type of signage Tn, name of indicated place Dn, and navigation information NI . As listed and shown in Table 3 and Figure 3b, respectively, NI is assigned by the user in accordance with Tn. The user must specify a next goal position pn, next walking direction dn, and a set of passing points PN for positional, directional, and routing signage, respectively. pn and PN are specified w.r.t. the coordinate system XW of the textured environmental geometry GI . By contrast, dn is specified w.r.t. XI of Ii,j. ISPRS Int. J. Geo-Inf. 2017, 6, 267 7 of 22 Table 3. Assignment of navigation information depending on signage type. Signage Type Navigation Information NI to Achieve a Destination Referenced Coordinate System Positional signage Next goal position pn XW of GI Directional signage Next walking direction dn XI of Ii,j Routing signage A set of passing points PN = {pk} XW of GI Identification signage Name of current place Cn None 4.3. Legibility Property The legibility property includes the center point pl w.r.t. XI of Ii,j and radius rl of the 3D visibility catchment area (VCA). As shown in Figure 3c, the 3D VCA of signage represents a sphere in which people can recognize the information written in the signage. The VCA was defined originally as a 2D circle by Fillipidis et al. [25] and Xie et al. [26]. In this study, the 3D VCA is calculated such that the great circle of the sphere on the horizontal plane corresponds to the 2D VCA circle proposed by Xie et al. [26]. Specifically, pl and rl are calculated using the following equation: rl = wg 2 sin ϕl pl = pg + ng( wg 2 tan ϕl ) ϕl = tan −1 ( wg 2 dl ), (1) where dl represents the maximum viewing distance between the signage and the subject standing at a place, in which the subject can recognize the information on the signage. By measuring dl from the subjects, the legible space of the signage is calculated as the 3D VCA using Equation (1). 5. System for Evaluation of Ease of Wayfinding As shown in Figure 1, the wayfinding simulation using the DHM is performed in accordance with the user-specified wayfinding scenario, including DHM properties H = [M, θH , θV , nt], start position ps, initial walking direction dI , name of destination D, and signage locations and orientations Ts = {Ti}, where M, θH , θV , nt, and Ti represent motion-capture (MoCap) data for flat walking obtained from the gait database [27], horizontal and vertical angles of view frustum, threshold value of signage noticeability, and transformation matrix from XG to XW , respectively. Before the simulation, the locations and the orientations of each signage entity Si ∈ S are determined by assigning Ti ∈ Ts. Then, a DHM having the same body dimensions as the subject of M is generated. As shown in Figure 4, the DHM has 41 degrees of freedom and a link mechanism corresponding to that of M. The imitated eye position peye of the DHM is estimated as the midpoint between the top of the head and the neck. Finally, the wayfinding simulation is performed by repeating the algorithms described in the following subsections. ISPRS Int. J. Geo-Inf. 2017, 6, 267 8 of 22 ISPRS Int. J. Geo-Inf. 2017, 6, 267 8 of 22 Figure 4. Link mechanism of DHM. 5.1. Signage Perception Based on Imitated Visual Perception In the proposed system, signage visibility, noticeability, and legibility are estimated to determine whether a signage is found and its information is recognized by the DHM. The details are described in the following subsections. 5.1.1. Signage Visibility Estimation Signage visibility represents whether a signage is included in the view frustum of the DHM defined by and . As shown in Figure 5, it is estimated simply by scanning the eyesight of the DHM. First, the eyesight of the DHM is obtained using OpenGL by rendering an image from the camera model located at the DHM eye position . At the same time, as shown in the figure, the textured 3D environmental geometry and the textured 3D mesh model of each signage ∈ are rendered with a single color instead of their original textures. Finally, if the color of appears in the rendered image, is considered “visible” signage and inserted into a set of visible signage entities = { }. (a) (b) Figure 5. Signage visibility estimation: (a) View frustum of DHM; (b) image rendered using OpenGL. 5.1.2. Signage Noticeability Estimation As people overlook objects in their eyesight, it is not always true that the DHM can find a signage when is visible ∈ . Therefore, signage noticeability representing whether the DHM can notice ∈ must be estimated. Figure 4. Link mechanism of DHM. 5.1. Signage Perception Based on Imitated Visual Perception In the proposed system, signage visibility, noticeability, and legibility are estimated to determine whether a signage is found and its information is recognized by the DHM. The details are described in the following subsections. 5.1.1. Signage Visibility Estimation Signage visibility represents whether a signage is included in the view frustum of the DHM defined by θH and θV . As shown in Figure 5, it is estimated simply by scanning the eyesight of the DHM. First, the eyesight of the DHM is obtained using OpenGL by rendering an image from the camera model located at the DHM eye position peye. At the same time, as shown in the figure, the textured 3D environmental geometry GI and the textured 3D mesh model Gi of each signage Si ∈ S are rendered with a single color instead of their original textures. Finally, if the color of Gi appears in the rendered image, Si is considered “visible” signage and inserted into a set of visible signage entities Svis = {Sk}. ISPRS Int. J. Geo-Inf. 2017, 6, 267 8 of 22 Figure 4. Link mechanism of DHM. 5.1. Signage Perception Based on Imitated Visual Perception In the proposed system, signage visibility, noticeability, and legibility are estimated to determine whether a signage is found and its information is recognized by the DHM. The details are described in the following subsections. 5.1.1. Signage Visibility Estimation Signage visibility represents whether a signage is included in the view frustum of the DHM defined by and . As shown in Figure 5, it is estimated simply by scanning the eyesight of the DHM. First, the eyesight of the DHM is obtained using OpenGL by rendering an image from the camera model located at the DHM eye position . At the same time, as shown in the figure, the textured 3D environmental geometry and the textured 3D mesh model of each signage ∈ are rendered with a single color instead of their original textures. Finally, if the color of appears in the rendered image, is considered “visible” signage and inserted into a set of visible signage entities = { }. (a) (b) Figure 5. Signage visibility estimation: (a) View frustum of DHM; (b) image rendered using OpenGL. 5.1.2. Signage Noticeability Estimation As people overlook objects in their eyesight, it is not always true that the DHM can find a signage when is visible ∈ . Therefore, signage noticeability representing whether the DHM can notice ∈ must be estimated. Figure 5. Signage visibility estimation: (a) View frustum of DHM; (b) image rendered using OpenGL. 5.1.2. Signage Noticeability Estimation As people overlook objects in their eyesight, it is not always true that the DHM can find a signage Si when Si is visible Si ∈ Svis. Therefore, signage noticeability representing whether the DHM can notice Si ∈ Svis must be estimated. ISPRS Int. J. Geo-Inf. 2017, 6, 267 9 of 22 In the proposed system, signage noticeability is estimated using the saliency estimation algorithm proposed by Itti et al. [28] based on the visual search mechanism of real humans [29]. In this algorithm, a Gaussian pyramid is first generated from an image rendered by the camera model at peye. Then, feature maps representing contrasts of intensity, color differences, and orientations are obtained from each image. By integrating and normalizing the feature maps, a saliency map Ms = {m(x, y)} is generated, where m(x, y) ∈ [0, 1] represents the degree of saliency at a pixel (x, y). In the map, m(x, y) increases at the pixel, in which contrasts of intensity, color differences, and orientations are higher than those of other pixels. Finally, as shown in Figure 6, the propose system estimates the noticeability ni of visible signage Si ∈ Svis using the following equation: ni = max (x,y)∈Pi m(x, y), (2) where m(x, y) and Pi represent the degree of saliency at pixel (x, y) in MS and a set of pixels, in which the signage geometry Gi is rendered. If ni is greater than the noticeability threshold nt of the user-specified wayfinding scenario, Si is considered “found” signage, and inserted into a set of found signage entities S f ound = {Sk} (S f ound ⊆ Svis). ISPRS Int. J. Geo-Inf. 2017, 6, 267 9 of 22 In the proposed system, signage noticeability is estimated using the saliency estimation algorithm proposed by Itti et al. [28] based on the visual search mechanism of real humans [29]. In this algorithm, a Gaussian pyramid is first generated from an image rendered by the camera model at . Then, feature maps representing contrasts of intensity, color differences, and orientations are obtained from each image. By integrating and normalizing the feature maps, a saliency map = { ( , )} is generated, where ( , ) ∈ [0,1] represents the degree of saliency at a pixel ( , ). In the map, ( , ) increases at the pixel, in which contrasts of intensity, color differences, and orientations are higher than those of other pixels. Finally, as shown in Figure 6, the propose system estimates the noticeability of visible signage ∈ using the following equation: = max( , )∈ ( , ), (2) where ( , ) and represent the degree of saliency at pixel ( , ) in and a set of pixels, in which the signage geometry is rendered. If is greater than the noticeability threshold of the user-specified wayfinding scenario, is considered “found” signage, and inserted into a set of found signage entities = { } ( ⊆ ). Figure 6. Signage noticeability estimation. 5.1.3. Signage Legibility Estimation Signage legibility represents whether the DHM can recognize signage information of found signage ∈ , i.e., whether the DHM can read the textual or graphical information written on the signage. It is estimated using the 3D VCA of signage information , of . If is included in the 3D VCA of , , , is considered “recognized” signage information. In the proposed system, it is assumed that the DHM can correctly interpret , only when and , are found (i.e., ∈ ) and recognized, respectively. Note that the signage noticeability, , does not influence the signage legibility estimation. 5.2. Wayfinding Decision-Making Based on Signage Perception Based on the estimated signage visibility, noticeability, and legibility, the wayfinding state of the DHM is changed dynamically in accordance with the state transition chart shown in Figure 7a. As shown in the figure, when the simulation is performed, the DHM is set to start walking in the direction (state SW1 in Figure 7a). Then, as shown in Figure 7b, when a signage is found by the DHM, i.e., is inserted to , the DHM is set to walk toward the center position of , of (state SW2) to read the information on . Thereafter, the other signage does not influence the state transition until the state is changed to the look-around state (SW3) even if is found by the DHM. When , of becomes legible, the name of indicated place of , is compared with the name of destination of the wayfinding scenario. If , the state is changed to SW3 to find other signage related to ; else, the state is changed in accordance with the type of recognized signage information . If represents positional, directional, or routing signage, the state is changed to the motion planning state (SW4). By contrast, if represents an identification signage, the state is changed to the success state (SW5). In this state, the simulation is deemed complete because this state is the final state. Figure 6. Signage noticeability estimation. 5.1.3. Signage Legibility Estimation Signage legibility represents whether the DHM can recognize signage information of found signage Si ∈ S f ound, i.e., whether the DHM can read the textual or graphical information written on the signage. It is estimated using the 3D VCA of signage information Ii,j of Si. If peye is included in the 3D VCA of Ii,j, Ii,j is considered “recognized” signage information. In the proposed system, it is assumed that the DHM can correctly interpret Ii,j only when Si and Ii,j are found (i.e., Si ∈ S f ound) and recognized, respectively. Note that the signage noticeability, ni, does not influence the signage legibility estimation. 5.2. Wayfinding Decision-Making Based on Signage Perception Based on the estimated signage visibility, noticeability, and legibility, the wayfinding state of the DHM is changed dynamically in accordance with the state transition chart shown in Figure 7a. As shown in the figure, when the simulation is performed, the DHM is set to start walking in the direction dI (state SW1 in Figure 7a). Then, as shown in Figure 7b, when a signage Si is found by the DHM, i.e., Si is inserted to S f ound, the DHM is set to walk toward the center position pg of Ii, j of Si (state SW2) to read the information on Si. Thereafter, the other signage Sj does not influence the state transition until the state is changed to the look-around state (SW3) even if Sj is found by the DHM. When Ii, j of Si becomes legible, the name of indicated place Dn of Ii, j is compared with the name of destination D of the wayfinding scenario. If Dn 6= D, the state is changed to SW3 to find other signage related to D; else, the state is changed in accordance with the type of recognized signage information Tn. If Tn represents positional, directional, or routing signage, the state is changed to the motion planning state (SW4). By contrast, if Tn represents an identification signage, the state is ISPRS Int. J. Geo-Inf. 2017, 6, 267 10 of 22 changed to the success state (SW5). In this state, the simulation is deemed complete because this state is the final state. During the wayfinding simulation, the DHM basically repeats the states SW2, SW4, SW6, and SW3. As shown in Figure 7c, when the DHM recognizes Ii, j, it is set to walk toward the temporal destination of the DHM, i.e., subgoal position psub (SW4 and SW6). Then, as shown in Figure 7d, when the DHM arrives at psub, it is asked to observe the surrounding environment (i.e., look-around) by rotating the neck joint horizontally within its range of motion (SW3). When the DHM finds new signage in this state, the state changes back to SW2. By contrast, when the DHM cannot find any signage, the current DHM position is treated as a “disorientation spot” (SW7). The state SW7 is considered the failed state. Note that the state can be changed to SW8 from SW7 only when Tn represents a directional signage, as described in Section 5.3.1. ISPRS Int. J. Geo-Inf. 2017, 6, 267 10 of 22 During the wayfinding simulation, the DHM basically repeats the states SW2, SW4, SW6, and SW3. As shown in Figure 7c, when the DHM recognizes , , it is set to walk toward the temporal destination of the DHM, i.e., subgoal position (SW4 and SW6). Then, as shown in Figure 7d, when the DHM arrives at , it is asked to observe the surrounding environment (i.e., look-around) by rotating the neck joint horizontally within its range of motion (SW3). When the DHM finds new signage in this state, the state changes back to SW2. By contrast, when the DHM cannot find any signage, the current DHM position is treated as a “disorientation spot” (SW7). The state SW7 is considered the failed state. Note that the state can be changed to SW8 from SW7 only when represents a directional signage, as described in Section 5.3.1. (a) (b) (c) (d) Figure 7. Wayfinding decision-making based on signage perception: (a) Wayfinding state transition; (b) walking toward signage; (c) walking toward subgoal position; (d) look-around. 5.3. Signage-Based Motion Planning 5.3.1. Updating Subgoal Position of DHM In the signage-based motion planning state (SW4), first, the subgoal position is determined automatically depending on the type of recognized signage information and its navigation information . When = ′ ′, is determined as the next goal position of to make the DHM walk toward a location indicated by the recognized signage information , . When = ′ ′, as shown in Figure 8, a queue of fork points = { } is extracted by the following steps. (1) A graph node ( ∈ ) just under the pelvis position of the DHM is extracted from the navigation graph . Then, is inserted into a set of graph nodes , where , is found Walking trajectory Look-around Figure 7. Wayfinding decision-making based on signage perception: (a) Wayfinding state transition; (b) walking toward signage; (c) walking toward subgoal position; (d) look-around. 5.3. Signage-Based Motion Planning 5.3.1. Updating Subgoal Position of DHM In the signage-based motion planning state (SW4), first, the subgoal position psub is determined automatically depending on the type of recognized signage information Tn and its navigation information NI . When Tn =′ positional′, psub is determined as the next goal position pn of NI to make the DHM walk toward a location indicated by the recognized signage information Ii, j. When Tn =′ directional′, as shown in Figure 8, a queue of fork points F = {pm} is extracted by the following steps. ISPRS Int. J. Geo-Inf. 2017, 6, 267 11 of 22 (1) A graph node vc (vc ∈ V) just under the pelvis position pp of the DHM is extracted from the navigation graph GN . Then, vc is inserted into a set of graph nodes V′P, where V ′ P represents graph nodes on a feasible walking path when the DHM walks in accordance with the next walking direction dn indicated by Ii, j. (2) vc and dn of Ii, j are assigned to the variables vt and dt, respectively. (3) A graph node vp located in the direction of dt is extracted using the following equation: p = argmax k∈Nt dk·dt dk = t(vk)− t(vt) ‖t(vk)− t(vt)‖ , (3) where Nt represents a set of indices of graph node vk (vk /∈ V′P) connected to vt by a graph edge. Using this equation, vp is determined as a graph node with the minimum angle difference between dt and a graph edge connecting vk and vt. (4) If Nt 6= ∅, vp is inserted into V′P, and dk and vp are assigned to vt and dt, respectively. (5) If |Nt| ≥ 2 ∨ Nt = ∅, t(vp) is pushed into F because t(vp) is considered a center position at the fork way or at the terminal of the walkway. (6) Steps (3)–(5) are repeated, until Nt = ∅, i.e., until a graph node representing the terminal of the walkway is found. When the wayfinding state is changed to SW4 or SW8 in Figure 7a, a first fork point is taken from F and assigned to psub. This algorithm enables the proposed system to detect multiple disorientation spots, i.e., fork points with no visible and noticeable signage after perceiving directional signage. ISPRS Int. J. Geo-Inf. 2017, 6, 267 11 of 22 represents graph nodes on a feasible walking path when the DHM walks in accordance with the next walking direction indicated by , . (2) and of , are assigned to the variables and , respectively. (3) A graph node located in the direction of is extracted using the following equation: = arg max∈ ∙ = ( ) ( )‖ ( ) ( )‖, (3) where represents a set of indices of graph node ( ∉ ) connected to by a graph edge. Using this equation, is determined as a graph node with the minimum angle difference between and a graph edge connecting and . (4) If ∅, is inserted into , and and are assigned to and , respectively. (5) If | | 2 ∨ = ∅ , is pushed into because is considered a center position at the fork way or at the terminal of the walkway. (6) Steps (3)–(5) are repeated, until = ∅, i.e., until a graph node representing the terminal of the walkway is found. When the wayfinding state is changed to SW4 or SW8 in Figure 7a, a first fork point is taken from and assigned to . This algorithm enables the proposed system to detect multiple disorientation spots, i.e., fork points with no visible and noticeable signage after perceiving directional signage. Figure 8. Extraction of fork points from navigation graph. When = ′ ′, is determined as the last elements of a set of passing points of indicated by , . Then, the walking path of the DHM is estimated such that it passes the graph nodes at ∈ in Section 5.3.2. 5.3.2. Walking Path Selection and Walking Trajectory Generation As shown in Figure 9, after determining the subgoal position , the walking path = { } ( ∈ ) of the DHM is determined automatically by the following function: = Path( , ), (4) where Path( , ) represents a function to select a set of graph nodes between two nodes located at and using the Dijkstra method from . When the wayfinding state is changed to SW2 with the visible signage ∈ , ( ) and are assigned to and , where and represent a graph node just under the DHM pelvis position and the center position of , of , respectively. By contrast, when the state is changed to SW4, and are determined depending on the type of recognized signage information . When = ′ ′ or = ′ ′ , ( ) and are assigned to and , respectively. By contrast, when = ′ ′ , is determined as Figure 8. Extraction of fork points from navigation graph. When Tn =′ routing′, psub is determined as the last elements of a set of passing points PN of NI indicated by Ii, j. Then, the walking path VP of the DHM is estimated such that it passes the graph nodes at pk ∈ PN in Section 5.3.2. 5.3.2. Walking Path Selection and Walking Trajectory Generation As shown in Figure 9, after determining the subgoal position psub, the walking path VP = {vi} (vi ∈ V) of the DHM is determined automatically by the following function: VP = Path(pa, pb), (4) where Path(pa, pb) represents a function to select a set of graph nodes VP between two nodes located at pa and pb using the Dijkstra method from GN . When the wayfinding state is changed to SW2 with the visible signage Si ∈ Svis, t(vc) and pg are assigned to pa and pb, where vc and pg represent a graph node just under the DHM pelvis position pp and the center position pg of Ii, j of Si, respectively. By contrast, when the state is changed ISPRS Int. J. Geo-Inf. 2017, 6, 267 12 of 22 to SW4, pa and pb are determined depending on the type of recognized signage information Tn. When Tn =′ positional′ or Tn =′ directional′, t(vc) and psub are assigned to pa and pb, respectively. By contrast, when Tn =′ routing′, VP is determined as VP = ∪ k<|PN| k=0 Path(pk, pk+1), where pk ∈ PN is a passing point representing a walking route indicated by NI of Ii, j. After determining VP, the walking trajectory VT = 〈pi〉 is generated automatically by our previously developed optimization algorithm [23], where VT represents a sequence of sparsely discretized target pelvis positions of the DHM. This optimization algorithm is designed to make VT more natural and smooth, while avoiding contact with walls. The details are described in [23]. ISPRS Int. J. Geo-Inf. 2017, 6, 267 12 of 22 = ⋃ Path( , )| | , where ∈ is a passing point representing a walking route indicated by of , . After determining , the walking trajectory = 〈 〉 is generated automatically by our previously developed optimization algorithm [23], where represents a sequence of sparsely discretized target pelvis positions of the DHM. This optimization algorithm is designed to make more natural and smooth, while avoiding contact with walls. The details are described in [23]. Figure 9. Examples of walking path selection and walking trajectory generation. 5.4. MoCap-Based Adaptive Walking Motion Generation Finally, the walking motion of the DHM is generated as it follows using our MoCap-based adaptive walking motion generation algorithm [23]. In the algorithm, realistic articulated walking movements of the DHM are generated based on MoCap data for flat walking. The details and demonstrations are introduced in [23]. 6. Results and Validations The proposed system was developed using Visual Studio 2010 Professional edition with C++. The system was applied to a virtual maze and a real two-story indoor environment. In addition, it was validated by comparing the disorientation spots between the simulation and measurements obtained from young subjects. Videos of as-is environment modeling and wayfinding simulation results, i.e., Figures 10–13, are available in the supplementary video file. 6.1. Evaluation of Ease of Wayfinding in Virtual Maze The proposed system was first applied to a virtual maze with a set of signage entities = { , , , , }, to test its basic performance. Figure 10 shows the constructed environment model of the virtual maze. In the figure, textured environmental geometry was constructed manually using CAD software [30], and the set of walk surface points and navigation graph were constructed from a set of vertices of . Note that the proposed system could perform not only in the as-is environment model but in the given 3D model of the environment, e.g., CAD data of the environment, by converting the model to dense point clouds. Tables 4 and 5 show the wayfinding scenario and the user-assigned parameters of each signage information , , respectively. As shown in Table 5, all four types of signage were used. Figure 9. Examples of walking path selection and walking trajectory generation. 5.4. MoCap-Based Adaptive Walking Motion Generation Finally, the walking motion of the DHM is generated as it follows VT using our MoCap-based adaptive walking motion generation algorithm [23]. In the algorithm, realistic articulated walking movements of the DHM are generated based on MoCap data M for flat walking. The details and demonstrations are introduced in [23]. 6. Results and Validations The proposed system was developed using Visual Studio 2010 Professional edition with C++. The system was applied to a virtual maze and a real two-story indoor environment. In addition, it was validated by comparing the disorientation spots between the simulation and measurements obtained from young subjects. Videos of as-is environment modeling and wayfinding simulation results, i.e., Figures 10–13, are available in the supplementary video file. 6.1. Evaluation of Ease of Wayfinding in Virtual Maze The proposed system was first applied to a virtual maze with a set of signage entities S = {S1, S2, S3, S4, S5}, to test its basic performance. Figure 10 shows the constructed environment model of the virtual maze. In the figure, textured environmental geometry GI was constructed manually using CAD software [30], and the set of walk surface points WS and navigation graph GN were constructed from a set of vertices of GI . Note that the proposed system could perform not only in the as-is environment model but in the given 3D model of the environment, e.g., CAD data of the environment, by converting the model to dense point clouds. Tables 4 and 5 show the wayfinding scenario and the user-assigned parameters of each signage information Ii,j, respectively. As shown in Table 5, all four types of signage were used. ISPRS Int. J. Geo-Inf. 2017, 6, 267 13 of 22 ISPRS Int. J. Geo-Inf. 2017, 6, 267 13 of 22 (a) (b) (c) (d) Figure 10. Environment model of virtual maze: (a) Textured environmental geometry (#vertices: 4,241,573, #faces: 8,436,885); (b) walk surface points ; (c) navigation graph ; (d) wayfinding scenario. The results are available in the supplementary video file. Table 4. User-specified wayfinding scenario. Parameters Specified Values MoCap data for flat walking of MoCap data of a young male subject (Age: 22 years, height: 1.73 m) Horizontal angle of view frustum of 100 deg 1 Vertical angle of view frustum of 60 deg 1 Noticeability threshold ∈ [0, 1] of 0.3 2 Start position Shown in Figure 10d Initial walking direction Name of destination “Goal“ Signage locations and orientations Shown in Figure 10d 1 and were specified based on the handbook [31]. 2 was specified as a small value for validation. Table 5. User-assigned parameters of signage information. Parameters Sign Sign Sign Sign Sign Type of signage ‘Positional’ ‘Directional‘ ‘Directional‘ ‘Routing‘ ‘Identification‘ Name of indicated place “Goal“ Navigation information Shown in Figure 10d “Goal“ Maximum viewing distance 4.0 m 1 5.0 m 1 1.74 m 1 1 was specified as a tentative value without human measurements. Figure 10. Environment model of virtual maze: (a) Textured environmental geometry GI (#vertices: 4,241,573, #faces: 8,436,885); (b) walk surface points WS; (c) navigation graph GN ; (d) wayfinding scenario. The results are available in the supplementary video file. Table 4. User-specified wayfinding scenario. Parameters Specified Values MoCap data for flat walking M of H MoCap data of a young male subject (Age: 22 years, height: 1.73 m) Horizontal angle of view frustum θH of H 100 deg 1 Vertical angle of view frustum θV of H 60 deg 1 Noticeability threshold nt ∈ [0, 1] of H 0.3 2 Start position ps Shown in Figure 10d Initial walking direction dI Name of destination D “Goal“ Signage locations and orientations Ts Shown in Figure 10d 1 θH and θV were specified based on the handbook [31]. 2 nt was specified as a small value for validation. Table 5. User-assigned parameters of signage information. Parameters Sign S1 Sign S2 Sign S3 Sign S4 Sign S5 Type of signage Tn ‘Positional’ ‘Directional’ ‘Directional’ ‘Routing’ ‘Identification’ Name of indicated place Dn “Goal” Navigation information NI Shown in Figure 10d “Goal” Maximum viewing distance dl 4.0 m 1 5.0 m 1 1.74 m 1 1 dl was specified as a tentative value without human measurements. Figure 11 shows the evaluation results of ease of wayfinding. As shown in Figure 11a, when the simulation was performed, the DHM found and recognized S1 and I1, 1, respectively. In consequence, the DHM was set to walk toward the next goal positon pn indicated by I1, 1. Then, when the DHM arrived at pn, S2 and I2, 1 were found and recognized by the DHM (Figure 11b), respectively. A feasible ISPRS Int. J. Geo-Inf. 2017, 6, 267 14 of 22 walking path V′P and a set of fork points F of I2, 1 were then extracted. Then, the DHM was set to walk toward the first fork point p1 ∈ F of I2, 1. After that, the DHM found and recognized S3 and I3, 1 at p1 ∈ F, respectively. Then, as shown in Figure 11c, V ′ P and F of I3, 1 were extracted. At the same time, the DHM was set to walk toward p1 ∈ F of I3, 1. However, as shown in Figure 11d, the DHM could not find any new signage when it arrived at p1 ∈ F of I3, 1. Therefore, this spot was detected as a disorientation spot. As recommended by international standards [2], a facility manager must provide signage at all key decision points such as forks. Therefore, from this standpoint, the detection of this disorientation spot can be considered reasonable. ISPRS Int. J. Geo-Inf. 2017, 6, 267 14 of 22 Figure 11 shows the evaluation results of ease of wayfinding. As shown in Figure 11a, when the simulation was performed, the DHM found and recognized and , , respectively. In consequence, the DHM was set to walk toward the next goal positon indicated by , . Then, when the DHM arrived at , and , were found and recognized by the DHM (Figure 11b), respectively. A feasible walking path and a set of fork points of , were then extracted. Then, the DHM was set to walk toward the first fork point ∈ of , . After that, the DHM found and recognized and , at ∈ , respectively. Then, as shown in Figure 11c, and of , were extracted. At the same time, the DHM was set to walk toward ∈ of , . However, as shown in Figure 11d, the DHM could not find any new signage when it arrived at ∈ of , . Therefore, this spot was detected as a disorientation spot. As recommended by international standards [2], a facility manager must provide signage at all key decision points such as forks. Therefore, from this standpoint, the detection of this disorientation spot can be considered reasonable. (a) (b) (c) (d) (e) (f) Figure 11. Evaluation results of ease of wayfinding in virtual maze (red lines: graph edges, blue lines: graph edges on , cyan lines: , yellow lines: walking trajectory of DHM, purple lines: graph edges on ): (a) Wayfinding in accordance with ; (b) wayfinding in accordance with ; (c) wayfinding in accordance with ; (d) detecting disorientation spot; (e) wayfinding in accordance with ; (f) simulation was completed. The results are available in the supplementary video file. Figure 11. Evaluation results of ease of wayfinding in virtual maze (red lines: graph edges, blue lines: graph edges on VP, cyan lines: VT , yellow lines: walking trajectory of DHM, purple lines: graph edges on V′P): (a) Wayfinding in accordance with S1; (b) wayfinding in accordance with S2; (c) wayfinding in accordance with S3; (d) detecting disorientation spot; (e) wayfinding in accordance with S4; (f) simulation was completed. The results are available in the supplementary video file. ISPRS Int. J. Geo-Inf. 2017, 6, 267 15 of 22 Thereafter, as shown in Figure 11e, the DHM was set to walk toward p2 ∈ F indicated by I3, 1 to evaluate the ease of wayfinding after passing the detected disorientation spot. In consequence, the DHM found and recognized S4 and I4, 1 at p2 ∈ F of I3, 1, respectively. Then, the DHM was set to walk toward p4 ∈ PN of I4, 1 following VP generated on passing points pi ∈ PN of I4, 1. Finally, as shown in Figure 11f, the DHM found and recognized S5 and I5, 1, respectively, where S5 was an identification signage pertaining to the destination D. In consequence, the wayfinding simulation was completed. Based on the above results, from the standpoints of system performance, the following conclusions were obtained. • The proposed system could detect disorientation spots resulting from the lack of signage or poor location of signage in the environment model. • The proposed system could simulate the wayfinding of the DHM by discriminating among four types of signage, namely, positional, directional, routing, and identification. 6.2. Evaluation Results of Ease of Wayfinding in Real Two-Story Indoor Environment The proposed system was further applied to a real two-story indoor environment with a set of signage entities S = {S1, S2, S3, S4}. Figure 12 shows the constructed as-is environment model. In Figure 12, the laser-scanned point clouds were acquired from the environment by a terrestrial laser scanner [32]. The textured environmental geometry GI was constructed from 21,143 photos of the environment using commercial SfM software, ContextCapture [33], where the photos were extracted from the video data captured using a digital single-lens reflex camera [34]. As shown in Figure 12c, the model contains a few distorted regions, which can be attributed to the performance limitations of the SfM software. However, most of the model could be generated successfully. In the simulation, the DHM properties H of the wayfinding scenario was identical to that in Table 4. The starting position ps, initial walking direction dI , and signage locations and orientations TS are shown in Figure 12d,e. The maximum viewing distance dl of each signage was specified as dl = 4.46 m for each signage information Ii,j, as determined by measurement of dl of S1 using six subjects ranging in age from 22 to 26 years. A positional signage S1, two types of directional signage S2 and S3, and an identification signage S4 were arranged in the environment to simulate the situation in which people tried to find a conference room using only the signage in the unfamiliar indoor environment. Figure 13 shows the evaluation results of ease of wayfinding. As shown in Figure 13a, when the simulation was performed, S1 and I1, 1 were found and recognized by the DHM, respectively. Since the next goal position pn indicated by I1, 1 was specified on the end of the caracole on the second floor, the DHM was set to ascend the caracole. When the DHM arrived at pn of I1, 1, the DHM was asked to observe the surrounding environment to find new signage. However, as shown in Figure 13b, the DHM could not find S2 although S2 was visible. This was because the estimated signage noticeability n2 = 0.27 of S2 at the spot was less than the user-specified threshold, nt = 0.3. Thus, this spot was detected as a disorientation spot because S2 was overlooked. Following the above results, in Figure 13c, the signage design of S2, i.e., texture on Gi, was improved to enhance its noticeability. As a result, the ease of wayfinding was improved to enable the DHM to find S2 at the detected disorientation spot. This improvement was caused by the fact that n2 of S2 from the DHM standing at the disorientation spot detected previously increased to an adequately large value, n2 = 0.68. After the DHM recognized I2, 1, the DHM was set to walk toward the first fork point p1 indicated by I2, 1. However, as shown in Figure 13c, when the DHM arrived at p1 of I2, 1, the wayfinding state had fallen into SW7, i.e., gotten lost, since the DHM could not find any new signage at p1. This was because any signage could not be seen by the DHM at p1. Therefore, this spot was also detected as a disorientation spot owing to the lack of signage. ISPRS Int. J. Geo-Inf. 2017, 6, 267 16 of 22 ISPRS Int. J. Geo-Inf. 2017, 6, 267 16 of 22 (a) (b) (c) (d) (e) Figure 12. As-is environment model of two-story indoor environment: (a) Laser-scanned point clouds (#points: 5,980,647); (b) navigation graph ; (c) textured environmental geometry (#vertices: 625,484, #faces: 1,241,049); (d) wayfinding scenario on first floor [35]; (e) wayfinding scenario on second floor [35]. The results are available in the supplementary video file. Figure 12. As-is environment model of two-story indoor environment: (a) Laser-scanned point clouds (#points: 5,980,647); (b) navigation graph GN ; (c) textured environmental geometry GI (#vertices: 625,484, #faces: 1,241,049); (d) wayfinding scenario on first floor [35]; (e) wayfinding scenario on second floor [35]. The results are available in the supplementary video file. ISPRS Int. J. Geo-Inf. 2017, 6, 267 17 of 22 ISPRS Int. J. Geo-Inf. 2017, 6, 267 17 of 22 (a) (b) (c) (d) Figure 13. Evaluation results of ease of wayfinding in two-story indoor environment (yellow lines: walking trajectory of DHM): (a) Wayfinding simulation on first floor; (b) detection of disorientation spot resulting from overlooking the signage ; (c) design improvement of and detection of disorientation spot resulting from lack of signage; (d) ease of wayfinding improved completely by changing the design of and adding . The results are available in the supplementary video file. Figure 13. Evaluation results of ease of wayfinding in two-story indoor environment (yellow lines: walking trajectory of DHM): (a) Wayfinding simulation on first floor; (b) detection of disorientation spot resulting from overlooking the signage S2; (c) design improvement of S2 and detection of disorientation spot resulting from lack of signage; (d) ease of wayfinding improved completely by changing the design of S2 and adding S5. The results are available in the supplementary video file. ISPRS Int. J. Geo-Inf. 2017, 6, 267 18 of 22 By contrast, in Figure 13d, a new positional signage S5 was arranged around the detected disorientation spot. As a result, as shown in the figure, the wayfinding simulation of the DHM was completed successfully. As described above, the proposed system enabled the user to validate the ease of wayfinding in the environment interactively by considering the wayfinding of the DHM, as-is environment model, and arranged signage system. From the standpoint of system performance, the following conclusions were obtained. • The proposed system could detect disorientation spots resulting from the lack of signage and overlooking signage. • The proposed system could simulate the wayfinding of the DHM even in the realistic and complex as-is environment model. • The proposed system could quickly re-evaluate rearranged signage based on the simulation. 6.3. Efficiency of Environment Modeling and Simulation Table 6 shows the elapsed time of the as-is environment modeling and simulation. As shown in the table, the times for 3D environment modeling from laser-scanned point clouds were less than one minute in both environments. By contrast, owing to the performance limitation of the SfM software [33], construction of the textured environmental geometry GI required approximately one week. Table 6. Time required for environment modeling and simulation. (CPU: Intel(R) Core(TM) i7-6850K 3.60 GHz, RAM: 64 GB, GPU: GeForce GTX 1080). Process Time Required in Case of Virtual Maze Time Required in Case of Two-Story Indoor Environment Automatic construction of WS and GN from laser-scanned point clouds 2.5 s (#points: 963,691) 1 50.0 s (#points: 5,980,647) 1 Automatic construction of GI using SfM software [33] Approximately 1 week (#photos: 21,143) (resolution: 1920 × 1080) Signage visibility, legibility, and noticeability estimation Less than 0.17 s Signage-based motion planning Less than 0.02 s One-step walking motion generation with 100 frames interpolation 2 0.15 s 2.5 s 1 Number of downsampled points used for environment modeling. 2 Elapsed time of signage visibility, legibility, and noticeability evaluation was not included. Furthermore, the time required for signage visibility, legibility, and noticeability estimation was less than 0.17 s. In addition, the times required for one-step walking motion generation were 0.15 s and 2.5 s in the virtual maze and the two-story indoor environment, respectively. Therefore, it was confirmed that the proposed system could simulate the DHM wayfinding efficiently. Note that the time required for walking motion generation in the two-story indoor environment was longer owing to the high computational load of rendering the environment model. 6.4. Experimental Validation of System for Evaluating Ease of Wayfinding 6.4.1. Overview of Wayfinding Experiment The simulation results on ease of wayfinding presented in Section 6.2 were validated by the wayfinding experiment using six young subjects. In the validation, two signage systems imitating S = {S1, S2, S3, S4} and S ∪ S5 were arranged in the real environment, where S and S5 represent the set of signage entities used in the simulation in Figure 13a,b and the added signage in the simulation in Figure 13d, respectively. In the wayfinding experiment, first, the name of destination was revealed to the subjects at the start position ps. Then, the subjects were asked to find their way to the destination ISPRS Int. J. Geo-Inf. 2017, 6, 267 19 of 22 using the arranged signage system. During this process, wayfinding events such as finding signage and recognizing signage information were recorded by the thinking-aloud method [36], where the subjects were asked to walk while continuously thinking out loud. Verbal information from the subjects was recorded by handheld voice recorders. At the same time, videos of the walking trajectories of the subjects were captured by the observer. Finally, when the subjects arrived at the destination, the experiment was deemed complete. Note that all subjects have regularly used the environment, but the locations of arranged signage and the destination were not revealed to them. In addition, in the simulation results in Section 6.2, the maximum viewing distance dl was specified by measuring dl from those six subjects. In the experiments, first, the wayfinding behaviors of three young subjects (Y1–Y3) were measured using the signage system imitating S. After that, the behaviors of the other three young subjects (Y4–Y6) were measured using the signage system imitating S ∪ S5. 6.4.2. Comparison of Wayfinding Results between DHM and Subjects Figure 14 shows the comparison of wayfinding results between the DHM and the subjects. As shown in Figure 14a, a disorientation spot was found during the experiment by three subjects (Y1–Y3), which corresponded to the disorientation spot detected by the simulation. Thus, it was confirmed that the proposed ease of wayfinding simulation could detect disorientation spot, where the subjects actually lost their way owing to the lack of signage. ISPRS Int. J. Geo-Inf. 2017, 6, 267 19 of 22 in Figure 13d, respectively. In the wayfinding experiment, first, the name of destination was revealed to the subjects at the start position . Then, the subjects were asked to find their way to the destination using the arranged signage system. During this process, wayfinding events such as finding signage and recognizing signage information were recorded by the thinking-aloud method [36], where the subjects were asked to walk while continuously thinking out loud. Verbal information from the subjects was recorded by handheld voice recorders. At the same time, videos of the walking trajectories of the subjects were captured by the observer. Finally, when the subjects arrived at the destination, the experiment was deemed complete. Note that all subjects have regularly used the environment, but the locations of arranged signage and the destination were not revealed to them. In addition, in the simulation results in Section 6.2, the maximum viewing distance was specified by measuring from those six subjects. In the experiments, first, the wayfinding behaviors of three young subjects (Y1–Y3) were measured using the signage system imitating . After that, the behaviors of the other three young subjects (Y4–Y6) were measured using the signage system imitating ∪ . 6.4.2. Comparison of Wayfinding Results between DHM and Subjects Figure 14 shows the comparison of wayfinding results between the DHM and the subjects. As shown in Figure 14a, a disorientation spot was found during the experiment by three subjects (Y1–Y3), which corresponded to the disorientation spot detected by the simulation. Thus, it was confirmed that the proposed ease of wayfinding simulation could detect disorientation spot, where the subjects actually lost their way owing to the lack of signage. (a) (b) Figure 14. Comparison of wayfinding results between simulation and human measurements: (a) Comparison using = { , , , }; (b) comparison using ∪ . Figure 14. Comparison of wayfinding results between simulation and human measurements:(a) Comparison using S = {S1, S2, S3, S4}; (b) comparison using S ∪ S5. ISPRS Int. J. Geo-Inf. 2017, 6, 267 20 of 22 By contrast, as shown in Figure 14b, two subjects, Y4 and Y5, arrived at the destination when the signage system imitating S ∪ S5 was arranged. However, a disorientation spot was found during the experiment by subject Y6. This was explained by the fact that the subject Y6 overlooked the signage imitating S2. As shown in Figure 14a, this disorientation spot was also detected in the simulation because the DHM could not find S2 owing to the low noticeability of S2. Therefore, it was further confirmed that the proposed system could detect disorientation spot, where subjects actually lost their way owing to overlooking signage. 7. Conclusions In this study, we developed a simulation-based system for evaluating ease of wayfinding using a DHM in an as-is environment model. The proposed system was demonstrated using a virtual maze and a real two-story indoor environment. The following conclusions were drawn from our results: • Our system makes it possible to evaluate the ease of wayfinding by simulating the 3D interactions among the realistic wayfinding behaviors of a DHM, as-is environment model, and realistic signage system. • Under the user-specified wayfinding scenario, the system simulates the wayfinding of the DHM by evaluating signage locations, continuity, visibility, legibility, and noticeability based on the imitated visual perception of the DHM. • Realistic signage system, including four types of signage, namely, positional, directional, routing, and identification, can be discriminated in the wayfinding simulation. • Disorientation spots owing to the lack of signage and overlooking signage can be identified only by conducting the simulation. • Rearranged signage plans can be re-evaluated quickly by carrying out the simulation alone. Our system was further validated by comparison of disorientation spots between simulations and measurements obtained from six young subjects. From this validation, it was confirmed that the proposed system has a possibility of detecting disorientation spots, where people lose their way owing to the lack of signage or overlooking signage. To validate the performance of the proposed system in detail, wayfinding experiments with a greater number of subjects in various as-is environments, including outdoor environments, must be conducted using more complex wayfinding scenarios in a future work. Furthermore, in Sections 6.1 and 6.2, the noticeability threshold nt was specified without reference to measurements of human visual capabilities. However, in practice, nt must be specified as the minimum value estimated by the dominant users of the environment in consideration of their visual capabilities. Therefore, a method for determining a suitable value of nt using a statistical database related to human visual capabilities [37] will be developed in a future work. The textured environmental geometry GI of the two-story indoor environment included a few distorted regions owing to performance limitations of the SfM software and poor textures on the walls. In the proposed system, GI was used for signage noticeability estimation. From the standpoint of evaluating ease of wayfinding, the system must detect the disorientation spot, where low signage noticeability is expected. In general, the signage noticeability decreases in areas where wall surfaces around the signage are complex and textural, i.e., saliency of signage design is relatively low compared to its surroundings. Fortunately, in such areas, GI can be well reconstructed owing to the nature of the SfM algorithm. Therefore, the proposed system can detect disorientation spots resulting from overlooking signage, even if a part of GI is distorted. Furthermore, as mentioned in the literature [20], the presence of crowds influences the ease of wayfinding. Thus, crowd simulation technologies must be introduced into the proposed simulation framework. In addition, in the proposed system, the walking trajectory of the DHM was generated using a previously developed optimization algorithm [23]. However, as observed in Figure 14, the walking trajectories of individual human subjects vary. In our future work, such variabilities will ISPRS Int. J. Geo-Inf. 2017, 6, 267 21 of 22 be considered by introducing Monte Carlo simulation into the proposed system, i.e., generating a variety of DHM walking trajectories using the algorithm [23] with resampled parameters related to the trajectory generation. Supplementary Materials: The following is available online at www.mdpi.com/2220-9964/6/9/267/s1, Video S1: EvaluationResults.mp4. Acknowledgments: This work was supported by JSPS KAKENHI Grant No. 15J01552 and JSPS Grant-in-Aid for Challenging Exploratory Research under Project No.26560168. Author Contributions: Tsubasa Maruyama proposed the original idea of this paper; Tsubasa Maruyama developed the entire system and performed the experiments; Satoshi Kanai, Hiroaki Date, and Mitsunori Tada improved the idea of the paper; Tsubasa Maruyama wrote the paper. Conflicts of Interest: The authors declare no conflict of interest. References 1. World Health Organization. WHO Global Report on Falls Prevention in Older Age. Available online: http://www.who.int/ageing/publications/Falls_prevention7March.pdf (accessed on 30 June 2017). 2. International Organization for Standardization. ISO21542: Building Construction—Accessibility and Usability of the Built Environment. Available online: https://www.iso.org/standard/50498.html (accessed on 15 December 2011). 3. International Organization for Standardization/International Electrotechnical Commission. ISO/IEC Guide 71 Second Edition: Guide for Addressing Accessibility in Standards. Available online: http://www.iec.ch/ webstore/freepubs/isoiecguide71%7Bed2.0%7Den.pdf (accessed on 1 December 2014). 4. Rubenstein, L.Z. Falls in Older People: Epidemiology, Risk Factors and Strategies for Prevention. Available online: https://www.ncbi.nlm.nih.gov/pubmed/16926202 (accessed on 22 June 2017). 5. Maruyama, T.; Kanai, S.; Date, H. Tripping risk evaluation system based on human behavior simulation in laser-scanned 3D as-is environments. J. Comput. Des. Eng. 2017, under review. 6. Churchill, A.; Dada, E.; de Barros, A.G.; Wirasinghe, S.C. Quantifying and validating measures of airport terminal wayfinding. J. Air Transp. Manag. 2008, 14, 151–158. [CrossRef] 7. Hunt, E.; Waller, D. Orientation and Wayfinding: A Review. Available online: http://citeseerx.ist.psu.edu/ viewdoc/summary?doi=10.1.1.46.5608 (accessed on 30 June 2017). 8. Hölscher, C.; Büchner, S.J.; Brosamle, M.; Meilinger, T.; Strube, G. Signs and maps: Cognitive economy in the use of external aids for indoor navigation. In Proceedings of the 29th Annual Conference of the Cognitive Science Society, Nashville, TE, USA, 1–4 August 2007. 9. Yasufuku, K.; Akizuki, Y.; Hokugo, A.; Takeuchi, Y.; Takashima, A.; Matsui, T.; Suzuki, H.; Pinheiro, A.T.K. Noticeability of illuminated route signs for tsunami evacuation. Fire Saf. J. 2017, in press. [CrossRef] 10. Thora, T.; Bergmann, E.; Konieczny, L. Wayfinding and description strategies in an unfamiliar complex building. In Proceedings of the 33rd Annual Conference of the Cognitive Science Society, Boston, MA, USA, 20–23 July 2011. 11. Vilar, E.; Rebelo, F.; Noriega, P. Indoor human wayfinding performance using vertical and horizontal signage in virtual reality. Hum. Factors Ergon. Manuf. Serv. Ind. 2014, 24, 601–605. [CrossRef] 12. Buechner, S.J.; Wiener, J.; Hölscher, S. Methodological triangulation to assess sign placement. In Proceedings of the Symposium on Eye Tracking Research and Applications, Santa Barbara, CA, USA, 28–30 March 2012. 13. Furubayashi, S.; Yabuki, N.; Fukuda, T. A data model for checking directional signage at railway stations. In Proceedings of the First International Conference on Civil and Building Engineering Informatics, Tokyo, Japan, 7–8 November 2013. 14. Chen, Q.; de Vries, B.; Nivf, M.K. A wayfinding simulation based on architectural features in the virtual built environment. In Proceedings of the 2011 Summer Computer Simulation Conference, Hague, The Netherlands, 27–30 June 2011. 15. Morrow, E.; Mackenzie, I.; Nema, G.; Park, D. Evaluating three dimensional vision fields in pedestrian microsimulations. Transp. Res. Procedia 2014, 2, 436–441. [CrossRef] www.mdpi.com/2220-9964/6/9/267/s1 http://www.who.int/ageing/publications/Falls_prevention7March.pdf https://www.iso.org/standard/50498.html http://www.iec.ch/webstore/freepubs/isoiecguide71%7Bed2.0%7Den.pdf http://www.iec.ch/webstore/freepubs/isoiecguide71%7Bed2.0%7Den.pdf https://www.ncbi.nlm.nih.gov/pubmed/16926202 http://dx.doi.org/10.1016/j.jairtraman.2008.03.005 http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.46.5608 http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.46.5608 http://dx.doi.org/10.1016/j.firesaf.2017.04.038 http://dx.doi.org/10.1002/hfm.20503 http://dx.doi.org/10.1016/j.trpro.2014.09.053 ISPRS Int. J. Geo-Inf. 2017, 6, 267 22 of 22 16. Hajibabai, L.; Delavar, M.R.; Malek, M.R.; Frank, A.U. Agent-Based Simulation of Spatial Cognition and Wayfinding in Building Fire Emergency Evacuation. Available online: https://publik.tuwien.ac.at/files/ pub-geo_1946.pdf (accessed on 22 June 2017). 17. Brunnhuber, M.; Schrom-Feiertag, H.; Luksch, C.; Matyus, T.; Hesina, G. Bridging the gaps between visual exploration and agent-based pedestrian simulation in a virtual environment. In Proceedings of the 18th ACM Symposium on Virtual Reality Software and Technology, Toronto, ON, Canada, 10–12 December 2012. 18. Becker-Asano, C.; Ruzzoli, F.; Hölscher, C.; Nebel, B. A multi-agent system based on unity 4 for virtual perception and wayfinding. Transp. Res. Procedia 2014, 2, 452–455. [CrossRef] 19. Zhang, Z.; Jia, L.; Qin, Y. Optimal number and location planning of evacuation signage in public space. Saf. Sci. 2017, 91, 132–147. [CrossRef] 20. Motamedi, A.; Wang, Z.; Yabuki, N.; Fukuda, T.; Michikawa, T. Signage visibility analysis and optimization system using BIM-enabled virtual reality (VR) environments. Adv. Eng. Inf. 2017, 32, 248–262. [CrossRef] 21. Phaholthep, C.; Sawadsri, A.; Bunyasakseri, T. Evidence-based research on barriers and physical limitations in hospital public zones regarding the universal design approach. Asian Soc. Sci. 2017, 13, 133. [CrossRef] 22. Maruyama, T.; Kanai, S.; Date, H. Simulating a Walk of Digital Human Model Directly in Massive 3D Laser-Scanned Point Cloud of Indoor Environments. Available online: https://link.springer.com/chapter/ 10.1007/978-3-642-39182-8_43 (accessed on 22 June 2017). 23. Maruyama, T.; Kanai, S.; Date, H.; Tada, M. Motion-capture-based walking simulation of digital human adapted to laser-scanned 3D as-is environments for accessibility evaluation. J. Comput. Des. Eng. 2016, 3, 250–265. [CrossRef] 24. Maruyama, T.; Kanai, S.; Date, H. Vision-based wayfinding simulation of digital human model in three dimensional as-is environment models and its application to accessibility evaluation. In Proceedings of the International Design Engineering Technical Conferences & Computers & Information in Engineering Conference, Charlotte, NC, USA, 6–9 August 2016. 25. Filippidis, L.; Galea, E.R.; Gwynne, S.; Lawrence, P.J. Representing the influence of signage on evacuation behavior within an evacuation model. J. Fire Prot. Eng. 2006, 16, 37–73. [CrossRef] 26. Xie, H.; Filippidis, L.; Gwynne, S.; Galea, E.R.; Blackshields, D.; Lawrence, P.J. Signage legibility distances as a function of observation angle. J. Fire Prot. Eng. 2007, 17, 41–64. [CrossRef] 27. Kobayashi, Y.; Mochimaru, M. AIST Gait Database 2013. Available online: https://www.dh.aist.go.jp/ database/gait2013/ (accessed on 22 June 2017). 28. Itti, L.; Koch, C.; Niebur, E. A model of saliency-based visual-attention for rapid scene analysis. IEEE Trans. Pattern Anal. Mach. Intell. 1998, 20, 1254–1259. [CrossRef] 29. Treisman, A.M.; Gelade, G. A feature-integration theory of attention. Cogn. Psychol. 1980, 12, 97–136. [CrossRef] 30. FreeCAD: An Open-Source Parametric 3D CAD Modeler. Available online: https://www.freecadweb.org/ (accessed on 22 June 2017). 31. Takashi, T.; Keita, I.B. Physiology. In Handbook of Environmental Design, 2nd ed.; Koichi, I., Ed.; Maruzen Publishing: Tokyo, Japan, 2003. 32. 3D Laser-Scanner FARO. Available online: http://www.faro.com/products/3d-surveying/laserscanner- faro-focus/overview (accessed on 22 June 2017). 33. Bentley—Reality Modeling Software. Available online: https://www.bentley.com/en/products/brands/ contextcapture (accessed on 22 June 2017). 34. Nikon D3300. Available online: http://www.nikon-image.com/products/slr/lineup/d3300/ (accessed on 22 June 2017). 35. Floor Maps of Graduate School of Information Science and Technology. Available online: http://www.ist. hokudai.ac.jp/facilities/ (accessed on 22 June 2017). 36. O’Neill, M.J. Evaluation of a conceptual model of architectural legibility. Environ. Behav. 1991, 23, 259–284. [CrossRef] 37. Database of Sensory Characteristics of Older Persons with Disabilities. Available online: http://scdb.db.aist. go.jp/ (accessed on 22 June 2017). © 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). https://publik.tuwien.ac.at/files/pub-geo_1946.pdf https://publik.tuwien.ac.at/files/pub-geo_1946.pdf http://dx.doi.org/10.1016/j.trpro.2014.09.059 http://dx.doi.org/10.1016/j.ssci.2016.07.021 http://dx.doi.org/10.1016/j.aei.2017.03.005 http://dx.doi.org/10.5539/ass.v13n4p133 https://link.springer.com/chapter/10.1007/978-3-642-39182-8_43 https://link.springer.com/chapter/10.1007/978-3-642-39182-8_43 http://dx.doi.org/10.1016/j.jcde.2016.03.001 http://dx.doi.org/10.1177/1042391506054298 http://dx.doi.org/10.1177/1042391507064025 https://www.dh.aist.go.jp/database/gait2013/ https://www.dh.aist.go.jp/database/gait2013/ http://dx.doi.org/10.1109/34.730558 http://dx.doi.org/10.1016/0010-0285(80)90005-5 https://www.freecadweb.org/ http://www.faro.com/products/3d-surveying/laserscanner-faro-focus/overview http://www.faro.com/products/3d-surveying/laserscanner-faro-focus/overview https://www.bentley.com/en/products/brands/contextcapture https://www.bentley.com/en/products/brands/contextcapture http://www.nikon-image.com/products/slr/lineup/d3300/ http://www.ist.hokudai.ac.jp/facilities/ http://www.ist.hokudai.ac.jp/facilities/ http://dx.doi.org/10.1177/0013916591233001 http://scdb.db.aist.go.jp/ http://scdb.db.aist.go.jp/ http://creativecommons.org/ http://creativecommons.org/licenses/by/4.0/. Introduction Related Work Automatic 3D As-Is Environment Modeling Creation of Signage Entity Geometric Property Navigation Property Legibility Property System for Evaluation of Ease of Wayfinding Signage Perception Based on Imitated Visual Perception Signage Visibility Estimation Signage Noticeability Estimation Signage Legibility Estimation Wayfinding Decision-Making Based on Signage Perception Signage-Based Motion Planning Updating Subgoal Position of DHM Walking Path Selection and Walking Trajectory Generation MoCap-Based Adaptive Walking Motion Generation Results and Validations Evaluation of Ease of Wayfinding in Virtual Maze Evaluation Results of Ease of Wayfinding in Real Two-Story Indoor Environment Efficiency of Environment Modeling and Simulation Experimental Validation of System for Evaluating Ease of Wayfinding Overview of Wayfinding Experiment Comparison of Wayfinding Results between DHM and Subjects Conclusions work_h7kgytdtjfbkbbljthnxgrln4q ---- Drones and Surveillance Cultures in a Global World Research How to Cite: Muthyala, John. 2019. “Drones and Surveillance Cultures in a Global World.” Digital Studies/Le champ numérique 9(1): 18, pp. 1–51. DOI: https://doi.org/10.16995/dscn.332 Published: 27 September 2019 Peer Review: This is a peer-reviewed article in Digital Studies/Le champ numérique, a journal published by the Open Library of Humanities. Copyright: © 2019 The Author(s). This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. See http://creativecommons.org/licenses/by/4.0/. Open Access: Digital Studies/Le champ numérique is a peer-reviewed open access journal. Digital Preservation: The Open Library of Humanities and all its journals are digitally preserved in the CLOCKSS scholarly archive service. https://doi.org/10.16995/dscn.332 http://creativecommons.org/licenses/by/4.0/ Muthyala, John. 2019. “Drones and Surveillance Cultures in a Global World.” Digital Studies/Le champ numérique 9(1): 18, pp. 1–51. DOI: https://doi.org/10.16995/dscn.332 RESEARCH Drones and Surveillance Cultures in a Global World John Muthyala University of Southern Maine, US muthyala@maine.edu Digital technologies are essential to establishing new forms of dominance through drones and surveillance systems; these forms have significant effects on individuality, privacy, democracy, and American foreign policy; and popular culture registers how the uses of drone technologies for aesthetic, educational, and governmental purposes raise questions about the exercise of individual, governmental, and social power. By extending computational methodologies in the digital humanities like macroanalysis and distant reading in the context of drones and surveillance, this article demonstrates how drone technologies alter established notions of war and peace, guilt and innocence, privacy and the common good; in doing so, the paper connects postcolonial studies to the digital humanities. Keywords: Drones; Surveillance; Digital Humanities; Postcolonial studies; Globalisation; Digital cultures Les technologies numériques sont essentielles pour établir de nouvelles formes de domination par le biais des drones et des systèmes de surveillance. Ces formes ont des effets importants sur l’individualité, la vie privée, la démocratie et la politique étrangère américaine. La culture populaire dénombre un éventail de ces effets employant des technologies de drones pour des objectifs esthétiques, éducatifs et gouvernementaux d’une manière qui soulève des questions sur la mise en pratique du pouvoir individuel, gouvernemental et social. En étendant des méthodologies statistiques des Humanités numériques, tels que la macroanalyse et la lecture globale, dans le contexte des drones et de la surveillance, cet article démontre la façon dont les technologies numériques modifient fondamentalement les notions déjà établies de la guerre et de la paix, de la culpabilité et de l’innocence, de la vie privée et du bien commun. De ce fait, cet article lie les études post-coloniales aux Humanités numériques. Mots-clés: Drones; Surveillance; Humanités numériques; études post- coloniales; Mondialisation; Cultures numériques https://doi.org/10.16995/dscn.332 mailto:muthyala@maine.edu Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 2 of 51 Za Kaom Pa Stargo Stargo Drone Hamla (my gaze is as fatal as a drone attack). Song performed by Sitara Younas, Pashto singer “How the digital humanities advances, channels, or resists today’s great postindustrial, neoliberal, corporate, and global flows of information-cum- capital is thus a question rarely heard in the digital humanities…” —Alan Liu, “Where is Cultural Criticism in the Digital Humanities?” I make three central arguments in this paper: the use of digital technologies is essential to establishing new forms of dominance through drones (unmanned automated vehicles, UAVs) and surveillance systems; these forms have significant effects on individuality, privacy, democracy, and American foreign policy; and popular culture registers how the use of drone technologies for aesthetic, educational, and governmental purposes raises complex questions about the exercise of individual, governmental, and social power. In what follows, I first highlight the cultural turn in the digital humanities in order to open up a critical terrain to study the militarized and civilian uses of drones and the surveillance cultures they engender; second, I focus on drones as disruptive technologies that thrive on surveillance regimes; and third, I study the creative appropriations of drone technologies by artists and singers seeking to counter the global reach of digital networks that enable some nation- states to wield power over largely post-colonial societies, and control the social, legal, and political meanings of innocence and guilt, privacy and freedom. Taken together, these approaches help us infuse cultural criticism in the digital humanities and connect postcolonial studies with the digital humanities. Digital Humanities and the cultural turn Over the last two decades, digital humanities emerged as a promising field of inquiry in which interdisciplinary collaboration in the sciences and the humanities lead to new digital tools, multimodal interfaces, and hybrid methodologies. Early initiatives are often traced back to the electronic concordance of Saint Thomas Aquinas’ works, first created by Jesuit priest Father Roberto Busa in the 1950s, by partnering with International Business Machines (IBM). The use of computing in the humanities became the key topic for literary scholars and scientists in seminars offered by IBM, Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 3 of 51 and in 1966, they published Computers and the Humanities (Hindley 2013). In the decades that followed, digital technologies grew so rapidly that they spawned a dizzying array of communication and information tools and systems. Using computational approaches to the humanities, the digital humanities has generally concerned itself with text encoding, text mining, machine learning, database creations, archiving, curating, data visualization, algorithmic criticism, and distant reading. Organizations like the Office of Digital Humanities of the National Endowment for the Humanities, Alliance of Digital Humanities Organizations, Humanities, Arts, Science, and Technology Alliance and Collaboratory, Association for Computers and Humanities, Canadian Society for Digital Humanities, Australian Association for Digital Humanities, Japanese Association for Digital Humanities, European Association for Digital Humanities, and the panels of DH at the Modern Language Association Conference, THAT CAMP, and other conferences, including several journals, blogs, anthologies, university press series, undergraduate and graduate courses and programs, and regional and national grants and fellowships all show the discipline’s growing institutionalization in higher education in America and other parts of the world. A central debate in the digital humanities concerns computing: one side argues that the digital humanities mark the computational turn in the humanities, whereas the other side acknowledges the turn but broadens its focus to include the social and cultural impact of digital technologies (Berry 2012, 5). Scholars identify three waves or phases in digital humanities. The first phase focused on digitization, codes, software, and archiving; the second phase emphasized interactivity, making the data malleable, developing multimodal environments, and visualization; the third phase uses “digital toolkits in the service of the Humanities’ core methodological strengths: attention to complexity, medium specificity, historical context, analytical depth, critique and interpretation” (Presner, Schnapp, and Lunenfeld 2009). Perhaps (Muthyala 2016), it’s the nature of an emerging field to develop concepts and meta- critical acumen about its assumptions and practices, which are themselves emerging (new or realigned developments) and emergent (coming into being in relation to the urgency or need of scholarly or creative occasion). There is also a hackers vs Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 4 of 51 yackers divide: the hackers do the splendid inventions, creations, and euphoric discoveries that bring in millions of dollars and make life worth living, while the yackers ask uncomfortable questions about meaning, context, nuance, policy, purpose, pedagogy, social, political, and economic implications, ethics, the good life, and make the examined life miserable (Pannapacker 2013). Stressing coding as essential to DH, Stephen Ramsay contends, “Personally, I think digital humanities is about building things […] If you are not making anything, you are not […] a digital humanist” (Gold, 2012a). DH registers a transformation that is about “moving from reading and critiquing to building and making” (Gold, 2012a). Write David M. Berry and Anders Fagerjord (2017): “As digital technology has swept over the world, the humanities too have undergone a rapid change in relation to the use and application of digital technologies in scholarship […] Humanities research has been irrevocably transformed, as indeed have everyday life, our societies, economies, cultures and politics” (1). There is no going back to a pre-digital world; we are in a post-digital era, because “the tendrils of digital technology have in some way touched everyone” (Cascone 2000, 12). The digital is here to stay. What we do with it is what matters. Tongue-in-cheek yet with insight, Marjorie Burghart (2013) suggests three orders reminiscent of the three Medieval Orders, loosely defined, operating in digital humanities: “Oratores, bellatores, laboratores: those who pray, those who fight, those who work.” There are those who work and do things and produce new codes, software, systems, and tools used for scholarship and creativity; there are those who work hard to legitimize this work to non-specialists, the general public, and scholars in other disciplines; they fight the rhetorical battles to gain institutional prestige and academic credibility; and then there are those “non-practicing believers,” who are “interested by the DH phenomenon and enthusiastic, but not involved themselves in any practical aspect” (Burghart 2013). Since the aim here is not to rehearse the task of defining and explaining digital humanities, suffice it to say that these definitions are extended in several works: Susan Schreibman, Ray Siemens, and John Unsworth’s (2004) A Companion to Digital Humanities; Columbia University’s Round Table on DH (Center for Digital Research and Scholarship 2011) at the Center for Research and Scholarship, “Research Without Borders: Defining the Digital Humanities”; Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 5 of 51 Todd Presner, Jeffrey Schnapp, and Peter Lunenfeld’s (2009) Digital Humanities Manifesto 2.0; David M. Berry’s (2012) Understanding the Digital Humanities; Anne Burdick et al’s (2012) Digital_Humanities; Matthew K. Gold’s (2012b) Debates in the Digital Humanities and Melissa Terras, Julianne Nyhan, and Edward Vanhoutte’s (2013) Defining Digital Humanities: A Reader. A pointed criticism about the digital humanities comes from Alan Liu (2012), who argues that cultural criticism is notably absent in the digital turn in the humanities: While digital humanists develop tools, data, and metadata critically, therefore (e.g., debating the “ordered hierarchy of content objects” principle; disputing whether computation is best used for truth finding or, as Lisa Samuels and Jerome McGann put it, “deformance”; and so on), rarely do they extend their critique to the full register of society, economics, politics, or culture. How the digital humanities advances, channels, or resists today’s great postindustrial, neoliberal, corporate, and global flows of information-cum-capital is thus a question rarely heard in the digital humanities associations, conferences, journals, and projects with which I am familiar. Liu’s call for cultural criticism in the digital humanities is noteworthy, because the tendency to define the field primarily as an extension of computational humanities continues to gain purchase in public discourse; to critics like Stanley Fish (2018), digital humanities are deeply suspect: “administrators who pour funds and resources into the digital humanities are complicit in the killing of the humanities.” Recently, in criticizing the institutional cachet of digital humanities and what he views as hasty, misguided approaches to use statistical methods for literary analysis, Fish (2019) notes, “At bottom CLS [computational literary studies] or Digital Humanities is a project dedicated to irresponsibility masked by diagrams and massive data mining.” Timothy Brennan (2017) asks, “After a decade of investment and hype, what has the field accomplished?” His answer is sharp: “Not much” (Brennan 2017). Adam Kirsch (2014) sounds the alarm, proclaiming that “technology is taking over English departments,” which is a “false promise of the digital humanities.” Oddly enough, to Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 6 of 51 these critics, the digital humanities begins and ends with computational humanities, a view demonstrating a lack of awareness of the extensive discussions about the field, including whether it can even be called a field or discipline. Fish’s blaming administrators who support the digital humanities as being “complicit” in their devaluation is the kind of myopic, hyperbolic rhetoric we often find in political campaigns where, despite evidence to the contrary, candidates blame each other for all the ills of the world—the real, the imagined, the fanciful, the grotesque—and then some. Liu’s call to move beyond the computational towards the cultural turn in the digital humanities is, therefore, more urgent than before; his warning to think institutionally and socio-politically about the digital humanities by examining vast systems and networks that facilitate the flow of money, power, and influence by individuals, groups, and nation-states finds resonance in Daniel Allington, Sarah Brouillette, and David Golumbia’s (2016) indictment of higher education’s growing dependency on neoliberal values and business models. Arguing that digital humanities “discourse sees technological innovation as an end in itself and equates the development of business models with political progress,” they contend, “the unparalleled level of material support that Digital Humanities has received suggests that its most significant contribution to academic politics may lie in its (perhaps unintentional) facilitation of the neoliberal takeover of the university” (Allington, Brouillette, and Golumbia 2016). Likewise, Anne Cong-Huyen (2013) observes that the field has tended to remain insular by focusing heavily on technological expertise, as if without it one cannot become part of the discipline or really understand it: These digital and electronic technologies are of particular importance because they are often perceived as being neutral, without any intrinsic ethics of their own, when they are the result of material inequalities that play out along racial, gendered, national, and hemispheric lines. Not only are these technologies the result of such inequity, but they also reproduce and reinscribe that inequity through their very proliferation and use, which is dependent upon the perpetuation of global networks of economic and social disparity and exploitation. Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 7 of 51 Similarly, Tara McPherson (2012) says that “the difficulties we encounter in knitting together our discussions of race (or other modes of difference) with our technological productions within the digital humanities (or in our studies of code) are actually an effect of the very designs of our technological systems, designs that emerged in post- World War II computational culture.” The impulse to move beyond race by advocating colour-blindness worked closely like the modular systems that protected the coding logic intact by making it functionally invisible in order to enhance other uses and expectations. Likewise, in “Cultural Politics, Critique, and the Digital Humanities,” Tanner Higgin (2010) argues that unless we critique the broader institutional and systemic conditions that have allowed the digital humanities to emerge as they have now, the discipline will replicate inequality, because there are “far more subtle ways technologies reproduce oppressive social relations in everyday life within and without academia.” Higgin sees a “potentially techno fetishistic obsession in DH with technological transformation via the creation and use of various digital tools/platforms/networks, etc. as agents of social change. These efforts are often performed under the guiding ethos of collaboration which often becomes an uncritical stand-in for an empty politics of access and equity” (Higgin 2010). Adding yet another critical angle to the debate, Alex Reid (2014) argues that the scientific worldview can also be unexaminedly appropriated by the humanities, including the very distinction between them that the humanities seek to dismantle. The risk is that the human in the humanities loses its central role as a subject and agent of experience, knowledge, and consciousness. In “Critical Theory and the Mangle of Digital Humanities,” Todd Presner (2015) seeks to connect critical theory to digital humanities by not flattening out the differences between doing or building something with digital technologies and the appreciative, interpretive, and contextually analytical impulses of the humanities; he suggests that “the first challenge for digital humanities is to develop both critical and genealogical principles for exposing its own discursive structures and knowledge formations at every level of practice, from the materiality of platforms, the textuality of the code, and the development of content objects to the systems of inclusion and exclusion, truth and falsehood governing its disciplinary rituals, doctrines, and social systems” Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 8 of 51 (60). It is what concerns Adeline Koh (2014), who argues that the discourse of civility, the social contract for participating in liberal society, in digital humanities has two requirements: 1) “the practice of civility, or niceness; and 2) possession of technical knowledge, defined as knowledge of coding or computer programming” (94; italics from original) These two stipulations function as “rules” for “entry to the scholarly field” (Koh 2014, 94). Like Koh, Gary Hall cautions against drawing heavily on science to re-orient the humanities, as if the latter were more in need of re-assessment than the former, which implicitly privileges the one over the other; instead, Hall asks, “Along with a computational turn in the humanities, might we not also benefit from more of a humanities turn in our understanding of the computational and the digital?” (2011, 2). In cautioning practitioners and scholars in digital humanities to avoid relying excessively on the sciences or assuming that scientific methodology in its quantitative modality is fundamentally unlike the unstable interpretive knowledge the humanities offers, Liu, Cong-Huyen, McPherson, Ramsay, Higgin, Allington, Brouillette, Golumbia, Reid, Presner, Koh, and Hall emphasize the need to rethink, not just reposition, the digital humanities in relation to institutional operations, governmental policies, demographics shifts, and cultural orientations that support and legitimize the sciences; in other words, the cultural turn in the digital humanities is necessary and urgent. Drone warfare and empire in the 21st century One way to extend these critics’ ideas is to examine the rise of two recent phenomena: drones and surveillance. With their bulbous front-ends, the Predator, Reaper, and Global Hawk are the iconic symbols of drones. 27 ft in length and with a wingspan of 55 ft, the Predator can fly for 24 hours at 25,000 ft, and the system costs $20 million. 36 ft in length and with a wingspan of 66 ft, the Reaper can fly for 24 hours at 50,000 ft, and the system costs $26.8 million. 48 ft in length and with a wingspan of 131 ft, the Global Hawk can fly for 28 hours at 60,000 ft and costs $140.9 million (Gertler 2012, 31) (Figures 1 and 2). Other models and platforms, with varied operational histories, include Firescout, Grey Eagle, Hawk, Hunter, Hummingbird, Nano, Prowler Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 9 of 51 II, Puma, Raven, ScanEagle, Sentinel, Shadow, Switchblade, T-Hawk, Warrior, Wasp III, (Gertler 2012, 8; AeroVironment 2019). Companies producing drones or drone technology include General Atomics, AeroVironment, Raytheon, Boeing, Northrop Grumman, and Lockheed Martin (Benjamin 2013, 34–54). Drones like Switchblade can fire missiles and also plunge towards a target in a suicide mission to kill it. Research is being conducted to produce technology that will enable drones to be Figure 1: Global Hawk. Figure 2: Reaper Drone. Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 10 of 51 almost fully automatic, requiring little pilot control (Benjamin 2013, 37). On May 14, 2013, a drone, X-47B, took off from an aircraft carrier, setting a precedent for drone warfare, because it makes mobile the infrastructural needs of maintaining, protecting, and launching drones from areas over which the military can establish control. This development sets “the way for the US to launch unmanned aircraft from just about any place in the world” (Vergakis 2013). The efficacy of drone warfare, from a military perspective, is predicated on the range and quality of the military, technological, and political infrastructure necessary to share intelligence, coordinate missions, and execute them successfully. The “military’s secret military,” (Turse 2012, 12) referred to as US Special Operations Command (SOCOM), set up in 1987, today includes the Green Berets, Rangers, Navy Seals, Air Force Air Commandos, and Marine Corp Special Operations Teams. This unit “carries out the United States’ most specialized and secret missions. These include assassinations, counter-terrorist raids, long-range reconnaissance, intelligence analysis, foreign troop training, and weapons of mass destruction counter-proliferation operations” (Turse 2012, 12). Its core cell, SOCOM, acts under the President’s direct supervision. Countries where SOCOM is or was active include Afghanistan, Bahrain, Belize, Brazil, Bulgaria, Burkina Faso, Dominican Republic, Egypt, Germany, Indonesia, Iran, Iraq, Jordan, Kazakhstan, Kuwait, Kyrgyzstan, Lebanon, Mali, Norway, Oman, Pakistan, Panama, Poland, Qatar, Romania, Saudi Arabia, Senegal, South Korea, Syria, Tajikistan, Thailand, Turkmenistan, United Arab Emirates, Uzbekistan, and Yemen (Turse 2012, 15–16). To maintain, manage, and deploy drones, command and control centres with varying degrees of sophisticated infrastructure and technological capabilities have been sent up in 60 bases all over the world, including in Arizona, Florida, Missouri, New Mexico, New York, North Dakota, Ohio, South Dakota, and Texas. The drones, Special Operations Command, and control centres “are the backbone of the new American robotic way of war. They are also the latest development in a long-evolving saga of America power projection abroad; in this case, remote-controlled strikes anywhere on the planet with a minimal foreign ‘footprint’ and little accountability” gain normalcy (Turse 2012, 22), as “bayonet, telegram, and cannon have been replaced by data mining, satellite reconnaissance, Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 11 of 51 and long distance strikes by weaponized drones” (Hensley 2018, 228). In short, “drones are power tools with the ability to transform the political and social landscape forever” (Yehya 2015, 3). And when we map the landscape of drone wars, “we jibe against the limits of cartographic and so of geopolitical reason,” which transforms drone wars into “the everywhere war,” observes Derek Gregory (2011, 239). This war “transforms the concept the battlefield into a multidimensional ‘battlespace’ where the enemy is fluid and indeterminate,” writes Christine Agius, further adding, “this vertical form of control re-asserts a type of neo-colonial surveillance and ordering that renders contingent any claims to sovereignty, constantly routinizing insecurity in certain spaces” (2017, 372; 380). Drone wars can take place anytime and anywhere; they re-define notions of normalcy and exception, as they generate constant insecurity by waging perpetual war. In drone warfare, it is difficult to ascertain when a country is at war, and when it is not, when conditions of peace prevail, and when they don’t, because the anytime- everywhere matrix enables powerful states to create and manage conditions of emergency on a scale that is trans-territorial and biopolitical. In A Theory of the Drone, Grégorie Chamayou (2015) highlights principles that give institutional character and social power to drones: “persistent surveillance or permanent watch; totalization of perspective or synoptic viewing; creating an archive or film of everyone’s life; data fusion; schematization of forms of life; detection of anomalies and pre-emptive anticipation” (38–42). Unlike traditional war in which the machinery of combat—troops, tanks, weapons, electronic gadgets, munitions, battleships, fighter jets—is assembled, managed, and deployed, and often visible to the eye, this new war is fought in secrecy. It’s a cheap war. It’s an invisible war. It’s a war of stealth and silence. Consider what transpired over the last two decades: in Pakistan, under President George W. Bush, there were 48 drone strikes, 116–137 civilian deaths, and 218–326 militant casualties, and under President Obama, there were 353 strikes, 129–162 civilian deaths, and 1,659–2,683 militant casualties (New America 2019a). In Yemen, Bush authorized 1 strike resulting in zero civilian casualties, and six militants killed, while Obama authorized 184 strikes, leading to 89–101 civilians killed, and 973–1,240 Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 12 of 51 militants killed (New America 2019b). In his first two years, President Donald Trump continued Obama’s aggressive use of drones, by authorizing 112 strikes in Pakistan and Yemen combined; if this rate continues during his presidency, he will surpass Obama’s drone strike record (Wolfgang 2018). The efficacy of drone warfare rests on the quantity and quality of data collected through surveillance (Drew 2009). As they hover in the air, drones secretly surveil entire towns and villages or zero in on buildings and moving objects, while recording thousands of hours of data and feeding them in live or recorded formats, so that pilots, analysts, operators, generals, and others can engage in data mining, target identification, tracking, and elimination. Analysts working in the Algorithmic Warfare Cross-functional Team, a result of Project Maven to “accelerate DoD’s integration of big data and machine learning,” would then spend time “turning countless hours of aerial surveillance into actionable intelligence” (Weisgerber 2017). In other words, certain methodologies of computational digital humanities—macroanalysis and distant reading—are the sine qua non of drone warfare. In Macroanalysis: Digital Methods and Literary History, Matthew Jockers (2013) argues that working with big data can help literary scholars ask new questions about genre, history, gender, and stylometry. As a complement, not substitute, to close reading, he advances macroanalysis to “emphasize that massive digital corpora offer us unprecedented access to the literary record and invite, even demand, a new type of evidence gathering and meaning making” (Jockers 2013, 8). He adds, “[…] the literary researcher must embrace new, and largely computational, ways of gathering evidence […]. More interesting, more exciting, than panning for nuggets in digital archives is the ability to go beyond the pan and exploit the trommel of computation to process, condense, deform, and analyze the deeper strata from which these nuggets were born, to unearth, for the first time, what the corpora really contain” (Jockers 2013, 9–10). Instead of only emphasizing “an examination of seminal works,” we can study the “aggregated ecosystem or ‘economy’ of texts” (Jockers 2013, 32). Along similar lines, Franco Moretti (2013) in Distant Reading opines that we should not rely on single or small text samples to create a historical period or literary canon or detail genres and styles and plots, but engage with large data sets Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 13 of 51 of information and learn to mine and interpret them for their nodes, networks, proximities and distances from other nodes and networks. Distant reading, he contends, “allows you to focus on units that are much smaller or much larger than the text: devices, themes, tropes—or genres and systems. And if, between the very small and the very large the text itself disappears, well, it is one of those cases when one can justifiably say, Less is more. If we want to understand the system in its entirety, we must accept losing something” (Moretti 2013, 48–9). Moretti seeks to apprehend literature or history as textual systems and networks by examining or distantly reading, as it were, large corpora containing metadata of thousands of texts and analyzing them across time by visualizing datasets. Within digital humanities as computational literary studies (CLS), these approaches have come under scrutiny, the latest being Nan Z. Da’s (2019) “The Computational Case Against Computational Literary Studies.” In examining several case studies, Da (2019) argues that Data sets with high dimensionality are decompressed using various forms of scalar reduction (typically through word vectorization) whose results are plotted in charts, graphs, and maps using statistical software. (605) She finds problems with how tagging and categorizing word frequencies and associations, pronoun uses and clusters, and finding patterns and inflections in large corpora are used to make arguments about gender, genre, literary history, themes, etc. In some cases, using the scientific model of replicating lab experiments in controlled settings, Da develops her own computational projects using similar or the same data sets, and arrives at different findings, especially when English texts are translated into other languages and non-English texts are used to read them distantly, as it were, or macroanalytically. Reviewing her study and other interventions in computational literary studies, like Ted Underwood’s (2019) Distant Horizons: Digital Evidence and Literary Change, is not my aim here. It is to note that Da uses computational methodology to critique computational literary studies, in order to argue the following: “Quantitative visualization is intended to reduce complex Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 14 of 51 data outputs to its essential characteristics. CLS has no ability to capture literature’s complexity” (Da 2019, 643; Critical Inquiry 2019). A good case can be made for the value of CLS to advance systems thinking in literary studies, to generally, provisionally, and visually plot the wide range of datasets gleaned from literary production over time; there is value in moving beyond a small corpus of texts when claims to their representational status are taken for granted or inadequately interrogated. CLS enable us to raise different, new, or recalibrated questions about literary taste, reading habits, genre evolution, and sub-genre transformations, including predictive analytics. However, my aim here is to draw from these debates to make a case for the cultural turn in the digital humanities, so that we do not end up privileging computational literary studies or humanities computing as the primary field for disciplinary valorization and professional identity; moreover, my aim is to use humanities methods of textual analysis, contextual inquiry, historical understanding, and conceptual, theoretical argumentation to study multi-genre and multimodal cultural productions that thematize the digital and technologically embody the digital in the context of drone warfare and the transnational surveillance cultures they generate. I am not saying there is a causal link between DH and drone warfare. What I am saying is that there are similarities in structure and method between them that need urgent scholarly examination. Like its analog precursor, the digital, to extend on Edward Said, is “in the world, and hence worldly,” and is “always enmeshed in circumstance, time, place, and society” (Said 1983, 35). Whatever the vastness of digital corpora, the complexity of coding languages, and the sophistication of algorithmic, robotic logics that compress information in space and time to generate analytics with predictive power, the conception, production, dissemination, and use of the digital are worldly endeavours, a series of innumerable acts and motivations profoundly and inescapably shaped by human interests, local pressures, national trends, and global flows. To engage with the worldliness of the digital is to grasp technological innovation as a social and cultural phenomenon that can rewrite, erase, re-draw, or affirm the histories, cultures, and spaces of many peoples and living Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 15 of 51 things in the world; it is to grasp the digital as affording new ways of conceiving of the world and our being in the world. The worldliness of the digital links First World concerns with so-called Third World realities, by foregrounding the enduring legacies of colonialism and the struggle for post-colonial provenance. Put differently, whereas computational literary studies involve digitizing metadata about literary texts and creating algorithms to retrieve data sets and read them for patterns, repetitions, inflections, and shifts in textual systems, drones and surveillance technologies generate and use data about peoples, cities, villages, towns, and terrains to detect patterns, repetitions, inflections, and shifts in human and animal behaviour with one central aim: track, identify, kill. Some methodologies that have become part of the digital humanities, whose lineage extends into computational humanities, are also essential practices in drone warfare and global surveillance. These technologies connect vast trans-regional communication networks, command and control centres, video and image feeds, intelligence analyses, military officials, and politicians working in real-time in locations strewn across the world to assess, interpret, and decide whom to kill, where to kill, when to kill. The network of cables, satellites, and screens, the jumble of joy sticks, keyboards, and computers, and the ensemble of bytes, pixels, and video feeds all coalesce to create a global theatre of war; in this theater, the contours and sensory attributes of material reality are looped endlessly in pixels and bytes; they are processed to recreate digital data and knowledge whose power to render the physical world intelligible and controllable and conquerable is of a piece with the sophisticated technology, pragmatic ingenuity, and exceptionalist thinking that characterize American society. Anarchy of global surveillance Kevin Haggerty and Richard Ericson (2007) propose a new paradigm called “surveillant assemblage” to describe surveillance as a process that manages the flow of information and data produced through a surveillance of ideas, things, and people in migration, thus making mobility a crucial dimension of the politics of visibility. They write: Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 16 of 51 This assemblage operates by abstracting human bodies from their territorial settings and separating them into a series of discrete flows. These flows are then reassembled into distinct ‘data doubles’ which can be scrutinized and targeted for intervention. In the process, we are witnessing a rhizomatic leveling of the hierarchy of surveillance, such that groups which were previously exempt from routine surveillance are now increasingly being monitored. (Haggerty and Ericson 2007, 104) The body here becomes disembodied but does not replace the corporeal body but acts as its “data double” (Haggerty and Ericson 2007, 109). Surveillance, writes Daniel J. Solove (2004) in The Digital Person: Technology and Privacy in the Information Age, leads to the creation of “digital dossiers” that are “collection[s] of detailed data about an individual. […] data is digitized into binary numerical form, which enables computers to store and manipulate it with unprecedented efficiency” (1–2). A prominent theorist of information technology and data management, Roger A. Clarke (1998) in “Information Technology and Dataveillance” coins the term “dataveillance” to characterize a new modality of surveillance enhanced by the growth of digital technologies: “dataveillance is the systematic use of personal data systems in the investigation or monitoring of the actions or communications of one or more persons” (499). Dataveillance in this context is best apprehended as “meticulous rituals of power,” asserts William G. Staples (2003) in Everyday Surveillance, because they are “microtechniques of social monitoring” and “‘small’ procedures and techniques that are precisely and thoroughly exercised”; they are “ritualistic because they are faithfully repeated and are often quickly accepted and routinely practiced with little questions”; and they exude “power because they are intended to discipline people into acting in ways that others have deemed to be lawful or have defined as appropriate or simply ‘normal’” (xii, 3). Hence, the Gorgon Stare: with twelve cameras, the MQ-9 Reaper can surveil an area of four kilometers and produce images and video feeds that can be differentially accessed and analyzed by people separated in space and time (Shachtman 2009). A drone with ARGUS-IS (Autonomous Real-Time Ground Ubiquitous Surveillance- Imaging Systems) takes this further: it can cover fifteen square miles and send video Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 17 of 51 feed to sixty-five windows, each capable of focusing continuously on a moving target or one location (Hambling 2009). In 2005 during the Bush presidency, the Force Application and Launch Continental United States Program (Falcon) was designed to release remote controlled spacecraft that could fly close to five times faster than the speed of sound, at 100,000 feet, and with 1000 pounds of armaments and supplies. The aim of the program, in the words of John E. Pike, of GlobalSecuirty.org, is to “crush someone anywhere in world [sic] on 30 minutes’ notice with no need for a nearby air base” (Pincus 2005). “Surveillance, a technology of racial sorting and subjugation,” writes Jennifer Rhee, “structures drone technology and its dehumanizing tendencies” (2018, 164). Drone surveillance establishes a “regime of figuration, a way of seeing and, therefore, a modality of thought,” argues Nathan K. Hensley (2018, 229). The Gorgon Stare, ARGUS, and Falcon are designed to bring all things within their scopic purview and enable America to establish global strike capacity. They seek and probe and trace and map the daily activities of several groups of people, including women and children, without their knowledge. In Drone: Remote Control Warfare, Hugh Gusterson observes, “As the drones gaze unblinkingly from above, there can be voyeuristic pleasure in watching the Other. In fact, it is hard to imagine a more voyeuristic technology than the drone” (2016, 62). Some of them would turn out to be terrorists or actively aiding them, but not all. But to catch the few, the Gorgon Stare compels all whom it watches to lose privacy and dignity. To apprehend the few, the Gorgon Stare requires all whom it sees to demonstrate their innocence. The Gorgon Stare is biopolitical in two ways: it moves beyond the individual to surveil people as a totality, a mass of subjects made amenable to the scopic, panoramic gaze of the drone, and it seeks to manage and regularize life. As Michel Foucault explicates, “It is therefore not a matter of taking the individual at the level of individuality but, on the contrary, of using overall mechanisms and acting in such a way as to achieve overall states of equilibrium or regularity; it is, in a word, a matter of taking control over life and the biological process of man-as-species and of ensuring that they are not disciplined, but regularized” (1997, 246–7). Biopower seeks to manage all of life, or bring the multitude of the living under the domain of governmentality—to administer, to take charge, to mange, to sort, https://GlobalSecuirty.org Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 18 of 51 to distribute, to maintain life. It is this biopolitical impulse that gains incredible computational and surveillant power in the age of drones and the cultures of surveillance they engender. Thus, the drone instantiates a new structure of biopolitical power that seeks invasive domination through constant, secret surveillance of a space, its peoples, its inhabitants. It is within the drone’s optic field of operations that guilt is assumed and innocence a burden to be proven. The terror of the drone is not only that it takes life without notice and with blinding speed, or that it comes from nowhere and recedes into nowhere, or that it hums its presence and withdraws into thin air whenever it chooses. It is much more than that—it adjudicates life on a daily basis of surveillance that considers everyone suspicious, leaving little room for innocence to become the norm and guilt an aberration. This is the terrifying nature of the Drone: it is a predator on the prowl not only for those intending to cause harm, but for those who, in some situations, cannot speak, establish, or convey their innocence. A good example of how these risks have become military tactics in drone warfare is the “signature strike,” a strategy for increasing domination through dataveillance where nuances and specificities are subsumed into behavioural types, correlative data doubles, and predictive analyses (De Luce and Paul Mcleary 2016). As one operator says, “the drone program amounts to little more than death by unreliable metadata” (Storm 2014), because, as Alcides Eduardo dos Reis Peron points out, “the practice of constructing an enemy before identifying him, and incriminating all those related to him, is extremely controversial and insufficient to properly clarify those on the ground as enemies” (2014, 91). Moreover, “according to several administration officials,” write Jo Becker and Scott Shane (2012), the policy “in effect counts all military-age males in a strike zone as combatants. […] unless there is explicit intelligence posthumously proving them innocent.” This policy goes beyond surveilling and identifying individual terrorists to targeting groups of people engaged in suspicious activity. Derek Gregory (2014) observes, “Combatants are thus vulnerable to violence not only because they are its vectors but also because they are enrolled in the apparatus that authorizes it: they are killed not as individuals but as the corporate bearers of Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 19 of 51 a contingent (because temporary) enmity” (7). Peter Bergen notes (2012), “These are drone attacks based on patterns of merely suspicious activity by a group of men, rather than the identification of a particular individual militant.” When drones are equipped with transceivers or Air Handlers to mimic satellite towers to absorb telephonic communication, which is looped into data feeds for target analysis by intelligence and military personnel, the identity of a suspect becomes predicated on patterns of phone use. In instances where a strike is authorized, it is the SIM card (subscriber identification module) of the phone that leads to the targeting of the person using the phone (Scahill and Greenwald 2014). When a suspected phone is targeted and authorized for elimination, the exigencies of human interaction where different people end up using the targeted phone become redundant, because, in the surveillant assemblage, it’s the metadata that ascertains guilt and rationalizes death, not the individual or individuals using the phone. It is this process of data mining, geo-tagging, and algorithmic analysis that forecloses the possibility of separating suspects from innocents. Sheer incidental proximity in the everydayness of human interaction where innocent people end up using a targeted phone only to end up blown to pieces is what Jeremy Scahill and Glenn Greenwald (2014) refer to as “death by metadata […] where they think, or they hope, that the phone that they’re blowing up is in the possession of a person that they’ve identified as a potential terrorist. But in the end, they don’t actually really know. And that’s where the real danger with this program lies.” The surveillant assemblage reduces the need for gathering reliable intelligence based on close, extended observation and evidence in favour of a guilt-by-association logic that dramatically increases the risk of targeting innocent people, or those whose culpability does not deserve the ultimate punishment of death. In September 2011, drone strikes killed Anwar al-Awlaki and Samir Khan, US citizens and terror suspects, in Yemen. A few weeks later, a drone attack killed Abdulrahman, aged sixteen and son of al-Awlaki (Benjamin 2013, 65). In February 2010, US drones mistakenly killed close to two-dozen civilians, including women and children in Afghanistan (Benjamin 2013, 94). Low estimates of casualties in Pakistan, Yemen, and Somalia include 4,228 killed, 522 civilians, and 184 children, according to the Bureau of Investigative Journalism (2019). Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 20 of 51 The psychosocial impact of drone strikes includes fear and paranoia among helpers and official rescue personnel who retrieve the dead, rescue the living, and care for the injured. Because the blasts from the strikes often burn bodies, dismember them, or sometimes simply incinerate them, the process of identifying victims means gathering whatever body parts can be found and handing them to friends and relatives of the victims. In villages where the Jirga is conducted—public hearings and discussions to resolve disputes by the maliks (local elders) and khassadars (local police forces overseen by maliks)—due to drone strikes that killed dozens of attendees, some of whom were the Taliban who were present at the meeting to resolve local disputes, there is growing fear and anger about drone attacks that target militants but more often than not result in the loss of innocent life (Cavallaro, Sonnenberg, and Knuckey 2012, 23–4). Because of the “double tap” strategy of striking targets twice or more, rescuers often hesitate to rush to aid the injured, fearing becoming targets and losing their lives, thus depriving the injured, especially the innocent, of timely medical attention (Cavallaro, Sonnenberg, and Knuckey 2012, 74). Strikes that destroy places housing targets also sometimes destroy surrounding houses, leaving individuals and families helpless and destitute. Because medical expenses are high, many of the injured do not get adequate care or take loans they simply cannot afford but need if only to stay alive or avoid becoming severely handicapped. It is common for witnesses to drone strikes to exhibit “anticipatory anxiety” caused by the fear of impending strikes anytime and from anywhere (Cavallaro, Sonnenberg, and Knuckey 2012, 81). Terror, anxiety, and fear of becoming victims of drones generate post-traumatic disorders among those living in places hit by drones, or witnesses to the devastating impact of drone missiles. In some instances, parents and families are pulling children from school awhile, or refusing to send them, fearing that when groups of children get together, they could easily become drone targets. Similarly, practices of mourning and burying the dead, which happen in public gatherings, are observed with trepidation because it increases the likelihood of drone attacks on groups (Cavallaro, Sonnenberg, and Knuckey 2012, 89). Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 21 of 51 Sites of drone killings or crashes give visibility to the power, structure, and infrastructural systems that facilitate drone wars. As Lisa Parks (2017) argues, in terms of infrastructure, for instance, using Google Earth, we can discern how drones deal with “geology, physics, energy, and weather” through “earthmoving, importation, construction, installation, and maintenance” to build large air strips and hangars, which become the “staging ground for drone campaigns and vertical maneuvers” (137–9). In terms of the forensic, places where drones kill or crash become material signs that make visible the invisible structure of drone warfare, as the bodies of killed and the injured vivify the violence inflicted, and the debris reveals the type of drone, materials used in its construction, technological systems, and so on (Parks 2017, 151–2). In terms of the perceptual, drones and the surveillance regimes they establish produce “spectral suspects,” whose identities are established not by epidermal and other discernable features, but through infrared contouring of heat-emitting entities (like the human body), which can appear black or white, based on a given set of technological settings. Spectral suspects are “visualizations of temperature data that take on the biophysical contours of the human body while its surface appearance remains invisible and its identity unknown” (Parks 2017, 145). But here, since identities are not known, “seeing according to temperature turns everyone into a potential suspect or target and has the effect of ‘normalizing’ surveillance since all bodies appear similar beneath its gaze” (Parks 2017, 145). It is why other assessments and verifications of threat and identity come into play, like signature strikes and double tap, including computational approaches like maintaining data repositories, metadata analysis, data dossiers, data doubles, and dataveillance. To grasp human behaviour as part of a network of actions and patterns, drone surveillance facilitates a distant reading of human collectivities, a macroanalysis of information flows to ascertain suspicious activity and spectral suspects in order to contain or eliminate them pre-emptively. A major reservation about drone warfare, says Greg Kennedy (2013) in “Drones: Legitimacy and Anti-Americanism,” is the question of legitimacy, a term often used “in such circumstances interchangeably with concepts such as proportional, moral, Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 22 of 51 ethical, lawful, appropriate, reasonable, legal, justifiable, righteous, valid, recognized, and logical” (25). There is a tendency, point out Sarah Kreps and John Kaag (2012), to conflate technological sophistication with ethical and legal assessment, because technology is not neutral but used by human beings: “the ability to undertake more precise, targeted strikes should not be confused with the determination of legal or ethical legitimacy,” which raises the question of war and justice (17). Fred Kaplan (2013) underscores a key fact: drone strikes take place outside of war zones. They can happen anywhere the US decides a threat is imminent. He writes, “For when we talk about accidental civilian deaths by drones in Pakistan and Yemen, we are talking about countries where the United States is not officially fighting wars. In other words, these are countries where the people killed—and their embittered friends and relatives— didn’t know that they were living in a war zone” (Kaplan 2013). To further complicate matters, sometimes, those targeted by drones were “low-level, anonymous suspected militants who were predominantly engaged in insurgent or terrorist operations against their governments, rather than in active international terrorist plots” (Zenko 2013, 10). Such instances lead to drone warfare camouflaging proxy wars fought by a powerful state to help another government, and not necessarily to defend itself against foreign suspects. To the two dimensions of just war theory—the justification for war (jus ad bellum) and the rules of engagement during war (jus in bello)—philosopher Michael Walzer (2004) in Arguing About War adds a third, justice after the war (jus post bellum) (viii). A good argument can be made that in drone warfare, the new dispensation of American empire, all three dimensions are skewed. The ethical conundrum is this: the US is engaged in a global hunt for people posing imminent danger to the country and scours the entire world for them without formal intimation or declarations of war; the US envelopes entire regions and populations and subjects everyone, without distinction, to a surveillance regime to ferret out suspects and kill them; the US disposes its targets without consistently verifying the proportionality of the strikes, because the targets are chosen by macroanalysing big data generated by covert digital surveillance. Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 23 of 51 Critiques of and opposition to drone warfare emerged from various parts, both within the US and other parts of the world. Especially significant are the efforts by individuals and groups to make the invisible wars of drones visible, literal, palpable, visceral. And here, the turn to art and creativity becomes the avenue for expressing dissent against drone wars, while humanizing their deadly effects. But as we shall also see, drones and surveillance in cultural production raise complex questions about the power of art to register dissent and resistance, and foreground the uneven terrain of freedom and responsibility negotiated by culture producers and consumers; they shed light on the gendered inscriptions of drone warfare in military culture, which feminize drone piloting, because of its distance to and immunity from real-life battlefield risks of injury and death, while affirming the technological superiority of the countries that engage in drone wars, and the manifestation of male anxieties in celebrating bravery and honour produced in the drone techno-spatial ecosystem (Schnepf 2017; Hensley 2018; Clark 2018). They also seek to resist the power of the “robotic imaginary,” which Jennifer Rhee (2018) describes as the “shifting inscriptions of humanness and dehumanizing erasures evoked by robots” that emerge in “the inextricable entanglement of ‘technology’ and ‘culture’.” She adds, “as a concept, the robotic imaginary offers the capacity to identify both an abiding vision of the human that is held up to be, however provisionally or circumscribed, universal, and the extensive erasures of human experiences that enable this inscription of the human” (2018, 5–6). Drone art and culture foreground the manner in which the human is constructed through a regime of surveillance that generates data repositories, which serve as the basis for algorithmically identifying human targets for threat removal. However, as the next section will show, producing the data and extrapolating the human from the data involves a struggle for the human. Drone art and culture foreground the multifarious dimensions of this struggle, in order not to restore a stable, fixed human entity but to resist digital networks and protocols with the power to adjudicate life and death through invasive biopolitical surveillance. It’s in art, literature, and culture that we see a struggle for the human play out with poignancy (Center for the Study of the Drone 2019). Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 24 of 51 The struggle for the human in drone wars Operating Predator drones is not an easy task; it requires new skill sets and a new mode of understanding “battlefield,” “enemy,” “emergency,” and “collateral damage.” Just twenty-one years old when he started working as a drone pilot, Brandon Bryant operated from the Ground Control Station at Nellis air force base, close to Las Vegas, Nevada. In discussing his experiences as a remote pilot operating MQ-1B Predators flying over Afghanistan, Bryant notes that his squadron made 1,626 strikes; in dealing with the aftermath of each strike, Bryant eventually sought therapy and was diagnosed with post-traumatic stress disorder. He realized that “the job made him numb: a ‘zombie mode’ he slipped into as easily as his flight suit” (Power 2013). Bryant “sometimes felt himself merging with the technology, imagining himself as a robot, a zombie, a drone itself. Such abstractions don’t possess conscience or consciousness; drones don’t care what they mean, but Bryant most certainly does” (Power 2013). Surveilling targets and their habitations on pixelated screens for days and weeks on end and releasing Hellfire missiles that obliterated them with explosive power and, sometimes, finding out that the target’s identity was uncertain, their guilt not fully established, turned drone piloting into a job where ethics were always at risk of being compromised. Hovering virtually more than two miles above the earth to surreptitiously surveil people’s lives every day on computer screens in cockpits located thousands of miles away in Nevada, the drone pilot can discern a full range of personal and public behaviour of the people subjected to the drone’s watchful gaze. For drones to function as tools to carry out military or police missions, digital tools, software, and networks produce thousands of still and moving images and multimedia feed, which are amassed and assessed as large datasets. In tandem with intelligence reports, data is sorted, tagged, distributed, mined and made amenable for evaluation and assessment by data and military analysts, so as to identify suspects and launch missiles through remotely controlled armed drones to destroy targets. The role of human agency—an embodied sentient being feeling and thinking and deciding—becomes subordinated to the dynamics of data gathering, surveillance, and decision-making. Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 25 of 51 Between the target and the drone pilot is a semi-autonomous digitally-run system that generates vast gigabytes of data for surveillance, but as it multiplies its data and coordinates with a slew of other data structures and robotic systems to manage drone vehicles and pilot them, surveillance becomes dataveillance and the pilot and target merge into a vast digital superstructure where they become important nodes whose value and significance is internally assessed in relation to the purpose and viability of the military mission embodied in a global network of surveillance managed by the most powerful country on the earth. Ethics becomes immanent to the form and function of dataveillance, a situation in which external points of reference to pose questions about decisions and policies justifying drone strikes become harder to find or redundant. Accidents or mistakes that result in human lives being lost or strikes where innocent men, women, and children are wiped out with devastating missile power are evaluated in terms offered by the digital structure and system: assessing inputs and outputs, transmission protocols, evaluative criteria, collaboration among people reading and assessing a variety of data sets and military intelligence, readability of still and moving images, algorithmic machine learning to mine big data and generate patterns and trends to surveil and targets to identify. Put differently, human life is adiaphorized, as Zygmunt Bauman puts it. To wit, adiaphorization refers to situations where “systems and processes become split off from any other consideration of morality […] surveillance streamlines the process of doing things at a distance, of separating a person from the consequences of action” (Bauman and Lyon 2013). An action becomes “neither good nor evil, measurable against technical (purpose-oriented or procedural) but not against moral criteria” (Bauman 1993, 125). The military designed a software to mock up a drone strike in order to asses its strike capability and surrounding damages. When drone pilots release missiles that rip apart or hollow out structures of steel, aluminum, iron, wood, earth, and human bodies, there is a splattering of things, and of blood and tissue; the result of a drone strike is uncannily rendered in the colloquial term given to the military’s software program (now called Fast Assessment Strike Tool) designed to assess strike capability and damage: bugsplat (Cronin 2018, 2). The damage done Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 26 of 51 by a drone attack is akin to bugs splattering on a windshield of a vehicle travelling at high speed. Because humans appear as bugs on pixelated screens, and there is a visual blob when destroyed, there is human splatter, or bugsplat—“collateral damage estimate methodology” (Department of Defense 2012). To counter the invisible power of drone warfare, a collective of anonymous artists from America and Pakistan produced giant posters of victims of drone strikes and plastered them in the area where they were killed in the region of Khyber- Pakhtunkhwa in Pakistan. Featuring the photo of an innocent child whose parents were killed in a drone strike, the poster is enlarged enough to allow drone pilots see not a bug-like pixel on a screen but the face of a human being whose life is impacted by armed drones. Interestingly enough, a photo of this poster was taken by a small drone with cameras and posted online at #NotABugSplat.com (https://notabugsplat. com/). As Rhee (2018) notes, “#NotABugSplat’s representation of young drone victims is in tension with drone technology and the drone operator’s labo[u]r, which trains them to view those who come into the frame of their drone surveillance as bugs or dehumanized and threatening racial Others” (164–5). In this public art installation, the aim to humanize victims re-orients the drone pilot’s field of vision as his/her drone cameras surveil the terrain and send image feeds back to intelligence analysts and military brass. This reorientation of the field of vision is both literal and conceptual. At the literal level, what is remote and bug- like becomes its actual representation in the artistic rendition of a poster photo of a victim’s visage and body. The technology to zoom inwards on a camera’s subject to reveal its details comes up short in the drone video feeds, where the subject’s human features are pixelated into non-human entities like bugs. Rather than covering the site or hiding it from drone operators, the artists explicitly foreground the killing site with enhanced pictures so that the literal field of vision of the drone pilot sees a different terrain, one re-mapped by human actors on the ground. At the conceptual level, this enhancement of the subject who is now dead or living through the trauma of being victimized in drone strikes serves to change the logic of adiaphorization in dataveillance into one of human calculation in daily life: drone warfare is not an https://notabugsplat.com/ https://notabugsplat.com/ https://notabugsplat.com/ Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 27 of 51 autonomous, self-engineered mode of waging battle but one in which human beings use digital technologies to fulfill foreign policy and military objectives. The giant poster thus shortens the literal and conceptual distance: literally, it shortens it by enhancing the subject’s image to make it easier for the drone camera to locate it, and it shortens the psychological distance between the drone pilot and drone technology, with the hope that the tendency to automatize drone war is undercut by empathy in the pilot for the actual or intended victim of future strikes. The giant poster serves to highlight the past (drone strikes killed innocent people) and foreground the present (local and other human agents register their views of the strike by signaling who was victimized), so that the future will be bereft of such strikes (drone pilots realize the human cost of drone wars and refrain from firing missiles). In addition, the poster functions as a geo-tagger: it memorializes the victims while documenting history in local topography. Its historical accounting involves a remembrance with geo-spatial and temporal coordinates: time and location, space and place are crisscrossed with the explicit purpose of countering the adiaphorization of drone warfare. By taking pictures of the giant poster with a mini drone attached with cameras and broadcasting them in digital spaces that can be viewed by millions across the world, these dissenters enact an artistic politics of adaptation and subversion: drone technology is used not to kill or maim or surveil but to relocate the drone that kills and maims and surveils within a re-mapped topography that explicitly foregrounds the ethically compromised effects of drone warfare. Where the US military cannot or does not (or does so surreptitiously) keep records of civilian casualties of drone strikes, the artists publicize history by both documenting the location and victim of strikes and exhibiting them for the public and the drone pilot. This artistic creation installs drone war in public memory by subverting the use of drone technologies for ends that directly counter those of the drone pilots and their commanders: the giant photo makes public what the drone operators would prefer remain private; the giant photo registers the innocent victims of drone wars where the drone operators see bug splats; the giant photo interrupts the drone’s pilot’s field of vision by serving as a constant signifier of the ethical dimension of drone warfare, Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 28 of 51 one over which the drone pilot has little control: the poster has to be obliterated with another drone strike or constantly made part of the surveilled topography, which means practicing studied indifference or wanton forgetfulness, which places the onus of both actions squarely on the shoulders of drone pilots. Such art reinserts what drone warfare actively seeks to silence: the humanity of drone strike victims. Drone art and politics Where #NotABugSplat seeks to reinsert the human into a war whose techniques are virtual but results are materially deadly, Pakistani-American artist Mahwish Chishty seeks to change the symbolic meaning of the drone, from one associated with American empire and postmodern violence effected through virtual means to an object worthy of artistic curiosity. She seeks to abstract the drone from its militarized setting and turn it into a canvas where local Pakistani truck cultural practices can be painted, so that the drone is delinked from foreign state violence and turned into a tool or site for creative experimentation with local culture. However, the delinking is not an act of transposing politics into art, moving from one medium or modality into another, but of juxtaposing the political and the artistic, or, better still, of showing their imbrication, in order to reveal the contradictory, circumstantial nature of aesthetic production, where national and international interests do not undermine local specificities, while simultaneously not granting the latter a monopolizing power to determine the terms of aesthetic and political engagement. Featured at www.mahachishty.com/ are more than a dozen gouache paintings on paper, handmade paper, birch plywood and Masonite boards. Drones are painted in many shapes, with the MQ-9 Reaper, a popular armed US drone, used as the prominent design. Chishty draws from the folk painting traditions of Pakistani trucking industries where carvings, bright colours, mirrors, calligraphy, and paint are used to adorn trucks, often at considerable cost to their owners. Trucking in Pakistan is a major industry, as its roadways are used more than its waterways, railways, and airways for freight and public transportation. 60% of its 258,000-kilometre road network is paved, and the Bedford Rocket, an iconic British-based truck brand, now shares popularity with Hino and http://www.mahachishty.com/ Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 29 of 51 Nissan models from Japan (Elias 2011, 55; 58). Truck art, observes Jamal E. Elias (2011) in On Wings of Diesel: Trucks, Identity, and Culture in Pakistan, is “a function of visual culture as a window into the structure and politics of contemporary societies” (12) (Figures 3 and 4). Truck drivers are not the sole initiators of truck painting, but are usually intermediaries between owners and painters with of ten different intentions for painting: the owners seek to make a business statement and establish uniqueness in the market, which also gives them a chance for personal expression as paintings can include specific requirements of subject and theme and colour of the painters; the painters are part of a large circle of locally-based small businesses run individually or in groups. Calligraphy in Urdu and English, for instance, signals the owner’s familiarity with official or mainstream culture; on roads where top speeds are not feasible, decreasing the likelihood of wear and tear of the vehicle, decorative items like pinwheels are used inside trucks, which increases the longevity of art décor. Figure 3: Truck Art, Islamabad, Pakistan. Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 30 of 51 Elias (2005) further notes: “The motifs on trucks display not just aesthetic considerations, but attempts to depict aspects of the religious, sentimental and emotional worldviews of the individuals employed in the truck industry. And since trucks represent the major means of transporting cargo throughout Pakistan, truck decoration might very well be this society’s major form of representational art.” He distinguishes among five styles based on regions: Rawalpindi (stylized cowlings, appliqués of plastic), Sawat (wooden door carvings and metal hammered into shapes), Peshawar (a mix of the previous two styles that use carvings, metal, cowlings, paint), Baluch (chrome cowlings, complex, ornate designs patterned into mosaics), and Karachi (biggest truck centre showcases all styles, with woodcuts and wide colour spectrums). Subjects of decorative art include figures from religious, political, and everyday culture, women, personal art or objects as talismans (Elias 2005). Chishty uses many of these elements in painting drones, which are also represented in a variety of drone shapes: some are small, sharp, triangulations with boomerang shapes akin to X-47B; some have bulky, oval front-ends akin to Figure 4: Truck Art, Karachi, Pakistan. Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 31 of 51 Reaper and Predator drones; some are cast as twins joined at the back with two fronts facing opposite sides; some appear like thin butterflies in flight; and others have a burst of colourful missiles falling downwards from a flying drone. In an interview with Josh Harkinson of Mother Jones magazine, Chishty observes that her aim in painting drones this way was to make them “friendlier looking, instead of such hard-edged, metallic war machines” (Harkinson 2013). When asked if she were viewing militarized weapons idealistically, Chishty replies, “I don’t know if I am glorifying it. I just want people to talk about it. At the same time, it has some kind of beauty to it. I am also looking at them as objects, and not as much as war machines” (Harkinson 2013). To her, just as the truck drivers decorate their trucks ornately and with distinctive styles, which she views primarily as aesthetic expression, drone painting by using Pakistani folk art means using local culture to turn an object associated with death and war into an object of aesthetic contemplation. In “By the Moonlight,” a gouache painting on birch plywood, Chishty portrays the front underside of a wide- angled drone in green with decorative patterns of white appearing as conjoined shapes; the middle body is yellow and the tail-end is blue, with the wings rendered in darkened peach and around twelve semi-circular shapes, their borders lined in blue and yellow and adorning each wing side. This colourful drone is placed at the centre of what appears to be a modern street etched into plywood with tea stain. Several electric poles with wires line each side of the street with multi-storied buildings. The contrast is sharp but not jarring. While the lack of colour in the scene in which the drone is placed suggests its destructive force, it can also be viewed as an attempt to make the drone appear pleasant, colourful, and worthy of beautiful self-expression à la truck drivers styling their trucks (Figure 5). Put differently, Chishty is not practicing representational art in the general sense of using Pakistani truck art to depict realistic drone strikes or their repercussions on property, land, or humans; she is using local art to individually express her desire to counter the dominant perception of drones as objects of violence by turning them into colourful cultural artifacts. Many of them unambiguously titled after formal terms used in military jargon—RQ 170: the Beast of Kandahar, Hovering Reaper, Predator, Black Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 32 of 51 Hawk, X-47B—the paintings evoke truck art in loud, pleasing colours, woodcuts, embroidered cloth, talismans, metal works, calligraphy, and religious and cultural symbols (Figure 6). Figure 5: “By the Moonlight” by Mahwish Chishty. Figure 6: “Reaper Drone” by Mahwish Chishty. Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 33 of 51 Meghan Neal (2013) calls such work a form of cultural repurposing: “Drone art can be seen as a form of reappropriation—taking back something that in the popular consciousness is so often a symbol of death and destruction and making it something beautifully provocative, even hilarious.” Along similar lines, Anike (2013) in Muslim Media Watch of Patheos.com points out, “Chishty’s drone art is reappropriation; it questions the popular image of the drone as an icon of death and destruction and thus in its own way protests this symbol by choosing to view drones as objects, not just as war machines.” However, while many online commenters support Chishty’s views expressed in her interview with Josh Harkinson at Mother Jones, others voice strong disagreement about her choice of subject and her artistic work. One among them, Mariam Sabri, pointedly counters the supportive comments by noting, “I’ve been having discussions with a few artists, those who are involved with political advocacy through art, and an art teacher in Pakistan about this (Harkinson 2013). We all feel collectively sickened after reading Mahwish Chishty’s interview.” Sabri calls such drone art “silly,” “insensitive,” and “deluded,” because “she [Chishty] clearly seems to be depoliticizing drones” (Harkinson 2013). Sabri’s criticism is not without merit given Chishty’s observations in the interview: “I don’t know if I am glorifying it. I just want people to talk about it. At the same time, it has some kind of beauty to it. I am also looking at them as objects, and not as much as war machines” (Harkinson 2013). The key issue here is whether the appreciation of beauty is possible for people who experience the horror of drone strikes and the constant unease of living under drone surveillance. Even if we grant that it is theoretically or experientially possible, the question is, to what extent? In other words, what are the politics of location in cultural production and reception? Does where we are determine how we view art and culture? Evidently, yes. Chishty’s strategic move to wrest drone technology out of the discourse and activity of warfare is predicated on the idea that art ought to function in autonomous, or, better yet, depoliticized spaces. Speaking of truck art, Chishty says that truckers “spend so much time on it and they don’t get any funding. This is something that they do, just a personal interest. It has no reason whatsoever other than just an aesthetic sense” (Harkinson 2013). But aesthetic work, as Jamal Elias’s anthropological analyses of truck art shows, moves beyond personal, artistic https://www.patheos.com/ Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 34 of 51 expression to collective representation of trucking culture: travails of truck drivers, the sense of home they create and evoke on the road, the geographic differences that influence their choice of themes, and so on. In other words, truck art is woven into Pakistani trucking culture. Chishty’s approach draws on contemporary US-Pakistan politics about drones to highlight drones as aesthetic objects, which is a profoundly political act, but justifies this politics on the grounds of aesthetic autonomy. What needs underscoring is the potential for slippage in intent and interpretation: wanting people to talk about drones might well lead people to talk about drones primarily as works of art or only as tools of war; this contradicts the fact that the very purpose of her drone art is to counter the dominant impression of drones as tools of violence, an impression based not on aesthetic insistence (the US military is not advocating that Pakistanis view drones as art objects even as it launches drone strikes), but on verifiable history (drone strikes have killed and destroyed people and infrastructure) (Figures 7 and 8). Figure 7: Truck Wheel Art. Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 35 of 51 Critics who dismiss Chishty’s work as insulting to people whose lives were wrecked by drone missiles miss, understandably, the political import of her emphasis on drone aesthetics that seeks to grasp the drone primarily as technology, a tool built by human beings to accomplish certain ends. That it is used currently in warfare should not obscure the fact that as a technology, the drone is amenable for other uses, including creative ones that can bring the social and material impact of drone strikes into broader public spaces, a move that can shed light on the geopolitical imbalances structuring drone warfare. Her focus on individual freedom to pursue creative expression by appropriating a tool that has become a potent weapon of war towards non-military ends can be viewed as an attempt to re-centre the human subject that the drone, by its very nature, seeks to de-centre through data mining, algorithmic calculation, distant reading, and macroanalysis, what Bauman refers to as adiaphorization, as we have earlier seen. Chishty pushes this view further in the video art “Predator,” which can be projected into dark areas for a performative event. The video, available on Vimeo (https://vimeo.com/129010049), runs for 5 minutes and 27 seconds; centred and Figure 8: Mahwish Chishty’s “Hellfire Missile.” https://vimeo.com/129010049 Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 36 of 51 taking up the entire screen is a colourful image of a drone, speckled and painted with truck art colours and images; in the first minute, a hissing sound, almost a screech, builds into a crescendo of Aztec death rattles, the sounds produced when one blows air into the skull-shaped artifacts unearthed by archeologists in Mexico (Watson 2008). The sounds of these skull whistles are nerve-wracking, because they seem to condense a thousands screams, which is why they are also referred to in the vernacular as the “scream of a thousand corpses,” ostensibly a reference to the manner in which the Aztecs used the whistles for ceremonial rites and to intimidate enemies, or ward off threats. In a minute or so, we can see and hear the drone take a strike, but for almost three minutes, the drone simply hovers, closing and opening its eyes; it hovers and hovers; that is, as we have seen, the drone is hovering because it is surveilling individuals, groups, and populations constantly; then in the last minute of the video, the ominous wailing returns, to end with a drone strike. In video and animation, mixed with painting and sound, Chishty brings aesthetics and politics into open collision—the secret wars of drones are rendered aesthetically, not to displace politics with aesthetics, but to put politics and aesthetics into constant, creative tension. The drone is now no longer a depersonalized weapon of war; it is an aesthetic creation that can also be turned into a tool for violence. It is this double-sidedness of creative political expression that repurposes or reappropriates in order to juxtapose, not replace, which is a unique feature of Chishty’s art and installations. Drones and surveillance in popular culture The impulse to use drones aesthetically also finds expression in Pashto culture and literature. In “Impact of War on Terror on Pashto Literature and Art,” published in March 2014 by the Federally Administered Tribal Areas (FATA 2014) Research Centre in Islamabad, Pakistan, the impact of war is generally divided between pre-9/11 and post-9/11 periods. Nature, romance, landscape, individual dreams, love, desire, friendship are thematic concerns of the pre-9/11 period, and with the start of the war, changes become apparent as poets and artists began to shift focus to the devastating effects of war on small and big, village and semi-urban communities. Genres like the ghazal, nazm (Pashto poems), tappa, and jihadi tarana (anthem) all register this shift Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 37 of 51 in focus. Popular and well-regarded artists who have engaged with this shift include Salim ur Rehman Salim, Muqadar Shah Muqadar, Akbar Sayal, Ajmal, Bakht Sher Aseer, Shabab Ranizai, Roshan Bangash, Ata Muhammed Wardag, Rehmat Zalmai, and Syeda Haseena Gul, among many others (FATA 2014). It would be a mistake, however, to romanticize the pre-9/11 period because the Soviet invasion in 1979, which lasted for more than a decade, saw noticeable effects on art and literature among Pashtuns, but what makes this periodizing important is the extent to which military themes of war, loss, devastation, enemies, invasion, destruction and death and their associated symbols permeate creative activity. Responses to this war range from extreme anti-Americanism, where the West becomes the First Cause for war and, therefore, needs to be countered militarily, politically, and culturally, to broader explorations of how peoples living under the constant threat of military action or in militarized regions experience their effects on personal and public psyches. In jihadi taranas, the Manichean dichotomy of the West and Afghani/Pashtun identity is explicit and is generally oriented towards inciting readers to protest and rise up against the oppressive foreign powers. The output in this genre, however, is limited, while the political manifestation of this ideology in the political party of the Taliban and other such entities is undeniable (FATA 2014). This does not mean that pro- Taliban materials are not read widely. In Mohalla Jangi (Neighbourhood of War), Peshwar, Pakistan, there are 2,000 printing presses, some of which regularly print materials supporting the Taliban, Islamic radicalism, and anti-Americanism (Siddiqui 2012). In art, poems, ghazals and tappas, artists and writers view the landscape with less thrall because it is pockmarked with the effects of war; there is mourning and sadness in witnessing the changing landscape, which makes habitation increasingly difficult and associated with police actions and American military presence, on the one hand, and extremist, fundamentalist groups eager to subjugate and control society, on the other. Over the last three years, two songs by Pakistani Pasthto singer Sitara Younas received considerable attention on Youtube and in Pakistani regional popular culture. Her “Khud Kasha Dhamaka Yama” can be translated as “I am a suicide bomber.” Part of the lyrics include, “Don’t chase me. I am an illusion. I am a suicide blast.” Written Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 38 of 51 by Pashto writer Rashid Johar and composed by Pashto musician Shakir Zeb, the song uses the on-going US-Afghanistan and Pakistan military activities against terror groups as materials for song writing and singing (Ali 2011). Its explicit analogizing of one smitten with amorous desire for another with the unexpected, shadowy power of a suicide bomber has drawn public attention, with journalists like Manzoor Ali paraphrasing poet Farooq Firaq, who says that “suicide attacks have left deep imprints on our society and that such songs are a result of overall negativity in society” (Ali 2011). Firaq “proposes establishing a censor board—comprising of actors, writers and elders—to oversee and filter such content” (Ali 2011). We see here the lasting effects of wars and police and military missions on people living in these societies. The intent of this song is not designed as propaganda to convince young people, especially those disillusioned or frustrated with their lives, to become true believers in radical Islam and glorify the act of killing others through suicide; it is a registering of everyday life and the complex ways in which some people use the ideas and events they are familiar with to make sense of other aspects of their lives and infuse new symbols and analogies that dramatize the dynamics of young love, romance, heroism, risk, danger, and yearning, to wit, the stuff of which dreams are made in human societies. Younas’ second song pushes the envelope further in “Za Kaom Pa Stargo Stargo Drone Hamla,” which translates as “My gaze is as fatal as a drone attack.” Penned and given melody by Pashto director Maas Khan Wesal, the song was performed in an episode by actress Dua Qureshi in the television film “Da Khkulo Badshahi Da” produced by Khans Productions (Khan 2012). A translation of parts of the song reads thus: My gaze is as fatal as a drone attack/The touch of my lips sweeten words Intoxicating wine are my looks/My gaze is as fatal as a drone attack Coquettish stare is a snare of beauty/Smile fresh as early morning dew Ensnares lovers with amorous pangs/My gaze is as fatal as a drone attack O lovers! Go through a lover’s agony/A leaping flame and a rose bud The clink of my bangles leaves one enchanted/My smile rustles desires in many a heart Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 39 of 51 Tests lovers’ courage/My gaze is as fatal as a drone attack My beauty and body/At its prime Leaves many going astray/My gaze is as fatal as a drone attack. (Khan 2012) The singer recognizes the power that she, a woman, wields over a man; she is confident of her attractive looks as she croons that “the clink of my bangles leaves one enchanted” and “my smile rustles desires in many a heart.” Her attractive features are so compelling that they heighten the desire of lovers to the point where their commitment to each is tested, because her “beauty and body at its prime, leaves many going astray.” This woman knows she can “sweeten” her utterances and disorient others with her beauty such that they lose senses. The force of these sentiments is echoed repeatedly in the refrain “My gaze is as fatal as a drone attack.” The link between drones and fatality is certainty. Drones are deadly weapons of war; they do make mistakes when they kill suspects, targets, and civilians, but what cannot be doubted is a simple certainty—they destroy, they kill. The power of the drone in this song derives less from the drone’s technological capacity to unleash missiles from thousands of feet in the air and find targets with accuracy but from its “gaze” that is “fatal.” In a neat stroke of lyricism, dance, and sentiment, the song captures the problematic nature of postmodern war: drones and surveillance cultures. Without the ability to subject a people to constant, detailed surveillance, drones lose their power as tools of violence. It is the drone’s unique, invisible ability to gaze at the other that makes the other succumb to the drone’s missile. Implicit here is the idea that to counter the gaze of this seductive woman, the lover has to resist her at the level of her gaze; he has to turn that gaze around or ensure that he cannot be located in her field of vision. In other words, he has to contest the power of her surveillance that recognizes the disorienting effects she has on him. But that is what he cannot, thus the deadly accuracy of the woman’s power: “my gaze is as fatal as a drone attack.” Not surprisingly, such cultural interweaving of death, violence, romance, and love generated strong disapproval, even talk of censuring cultural production. Gul Nazir Mangal, an artist from Waziristan, a region administered by Pakistan, says, “We should not be proud of these attacks, which are being carried out by foreigners on our land. This needs to be condemned instead of making songs and dancing on its Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 40 of 51 tunes,” because such songs are “not only harmful to culture and literature, but also create a sense of disunity amongst the people” (Khan 2012). Officials should, suggests Mangal, set up a censor board to check cultural content before it’s released to the public. Arshad Ali, another musician, reiterates this, saying that “It’s not appropriate to incorporate drone attacks in music as it’s a grave issue faced by our country. Each artist has a certain responsibility towards society” (Khan 2012). But what is the nature of this responsibility when it comes to digital technologies, drones, surveillance, and networks? We cannot address these issues unless we frame them within global contexts, as we have seen in this essay. Drones and surveillance are woven into digital networks that not only connect different countries but impact individuals, groups, and entire populations around the world; it is hardly surprising, then, that cultural engagement with drones and their effects and the vexing issues of authority, representation, intention, and social purpose have transnational dimensions. For more than a month starting in January 2014, the Ann Arbor Art Center in Michigan held a special gallery featuring the work of more than forty artists on the subject of drones. The Center explained its choice of subject thus: Drones are the quintessential object of the 21st century. They are revolutionizing global warfare and domestic and foreign surveillance, galvanizing the creative impulse, and challenging democratic principles and personal values around the globe. They are changing the way we work, play, battle, and live in the 21st century. (Ann Arbor Art Center 2014) “Galvanizing the creative impulse” aptly characterizes the artistic and cultural activity about drones over the last decade. It is an international phenomenon with artists in Afghanistan, Pakistan, England, and America boldly and creatively thinking about and using drones; not just armed drones but drones as a new technological artifact with a unique ability to reorient us to space and time. But as we have seen, the artistic impulse about drones moves well beyond this laudable goal even as it stresses its humanizing potential. Drone art has become cultural life: people are painting drones literally and digitally; they are using mixed media to generate new Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 41 of 51 juxtapositions of ideas and symbols; they are singing about them in telefilms in Pahstun Afghani societies; they are making paper or cloth imprints to attract drone operators; they are using drones in live dance performances; they are rewiring them for paint bombing, or graffiti art. The digital, the arts, and the humanities become entwined in an act of creative exploration that allows suppressed voices to be heard, registers the unacknowledged effects of invisible wars in public discourse, and digitally enables human presence and the quest for dignity to find transnational resonance in a global world. To conclude: when we moved beyond computational humanities to study the imbrication of the digital—as technology, tool, ideology, and episteme—in drones and surveillance, we bridge the digital humanities to postcolonial digital humanities by foregrounding a new biopolitical reality in which digital technologies fundamentally alter established notions of war and peace, guilt and innocence, privacy and the common good. Such a bridging involves, as Roopika Risam (2019) aptly puts it, “praxis at the intersection of digital technologies and humanistic inquiry: designing new workflows and building new archives, tools, databases, and other digital objects that actively resist reinscriptions of colonialism and neocolonialism” (4). If we don’t move beyond the computational humanities to examine the governmental and military institutions that establish sophisticated, transnationally networked digital regimes to surveil peoples and kill terror suspects while also killing civilians, the threat to liberal democracy will increase, not decrease; we need to not only infuse the digital into the humanities but the humanities into the digital; that is, we need to apply humanities approaches to examine how social and political organizations thrive on constant technological innovation to realize national security goals at the expense of robbing thousands of peoples of their rights to privacy and dignity. It involves making the digital humanities public by widely disseminating specialized DH research to general, non-academic audiences, and bringing to bear DH tools and humanities methodologies on domestic and foreign policy, military practices, discourses of exceptionalism, imperial worldviews, in short, on matters of public concern; it involves drawing on complex fields of cultural and social production to enrich our Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 42 of 51 understanding of the human in a digital age, shape our scholarly endeavours, and inform our pedagogical practices. By affirming the human dimensions of surveilled subjects and examining the trans-territorial networks of surveillance in post-colonial societies, we can try to nullify, prevent, blunt, or deflect the same logic of national security being applied to us, right here in America, in American towns, counties, and cities. But that can yet happen, unless we rigorously study, question, and publicly engage with, adapt, re-orient, and transform the cultural and political dimensions of digital technologies. Acknowledgements I thank the reviewers of this article for giving detailed, helpful suggestions. Special thanks to Mahwish Chishty, for permission to use her paintings. Competing Interests The author has no competing interests to declare. References AeroVironment. 2019. “Media Gallery: Unmanned Aerial Systems.” Accessed June 17. https://www.avinc.com/media_center/unmanned-aircraft-systems. Agius, Christine. 2017. “Ordering without Bordering: Drones, the Unbordering of Late Modern Warfare and Ontological Insecurity.” Postcolonial Studies 20(3): 370–86. DOI: https://doi.org/10.1080/13688790.2017.1378084 Ali, Manzoor. 2011. “Khud Kasha Dhamaka Yama: The Song’s a Blast.” The Express Tribune. November 26. Accessed June 17, 2019. http://tribune.com.pk/ story/298042/khud-kasha-dhamaka-yama-the-songs-a-blast/. Allington, Daniel, Sarah Brouillette, and David Golumbia. 2016. “Neoliberal Tools (and Archives): A Political History of the Digital Humanities.” Los Angeles Review of Books. May 1. Accessed June 17, 2019. https://lareviewofbooks.org/ article/neoliberal-tools-archives-political-history-digital-humanities/. Anike. 2013. “The Colorful Drones of Mahwish Chishty,” Patheos. July 1. Accessed June 17, 2019. http://www.patheos.com/blogs/mmw/2013/07/the-colourful- drones-of-mahwish-chishty/. Ann Arbor Art Center. 2014. Drones 2014. Accessed June 23, 2019 https://www. annarborartcenter.org/drones-2014/. https://www.avinc.com/media_center/unmanned-aircraft-systems https://doi.org/10.1080/13688790.2017.1378084 http://tribune.com.pk/story/298042/khud-kasha-dhamaka-yama-the-songs-a-blast/ http://tribune.com.pk/story/298042/khud-kasha-dhamaka-yama-the-songs-a-blast/ https://lareviewofbooks.org/article/neoliberal-tools-archives-political-history-digital-humanities/ https://lareviewofbooks.org/article/neoliberal-tools-archives-political-history-digital-humanities/ http://www.patheos.com/blogs/mmw/2013/07/the-colourful-drones-of-mahwish-chishty/ http://www.patheos.com/blogs/mmw/2013/07/the-colourful-drones-of-mahwish-chishty/ https://www.annarborartcenter.org/drones-2014/ https://www.annarborartcenter.org/drones-2014/ Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 43 of 51 Bauman, Zygmunt. 1993. Postmodern Ethics. Cambridge, MA: Blackwell. Bauman, Zygmunt, and David Lyon. 2013. Liquid Surveillance: A Conversation. Malden, MA: Polity Press. Kindle edition. Becker, Jo, and Scott Shane. 2012. “A Measure of Change: Secret ‘Kill List’ Proves a Test of Obama’s Principles and Will.” The New York Times. May 29. Accessed June 17, 2019. http://www.nytimes.com/2012/05/29/world/obamas-leadership-in- war-on-al-qaeda.html?pagewanted=1&_r=2&pagewanted=all&#p[TMATMA]. Benjamin, Medea. 2013. Drone Warfare. New York: Verso. Bergen, Peter, and Megan Braun. 2012. “Drone is Obama’s Weapon of Choice.” CNN Opinion. September 19. Accessed June 17, 2019. http://www.cnn. com/2012/09/05/opinion/bergen-obama-drone/index.html. Berry, David M. 2012. “Introduction: Understanding the Digital Humanities.” Understanding the Digital Humanities, edited by David M. Berry. New York: Palgrave Macmillan. DOI: https://doi.org/10.1057/9780230371934 Berry, David M., and Anders Fagerjord. 2017. Digital Humanities: Knowledge and Critique in a Digital Age. Malden, Massachusetts: Polity Press. Brennan, Timothy. 2017. “The Digital-Humanities Bust.” The Chronicle of Higher Education. October 15. Accessed June 17, 2019. https://www.chronicle.com/ article/The-Digital-Humanities-Bust/241424. Burdick, Anne, Johanna Drucker, Peter Lunenfeld, Todd Presner, and Jeffrey Schnapp. 2012. Digital_Humanities. Cambridge: MIT Press, 2012. Burghart, Marjorie. 2013. “The Three Orders or Digital Humanities Imagined #dhiha5.” Digital Humanities à l’IHA. April 28. Accessed June 17, 2019. http:// dhiha.hypotheses.org/817. Cascone, Kim. 2000. “The Aesthetics of Failure: ‘Post-Digital’ Tendencies in Contemporary Computer Music.” Computer Music Journal 24(4): 12–8. DOI: https://doi.org/10.1162/014892600559489 Cavallaro, James, Stephan Sonnenberg, and Sarah Knuckey. 2012. Living Under Drones: Death, Injury and Trauma to Civilians from US Drones Practices in Pakistan. Stanford: International Human Rights and Conflict Resolution Clinic at Stanford Law School; New York: Global Justice Clinic at NYU School of Law. Accessed June http://www.nytimes.com/2012/05/29/world/obamas-leadership-in-war-on-al-qaeda.html?pagewanted=1&_r=2&pagewanted=all&#p[TMATMA] http://www.nytimes.com/2012/05/29/world/obamas-leadership-in-war-on-al-qaeda.html?pagewanted=1&_r=2&pagewanted=all&#p[TMATMA] http://www.cnn.com/2012/09/05/opinion/bergen-obama-drone/index.html http://www.cnn.com/2012/09/05/opinion/bergen-obama-drone/index.html https://doi.org/10.1057/9780230371934 https://www.chronicle.com/article/The-Digital-Humanities-Bust/241424 https://www.chronicle.com/article/The-Digital-Humanities-Bust/241424 http://dhiha.hypotheses.org/817 http://dhiha.hypotheses.org/817 https://doi.org/10.1162/014892600559489 Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 44 of 51 17, 2019. https://law.stanford.edu/publications/living-under-drones-death- injury-and-trauma-to-civilians-from-us-drone-practices-in-pakistan/. Center for Digital Research and Scholarship. 2011. “Research Without Borders: Defining the Digital Humanities.” Columbia University. April 6. Accessed June 17, 2019. http://www.youtube.com/watch?v=Xu6Z1SoEZcc. Center for the Study of the Drone. 2019. “Understanding the Drone Through Art.” Bard College. Accessed June 17. https://dronecenter.bard.edu/multimedia- portals/portal-drone-art/. Chamayou, Grégorie. 2015. A Theory of the Drone. New York: The New Press. Clark, Lindsay C. 2018. “Grim Reapers: Ghostly Narratives and Killing in Drone Warfare.” International Feminist Journal of Politics 20(3): 602–23. DOI: https:// doi.org/10.1080/14616742.2018.1503553 Clarke, Roger A. 1988. “Information Technology and Dataveillance.” Communications of the ACM 31(5): 498–512. DOI: https://doi.org/10.1145/42411.42413 Cong-Huyen, Anne. 2013. “#CESA2013: Race in DH – Transformative Asian/American Digital Humanities.” September 24. Accessed June 17, 2019. http://anitaconchita.wordpress.com/2013/09/24/cesa2013-race-in-dh- transformative-asianamerican-digital-humanities/. Critical Inquiry. 2019. “Computational Literary Studies: A Critical Inquiry Online Forum.” In the Moment. March 31. Accessed June 17. https://critinq.wordpress. com/2019/03/31/computational-literary-studies-a-critical-inquiry-online- forum/. Cronin, Bruce. 2018. Bugsplat: The Politics of Collateral Damage in Western Armed Conflicts. New York: Oxford University Press. Da, Nan Z. 2019. “The Computational Case Against Computational Literary Studies.” Critical Inquiry 45: 601–39. DOI: https://doi.org/10.1086/702594 De Luce, Dan, and Paul Mcleary. 2016. Foreign Policy. April 5. Accessed June 17, 2019. https://foreignpolicy.com/2016/04/05/obamas-most-dangerous-drone- tactic-is-here-to-stay/. Department of Defense. 2012. “No-Strike and the Collateral Damage Estimation Methodology.” Chairman of Joint Chiefs of Staff Instruction 3160.01A. Washington, DC. https://law.stanford.edu/publications/living-under-drones-death-injury-and-trauma-to-civilians-from-us-drone-practices-in-pakistan/ https://law.stanford.edu/publications/living-under-drones-death-injury-and-trauma-to-civilians-from-us-drone-practices-in-pakistan/ http://www.youtube.com/watch?v=Xu6Z1SoEZcc https://dronecenter.bard.edu/multimedia-portals/portal-drone-art/ https://dronecenter.bard.edu/multimedia-portals/portal-drone-art/ https://doi.org/10.1080/14616742.2018.1503553 https://doi.org/10.1080/14616742.2018.1503553 https://doi.org/10.1145/42411.42413 http://anitaconchita.wordpress.com/2013/09/24/cesa2013-race-in-dh-transformative-asianamerican-digital-humanities/ http://anitaconchita.wordpress.com/2013/09/24/cesa2013-race-in-dh-transformative-asianamerican-digital-humanities/ https://critinq.wordpress.com/2019/03/31/computational-literary-studies-a-critical-inquiry-online-forum/ https://critinq.wordpress.com/2019/03/31/computational-literary-studies-a-critical-inquiry-online-forum/ https://critinq.wordpress.com/2019/03/31/computational-literary-studies-a-critical-inquiry-online-forum/ https://doi.org/10.1086/702594 https://foreignpolicy.com/2016/04/05/obamas-most-dangerous-drone-tactic-is-here-to-stay/ https://foreignpolicy.com/2016/04/05/obamas-most-dangerous-drone-tactic-is-here-to-stay/ Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 45 of 51 Drew, Christopher. 2009. “Drones are Weapons of Choice in Fighting Qaeda.” The New York Times. March 16. Accessed June 17. 2019. https://www.nytimes. com/2009/03/17/business/17uav.html. Elias, Jamal. 2005. “On Wings of Diesel: The Decorated Trucks of Pakistan.” Amherst College Magazine. Accessed June 17, 2019. https://www.amherst.edu/amherst- story/magazine/issues/2005_spring/wings. ———. 2011. On Wings of Diesel: Trucks, Identity, and Culture in Pakistan. Oxford, England: One World Publications. FATA (Federally Administered Tribal Areas) Research Center. 2014. Impact of War on Terror on Pashto Literature and Art. Islamabad, Pakistan. Fish, Stanley. 2018. “Stop Trying to Sell the Humanities.” The Chronicle of Higher Education. June 17. Accessed June 17, 2019. https://www.chronicle.com/article/ Stop-Trying-to-Sell-the/243643. ———. 2019. “Computational Literary Studies: Participant Forum Responses, Day 3.” Critical Inquiry. In the Moment. April 3. Accessed June 17, 2019. https://critinq. wordpress.com/2019/04/03/computational-literary-studies-participant-forum- responses-day-3-5. Foucault, Michel. 1997. Society Must be Defended: Lectures at the Collège de France, 1975–76. New York: Picador. Gertler, Jeremiah. 2012. “U.S Unmanned Aerial Systems.” Congressional Research Service. U.S. Department of State. January 3. Accessed June 23. https://pdfs. semanticscholar.org/b1c9/1702837787bb72dbde6affec193996a51ce0.pdf?_ ga=2.177205733.1390175729.1561346061-577688779.1561346061. Gold, Matthew K. 2012a. “The Digital Humanities Moment.” In Debates in the Digital Humanities, edited by Matthew K. Gold. Minneapolis: University of Minnesota Press. Accessed June 24, 2019. http://dhdebates.gc.cuny.edu/debates/text/2. ———. 2012b. Debates in the Digital Humanities. Minneapolis: University of Minnesota Press. Accessed June 24, 2019. http://dhdebates.gc.cuny.edu/. Gregory, Derek. 2011. “The Everywhere War.” The Geographical Journal 177(3): 238–50. DOI: https://doi.org/10.1111/j.1475-4959.2011.00426.x ———. 2014. “Drone Geographies.” Radical Philosophy 183. Accessed June 17, 2019. https://www.radicalphilosophy.com/article/drone-geographies. https://www.nytimes.com/2009/03/17/business/17uav.html https://www.nytimes.com/2009/03/17/business/17uav.html https://www.amherst.edu/amherst-story/magazine/issues/2005_spring/wings https://www.amherst.edu/amherst-story/magazine/issues/2005_spring/wings https://www.chronicle.com/article/Stop-Trying-to-Sell-the/243643 https://www.chronicle.com/article/Stop-Trying-to-Sell-the/243643 https://critinq.wordpress.com/2019/04/03/computational-literary-studies-participant-forum-responses-day-3-5 https://critinq.wordpress.com/2019/04/03/computational-literary-studies-participant-forum-responses-day-3-5 https://critinq.wordpress.com/2019/04/03/computational-literary-studies-participant-forum-responses-day-3-5 https://pdfs.semanticscholar.org/b1c9/1702837787bb72dbde6affec193996a51ce0.pdf?_ga=2.177205733.1390175729.1561346061-577688779.1561346061 https://pdfs.semanticscholar.org/b1c9/1702837787bb72dbde6affec193996a51ce0.pdf?_ga=2.177205733.1390175729.1561346061-577688779.1561346061 https://pdfs.semanticscholar.org/b1c9/1702837787bb72dbde6affec193996a51ce0.pdf?_ga=2.177205733.1390175729.1561346061-577688779.1561346061 http://dhdebates.gc.cuny.edu/debates/text/2 http://dhdebates.gc.cuny.edu/ https://doi.org/10.1111/j.1475-4959.2011.00426.x https://www.radicalphilosophy.com/article/drone-geographies Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 46 of 51 Gusterson, Hugh. 2016. Drone: Remote Control Warfare. Cambridge, Massachusetts: MIT Press. DOI: https://doi.org/10.1063/1.5009234 Haggerty, Kevin D., and Richard Ericson. 2007. “The Surveillant Assemblage.” In The Surveillance Studies Reader, edited by Sean P. Hier, and Joshua Greenberg, 104–16. New York: Open University Press. Hall, Gary. 2011. “The Digital Humanities Beyond Computing.” Culture Machine 12: 1–11. Hambling, David. 2009. “Special Forces Gigapixel Flying Spy See All.” Wired. February 12. Accessed June 23, 2019. http://www.wired.com/dangerroom/2009/02/ gigapixel-flyin/. Harkinson, Josh. 2013. “Friendly Fire: Drones as Folk Art.” Mother Jones. June 24. Accessed June 17, 2019. http://www.motherjones.com/media/2013/06/ pakistani-drone-art-mahwish-chishty. Hensley, Nathan K. 2018. “Drone Form: Mediation at the End of Empire.” Novel: A Forum on Fiction 51(2): 226–49. DOI: https://doi.org/10.1215/00295132- 6846084 Higgin, Tanner. 2010. “Cultural Politics, Critique and the Digital Humanities.” May 5. Accessed June 17, 2019. http://www.tannerhiggin.com/cultural-politics- critique-and-the-digital-humanities/. Hindley, Meredith. 2013. “The Rise of the Machines.” Humanities: The Magazine for the National Endowment for the Humanities 34(4). Accessed June 17, 2019. http:// www.neh.gov/humanities/2013/julyaugust/feature/the-rise-the-machines. Jockers, Matthew. 2013. Macroanalysis: Digital Methods and Literary History. Chicago: University of Illinois Press. ProQuest Ebook Central. DOI: https://doi. org/10.5406/illinois/9780252037528.001.0001 Kaplan, Fred. 2013. “The World as Free Fire Zone.” MIT Technology Review. June 7. Accessed June 17, 2019. http://www.technologyreview.com/ featuredstory/515806/the-world-as-free-fire-zone/page/2/. Kennedy, Greg. 2013. “Drones: Legitimacy and Anti-Americanism.” Parameters 42(4)/43(1): 25–8. https://doi.org/10.1063/1.5009234 http://www.wired.com/dangerroom/2009/02/gigapixel-flyin/ http://www.wired.com/dangerroom/2009/02/gigapixel-flyin/ http://www.motherjones.com/media/2013/06/pakistani-drone-art-mahwish-chishty http://www.motherjones.com/media/2013/06/pakistani-drone-art-mahwish-chishty https://doi.org/10.1215/00295132-6846084 https://doi.org/10.1215/00295132-6846084 http://www.tannerhiggin.com/cultural-politics-critique-and-the-digital-humanities/ http://www.tannerhiggin.com/cultural-politics-critique-and-the-digital-humanities/ http://www.neh.gov/humanities/2013/julyaugust/feature/the-rise-the-machines http://www.neh.gov/humanities/2013/julyaugust/feature/the-rise-the-machines https://doi.org/10.5406/illinois/9780252037528.001.0001 https://doi.org/10.5406/illinois/9780252037528.001.0001 http://www.technologyreview.com/featuredstory/515806/the-world-as-free-fire-zone/page/2/ http://www.technologyreview.com/featuredstory/515806/the-world-as-free-fire-zone/page/2/ Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 47 of 51 Khan, Hidayat. 2012. “My Gaze is as Fatal as a Drone Attack.” The Express Tribune. September 18. Accessed June 17, 2019. http://tribune.com.pk/story/438610/ my-gaze-is-as-fatal-as-a-drone-attack/. Kirsch, Adam. 2014. “Technology is Taking Over English Departments.” The New Republic. May 2. Accessed June 17, 2019. https://newrepublic.com/ article/117428/limits-digital-humanities-adam-kirsch. Koh, Adeline. 2014. “Niceness, Building, and Opening the Genealogy of the Digital Humanities: Beyond the Social Contract of Humanities Computing.” Differences: A Journal of Feminist Cultural Studies 25(1): 93–106. DOI: https:// doi.org/10.1215/10407391-2420015 Kreps, Sarah, and John Kaag. 2012. “The Use of Unmanned Aerial Vehicles in Contemporary Conflict: A Legal and Ethical Analysis.” Polity 44: 1–26. DOI: https://doi.org/10.2139/ssrn.2023202 Liu, Alan. 2012. “Where is Cultural Criticism in the Digital Humanities?” In Debates in the Digital Humanities, edited by Matthew K. Gold. Minneapolis: University of Minnesota Press. Accessed June 17, 2019. http://dhdebates.gc.cuny.edu/ debates/text/20. Open Access Edition. McPherson, Tara. 2012. “Why are the Digital Humanities So White? Or Thinking the Histories of Race and Computation.” In Debates in the Digital Humanities, edited by Matthew K. Gold. Minneapolis: University of Minnesota Press. Accessed June 17, 2019. http://dhdebates.gc.cuny.edu/debates/text/29. Open Access Edition. DOI: https://doi.org/10.5749/minnesota/9780816677948.003.0017 Moretti, Franco. 2013. Distant Reading. New York: Verso. Muthyala, John. 2016. “Whither the Digital Humanities?” Hybrid Pedagogy. Accessed June 17, 2019. http://hybridpedagogy.org/whither-the-dh/. Neal, Meghan. 2013. “Finally: A Drone for Dropping Rhymes, not Bombs.” Motherboard. June 25. Accessed June 23, 2019. https://www.vice.com/en_us/ article/533xmk/poetry-drone-would-drop-rhymes-not-bombs. New America. 2019a. “Drone Strikes: Pakistan.” Accessed June 17. https://www. newamerica.org/in-depth/americas-counterterrorism-wars/pakistan/. http://tribune.com.pk/story/438610/my-gaze-is-as-fatal-as-a-drone-attack/ http://tribune.com.pk/story/438610/my-gaze-is-as-fatal-as-a-drone-attack/ https://newrepublic.com/article/117428/limits-digital-humanities-adam-kirsch https://newrepublic.com/article/117428/limits-digital-humanities-adam-kirsch https://doi.org/10.1215/10407391-2420015 https://doi.org/10.1215/10407391-2420015 https://doi.org/10.2139/ssrn.2023202 http://dhdebates.gc.cuny.edu/debates/text/20. Open Access Edition http://dhdebates.gc.cuny.edu/debates/text/20. Open Access Edition http://dhdebates.gc.cuny.edu/debates/text/29 https://doi.org/10.5749/minnesota/9780816677948.003.0017 http://hybridpedagogy.org/whither-the-dh/ https://www.vice.com/en_us/article/533xmk/poetry-drone-would-drop-rhymes-not-bombs https://www.vice.com/en_us/article/533xmk/poetry-drone-would-drop-rhymes-not-bombs https://www.newamerica.org/in-depth/americas-counterterrorism-wars/pakistan/ https://www.newamerica.org/in-depth/americas-counterterrorism-wars/pakistan/ Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 48 of 51 ———. 2019b. “Drone Strikes: Yemen.” Accessed June 17. https://www.newamerica. org/in-depth/americas-counterterrorism-wars/us-targeted-killing-program- yemen/. Pannapacker, William A. 2013. “‘Hacking’ and ‘Yacking’ About the Digital Humanities.” The Chronicle of Higher Education. September 3. Accessed June 17, 2019. https://www.chronicle.com/article/HackingYacking-About/141311. Parks, Lisa. 2017. “Vertical Mediation.” In Life in the Age of Drone Warfare, edited by Lisa Parks, and Caren Kaplan, 134–57. Durham: Duke University Press. DOI: https://doi.org/10.1215/9780822372813 Peron, Alcides Eduardo dos Reis. 2014. “The ‘Surgical’ Legitimacy of Drone Strikes? Issues of Sovereignty and Human Rights in the Use of Unmanned Aerial Systems in Pakistan.” Journal of Strategic Security 7(4): 81–93. DOI: https://doi. org/10.5038/1944-0472.7.4.6 Pincus, Walter. 2005. “Pentagon Has Far-Reaching Defense Spacecraft in Works.” Washington Post. March 16. Accessed June 17, 2019. http://www.washingtonpost. com/wp-dyn/articles/A38272-2005Mar15.html. Power, Matthew. 2013. “Confessions of a Drone Warrior.” GQ. October 23. Accessed June 17, 2019. https://www.gq.com/story/drone-uav-pilot-assassination. Presner, Todd. 2015. “Critical Theory and the Mangle of Digital Humanities.” In Between Humanities and the Digital, edited by Patrik Svensson, and David Theo Goldberg, 55–68. Boston: Massachusetts Institute of Technology. Presner, Todd, Jeffrey Schnapp, and Peter Lunenfeld. 2009. Digital Humanities Manifesto 2.0 Accessed June 17, 2019. http://www.humanitiesblast.com/ manifesto/Manifesto_V2.pdf. Reid, Alex. 2014. “Digital Nonhumanities... Excerpt.” January 11. Accessed June 17, 2019. https://profalexreid.com/2014/01/11/digital-nonhumanities-excerpt/. Rhee, Jennifer. 2018. The Robotic Imaginary: The Human and the Price of Dehumanized Labor. Minneapolis: University of Minnesota Press. DOI: https:// doi.org/10.5749/j.ctv62hh4x Risam, Roopika. 2019. New Digital Worlds: Postcolonial Digital Humanities in Theory, Praxis, and Pedagogy. Evanston, IL: Northwestern University Press. DOI: https:// doi.org/10.2307/j.ctv7tq4hg https://www.newamerica.org/in-depth/americas-counterterrorism-wars/us-targeted-killing-program-yemen/ https://www.newamerica.org/in-depth/americas-counterterrorism-wars/us-targeted-killing-program-yemen/ https://www.newamerica.org/in-depth/americas-counterterrorism-wars/us-targeted-killing-program-yemen/ https://www.chronicle.com/article/HackingYacking-About/141311 https://doi.org/10.1215/9780822372813 https://doi.org/10.5038/1944-0472.7.4.6 https://doi.org/10.5038/1944-0472.7.4.6 http://www.washingtonpost.com/wp-dyn/articles/A38272-2005Mar15.html http://www.washingtonpost.com/wp-dyn/articles/A38272-2005Mar15.html https://www.gq.com/story/drone-uav-pilot-assassination http://www.humanitiesblast.com/manifesto/Manifesto_V2.pdf http://www.humanitiesblast.com/manifesto/Manifesto_V2.pdf https://profalexreid.com/2014/01/11/digital-nonhumanities-excerpt/ https://doi.org/10.5749/j.ctv62hh4x https://doi.org/10.5749/j.ctv62hh4x https://doi.org/10.2307/j.ctv7tq4hg https://doi.org/10.2307/j.ctv7tq4hg Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 49 of 51 Said, Edward. 1983. The World, the Text, and the Critic. Cambridge: Harvard University Press. Scahill, Jeremy, and Glenn Greenwald. 2014. “Death by Metadata: Jeremy Scahill and Glenn Greenwald Reveal NSA Role in Assassinations Overseas.” Democracy Now. February 10. Accessed June 17, 2019. https://www.democracynow. org/2014/2/10/death_by_metadata_jeremy_scahill_glenn. Schnepf, J. D. 2017. “Domestic Aerial Photography in the Era of Drone Warfare.” Modern Fiction Studies. Project Muse 63(2): 270–87. DOI: https://doi. org/10.1353/mfs.2017.0022 Schreibman, Susan, Ray Siemens, and John Unsworth. (eds.) 2004. A Companion to Digital Humanities. Oxford: Blackwell. Accessed June 17, 2019. http:// www.digitalhumanities.org/companion/. DOI: https://doi.org/10.1111/ b.9781405103213.2004.00026.x Shachtman, Noah. 2009. “Air Force to Unleash ‘Gorgon Stare’ on Squirting Insurgents.” Wired. February 19. Accessed June 17, 2019. http://www.wired. com/dangerroom/2009/02/gorgon-stare/. Siddiqui, Taha. 2012. “Taliban Jihad Literature: What’s read in Afghanistan is Printed in Pakistan.” The Express Tribune. August 12. Accessed June 17, 2019. http://tribune.com.pk/story/421356/taliban-jihad-literature-whats-read-in- afghanistan-is-printed-in-pakistan/. Solove, Daniel J. 2004. The Digital Person: Technology and Privacy in the Information Age. New York: New York University Press. Staples, William G. 2003. Everyday Surveillance: Vigilance and Visibility in Postmodern Life. New York: Bowman and Littlefield Publishers, Inc. Storm, Darlene. 2014. “Whistleblower: NSA Targets SIM Cards for Drone Strikes, ‘Death by Unreliable Metadata’.” Computerworld. February 10. Accessed June 17, 2019. https://www.computerworld.com/article/2475921/data-privacy/ whistleblower--nsa-targets-sim-cards-for-drone-strikes---death-by-unreliable- metadata-.html. Terras, Melissa, Julianne Nyhan, and Edward Vanhoutte. 2013. Defining Digital Humanities: A Reader. New York, NY: Ashgate. https://www.democracynow.org/2014/2/10/death_by_metadata_jeremy_scahill_glenn https://www.democracynow.org/2014/2/10/death_by_metadata_jeremy_scahill_glenn https://doi.org/10.1353/mfs.2017.0022 https://doi.org/10.1353/mfs.2017.0022 http://www.digitalhumanities.org/companion/ http://www.digitalhumanities.org/companion/ https://doi.org/10.1111/b.9781405103213.2004.00026.x https://doi.org/10.1111/b.9781405103213.2004.00026.x http://www.wired.com/dangerroom/2009/02/gorgon-stare/ http://www.wired.com/dangerroom/2009/02/gorgon-stare/ http://tribune.com.pk/story/421356/taliban-jihad-literature-whats-read-in-afghanistan-is-printed-in-pakistan/ http://tribune.com.pk/story/421356/taliban-jihad-literature-whats-read-in-afghanistan-is-printed-in-pakistan/ https://www.computerworld.com/article/2475921/data-privacy/whistleblower--nsa-targets-sim-cards-for-drone-strikes---death-by-unreliable-metadata-.html https://www.computerworld.com/article/2475921/data-privacy/whistleblower--nsa-targets-sim-cards-for-drone-strikes---death-by-unreliable-metadata-.html https://www.computerworld.com/article/2475921/data-privacy/whistleblower--nsa-targets-sim-cards-for-drone-strikes---death-by-unreliable-metadata-.html Muthyala: Drones and Surveillance Cultures in a Global WorldArt. 18, page 50 of 51 The Bureau of Investigative Journalism. 2019 “Drone Wars: The Full Data.” Accessed June 23. https://v1.thebureauinvestigates.com/category/projects/ drones/drones-graphs/. Turse, Nick. 2012. The Changing Face of Empire: Special Ops. Drones, Spies, Proxy Fighters, Secret Bases, and Cyberwarfare. Chicago: Haymarket Books. Underwood, Ted. 2019. Distant Horizons: Digital Evidence and Literary Change. Chicago: University of Chicago Press. DOI: https://doi.org/10.7208/ chicago/9780226612973.001.0001 Vergakis, Brock. 2013. “U.S. Launches Drone from Aircraft Carrier.” washingtonexaminer.com May 14. Accessed June 24, 2019. https://www. washingtonexaminer.com/us-launches-drone-from-aircraft-carrier. Walzer, Michael. 2004. Arguing about War. New Haven: Yale University Press. Watson, Julie. 2008. “Archeologists Digging up Pre-Columbian Sounds.” Los Angeles Times. July 6. Accessed June 17, 2019. http://articles.latimes.com/2008/jul/06/ news/adfg-sounds6. Weisgerber, Marcus. 2017. “The Pentagon’s New Algorithmic Warfare Cell Gets Its First Mission: Hunt ISIS.” Defense One. May 14. Accessed June 17, 2019. https:// www.defenseone.com/technology/2017/05/pentagons-new-algorithmic- warfare-cell-gets-its-first-mission-hunt-isis/137833/. Wolfgang, Ben. 2018. “Trump Outpacing Obama in Drone Strikes; 80 in First Year: Report.” The Washington Times. June 7. Accessed June 17, 2019. https://www. washingtontimes.com/news/2018/jun/7/donald-trump-outpacing-barack- obama-drone-strikes-/. Yehya, Naief. 2015. “The Drone: God’s Eye, Death Machine, Cultural Puzzle.” Culture Machine 16: 1–3. Zenko, Micah. 2013. “Reforming US Drone Strike Policies.” Council on Foreign Relations. Special Report 65: 1–41. https://v1.thebureauinvestigates.com/category/projects/drones/drones-graphs/ https://v1.thebureauinvestigates.com/category/projects/drones/drones-graphs/ https://doi.org/10.7208/chicago/9780226612973.001.0001 https://doi.org/10.7208/chicago/9780226612973.001.0001 https://www.washingtonexaminer.com https://www.washingtonexaminer.com/us-launches-drone-from-aircraft-carrier https://www.washingtonexaminer.com/us-launches-drone-from-aircraft-carrier http://articles.latimes.com/2008/jul/06/news/adfg-sounds6 http://articles.latimes.com/2008/jul/06/news/adfg-sounds6 https://www.defenseone.com/technology/2017/05/pentagons-new-algorithmic-warfare-cell-gets-its-first-mission-hunt-isis/137833/ https://www.defenseone.com/technology/2017/05/pentagons-new-algorithmic-warfare-cell-gets-its-first-mission-hunt-isis/137833/ https://www.defenseone.com/technology/2017/05/pentagons-new-algorithmic-warfare-cell-gets-its-first-mission-hunt-isis/137833/ https://www.washingtontimes.com/news/2018/jun/7/donald-trump-outpacing-barack-obama-drone-strikes-/ https://www.washingtontimes.com/news/2018/jun/7/donald-trump-outpacing-barack-obama-drone-strikes-/ https://www.washingtontimes.com/news/2018/jun/7/donald-trump-outpacing-barack-obama-drone-strikes-/ Muthyala: Drones and Surveillance Cultures in a Global World Art. 18, page 51 of 51 How to cite this article: Muthyala, John. 2019. “Drones and Surveillance Cultures in a Global World.” Digital Studies/Le champ numérique 9(1): 18, pp. 1–51. DOI: https://doi. org/10.16995/dscn.332 Submitted: 24 November 2018 Accepted: 05 June 2019 Published: 27 September 2019 Copyright: © 2019 The Author(s). This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. See http://creativecommons.org/licenses/by/4.0/. OPEN ACCESS Digital Studies/Le champ numérique is a peer-reviewed open access journal published by Open Library of Humanities. https://doi.org/10.16995/dscn.332 https://doi.org/10.16995/dscn.332 http://creativecommons.org/licenses/by/4.0/ Digital Humanities and the cultural turn Drone warfare and empire in the 21st century Anarchy of global surveillance The struggle for the human in drone wars Drone art and politics Drones and surveillance in popular culture Acknowledgements Competing Interests References Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 Figure 6 Figure 7 Figure 8 work_hebtpqw3vffnpfo4wo7n5klapy ---- Digital Humanities Within a Global Context: Creating Borderlands of Localized Expression 1 23 Fudan Journal of the Humanities and Social Sciences ISSN 1674-0750 Fudan J. Hum. Soc. Sci. DOI 10.1007/s40647-018-0224-0 Digital Humanities Within a Global Context: Creating Borderlands of Localized Expression Amy E. Earhart 1 23 Your article is protected by copyright and all rights are held exclusively by Fudan University. This e-offprint is for personal use only and shall not be self-archived in electronic repositories. If you wish to self-archive your article, please use the accepted manuscript version for posting on your own website. You may further deposit the accepted manuscript version in any repository, provided it is only made publicly available 12 months after official publication or later and provided acknowledgement is given to the original source of publication and a link is inserted to the published article on Springer's website. The link must be accompanied by the following text: "The final publication is available at link.springer.com”. O R I G I N A L P A P E R Digital Humanities Within a Global Context: Creating Borderlands of Localized Expression Amy E. Earhart1 Received: 19 February 2018 / Accepted: 22 March 2018 � Fudan University 2018 Abstract As scholars have begun the digitization of the world’s cultural materials, the understanding of what is to be digitized and how that digitization occurs remains narrowly imagined, with a distinct bias toward North American and European notions of culture, value and ownership. Humanists are well aware that cultural knowledge, aesthetic value and copyright/ownership are not monolithic, yet digital humanities work often expects the replication of narrow ideas of such. Drawing on the growing body of scholarship that situates the digital humanities in a broad global context, this paper points to areas of tension within the field and posits ways that digital humanities practitioners might resist such moves to homogenize the field. Working within the framework of border studies, the paper considers how working across national barriers might further digital humanities work. Finally, ideas of ownership and/or copyright are unique to country of origin and, as such, deserve careful attention. While open access is appealing in many digital humanities pro- jects, it is not always appropriate, as work with indigenous cultural artifacts has revealed. Keywords Digital humanities � Global � Borderlands � Transnational As scholars have begun the digitization of the world’s cultural materials, the understanding of what is to be digitized and how that digitization occurs, of how we utilize technology, of infrastructures of academic digital humanities (dh), remains narrowly imagined, with a distinct bias toward North American and European notions of culture, value and ownership. Humanists are well aware that cultural & Amy E. Earhart aearhart@tamu.edu 1 Department of English, Texas A&M University, 4227 TAMU, College Station, TX 77843-4227, USA 123 Fudan J. Hum. Soc. Sci. https://doi.org/10.1007/s40647-018-0224-0 Author's personal copy http://orcid.org/0000-0002-6090-674X http://crossmark.crossref.org/dialog/?doi=10.1007/s40647-018-0224-0&domain=pdf http://crossmark.crossref.org/dialog/?doi=10.1007/s40647-018-0224-0&domain=pdf https://doi.org/10.1007/s40647-018-0224-0 knowledge, academic infrastructures and copyright/ownership are not monolithic, yet digital humanities disciplinary structures often expect the replication of narrow ideas of such. Katherine Hayles predicts an entanglement of codes within a global environment, noting that ‘‘As the worldview of code assumes comparable importance to the worldviews of speech and writing, the problematics of interaction between them grow more complex and entangled’’ (2010, 31). The multiplicity of codes as expressed within global environments brings a largely ignored complexity to digital humanities and code studies and necessitates scholarship to interpret and critique such codes. While digital humanities is global, those of us practicing digital humanities continue to work within, to replicate, localized academic structures. While we might have come to terms intellectually with the notion that our scholarship is looking outward, that we are increasingly called upon to view our work within a complex web of global academic conversations, individual academics remain caught within nationally bound structures of academia, making the notion of a globalized construction of scholarship that values disparate forms of digital humanities incredibly difficult. As digital humanists imagine the ways that our community of scholars across the world might engage, we have the opportunity to construct a collaborative environment that models the best of such interactions. Efforts are well underway. Models range from a big tent approach, an umbrella model that pulls together all such efforts, to a networked set of nodes. Yet, as global interaction among digital humanists grows it has revealed tension regarding the way in which the digital humanities engage with each other. Rather than initiating a one size fits all global model, we need to imagine a global digital humanities that lives in the borderlands, a place of connection and contradiction and, mostly importantly, a place that does not try to centralize itself. Recognizing that monolithic models of digital humanities are unproductive, digital humanists have begun to discuss how we might create academic infrastruc- tures, such as organizations, conferences and journals, that fully account for the diversity of practice. Early organizations such as GO::DH, Global Outlook::Digital Humanities, are leaders in the expansion of such infrastructure. Developed to ‘‘break down barriers that hinder communication and collaboration among researchers and students of the Digital Arts, Humanities, and Cultural Heritage sectors in high, mid, and low income economies’’ (GO:DH 2017), GO::DH has become a Special Interest Group (SIG) affiliated with the largest digital humanities organization in the world, the Alliance of Digital Humanities Organizations or ADHO. Work by members of GO::DH and others within ADHO has helped to make building ‘‘global digital humanities networks’’ one of the priorities of ADHO. ADHO has also been working to expand membership, constituent organizations and cultural and linguistic difference within their organization. Other co-partners of ADHO include Centernet: An International Network of Digital Humanities Centers, constructed as ‘‘an international network of digital humanities centers formed for cooperative and collaborative action to benefit digital humanities and allied fields in general, and centers as humanities cyberinfrastructure in particular.’’ Emphasizing inclusivity, the organization views itself as a ‘‘big tent,’’ extending a welcome to all who self-define as digital humanities. While centernet is an international network A. E. Earhart 123 Author's personal copy with expansive goals, it remains limited in representation. Many countries that are actively producing digital humanities work, such as India, are not included in the network. Only two centers in Africa are included, though excellent digital humanities work across Asia is underway. Clearly the largest digital humanities organizations in the world are trying to articulate the way by which they might encourage a global discussion of digital humanities, but remain limited in their success. Digital humanities as a structural entity has coalesced around the ADHO yearly conference. Since 1989 digital humanists have gathered for the annual conference, imagined as international in scope. Originally the conference rotated between North American and Europe, but in order to encourage international participants the conference has begun to meet in wide ranging locations; it has moved from its original Canadian/US/Western Europe locations to greater parts of Europe and the Americas, such as Poland and Mexico. Created under the umbrella of ADHO, the organization includes The European Association for Digital Humanities (EADH); the Association for Computers and the Humanities (ACH), predominantly an Americas organization; Canadian Society for Digital Humanities/Société canadi- enne des humanités numériques (CSDH/SCHN); centerNet, Australasian Associa- tion for Digital Humanities (aaDH); Japanese Association for Digital Humanites (JADH); and Humanistica, L’association francophone des humanités numériques/digitales (Humanistica). Past conference themes have embraced a global digital humanities. The 2012 international digital humanities conference, held at the University of Hamburg, had the auspicious theme of Digital Diversity: Cultures, Languages and Methods. Australia’s hosting of the 2015 conference focused on a theme of Global Digital Humanities. The 2018 Digital Humanities Conference held in Mexico City asks for us to consider Bridges/Puentes. The conference is fairly unique among academic conferences in that it is attempting to pull together such a broad group of scholars. There is no other academic conference in the literature, for example, that has the long-term goal of global outreach and has made such strives toward building a global organization. Digital humanities journals are also focusing on the global digital humanities and have begun to publish papers that engage with the complex issues of how we might define digital humanities in the increasingly broad space and places in which the scholarship is created. Such efforts extend to journals affiliated with ADHO, including DSH: Digital Scholarship in the Humanities (formerly LLC: The Journal of Digital Scholarship in the Humanities), DHQ (Digital Humanities Quarterly) and Digital Studies/Le champ numérique which have featured global issues, such as collections titled ‘‘Digital Humanities Without Borders,’’ ‘‘Global Outlook::Digital Humanities: Global Digital Humanities Essay Prize,’’ both in Digital Studies/Le champ numérique, and papers that consider a broader global understanding of digital humanities, such as ‘‘Corpus-Based Studies of Translational Chinese in English–Chinese Translation’’ and ‘‘Aspect Marking in English and Chinese: Using the Lancaster Corpus of Mandarin Chinese for Contrastive Language Study,’’ both in DSH: Digital Scholarship in the Humanities. However, the data suggest that we still have a long way to go if we want to be a global organization. Melissa Terras was the first to focus attention on conference Digital Humanities Within a Global Context: Creating… 123 Author's personal copy representation, finding that the conference was attended overwhelmingly by scholars from the USA, Canada and the UK (see Fig. 1). Concerned about the lack of geodiversity of conference attendance, Terras has continued to track attendance, and her recent work suggests that digital humanities remains imagined as western located (see Fig. 2). Work by Roopika Risam, Alex Gil, Isabel Galina, Domenico Fiormont, Elika Ortega, Padmini Ray Murray, among other scholars, have called interpretations such as Fig. 2 into question, suggesting that the digital humanities is centered in the Americas and Europe only in the Western imagination, a construct that ignores the broad scope of global digital humanities. Risam notes, ‘‘the distribution of DH centers suggests uneven development. The USA and, to a lesser extent, the UK and Canada appear the true centers of DH, while other countries comprise the peripheries’’ (2017, 378). Should we want to broaden the digital humanities to a globally representative field, then we must begin to not only reimagine boundaries, but to construct organizations which decentralize. Part of the difficulty is that the structures of the largest digital humanities organizations, such as ADHO, remain narrowly focused. A study of the conference authors from 2004 to 2013 shows that conference participation remains unequally distributed (see Fig. 3). Conference participation is largely formed by the perennial question of how to define the field, with some definitions driving limited globalized membership, so too might structural issues associated with the conference. centerNet and ADHO offer free and reduced cost memberships for joining their entities and, while waiving membership fees does encourage participation, the actual costs associated with attending the Digital Humanities conference, from airfare to lodging costs, remain high. Registration discounts occur by career stage, with staff and students receiving Fig. 1 Presenters at ACH/ALLC 2005 by Institution Country. Terras (2006). Please note that the Digital Humanities Conference was originally titled the ACH/ALLCH conference A. E. Earhart 123 Author's personal copy discounted rates, but the organization has not included registration differentiation by region, country or income, leaving those from low-economy counties facing a dramatic challenge. For example, at the 2016 digital humanities conference in Krakow participants from Poland reported that the registration costs of the Fig. 2 Quantifying Digital Humanities. Melissa Terras. Infographic: Quantifying Digital Humanities. 2012. Melissa Terras’ Blog. http://www.ucl.ac.uk/infostudies/melissa-terras/DigitalHumanitiesInfogra phic.pdf Accessed September 18, 2017 Fig. 3 Number of authors per region 2004–2013. Weingart and Eichmann-Kalwara (2017) Digital Humanities Within a Global Context: Creating… 123 Author's personal copy http://www.ucl.ac.uk/infostudies/melissa-terras/DigitalHumanitiesInfographic.pdf http://www.ucl.ac.uk/infostudies/melissa-terras/DigitalHumanitiesInfographic.pdf conference were equivalent to a month of salary for lecturers. Though the conference was in their home country, the cost was prohibitive. While some have floated the idea of income-based registration, to date the conference has not responded to a key structural issue that prohibits participation from a broader digital humanities community. The conference has taken positive steps to create a less exclusionary space by holding the 2015 conference in Australia and the 2018 conference in Mexico. Prompted by the 2011 formation of La Red de Humanidades Digitales (RedHD), the 2018 Mexico City conference will be ‘‘the first time that the conference will take place in Latin America & the global south.’’ The shift in locations for Digital Humanities signals an important moment in the history of the organization is largely due to the hard work of organizations like GO::DH and RedHD. However, there remain clear structural barriers to an inclusive global digital humanities. Algorithmic analysis of digital humanities’ structures points to continuing problems in developing a diverse global digital humanities. Scott Weingart’s analysis of the yearly ADHO conference has pushed digital humanities to think through how we are constituting ourselves through our conference and our field, revealing the ways that conference participation remains geographically located in the Americas and Europe. 1 Conference participation limitations also appear in our constituent journals which are likewise publishing articles predominantly clustered around scholars in the Americas and Europe. Telling is an analysis of Digital Humanities Quarterly: DHQ examining co-author networks in the journal from 2007 to 2014 which reveals that the networks remain squarely centered in the Americas, with very little representation beyond Europe (see Fig. 4). All of this suggests that digital humanities as understood through our organizational entities, digital humanities organizations, conferences and journals, desires to be global but remains merely the imagined global. The domination of the primary modes of disciplinary construction, journals and conferences by the Americas and Europe is a problem in that it is creating a field that runs counter to the described goals of global digital humanities, implying that no matter the imagined global digital humanities, a truly global understanding of an organization or a field is difficult to construct, perhaps even more difficult in the current age of nationalist tensions. There are numerous interventions underway to broaden our representation of global digital humanities, but we remain caught within tensions of an umbrella structure that enforces structures that are often not conducive to the larger representation of digital humanities. Digital humanities has struggled to articulate a global organization in large part because of originating tensions within the organization construction. Digital humanities, as a field, has struggled to articulate what is included within its rubric, a struggle that remains an open academic question. Tensions within the field have revolved around who’s in and who’s out, but in a localized context focused on, once again, the Americas and Europe. Reviewing the literature that attempts to define digital humanities reveals that geography has been ignored by scholarship until 1 See dh quantified for a list of scholars invested in collecting information of the community: http:// scottbot.net/dh-quantified. A. E. Earhart 123 Author's personal copy http://scottbot.net/dh-quantified http://scottbot.net/dh-quantified recent interventions. Such scholarly constructions of digital humanities which view digital humanities as naturalized within a European and Americas structure has led to current limitations of the field. As O’Donnell et al. make clear, our current representation of digital humanities moves along clear lines of demarcation, whether economic, linguistic or geographic (2016, 493). The centering of digital humanities in this manner has created an ‘‘unproductive dichotomy of center and periphery,’’ leading to a call for a resistance to such structures through a creation of a regional or local digital humanities (Gil and Ortega 2016, 23). For example, Alex Gil’s ‘‘Around DH in 80 Days’’ project resists the limited centering of digital humanities, instead revealing the diversity of global digital humanities projects (see Fig. 5). The diversification of digital humanities, the struggle to create an organizational entity that inclusively represents a global digital humanities, will continue to occur through ADHO and its affiliated conference and journals, but the organizational structures currently remain resistant to a more globally imagined digital humanities. Because of this, we might ask whether ADHO is actually the mechanism to bring about global digital humanities. As the organization has grown, there has been an almost de facto understanding that it should be the center for global dh. But the centering of digital humanities in an organization that has arisen out of western academic structures will, I argue, always struggle to imagine how to construct a truly representative field. A better question might be whether we can construct an alternative mechanism that accurately represents all the different ways that digital humanities is practiced in a global environment. The rejection of an umbrella or big tent organization in which to coalesce a global digital humanities is born out of an analysis of the way that geographic, economic, cultural and structural approaches to academic discipline impact our interactions in the larger digital humanities. During the research and writing of Traces of the Old, Uses of the New: The Emergence of Digital Humanities (2015) I came to understand that providing one definition of the digital humanities was dependent upon a stable infrastructure from which the practice developed. The definition of digital humanities within the Americas is dependent upon an academia that is increasingly defunded and deprofessionalized, driving a digital humanities that is interested in an entrepreneurially based startup model of digital humanities. This is not so for other localized digital humanities practices, yet dh organizations like ADHO continue to imagine digital humanities with a distinct bias toward North American and European notions of culture, value and ownership. O’Donnell et al. rightly argue that this view of digital humanities is predicated on viewing the development of a global digital humanities ‘‘as an opportunity for transferring Fig. 4 ‘‘Co-Author Network for Digital Humanities Quarterly: 2007–14.’’ de la Cruz et al. (2015) Digital Humanities Within a Global Context: Creating… 123 Author's personal copy knowledge, experience, and access to infrastructure from a developed North to an underdeveloped South’’ (2016, 496). Rejecting this, the authors call for an approach that ‘‘is far more about developing understanding than merging practice,’’ and they turn to ‘‘supra-networks that transcend national, linguistic, regional and economic boundaries’’ (2016, 496). I’d like to quibble with the use of networks as the way by which we should represent the interaction of the various global representations of digital humanities. The notion of an overarching system that is built from nodes, is not that different than how ADHO and its constituent conference imagines itself, a model that ignores the very real institutional and cultural divides that are always with us. In many ways, a supra-network is a slightly shifted replication of the long understood big tent digital humanities and, ultimately, a failed model. Digital humanities is an amorphous and fluid concept or practice, particularized in various disciplines, national contexts and even local environments, but the field is represented as a coherent body of practice by intact structures that include the annual digital humanities conference, the various global organizations that form ADHO, and even journals published by the various societies. The digital humanities, as represented by the yearly international conference, is a digital humanities which ignores the borders of practice that masks areas of dissension and normalizes the field to a particular form without contour. However, the center does not hold and recent conferences have featured ruptures, revealing the false constructedness of a coherent digital humanities. Structuring the global digital humanities as a ‘‘big tent’’ hides the way that such a representation seeks ‘‘sameness’’ in practice. A counternarrative that provides a more inclusive understanding of global digital humanities is one that turns to specificity. While some may see the segmentation of digital humanities as counterproductive, I argue Fig. 5 ‘‘Around DH in 80 Days.’’ Gil (2014) A. E. Earhart 123 Author's personal copy that digital humanities must be particularized because dh, as enacted, is so broad, diffuse and flexible that a generalized definition does not adequately address the various digital approaches currently in use nor how certain humanities fields are being altered by digital practice. A far more productive understanding of our collective histories is to identify the borders of practice and to look for disciplinary overlaps that benefit all partners. A specificity of global digital humanities’ practices is best understood in the framework of what Gloria Anzaldua has called the borderlands in her crucial work Borderlands|La Frontera (1987). Anzaldua’s framework allows us to examine the impact of cultural representations of digital humanities within larger frameworks of power, including the economic, cultural and power dynamics that impact the production of scholarship. While Anzaldua is writing prior to the digital turn and code studies scholarship, her work is prescient. Examining the code shifting of language, Anzaldua argues that language codes provide a way to examine the complexity of networked interfaces of communication and a way of understand how cultural identity is impacted by power dynamics of such code. Anzaldua’s focus on code switching, defined in her book as language switching or ‘‘The switching of ‘codes’ …from English to Castillian Spanish to the North Mexican dialect of Tex- Mex to a sprinkling of Nahuatl to a mixture of all of these,’’ produces great cultural upheaval. This ‘‘language of the Borderlands’’ is ever shift and changing and ‘‘There, at the juncture of cultures, languages cross-pollinate and are revitalized; they die and are born’’ (1987, Preface). While Anzaldua situates her discussion of borderlands in the geographic specificity of the Texas/Mexico border, her theorization of power between multiple cultural codes might be extended to our understanding of digital humanities. Roopika Risam echoes such an extension of code switching when she calls for DH accents, a recognition of the multiple languages, both ‘‘linguistic and computational’’ as the formation of dh(s) (2017, 381). To Risam, the multiple accents of digital humanities must be ‘‘understood in a broader ecology of ‘accents’ that inflect practices, whether geography, language, or discipline,’’ providing a model that makes sense of and values the broadness of digital humanities, rather than contains such diversity within a limited framework (2017, 382). Key to understanding the way that localized digital humanities interact within a global framework is to evaluate the contingent power structures. Anne Donadey notes, ‘‘Discrete fields of knowledge can be seen as being separated by disciplinary borders; the interdisciplinary and comparative areas where they meet and are brought together can be viewed as borderland zones in which new knowledge is created, sometimes remaining in the borderland, sometimes becoming institution- alized into a different field of knowledge with its own borders’’ (2007, 23–24). The importance of borders is not in the separation, though indeed that is in play, but the meeting points, which provide productive tensions that bring forth new knowledge. Focusing on resistance, as Donadey puts it, avoids the flattening of ‘‘the concept of borderlands that would erase its historical and cultural grounding by turning it into a disembodied metaphor’’ (2007, 23). The borderlands stand in opposition to big tent representations of cultural connection. To embrace a borderlands understanding of global digital humanities is to respect localized practices and to Digital Humanities Within a Global Context: Creating… 123 Author's personal copy embrace points of context rather than a homogenized centrality. As Anzaldua reminds us, ‘‘A borderland is a vague and undetermined place created by the emotional residue of an unnatural boundary. It is in a constant state of transition’’ (1987, 3). The continual renegotiation of points of connection is productive and ever shifting. Rather than attempting to stabilize such moments, border theory seeks fluidity and destabilization as a means of new knowledge production. Viewing the global digital humanities within a border theory model rather than a big tent or umbrella formulation, one journal or one conference, allows scholars to seek those points of contact while understanding how the power dynamics of digital humanities have come to create points of contention. Crucial to respecting the integrity of localized digital humanities is a careful examination of our assumptions about technology use in digital humanities projects. GO::DH has supported ‘‘minimal computing’’ approaches as a way to rethink the way that many western digital humanities projects center technology innovation. Based on discussions in 2014 with digital humanists in Cuba, those associated with GO::DH, led by Alex Gil, recognized that computing needs in various localized environments might benefit from what Ernesto Oroza calls the ‘‘architecture of Necessity’’ (Gil and Ortega 2016, 29). GO::DH has defined ‘‘minimal computing’’ as that which ‘‘simultaneously capture(s) the maintenance, refurbishing, and use of machines to do DH work out of necessity along with the use of new streamlined computing hardware like the Raspberry Pi or the Arduino micro controller to do DH work by choice. This dichotomy of choice versus necessity focuses the group on computing that is decidedly not high-performance and importantly not first-world desktop computing’’ (GO::DH 2017). While we continue to need to explore how technologies benefit our research questions, we cannot ignore more minimal computing approaches that are often the most innovative and expansive within our field. The bias toward highly robust, often expensive, technologically centered projects as the gold standard for dh also creates a centered field that actively ignores the work occurring in some parts of global digital humanities. To best move forward, we need to return to a multiplicity of approaches that allows for scholarship to recenter technology, and we must resist the creation of rigid borders of academic disciplinarity that effectively shuts down the possibilities of global digital humanities interchange. To proceed in a non-policed borderlands, we must resist a tyranny of technology. Frames for our community interaction must be fluid and non-centralized. They must be evolving. To enable the productive friction between communities, we might begin to see our fields as less about connective nodes and networks and more focused on transnational understandings of disconnecting nodes. Border theory expands our methodologies and our approaches, rejecting a narrow understanding of digital humanities. It allows us to rethink the way that our own scholarship has been colonized and limited, particularly through models of ownership. A tenet of digital humanities in the Americas, for example, has focused around issues regarding ownership of scholarship, with faculty increasingly asserting control over their own labor and their ability to disseminate it freely, as open access (oa) materials, to an audience apart from or in parallel with more traditional structures of academic publishing. Key to defining the digital humanities A. E. Earhart 123 Author's personal copy then is that our scholarship is increasingly public. Matthew Kirschenbaum notes that ‘‘Whatever else it might be then, the digital humanities today is about a scholarship (and a pedagogy) that is publicly visible in ways to which we are generally unaccustomed, a scholarship and pedagogy that’s bound up with infrastructure in ways that are deeper and more explicit than we are generally accustomed, a scholarship and pedagogy that is collaborative and depends on networks of people and that lives an active, 24/7 life online’’ (2012, 60). The public digital humanities and the accompanying push for open access are central to the way that many digital humanists situate their scholarship. However, to fully encompass all expressions of digital humanities, we must also think carefully about issues of ownership, which many in digital humanities have expressed in limited western contexts such as copyright. As we move toward a model of interchange and exchange of globalized digital scholarship, the understanding of ownership and open access must be carefully examined and complicated. The dominance of models of open access in the Americas has been critiqued by a growing number of scholars, with particular attention to this issue from scholars who work with indigenous communities and knowledges. Kim Christen, for example, has produced scholarship and innovative digital tools to address issues of ownership and openness that are centered on indigenous knowledge structures. Her work recognizes that the digital archiving process has deep roots in museum and library collections’ problematic pasts and that many indigenous communities’ have had their intellectual production exploited by colonizers. As Christen notes, ‘‘The colonial collecting project was a destructive mechanism by which Indigenous cultural materials were removed from commu- nities and detached from local knowledge systems’’ (2015, 2). In response, Christen has developed a content management system (CMS), Mukurtu, that allows for sophisticated control of the materials within the CMS, demarcating the viewing of digital objects through localized understandings of what should be seen and what should not be seen and forcing the user to understand that there are certain objects or ideas that are not open to all. 2 While Christen’s work explicitly targets indigenous groups, her thinking about what should be seen and what should not be seen models best practices that we must extend into our conception of the global digital humanities. At the 2017 Montreal Digital Humanities meeting the ‘‘Copyright, Digital Humanities, and Global Geographies of Knowledge’’ panel considered this important issue. The discussion of copyright practices in various countries during the panel revealed the very limited understanding of the topic within the larger collective who attended the conference. Isabel Galina Russell’s remarks focused on copyright in Latin America, with her particular expertise focused on Mexico. Galina Russell emphasized that ‘‘Latin America distinguishes itself from other regions of the world in that scientific information belongs to all’’ (2017). Recognizing that few for profit academic commercial publishers exist in Latin America, Galina Russell argues that ‘‘there is a 2 See Kimberly Christen. ‘‘On Not Looking: Economies of Visuality in Digital Museums’’ in The International Handbooks of Museum Studies: Museum Transformations, First Edition. Ed. Annie E. Coombes and Ruth B. Phillips. Oxford: John Wiley & Sons, Ltd. Oxford Press, 2015: 365–386. 365–3666. Digital Humanities Within a Global Context: Creating… 123 Author's personal copy generalized idea that knowledge produced in the university belongs to all, it is a common good provided to the country,’’ negating copyright and shifting ownership of academic production to the public (2017). This conception of ownership stands in stark contrast to the way that ownership has functioned within the types of structures set up by the western for profit academic publishers and that many dh scholars see as central to oa initiatives. In the same panel, Padmini Ray Murray discussed the copyright lawsuit brought against Shyam Singh, the owner of a small Indian shop producing course packs for students at a local university, who was sued by several leading academic presses. Murray points out that the case revealed the way that assumptions of copyright elided national boundaries and attempted to apply western understandings of ownership on scholarly work. At the same time that the lawsuit negated copyright rules of the Indian state, it also selectively ignored US and UK copyright rules with the desire to further enforce western ideas of ownership. In response to the supposed copyright violations, the lawsuit ‘‘sought to ban all course packs, including those that observe the US definition of fair use, i.e., excerpts comprising less than 10% of the whole text’’ (2017). At the same time the legal challenge ignored ‘‘Section 52 of the Indian Copyright Act \that[ permits ‘fair dealing’ with the purpose of research, as well as permitting any copyrighted work to be used for the purpose of educational instruction’’ (2017). Situating copyright law neither in Indian or the west, the lawsuit was written as nationless, boundary less, centered only on the effort to end the exchange of information. Both papers point to the complications of thinking about ownership and knowledge as equivalent forms across cultures and nations. While we might value open access in the digital humanities, not all producers of knowledge will accede to openness. Instead we must, once again, develop structures that see knowledge as culturally defined and controlled. By valuing the localized understanding of knowledge and knowledge production, we situate the global digital humanities within a productive nexus of borders. Instead of insisting that we encapsulate all practices of digital humanities within a big tent or a centralized structure, we should instead view ADHO and its conferences and journals as important, but not central, meeting spaces for digital humanists. Rather than seeing ADHO as the center, we should encourage a global digital humanities that works on the borderlands, with localized expressions of scholarship that reinvigorate through exchange. Rejecting the ‘‘dualistic thinking in the individual and collective consciousness’’ is a struggle, as Anzaldua argues, but it is the only way that we might move beyond binaries that are currently in place, whether technologically advanced/primitive, east/west, or low income/high income (1987, 422). Resisting the homogenization of scholarly methods, questions, outcomes, production and ownership is the only way to develop a truly robust global digital humanities. A. E. Earhart 123 Author's personal copy References Anzaldua, Gloria. 1987. Borderlands/La Frontera. San Francisco: Aunt Lute Book Company. Centernet: An International Network of Digital Humanities Centers. 2017. https://dhcenternet.org/about. Accessed 15 Aug 2017. Christen, Kimberly. 2015. Tribal Archives, Traditional Knowledge, and Local Contexts: Why the ‘s’ Ma Ers. Journal of Western Archives 6(1): 1–19. de la Cruz, Dulce Maria, Jake Kaupp, Max Kemman, Kristin Lewis, and Teh-Hn Yu. 2015. Mapping Cultures in the Big Tent: Multidisciplinary Networks in the Digital Humanities Quarterly. https:// jkaupp.github.io/DHQ/coursework/VisualizingDHQ_Final_Paper.pdf. Accessed 10 Aug 2017. DH2018: Mexico City. Dh 2018 (blog) 2018. https://dh2018.adho.org/en/. Accessed 10 Aug 2017. Donadey, Anne. 2007. Overlapping and Interlocking Frames for Humanities Literary Studies: Assia Djebar, Tsitsi Dangarembga. Gloria Anzaldua. College Literature 34(4): 22–42. Earhart, Amy E. 2015. Traces of the Old, Uses of the New: The Emergence of the Digital Literary Studies. Ann Arbor: University of Michigan Press. Galina Russell, Isabel. 2017. Presentation, Panel on Copyright, Digital Humanities, and Global Geographies of Knowledge. Presented at the Digital Humanities 2017, Montreal, Canada. Gil, Alex. 2014. Around DH in 80 Days. Around DH in 80 Days (blog). http://www.arounddh.org. Accessed 10 Aug 2017. Gil, Alex, and Elika Ortega. 2016. Global Outlooks in Digital Humanities: Multilingual Practices and Minimal Computing. In Doing Digital Humanities: Practice, Training, Research, ed. Constance Crompton, Richard J. Lane, and Ray Siemens, 22–34. London: Routledge. Global Outlook::Digital Humanities. 2017. http://www.globaloutlookdh.org. Accessed 10 Aug 2017. Hayles, Katherine. 2010. My Mother Was a Computer: Digital Subjects and Literary Texts. Chicago: University of Chicago Press. Kirschenbaum, Matthew. 2012. What is Digital Humanities and What’s It Doing in English Departments? In Debates in the Digital Humanities, ed. Matthew Gold, 3–11. St. Paul: U Minnesota P. Membership. ADHO (blog). 2018. https://adho.org/faq. Accessed 10 Aug 2017. O’Donnell, Daniel Paul, Katherine L. Walter, Alex Gil, and Neil Fraistat. 2016. Only Connect: The Globalization of the Digital Humanities. In A New Companion to the Digital Humanities, ed. Susan Schreibman, Ray Siemens, and John Unsworth, 493–510. Malden, MA: Wiley Blackwell. Pannapacker, William. 2009. The Brainstorm Blog: The Chronicle of Higher Education Online. Ray Murray, Padmini. 2017. Presentation, Panel on Copyright, Digital Humanities, and Global Geographies of Knowledge. Presented at the Digital Humanities 2017, Montreal, Canada. Risam, Roopika. 2017. Other worlds, other DHs: Notes towards a DH Accent. Digital Scholarship in the Humanities 32(2): 377–384. SIGs: ADHO Special Interest Groups (SIGs). 2017. ADHO (blog). http://adho.org/sigs. Accessed 3 Nov 2017. Terras, Melissa. 2006. Disciplined: Using Educational Studies to Analyse ‘Humanities Computing’. Literary and Linguistic Computing 21(2): 229–246. Terras, Melissa. 2011. Quantifying Digital Humanities. UCL Centre for Digital Humanities. http://blogs. ucl.ac.uk/dh/2012/01/20/infographic-quantifying-digital-humanities/. Accessed 5 Nov 2017. Weingart, Scott B., and Nickoal Eichmann-Kalwara. 2017. What’s Under the Big Tent? A Study of ADHO Conference Abstracts. Digital Studies/Le Champ Numerique 7: 6. https://doi.org/10.16995/ dscn.284/. Amy E. Earhart is an Associate Professor in the Department of English at Texas A&M University. She is the author of Traces of the Old, Uses of Old: The Emergence of Digital Literary Studies (2015) and co- editor of The American Literature Scholar in the Digital Age (2010). She is the author of various books and chapters in venues including Debates in Digital Humanities, Textual Cultures and the Humanities and the Digital, among others. Digital Humanities Within a Global Context: Creating… 123 Author's personal copy https://dhcenternet.org/about https://jkaupp.github.io/DHQ/coursework/VisualizingDHQ_Final_Paper.pdf https://jkaupp.github.io/DHQ/coursework/VisualizingDHQ_Final_Paper.pdf https://dh2018.adho.org/en/ http://www.arounddh.org http://www.globaloutlookdh.org https://adho.org/faq http://adho.org/sigs http://blogs.ucl.ac.uk/dh/2012/01/20/infographic-quantifying-digital-humanities/ http://blogs.ucl.ac.uk/dh/2012/01/20/infographic-quantifying-digital-humanities/ https://doi.org/10.16995/dscn.284/ https://doi.org/10.16995/dscn.284/ Digital Humanities Within a Global Context: Creating Borderlands of Localized Expression Abstract References work_hf3rbisgfzaupmouglg6u4tjey ---- 1 volume 7 issue 14/2018 THE RESEARCHER AS STORYTELLER U S I N G D I G I TA L T O O L S F O R S E A R C H A N D S T O R Y T E L L I N G W I T H A U D I O - V I S U A L M AT E R I A L S Berber Hagedoorn University of Groningen Research Centre for Media and Journalism Studies Oude Kijk in ‘t Jatstraat 26 9712 EK Groningen The Netherlands B.Hagedoorn@rug.nl Sabrina Sauer University of Groningen Research Centre for Media and Journalism Studies Oude Kijk in ‘t Jatstraat 26 9712 EK Groningen The Netherlands S.C.Sauer@rug.nl Abstract: This article offers a first exploratory critique of digital tools' socio-technical affordances in terms of support for narrative creation by media researchers. More specifically, we reflect on narrative creation processes of research, writing and story composition by Media Studies and Humanities scholars, as well as media professionals, working with crossmedia and audio-visual sources, and the pivotal ways in which digital tools inform these processes of search and storytelling. Our study proposes to add to the existing body of user-centred Digital Humanities research by presenting the insights of a cross-disciplinary user study. This involves, broadly speaking, researchers studying audio-visual materials in a co-creative design process, set to fine-tune and further develop a digital tool (technically based on linked open data) that supports audio-visual research through exploratory search. This article focuses on how 89 researchers – in both academic and professional research settings – use digital search technologies in their daily work practices to discover and explore (crossmedia, digital) audio-visual archival sources, especially when studying mediated and historical events. We focus on three user types, (1) Media Studies researchers; (2) Humanities researchers that use digitized audio-visual materials as a source for research, and (3) media professionals who need to retrieve materials for audio-visual text productions, including journalists, television/image researchers, documentalists, documentary filmmakers, digital storytellers, and media innovation experts. Our study primarily provides insights into the search, retrieval and narrative creation practices of these user groups. A user study such as this which combines different qualitative methods (focus groups with co-creative design sessions, research diaries, questionnaires), first, affords fine-grained insights. Second, it demonstrates the relevance of closely considering practices and mechanisms conditioning narrative creation, including self-reflexive approaches. Third and finally, it informs conclusions about the role of digital tools in meaning-creation processes when working with audio-visual sources, and where interaction is pivotal. Keywords: narratives, narrative creation, storytelling, exploratory search, media research, working with audio- visual sources (AV), user studies, Digital Humanities, archives, affordances of digital search tools, linked open data mailto:B.Hagedoorn@rug.nl mailto:S.C.Sauer@rug.nl B. Hagedoorn and S. Sauer, The Researcher as Storyteller 2 This article presents results of an exploratory Digital Humanities study focused on researchers working with digitized audio-visual (AV) sources, particularly regarding cases of mediated and historical events.1 In this article, we reflect on narrative creation processes, specifically research, writing and story composition by Media Studies and Humanities scholars as well as media professionals, and the pivotal ways in which digital tools inform these processes of search and storytelling around crossmedia AV sources. Whilst our study is concerned with supporting media research from beginning to end, we take a particular interest in exploratory search2 for supporting the first – exploratory and initial – stages of doing research, because during the initiation of a search researchers “may be in most need of support”.3 We argue that this is especially prevalent for researchers working with AV and crossmedia sources, due to the complex, dynamic and multifaceted nature of this data type. Sonja de Leeuw has discussed the history and challenges for European television history since the dawn of its archival turn in the opening article of VIEW, arguing that “institutions and digital libraries are challenged to meet the needs of users, to construct new interfaces not only in-house but also through online platforms. This requires fresh conceptual thinking about topical relations and medium-specific curatorial approaches as well as user-led navigation and the production of meaning”4 (our emphasis). In this article we study how contemporary digital tools and platforms of cultural heritage institutions adapt and react to this challenge, in interaction with curatorial approaches and user perspectives. Here, we pay particular attention to research with AV sources via audio-visual archival institutions, and the impact on narrative creation around mediated events. This article analyses and questions the ‘translation’ of AV data on different platforms into the narratives that we, as researchers working with AV sources, can tell – and by doing so, informs on the conclusions about the role of digital tools in meaning-creation processes. The study’s theoretical and methodological starting point is that narratives5 should be viewed in terms of their socio-technical context. Digital tools – used to search for, annotate, and analyse events – frame and afford the narratives that both media scholars and professionals as researchers can form around their research question. In their work, researchers study and integrate cultural and political meanings connected to media events.6 They delve into how said meanings – often disruptive, and long-term – are reproduced and made sense of via television and connected media platforms. In turn, we have studied how researchers search for narratives (cases) surrounding ‘disruptive’ events (such as natural disasters, terrorist attacks, and ‘breaking news’ marathons of disaster and terror) from a cultural-historical perspective, drawing upon archival and Linked Data materials from the Netherlands Institute of Sound and Vision and the digital search database and tool Media Suite (CLARIAH). Special attention is paid here to the functionalities of DIVE+, a Linked Data event-based browser, based on the simple event data model, where users can browse and explore different heritage collections simultaneously, which supports the creation of browsing narratives. 1 O u t l i n e a n d M e t h o d This study integrates the research areas Media Studies, Information Studies and Science and Technology Studies. It connects research and search practices to data quality enhancement, to realize a cross-disciplinary project that 1 Nick Couldry, Andreas Hepp and Friedrich Krotz, Media Events in a Global Age, Routledge, 2009; Elihu Katz and Tamar Liebes, ‘’No More Peace!’: How Disaster, Terror and War Have Upstaged Media Events,’ International Journal of Communication 1, 2007, 157-166; Daniel Dayan and Elihu Katz, Media Events: The Live Broadcasting of History, Harvard University Press, 1992. 2 Gary Marchionini, ‘Exploratory Search: From Finding to Understanding,’ Communications of the ACM 49, 4, 2006, 41-46. 3 Gary Marchionini and Ryen White, ‘Find What You Need, Understand What You Find,’ International Journal of Human-Computer Interaction, 23, 3, 2007, 205-237. 4 Sonja De Leeuw, ‘European Television History Online: History and Challenges,’ VIEW: Journal of European History and Culture 1,1, 2012, 3-11. 5 Marie-Laure Ryan, ed, Narrative across Media: The Languages of Storytelling, U of Nebraska Press, 2004. 6 Elihu Katz and Tamar Liebes, ‘’No More Peace!’: How Disaster, Terror and War Have Upstaged Media Events,’ International Journal of Communication 1, 2007, 157-166; César Jiménez-Martínez, ‘Integrative Disruption: The Rescue of the 33 Chilean Miners as a Live Media Event,’ in Andrew Fox (ed.) Global Perspectives on Media Events in Contemporary Society, IGI Publishers, 2016, pp. 60-77. https://doi.org/10.1080/10447310701702352 https://doi.org/10.1080/10447310701702352 B. Hagedoorn and S. Sauer, The Researcher as Storyteller 3 seeks both technical and academic innovation. This study therefore takes a cross-disciplinary digital hermeneutics approach. By integrating digital technology for interpretation support, we provide insight into the roles of narratives in digital hermeneutics – the encounter of hermeneutics and web technology7 – and how events (and in what form) help interpretation. Our theoretical and methodological framework connects ideas about user studies in Digital Humanities to our own user-centred design and mixed methodology: a co-creative design approach that includes focus groups, research diaries, and questionnaires with open questions, to learn about the role of narrative creation and exploratory search in media research practices. Furthermore, the framework brings prior work on media events and narratives into focus, in relation to our research on understanding user-technology interactions. Our analysis is focused on how researchers use and reflect on the use of exploratory search tools, and how exploratory search informs narrative creation practices. The collected data provides insights into how researchers search and explore digital audio-visual archives to form narratives. Through user studies, we were able to focus on, first, how researchers construct navigation paths via exploratory search, and, second, evaluate the role of narratives in learning about historical mediated events and doing research into these events. In this process, DIVE+ (see §3 and Video 1) was also compared to other online search tools, such as Google Explore. Ultimately, studying working with AV can provide specific insights into the different perspectives that define the course and framing of mediated events, and our study offers a critique of digital tools’ socio-technical affordances in terms of support for search, retrieval and narrative creation by researchers working with AV materials. Video 1. DIVE+: Explorative Search for Digital Humanities Digital Humanities centres on humanities questions that are raised by and answered with digital tools. At the same time, the DH-field interrogates the value and limitations of digital methods in Humanities’ disciplines. While it is important to understand how digital technologies can offer new venues for Humanities research, it is equally essential to understand and interpret the ‘user side’ and sociology of Digital Humanities. Our overarching research question is concerned with how media researchers (scholars and professionals) appropriate search tools to ask and answer new questions, and apply digital methods when working with AV sources. To answer this question, we relate it to a concrete search practice and digital tool, and ask the sub question: how does exploratory search support researchers to study (disruptive) media events across media, and how these events are instilled with specific cultural or political meanings? 7 Chiel Van Den Akker, Susan Legêne, Marieke Van Erp, Lora Aroyo, Roxane Segers, Lourens Van Der Meij, Jacco Van Ossenbruggen, Guus Schreiber, Bob Wielinga, Johan Oomen, Geertje Jacobs, ‘Digital Hermeneutics: Agora and the Online Understanding of Cultural Heritage Categories and Subject Descriptors,’ WebSci 11, Koblenz, Germany, 2011. https://www.youtube.com/watch?v=FI3MPiU9rjo http://dl.acm.org/citation.cfm?id=2527039 http://dl.acm.org/citation.cfm?id=2527039 http://dl.acm.org/citation.cfm?id=2527039 B. Hagedoorn and S. Sauer, The Researcher as Storyteller 4 As a result, we can consider the implications on how researchers interpret and negotiate AV sources and affordances of digital tools, in their own research practices. User studies observe technology use in practice, and can therefore show how users appropriate technologies.8 User studies can serve to evaluate technologies in UI/UX testing (i.e. User Interface Design and User Experience testing) and pre-conceived use cases.9 They may also help us understand how technologies are increasingly becoming part of disciplinary practices.10 Whilst previous user research in Digital Humanities concentrates on assessing how and why Digital Humanities benefits from studies into user needs and behaviour11 – on user requirement research12 and on participatory design research13 – our article proposes to add to this body of research by presenting insights of a cross-disciplinary user study that involves researchers studying AV materials, in an iterative co-creative design process14 set to fine-tune and further develop a digital tool that supports audio-visual research through exploratory search. We employed a user-centred design methodology15 to analyse researchers’ engagement when using exploratory search, and more specifically, how users and technologies co-construct meaning and meaning-making practices. We studied how media researchers use digital search technologies in their daily work practices, to discover and explore digital AV archival material. Our study includes three user types: (1) Media Studies researchers, who are generally more experienced in working with AV sources; (2) Humanities researchers that use AV materials as a source for research or are interested in doing so, with varying degrees of expertise; and (3) media professionals who need to retrieve AV materials for audio-visual text productions, such as television programmes, journalistic productions or other creative endeavours. In group 1 and 2 we met with both university students (advanced levels) and lecturers. Humanities researchers (group 2) include scholars with academic backgrounds such as history, international studies, digital humanities, communication studies, languages and culture studies, whilst media professionals (group 3) include journalists, television/image researchers, documentalists, documentary filmmakers, digital storytellers, and media innovation experts. These user types are the foreseen end users of DIVE+ and the overarching Media Suite tool and database, because they create audio-visual narratives for their respective work purposes. We set up co-creative design sessions (see §5) with 89 researchers in both academic as well as professional settings, across different cities and institutions in the Netherlands (group 1: 21 participants; group 2: 57 participants; group 3: 11 participants) to observe and reflect on how they interact with search tools to explore, access and retrieve digitized AV material for narrative creation, and in some cases, creative re-use of this material in new audio-visual productions. From this micro-analysis, we extrapolate insights at the meso level: to relate insights gained about user interactions with one exploratory search tool (DIVE+) to more overarching ideas about user-technology interactions, and what such interactions imply about the role of digital tools in Humanities and Media Studies. 8 Leslie Haddon, ‘Domestication Analysis, Objects of Study, and the Centrality of Technologies in Everyday Life,’ Canadian Journal of Communication 36, 2, 2011; Nelly Oudshoorn and Trevor Pinch, How Users Matter: The Co-construction of Users and Technology, MIT Press, 2003. 9 Claire Warwick, ‘Studying Users in Digital Humanities,’ Digital Humanities in Practice, Facet Publishing, 2012, pp. 1-21. 10 James Stewart and Robin Williams, ‘The Wrong Trousers? Beyond the Design Fallacy: Social Learning and the User,’ in Debra Howcroft and Eileen M. Trauth (eds.) Handbook of Critical Information Systems Research: Theory and Application, Edward Elgar, 2005, pp. 195-221. 11 Claire Warwick, ‘Studying Users in Digital Humanities,’ Digital Humanities in Practice, Facet Publishing, 2012, pp. 1-21. 12 Harriet E. Green and Patricia Lampron, ‘User Engagement with Digital Archives for Research and Teaching: A Case Study of Emblematica Online,’ portal: Libraries and the Academy, 17, 4, 2017, 759-775. 13 Max Kemman and Martijn Kleppe, ‘User Required? On the Value of User Research in the Digital Humanities,’ Selected Papers from the CLARIN 2014 Conference, October 24-25, 2014, Soesterberg, The Netherlands 116, Linköping University Electronic Press, 2014. 14 Elisabeth B.-N. Sanders and Pieter Jan Stappers, ‘Co-Creation and the New Landscapes of Design,’ Co-Design, 4, 1, 2008, 5-18. 15 S.M. Zabed Ahmed, Cliff McKnight and Charles Oppenheim, ‘A User-Centred Design and Evaluation of IR Interfaces,’ Journal of Librarianship and Information Science, 38, 3, 2006, 157-172. http://www.cjc-online.ca/index.php/journal/article/view/2322/2929 https://preprint.press.jhu.edu/portal/sites/ajm/files/17.4green.pdf https://preprint.press.jhu.edu/portal/sites/ajm/files/17.4green.pdf B. Hagedoorn and S. Sauer, The Researcher as Storyteller 5 2 D I V E + This research study is CLARIAH-centric, from the perspective of DIVE+. The latter is integrated in the national CLARIAH (Common Lab Research Infrastructure for the Arts and Humanities) research infrastructure in the Netherlands – as part of the CLARIAH Media Suite – and is aimed at providing researchers with access to digitized audio-visual data as well as tools for research and analysis. DIVE+ is an event-centric Linked Data digital collection browser, which offers intuitive or exploratory browsing and exploration of media events at different levels of detail. It connects media objects (images or movies retrieved from cultural datasets), places (geographical or descriptive), actors (people or organizations) and concepts that are depicted or associated with particular collection objects, to contextualize search paths into overarching narratives and timelines.16 This tool is the result of collaboration between computer scientists, Humanities scholars, cultural heritage professionals and interaction designers.17 Figure 1. DIVE+ supports creation, saving and sharing of explored connections between objects, persons and places in the form of so-called search narratives.18 Events are a central part of this data enrichment: giving context to objects in collections by linking them in events. DIVE+ builds on the results of DIVE by expanding this digital hermeneutics approach for interaction, interpretation and exploration of digital heritage via different and linked online collections, providing a basis for interpretation support 16 In Media Suite version 4, the DIVE+ categories have been updated to Media Objects, People, Locations, and Concepts. 17 DIVE+ is a research project funded by the NLeSC and is a collaborative effort of Vrije Universiteit Amsterdam (Lora Aroyo, Victor de Boer, Oana Inel, Chiel van den Akker, Susan Legêne), Netherlands Institute for Sound and Vision (Jaap Blom, Liliana Melgar, Johan Oomen), Frontwise (Werner Helmich), University of Groningen (Berber Hagedoorn, Sabrina Sauer) and the Netherlands eScience Centre (Carlos Martinez Ortiz). It is also supported by CLARIAH and NWO. It was the winning submission of the LODLAM Challenge 2017 Grand Prize (International Summit for Linked Open Data in Libraries, Archives and Museums) in recognition of how DIVE+ demonstrates social, cultural and technical impact of Linked Data. 18 Victor de Boer, Oana Inel, Lora Aroyo, Chiel van den Akker, Susan Legêne, Carlos Martinez, Werner Helmich, Berber Hagedoorn, Sabrina Sauer, Jaap Blom, Liliana Melgar and Johan Oomen, ‘DIVE+: Exploring Linked Integrated Data,’ Europeana Insight, September 2017, https://pro. europeana.eu/page/issue-7-lodlam#dive-exploring-integrated-linked-media. https://mediasuite.clariah.nl/ https://www.beeldengeluid.nl/en/knowledge/projects/dive https://www.esciencecenter.nl/ https://www.esciencecenter.nl/ https://www.clariah.nl/ https://www.clariah.nl/ https://www.nwo.nl/ https://www.nwo.nl/ https://www.beeldengeluid.nl/en/knowledge/blog/dive-receives-grand-prize-lodlam-summit-venice https://pro.europeana.eu/page/issue-7-lodlam#dive-exploring-integrated-linked-media https://pro.europeana.eu/page/issue-7-lodlam#dive-exploring-integrated-linked-media B. Hagedoorn and S. Sauer, The Researcher as Storyteller 6 in the searching and browsing of heritage objects, with semantic information from existing collections plus open Linked Data vocabularies.19 This browser offers events-driven exploration of digital heritage material, where events are prominent building blocks in the creation of narrative backbones20 and links a variety of different media sources and collections. Whilst DIVE+ is continually updated, including through crowdsourcing, at the time of our research the browser contains entities from Delpher (scanned radio bulletins from KB/National Library of the Netherlands), Amsterdam Museum, Tropenmuseum and the Netherlands Institute for Sound and Vision (news broadcasts of the Open Images collection). Our research study aids in answering the question how such a browser – technically based on linked open data, supporting event-centric exploration or context analysis – can support a scholar/researcher from beginning to end, and therefore this study helps to improve DIVE+ (as part of the overarching Media Suite) as a browser. To do so, our research study draws upon the exploration of narratives (narrative centric approach) instead of other types of search (for instance more traditional or document centric approaches, such as faceted search). Moreover, this study addresses the purpose and usefulness of narratives for scholarly research. Figure 2. DIVE+ Linked data sources and vocabularies: establishing explorable links through shared vocabularies. 3 E x p l o r a t o r y S e a r c h : A B a s i s f o r T o o l C r i t i c i s m a n d R e s e a r c h i n g ‘ D i s r u p t i v e ’ M e d i a E v e n t s Users’ ideas and practices with exploratory search and retrieval technologies can not only shape AV narratives and productions, but can also enhance the development of exploratory search tools. Our study contributes to ideas about 19 DIVE+ Project Homepage, Beeld en Geluid, http://diveproject.beeldengeluid.nl. 20 Victor De Boer, Liliana Melgar, Oana Inel, Carlos Martinez Ortiz, Lora Aroyo and Johan Oomen, ‘Enriching Media Collections for Event- Based Exploration,’ 11th Metadata and Semantics Research Conference (MTSR2017), Tallinn, Estonia. Best Paper Award; Victor De Boer, Johan Oomen, Oana Inel, Lora Aroyo, Elco Van Staveren, Werner Helmich and Dennis De Beurs, ‘DIVE into the Event-Based Browsing of Linked Historical Media,’ Web Semantics: Science, Services and Agents on the World Wide Web, 35, 3, 2015, 152-158. https://www.beeldengeluid.nl/en/knowledge/projects/dive http://www.victordeboer.com/wp-content/uploads/2017/12/enriching-deboer-47-mtsr2017.pdf http://www.victordeboer.com/wp-content/uploads/2017/12/enriching-deboer-47-mtsr2017.pdf https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3198924 http://www.websemanticsjournal.org/index.php/ps/article/view/427/442 B. Hagedoorn and S. Sauer, The Researcher as Storyteller 7 tool criticism in Digital Humanities research. According to David Berry,21 the first wave of Digital Humanities focused on digitization and realizing technological infrastructures, whilst the second wave was generative, creating environments and tools to interact with data that is born digitally. The third wave, which Berry refers to in terms of a third layer, should concentrate on “the underlying computationality of the forms held within a computational medium (...) to look at the digital component of the Digital Humanities in the light of its medium specificity, as a way of thinking about how media changes produce epistemic changes”.22 We advocate a research approach as a part of which questions such as why specific data is collected, for what purpose, and within what context – the so-called politics of archiving – are addressed from a critical (Humanities) perspective. In line with Berry, we aim to understand (research) culture through digital technology, and even more specifically, the ways in which digital tools facilitate everyday research practices.23 We interrogate the underlying assumptions about how media researchers explore AV materials online. This is in line with Berry’s argument that one should understand culture through the use of (and through working with) digital technology; with a focus on how people use software in their everyday practices.24 Moreover, a reflection on the use of a digital search tool designed to afford both exploration and narrative creation, allows us to draw user-validated conclusions about how this particular tool reshapes an understanding of what it means to explore and create narratives via digital tools. It may well turn out that the ways in which the tool designers translated ideas about exploring and narrativizing digital material, do not match how the foreseen users understand exploration and narratives. We argue that exploratory search is crucial for Humanities researchers who draw upon audio-visual materials in their research. Recognizing relevant multi-platform sources and bringing these to attention – in an iterative fashion – greatly supports scholars in their research. Supporting researchers’ explorations is especially relevant in the case of scholars studying complex mediated and/or historical events. In the first place, because audio-visual, online and digital sources are in abundance, scattered across different platforms and changing daily in our contemporary landscape. Second, disruptive media events are difficult to interpret due to the challenges of grasping the immediate story. A media event is an event with a specific narrative that gives the event its meaning, and is in contemporary societies increasingly recognized as non-planned or disruptive. Disruptive media events,25 such as the sudden rise of populist politicians, terrorist attacks or environmental disasters, are shocking and unexpected, making them especially difficult to interpret. One can even argue that in today’s crossmedia landscape, disruption has become a marker of the way in which news narratives are continually told, circulated and shared across media, formatted as breaking news.26 This leads to problems for researchers who analyse how narratives construct different political, economic or cultural meanings around such events. Previous research argues that media events should always be viewed in relation to their wider political and socio-cultural contexts. Events, as they unfold in the media, may correspond to long-term social phenomena, and the way in which such events are constructed has particular connotations. Specific actors (newscasters, governments, institutions, political interest groups) use media events to build narratives in line with their own political, economic or cultural purposes – examples are stories of empathy, fear and change in relation to international media events.27 We argue that researchers, in turn, also build event narratives, and can therefore said to be storytellers. Yet, disruptive media events, such as live broadcasting marathons of disaster, terror, and war, have not yet been researched in the context of exploratory search strategies. 21 David M. Berry, ‘Introduction: Understanding the Digital Humanities,’ in Understanding Digital Humanities, David M. Berry (ed.), Palgrave Macmillan UK, 2012, pp. 1-20. 22 Ibid, p. 4. 23 Ibid, p. 5. 24 Ibid, p. 5. 25 Elihu Katz and Tamar Liebes, ‘No More Peace!’: How Disaster, Terror and War Have Upstaged Media Events,’ International Journal of Communication 1, 2007, 157-166. 26 Ingrid Volkmer, News in Public Memory. An International Study of Media Memories across Generations, Peter Lang, 2006; Daniel Dayan and Elihu Katz, Media Events: The Live Broadcasting of History, Harvard University Press, 1992. 27 César Jiménez-Martínez, ‘Integrative Disruption: The Rescue of the 33 Chilean Miners as a Live Media Event,’ in Andrew Fox (ed.) Global Perspectives on Media Events in Contemporary Society, IGI Publishers, 2016, pp. 60-77. http://dx.doi.org/10.1057/9780230371934_1 B. Hagedoorn and S. Sauer, The Researcher as Storyteller 8 Searching for stories, shapes stories.28 Prior research underlines the importance of visualizing, constructing and storing of narratives during information navigation to contextualize retrieved materials.29 Our own research study further illuminates the role of media researchers as storytellers, and their processes of selection and interpretation when working with audio-visual sources and learning about mediated events, especially regarding search, retrieval and narrative creation. 4 C o - C r e a t i v e U s e r S e s s i o n s Our case study approach combines grounded theory – that fosters an understanding of how researchers interpret and create narratives – with usability methodologies, such as work task evaluations. First of all, this allows us to draw conclusions about how search tools and digital technologies co-construct the researcher’s professional practice. Second, the data helps us probe the question how the kind of digitality of search and retrieval shapes the practice of media research, and, in extension of this, creative and storytelling processes. The research takes an interdisciplinary approach: it combines insights from Media Studies, as well as from Information Studies and Science and Technology Studies and integrates ideas about narrative creation, search practices, and overarching notions about how users and technologies co-construct meaning.30 Therefore, the presented research does not necessarily focus on how Digital Humanities’ tools have an impact on researchers’ practices, but rather analyses how researchers make use of search tools. In our user study, we collected qualitative data to answer our main question; in keeping with our user-centred approach, we (A) observed how users used the search browser by giving users search tasks31; (B) asked users specific written and verbal feedback about their user experience (questionnaires with open questions and research diaries)32, and (C) collected user perspectives on the role of digital search technologies in Humanities research in the shape of user-generated posters. The user study observes media researchers as they use DIVE+ to explore media events, across three stages: (1) research question formulation; (2) DIVE+ use; and (3) comparative user evaluations of the DIVE+ browser, compared to other online search tools such as Google Explore, resulting in specific search narratives. While interacting with the search browser, users were observed, and asked to provide feedback on their search experience, talking aloud about their search journeys. They were subsequently asked to export the navigation paths that were generated in the DIVE+ browser and provide written or verbal feedback on their experiences in terms of how DIVE+ supports narrative creation about historical events. This feedback was, then, discussed during a focus group session, in which we asked participants to reflect on their experiences. 28 Sabrina Sauer, ‘Audiovisual Narrative Creation and Creative Retrieval: How Searching for a Story Shapes the Story,’ Journal Of Science And Technology Of The Arts, 9, 2, 2017, 37-46. 29 Berber Hagedoorn and Sabrina Sauer, ‘Getting the Bigger Picture: An Evaluation of Media Exploratory Search and Narrative Creation,’ DHBenelux 2017 Conference, Paper, Utrecht University, Utrecht, 4 July 2017; Chiel Van Den Akker, Susan Legêne, Marieke Van Erp, Lora Aroyo, Roxane Segers, Lourens Van Der Meij, Jacco Van Ossenbruggen, Guus Schreiber, Bob Wielinga, Johan Oomen, Geertje Jacobs, ‘Digital Hermeneutics: Agora and the Online Understanding of Cultural Heritage Categories and Subject Descriptors,’ WebSci 11, Koblenz, Germany, 2011; Maartje Kruijt, Supporting Exploratory Search with Features, Visualizations, and Interface Design: A Theoretical Framework, MA Thesis, University of Amsterdam, 2016; Sonja De Leeuw, ‘European television history online: History and challenges,’ VIEW: Journal of European History and Culture 1,1, 2012, 3-11. 30 Wiebe Bijker, Thomas Hughes and Trevor Pinch, The Social Construction of Technological Systems. New Directions in the Sociology and History of Technology, MIT Press, 2012. 31 Barbara Wildemuth and Luanne Freund, ‘Assigning Search Tasks Designed to Elicit Exploratory Search Behaviors,’ in Proceedings of the Symposium on Human-Computer Interaction and Information Retrieval, ACM, 2012, p. 4. 32 Elaine G. Toms and Wendy Duff, ‘’I spent 1,5 hours sifting through one large box’: Diaries as Information Behavior of the Archives User: Lessons Learned’, Journal of the American Society for Information Science and Technology, 53, 14, 2012, 1232-1238. http://dl.acm.org/citation.cfm?id=2527039 http://dl.acm.org/citation.cfm?id=2527039 http://dl.acm.org/citation.cfm?id=2527039 http://dl.acm.org/citation.cfm?id=2527039 http://dl.acm.org/citation.cfm?id=2527039 http://dl.acm.org/citation.cfm?id=2527039 http://dl.acm.org/citation.cfm?id=2527039 http://dl.acm.org/citation.cfm?id=2527039 http://dl.acm.org/citation.cfm?id=2527039 https://onlinelibrary.wiley.com/doi/10.1002/asi.10165 https://onlinelibrary.wiley.com/doi/10.1002/asi.10165 B. Hagedoorn and S. Sauer, The Researcher as Storyteller 9 5 S e a r c h T a s k s Users were introduced to the DIVE+ search browser, the overarching Media Suite, as well as Google Explore and selected online audio-visual repositories, and subsequently asked to perform a search task. Search tasks are “goal-oriented activities carried out using search systems”.33 We developed exploratory search tasks in line with recommendations for task design.34 This means we tailored tasks to research situations. An example task given to the users of the DIVE+ browser was: Example task 1: Imagine that a media company is going to produce programmes about Jakarta, Beatrix (former Queen and now Princess of the Netherlands), Islam, or Watersnoodramp (1953 North Sea flood). Your goal is to propose an interesting angle for one of the programmes. Figure 3. Image of heavily damaged house during 1953 North Sea flood in Zeeland, the Netherlands. Source: Commons Wikimedia. For an exploratory search task such as described in example task 1, with a specific focus on the keyword Watersnoodramp (referring to the 1953 North Sea flood, a natural disaster in the Netherlands with 1836 casualties), this could result in an exploration path and search narrative as visualized in Video 2. 33 Barbara Wildemuth, Luanne Freund and Elaine G. Toms, ‘Untangling Search Task Complexity and Difficulty in the Context of Interactive Information Retrieval Studies,’ Journal of Documentation 70, 6, 2014, 1118-1140. 34 Pia Borlund, ‘A Study of the Use of Simulated Work Task Situations in Interactive Information Retrieval Evaluation: A Meta-Evaluation,’ Journal of Documentation 72, 3, 2016, 394-413. https://www.emeraldinsight.com/doi/abs/10.1108/JD-03-2014-0056 https://www.emeraldinsight.com/doi/abs/10.1108/JD-03-2014-0056 https://www.emeraldinsight.com/doi/abs/10.1108/JD-06-2015-0068 B. Hagedoorn and S. Sauer, The Researcher as Storyteller 10 Video 2. Exploration Path 1: Using DIVE+ (Media Suite) to search for watersnoodramp (North Sea flood). Another example task given to users to research a long-term media event was: Example task 2: Try looking for sources about the representation of the social acceptance of migrants, refugees and migration as a long-term event, and its impact on (Dutch) society. What research questions are sparked by what you find? How do the search affordances of the online repository/ies shape your research question and your understanding of the topic? Reflect on your own role as a storyteller, and how you think the tool you are using influences this role. For an exploratory search task such as described in example task 2, with a specific focus on the keyword vluchteling (refugee), this could result in a navigation path and search journey such as visualized in Video 3. Video 3. Exploration Path 2: Using DIVE+ (Media Suite) to search for ‘vluchteling’ (‘refugee’). https://www.youtube.com/watch?v=pXwzejOE57A https://www.youtube.com/watch?v=S5eDn2UmGaE&feature=youtu.be&hd=1 B. Hagedoorn and S. Sauer, The Researcher as Storyteller 11 6 T h e R e s e a r c h J o u r n e y Research into mediated events represented on multiple media platforms (including crossmedia or multi-platform audio- visual texts) can then take the following general steps when circling around a research question and specifying your research topic. This process of the research journey follows, in an iterative fashion, the steps of Explore – Refine – Analyse – Tool Criticism – Write – Disseminate. These are presented below in a model for grounded analysis, which answers to our discussed need for hermeneutic approaches in Digital Humanities, to closely consider practices and mechanisms conditioning narrative creation and for researchers to include a self-reflexive approach. Explore Refine Analyse Tool Criticism Write Disseminate Figure 4. The Research Journey: Explore – Refine – Analyse – Tool Criticism – Write – Disseminate. 6 . 1 S t e p 1 . E x p l o r e Exploring the topic to acquire contextual information about the topic (exploratory search, context acquisition): a) searching for videos; b) access academic databases to explore topics, read historical overviews and articles; c) searching for AV-material using faceted search to search for names, date, genre (news, documentary, current events programmes), and by broadcaster d) visiting archives physically to read newspapers on the days of the event, and the weeks/months after. Making a decision about which collections/archives are of interest to search, has ramifications: are these collections accessible, in terms of (1) their location: does the researcher need to visit the collection/archive in person, is there a digital point of access; (2) materiality of the collection: is the collection retrievable, and in what material form (physical objects, or digitized, or digital) and (3) contextualization, such as accessible metadata and other forms of contextualization that gives research value to the collection items. B. Hagedoorn and S. Sauer, The Researcher as Storyteller 12 6 . 2 S t e p 2 . R e f i n e Refining ideas about the topic by sorting and relating sources. There are different ways to connect materials on paper (a researcher may use a mind map to draw out how sources relate) to primarily piece together: a) sequence of events; b) the different sources that are found (is it a primary source, is it a secondary source); c) draw out storylines: what do the sequences of events (described by the different sources) show in terms of a narrative, what is the story that is being told? Is this a description of events. Is it an interpretation of said facts? In other words, how are the disruptive events translated into a story (short or long-term narrative)? When searching in collections, the researcher can refine search for instance by title and keywords of the event or implicated persons (to see whether data is available about the event) (1) as close to the event in time and (2) media content that discusses the event (for example, political talk shows more distant to the event than directly after) to collect discourses surrounding the event for analysis. For each collection, the researcher should apply and source criticism. These include questions such as what media objects, subjects, places, and actors are part of the event, or what information is available to be able to study the event? And what is the position of the retrieved object in context of the larger collection it belongs to? Researchers therefore critically reflect on the role of provenance, novelty, and diversity of objects and collections. 6 . 3 S t e p 3 . A n a l y s e After selecting a corpus, the researcher analyses this corpus to gain insight into processes of construction and manipulation of meanings: analysing selected materials, looking specifically at how each item tells a story, or trying to piece together what is happening or has happened, per: a) Type(s) of material and medium: television broadcasts, radio broadcasts, online articles (when archived), news- paper articles, interviews, scholarly articles; b) Narrative discourse(s): how is the story about the event being told, what are the central keywords used in the descriptions – because this helps creating insight into the discourse(s) surrounding the event; c) What stories/narratives are told about the event? How are media trying to understand what is happening? And what do these narratives signify in terms of how we interpret media events? d) Integrating findings. 6 . 4 S t e p 4 , 5 a n d 6 . T o o l c r i t i c i s m , w r i t i n g , a n d d i s s e m i n a t i o n Finally, the researcher integrates findings and writes these up. During the writing process, including recording findings for dissemination, the researcher also demonstrates tool criticism – also in relation to the aforementioned step of source criticism – explicitly reflecting on and demonstrating awareness of: a) How the archive and search tools used, constrain or shape the outcome of the research process. During the writing process, the researcher therefore also needs to have access to or be able to gather information about the selection and interpretation process of the used tool and repository/database; b) How the research and dissemination practices of the researcher (contextualization, re-mix, re-use) could possible add to further contextualization of cultural heritage objects; B. Hagedoorn and S. Sauer, The Researcher as Storyteller 13 c) And finally, during the writing up of findings for dissemination, the researcher can pay particular attention to how the research gives insight into how media create lucid narratives about events that are inherently complex and chaotic, as well as scattered. This form of grounded analysis leaves room for scholars to discover unexpected insights, new narratives and discourses. Discovering a multitude of narratives around the event can also be just as interesting, as it grants insight into the multi-interpretability of past events. 7 A n a l y s i s o f S e a r c h P r a c t i c e s a n d T o o l C r i t i c i s m 7 . 1 U s i n g e x p l o r a t o r y s e a r c h d u r i n g r e s e a r c h q u e s t i o n f o r m u l a t i o n Exploratory search tools are not used very often.35 Media Studies researchers did indicate enjoyment at the freedom that exploratory search offered them, especially in terms of how it triggered research questions. For instance, Media Studies students with advanced experience in working with audio-visual sources and digital search tools (BA level 3 Research Seminar) seemed to associate a clear research question with rigorous and intent heavy search, and exploratory search is regarded as more free flowing, aiding them in learning about facts that they would not have learned about when using more traditional sources. Exploratory search in this way can help with further focusing or defining the scope of one’s research, and even with developing a research question: “Exploratory search can result in new perspectives and approaches which in turn benefit the initial research” – Media Studies researcher [respondent no. 55] Humanities researchers further indicated how the randomness of source selection opened up chances for researchers to find sources that other methods might not reveal. In particular, collections that offer the possibility to search Linked Data (related entities) from a singular entry point, were considered to have the potential to illustrate context more than a historical account might provide. Contextual understanding is also central: respondents identify quite often that exploratory search does not necessarily add to the actual research project, but to the understanding of the topic they are researching. On the one hand, this seems to be valued quite highly, but on the other hand, it does not seem to be a priority during research in general, as a group of Media Studies researchers concluded after collecting user perspectives on the role of digital search technologies in Humanities research in the shape of user-generated posters: “Overall, we do believe that exploratory search is useful but perhaps to create a general understanding of the topic you are researching, rather than to find specific information that could answer your research question” – Media Studies researcher [respondent no. 56] 7 . 2 S e r e n d i p i t y Exploratory search then seems to function more as a creative stimulus. Makri et al. have argued that digital information environments need to support serendipity strategies to allow users to “make mental space or draw on previous experiences”.36 In this context, the co-creative design sessions practically point to how exploratory 35 Berber Hagedoorn and Sabrina Sauer, ‘Getting the Bigger Picture: An Evaluation of Media Exploratory Search and Narrative Creation,’ DHBenelux 2017 Conference, Paper, Utrecht University, Utrecht, 4 July 2017. 36 Stephann Makri, Ann Blandford, Mel Woods, Sarah Sharples and Deborah Maxwell, ‘Making My Own Luck’: Serendipity Strategies and How to Support Them in Digital Information Environments,’ Journal of the Association for Information Science and Technology, 65, 11, 2014, 2179-2194. B. Hagedoorn and S. Sauer, The Researcher as Storyteller 14 search during research question formulation and information retrieval offers potential for serendipitous browsing. Serendipitous search encounters are generally characterized as fortuitous accidental findings that are the outcome of a creative act,37 which is either afforded by the personality type of the seeker (e.g. ‘super encounters’38 have prepared minds and are open to recognize serendipitous findings) or by triggers embedded in the search system. Our user studies bring into view what organization and management theorist Miguel Pina e Cunha has described as: “[W]hile unexpected sources of knowledge are by definition impossible to locate (...) serendipitous discoveries may result from intentional exploratory search processes”.39 However, although finding new, unexpected narratives is important – in terms of new discoveries (!) – discovering insights serendipitously is not a goal in itself. Rather, eliciting serendipity is part of the skillset of a researcher, implicitly. 7 . 3 A n g l e s f o r c r e a t i v e c o n t e n t In this process, for media professionals specifically, the research question is translated into searching for an angle on a topic: from macro (the bigger idea or angle) to micro. The ‘angle’ is something that depends on the perceived audience of the programme or text the professionals are creating. For instance, an informative programme for a young target audience requires a different take on the Watersnoodramp (North Sea flood) disruptive event, then a documentary for adults would. Exploration is guided by expectations about the audience and the researcher’s own domain knowledge: how much does the professional personally know about, and how much are they personally interested in, the topic? How much exploration is afforded, also in terms of time and budget? Exploration is impacted by the professionals’ poetics,40 meaning the practices, conditions and unwritten rules of thumb guiding the selection and interpretation processes of media professionals with different genres, programmes (for instance television history programming) and target audiences,41 which in turn guide practices of creative retrieval as well.42 The institution of the archive, and the documentalists working there, need to be included here as agents of historical knowledge, as they also reveal such particular aims, strategies and conditions regarding the providing of access, contextualization and circulation of AV sources.43 Our user studies further reveal how media professionals (journalists, television/image researchers, documentalists, documentary filmmakers, digital storytellers, and media innovation experts) often search Wikipedia and YouTube to find inspiration for an angle, while newspapers (in databases) are reviewed for more detailed information about and around a topic. Previously made productions are also revisited: what was already made and searched for regarding the topic? Professionals’ search also includes various search tricks: the use of words that will lead to interesting material (such as the search term curiosa, which is a term that only expert users of the archive system would think of). Offering a browser that invites users to find inspiring and interesting material for a new angle on a topic becomes relevant for AV narratives that need original content, such as documentaries or the news. Sometimes the sheer 37 Elaine G. Toms, ‘Serendipitous Information Retrieval,’ DELOS Workshop: Information Seeking, Searching and Querying in Digital Libraries, n.p., 2000. 38 Sanda Erdelez, ‘Information Encountering: It’s More Than Just Bumping into Information,’ Bulletin of the American Society for Information Science, Feb/March, 2009, 25-29. 39 Miguel Pina e Cunha, ‘Serendipity: Why Some Organizations Are Luckier than Others,’ FEUNL Working Paper Series, Lisbon, 2005. 40 Berber Hagedoorn, ‘Collective Cultural Memory as a TV Guide: ‘Living’ History and Nostalgia on the Digital Television Platform,’ Acta Universitatis Sapientiae, Series Film and Media Studies 14, ‘Histories, Identities, Media,’ 2017, 71-94. pp. 73; Berber Hagedoorn, ‘De poëtica van het verbeelden van geschiedenis op broadcast televisie,’ Journal for Media History/Tijdschrift voor Mediageschiedenis 20, 1, 2017, 78-114; 41 Berber Hagedoorn, Doing History, Creating Memory: Representing the past in documentary and archive-based television programmes within a multi-platform landscape, Doctoral dissertation, Faculty of Humanities, Utrecht University, the Netherlands, 2016: pp. 24-33. 42 Sabrina Sauer and Maarten de Rijke, ‘Seeking Serendipity: A Living Lab Approach to Understanding Creative Retrieval in Broadcast Media Production,’ in Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, ACM, 2016, 989-992. 43 Berber Hagedoorn and Bas Agterberg, ‘The End of the Television Archive as We Know It? The National Archive as an Agent of Historical Knowledge in the Convergence Era,’ Media and Communication, 4, 3, 2016, 162-175. https://onlinelibrary.wiley.com/doi/full/10.1002/bult.118 https://content.sciendo.com/view/journals/ausfm/14/1/article-p71.xml http://www.tmgonline.nl/index.php/tmg/article/view/282 http://www.tmgonline.nl/index.php/tmg/article/view/282 https://www.cogitatiopress.com/mediaandcommunication/article/view/595 https://www.cogitatiopress.com/mediaandcommunication/article/view/595 B. Hagedoorn and S. Sauer, The Researcher as Storyteller 15 amount of material is daunting, however, the media professionals feel that this needs to be browsed through. Digital tools offering better support for this, is something that is met with certain enthusiasm. 7 . 4 M a k i n g m e a n i n g , c r e a t i n g l u c i d n a r r a t i v e s All three user groups demonstrate deeper reflections about how the tools that they used for (re)search and retrieval, inherently provide narrative elements. On an individual level, this is regarded as crucial in relation to the subjectivity of research. Users reflected further on how they – meaning the user as a researcher – are not the only influential factor regarding their produced research narrative, rather their tools and their own use of these tools also impacts the way in which their research is shaped and narrated: Meaning is attributed to the way one searches and conducts research – Media Studies researcher [respondent no. 58] The meaning is formed by the search tools you use and the way that you search – Media Studies researcher [respondent no. 64] Real connections still have to be made in an old and traditional way... in the mind of the researcher – Humanities researcher [respondent no. 14] Subsequently, the resulting search or narrative path, which represents a mediated event as a (more or less) lucid narrative, is also not regarded as neutral: “Narrative is a framing tool that helps shape information” – Media Studies researcher [respondent no. 59] (our emphasis). Our research study offers practical examples of how exploratory search can, then, support interpretation and narrative creation of events, through the visualization of the navigation path. “Exploratory research lets you see connections and thus shows you the meaning of AV content” – Media Studies researcher [respondent no. 75]. “When AV content is put together and looked at as a set, it can become a part of a narrative with a variety of meanings” – Media Studies researcher [respondent no. 73]. Figure 5. Narrative creation in DIVE+: exploration path searching for keyword ‘vluchteling’ (‘refugee’) [screenshot from Video 3 above]. B. Hagedoorn and S. Sauer, The Researcher as Storyteller 16 7 . 5 T r u s t v e r s u s h i d d e n a g e n d a s On an institutional and cultural-historical level, prior media research has argued how the transmission and portrayal of any event is necessarily dependent on the attitude or demeanour of the broadcasting institution.44 The large-scale comparative research of the European Television History Network has demonstrated how “[n]o event is value-free and neither is its mediation or interpretation. Historically, and across cultures and borders, values change”.45 However, this seems problematic when investigating and generating narratives in an exploratory search tool such as DIVE+. This is the case because currently, despite the fact that exploratory search and the visualization of the search path in DIVE+ can support narrative creation, researchers do not grasp how the tool mediates an attitude or demeanour. Based on our studies, we argue that trust in the search engine, browser and archive, is usually based on prior experience. Prior experience regarding search and retrieval determine the user’s expectations, their skills (for example in investigating signposts, such as the About page, for clues on the politics of archiving) and therefore their attitude towards retrieving dependable search results. As a respondent describes: “Even a database has a hidden agenda (...) Can I trust the algorithm?” – Media professional [respondent no. 3] (our emphasis). It is also relevant to note how in DIVE+, a search for Watersnoodramp leads to material that is dated before the time of the flood. This is interesting to the user, because it triggers curiosity about what the browser suggests. The results encountered through exploratory search are regarded as directionless in the sense that their usefulness depends on the researcher, and the project. The direction and value of the results are thus heavily dependent on the way in which they are used. In relation to our previous point that researchers currently do not grasp how the tool mediates attitude or demeanour, when it is difficult to gauge where materials and entities come from, this makes it problematic for the user to assess the usefulness of the source. In addition, crossmedia audio-visual sources are changing daily, and hence sometimes brings forward a different result due to removal of data from the database. Improvements for the DIVE+ browser are then specifically directed towards how more transparency about entities and relations should be added: There [in another search tool] the data-triples were shown, the entities, the relation between them, these were explicitly shown. And that already gave me more inspiration ... where does this relation stem from, you could find out very quickly, by directly clicking on it ... and it was revealed that the birth place is Ghent (...) It’s not immediately clear at a glance what the link is between the entities when you look at your search results – Media professional [respondent no. 3] (our emphasis) The experience was OK, but the interface is very cluttered. There is too much visible on the screen – Media professional [respondent no. 7] (our emphasis) 7 . 6 N e e d - t o - h a v e s o r n i c e - t o - h a v e s ? Media professionals describe fine-grained selection functionalities as the ‘need-to-haves’, especially to easily refine search results beyond entity categories: clear, well-defined search fields and more filter options, including per medium to make a distinction between text, audio and video in search results. Respondents argue that when such need-to- haves are lacking, the functionalities offered for exploring and linking are only ‘nice-to-haves’. Professionals especially request more direct insights into in-depth relationships, stating that this is now deemed too shallow: You will quickly find relations (connections) based on general search terms, but unfortunately, I did not find the depth of the relation between Beatrix and woningnood [housing shortage] – Media professional [respondent no. 4] (our emphasis) 44 Paddy Scannell, Radio, Television & Modern Life. A Phenomenological Approach, Blackwell Publishing, 1996. 45 Jonathan Bignell and Andreas Fickers, A European Television History, Wiley-Blackwell, 2008. B. Hagedoorn and S. Sauer, The Researcher as Storyteller 17 Very broad results, it’s often unclear why something is shown. You see few other relations between the results except the keyword Jakarta. I had expected a concept such as the independence act’ – Media professional [respondent no. 6] (our emphasis) The dataset on the background is missing critical mass to deliver sufficient results – Media professional [respondent no. 8] Professionals argue that their expressed need to give users more control over search filters, stems directly from the fact that in their professional practice, they are used to using search interfaces with many, many search fields. “The useful thing about many search fields is that you can focus very nicely on where you start and end in the definition of the field” – Media professional [respondent no. 3]. Prior experience, again, is thus a key factor impacting the interpretation and selection experience. 8 A n a l y s i s o f S e a r c h N a r r a t i v e s : T h e S t e e p L e a r n i n g C u r v e 8 . 1 E x p l o r a t i o n r o u t e s a n d m e t a - s t r u c t u r e s a s n a r r a t i v e s The search engines most often used for exploratory search by our respondents were DIVE+ and Google Explore, the Google Trends explore functionality. While DIVE+ is designed for working with audio-visual sources, the lay- out of Google Explore was deemed more user friendly and easier to navigate by our users. The learning curve of using DIVE+ made it less attractive for use from the outset, compared to Google Explore. This was made especially clear in our studies by respondent commentary about the difficulty in assessing both how connections between entities are established by the tool, as well as the unclear depth of the relation between entities (see also the commentary made above by respondent no. 4 regarding that they did not find the depth of a particular relation). Across all user groups, respondents expressed how the DIVE+ platform and exploratory search can help in guiding the user, and even aid in raising new research questions. Platform functionalities and affordances can help steer or guide the researcher and at the same time can push to formulate new questions. First, exploratory search is considered by our users to demand narrow research questions. “The added value is that you can determine (...) what your topic is going to be about based on the available research data” – Media Studies researcher [respondent no. 54]. Second, exploratory search is regarded as iterative. For example, one respondent (Media Studies researcher, respondent no. 57) described the process of exploratory search in DIVE/DIVE+ as constant revisioning of the research question based on the retrieved results. Here, a search narrative is defined as a route which indicates different phases. This underscores the learning curve of exploratory search, and different phases of narrative creation for the researcher: narrative creation as an exploration route. The users’ responses show that narratives in general, and in particular research narratives, are not a fixed entity but fluid. The attached meanings are ever-changing, based on the conditions in which discourses are encountered and constructed via individuals or events. It is noteworthy that both exploratory search and narratives are classified by respondents as non-fixed. Narratives are seen as to be composed of other narratives, in the sense that texts are constructed from other texts: [Narrative is] a way of framing information and events, that makes certain elements strange and normalizes others, creating something like a story – Media Studies researcher [respondent no. 65] B. Hagedoorn and S. Sauer, The Researcher as Storyteller 18 During the process of collecting data, the narrative might change, for media researchers might find information that changes their research question and primary focus – Media Studies researcher [respondent no. 59] Importantly, users indicate here how the practice of telling narratives is, as we saw earlier for the practice of searching, based in prior experiences, and narratives are shaped by prior experiences. Research itself was not considered as a narrative by all respondents. For Humanities researchers especially, research was strongly considered as not a narrative: “I believe that the narrative metaphor does not really apply to my research, because I do not produce sequential data, but rather a meta-structure, which cannot be told as a story” – Humanities researcher [respondent no. 38] (our emphasis). Media professionals were most critical whether the DIVE+ search path resulted in a narrative: The list of narratives is very helpful, but does not really yield a story. More like a storage of the search process – Media professional [respondent no. 2] (our emphasis) I mainly found general information and a further search for a relationship with an event did not offer a satisfactory outcome – Media professional [respondent no. 4] (our emphasis) Subsequently, visualization of more in-depth relationships is requested by users as an improvement of the exploratory search browser. 8 . 2 M e d i a u s e r s a s s t o r y t e l l e r s i n c o n t r o l ? Professionals also found that not every click should be saved in the exploration path, which not only points to giving the user more control over search functionalities like filters, as discussed above, but also more control over the lucid narrative that is generated (in the form of the exploration or search path), which can be exported offline and saved on the researcher’s own desktop: Ideally this functionality [saving the search log] will not simply save my entire click history, but will retain only relevant results– Media professional [respondent no. 7] (our emphasis) It would be more useful if DIVE[+] did not save everything itself, but only on the request of the user – Media professional [respondent no. 2] (our emphasis) Media researchers (scholars and professionals) are, in fact, storytellers. Our research outlines how researchers build narratives, and makes the role of the researcher and digital search tools in the construction of narratives explicit. This highlights the interpretative aspects of research, and research is always being interpreted in certain (social) contexts. Practices of search, research and retrieval, too, frame a certain version of reality through the construction of a narrative. The researcher is framing the narrative by choosing which sources to use and not to use – Media Studies researcher [respondent no. 61] (our emphasis) Media researchers acquire information from multiple searches and piece this information together in order to find similarities, patterns, and discrepancies. These are then put together in a storytelling format – Media Studies researcher [respondent no. 71] (our emphasis) B. Hagedoorn and S. Sauer, The Researcher as Storyteller 19 Such skills, on the one hand, seem to be something that people in modern societies are more and more used to, as well as actively developing: in our current association society46, individuals function as experienced as a kind of information hunters and gatherers47 that collect information from different platforms or databases in logical narratives for themselves. On the other hand, our research also indicates how these skills, as well as awareness of how such skills contribute to understanding for both learning and doing research, can be better supported.48 8 . 3 T o w a r d s s y n t h e s i s Across all user groups, user explorations underline the difficulty for users to create narratives about media events, due to the fact that there is a learning curve when it comes to understanding how to inspect collections for metadata, how to compare collections, and even how to explore collections. The features and interface of DIVE+, especially, offers a steep learning curve. Each of the tools in the Media Suite supports users in a particular way, but it is a challenge for users to synthesize found source materials into an overarching narrative. The ideal place for this synthesis would be the Media Suite’s workspace functionality, where a user can create a workspace for a particular (shared) project and to collect and inspect bookmarked materials. 9 R e f l e c t i o n : T h e R e s e a r c h e r i n a S p l i t P o s i t i o n In this study, we have argued how narrative creation occurs during the encounter and interaction of digital search apparatuses’ attitudes, with those of the researcher. We have also pointed out differences between research fields in terms of prior skills in search and retrieval, and the expectations regarding search and retrieval that arise during the research journey. As we learned in our study, researchers themselves can also be made more aware of how, through their own search and research practices, they build narratives around events, and how this impacts the meaning making process. Offering researchers the ability to explore and create lucid narratives about media events, including bringing relevant (multi-media and multi-platform) AV sources to their attention, therefore greatly supports their interpretative work. We argue that this is especially prevalent in the first exploratory search stage of typical media and humanities research.49 Exploratory search is crucial for researchers who draw upon media materials in their research, because audio-visual, online and digital sources are in abundance, scattered across different platforms, and change daily in the contemporary landscape. Supporting researchers’ explorations becomes even more important when scholars study 46 Marcel Broersma, ‘De associatie maatschappij: journalistiek stijl en de onthechte nieuwsconsument,’ Inaugural lecture, Chair Journalistic Culture and Media, 17 March 2009. 47 Henry Jenkins, ‘Confronting the Challenges of a Participatory Culture (Part Six),’ Confessions of an Aca-Fan: The Official Weblog of Henry Jenkins, 26 October 2006. 48 We have therefore, based on our studies, improved the DIVE+ browser with support for audiovisual annotation (also video or media annotation), especially the option for users to manually add annotation to and in-between their exploratory search path(s). 49 Marc Bron, Jasmijn van Gorp and Maarten De Rijke, ‘Media Studies Research in the Data-Driven Age: How Research Questions Evolve,’ Journal of the Association for Information Science and Technology, 67, 7, 2015, 1535-1554; Chiel Van Den Akker, Susan Legêne, Marieke Van Erp, Lora Aroyo, Roxane Segers, Lourens Van der Meij, Jacco Van Ossenbruggen, Guus Schreiber, Bob Wielinga, Johan Oomen, Geertje Jacobs, ‘Digital Hermeneutics: Agora and the Online Understanding of Cultural Heritage Categories and Subject Descriptors,’ WebSci 11, Koblenz, Germany, 2011. https://www.rug.nl/staff/m.j.broersma/oratie_marcelbroersma_170309.pdf http://henryjenkins.org/blog/2006/10/confronting_the_challenges_of_5.html http://dl.acm.org/citation.cfm?id=2527039 B. Hagedoorn and S. Sauer, The Researcher as Storyteller 20 disruptive media events via audio-visual sources, due to the complexity of the narrative and the audio-visual text’s representation – including re-presentation in digital heritage and memory institutions. In today’s association society, we then find the media researcher in what could be described as a split position. On the one hand, there are important new opportunities and new types of questions that can be asked, which encourages the (re)use of television archives and European audio-visual heritage, promoting engagement with cultural memory on national and international levels. The increased access and more direct availability of high-quality material, with connected metadata and contextualization makes and keeps AV material valuable for research. Digital tools offer significant research opportunities to identify useful data faster and over a longer research period. This also includes important multimedia and crossmedia perspectives, such as searching and linking various data sets from different collections via a singular entry point. On the other hand, there are also new challenges and new types of questions that should be asked. One challenge is that practices of crossmedia and transmedia storytelling, for instance television programme websites and social media platforms with relevant contextual information are highly susceptible to change. Often there is no structural archiving of such contextual information, regarding online (web-archiving), printed and digital production documentation for a complete memory of production. Furthermore, media literacy remains a considerable issue for the skill sets of both digital natives and non-digital natives. New critical questions to be asked concern the so- called politics of archiving. Audio-visual sources represent a construction and selection of our reality, and their (un) availability in a database is again a selection: curators adding a further interpretative layer. In short, in the digital age, more people are part of the selection processes of the media representations we reuse and encounter as researchers. Exploratory search can support researchers’ explorations of difficult to interpret disruptive media events, potentially offering serendipitous browsing and discovery of event narratives, helping users to better assess the quality of sources. However, this serendipitous browsing needs to be anchored to situated search practice of the researcher – thus, creating a tool that affords both exploration and anchoring of narratives. However, such opportunities of linked open data do require a shift in search cultures. It is therefore relevant to deconstruct how exploratory search and digital tools afford narrative creation, giving insight into the constructed quality and key perspectives that define the course and framing of mediated events. But also, how they shape narratives due to technological affordances and constraints. Creating narratives whilst exploring adds understanding plus creative insights to research and learning through audio-visual materials. This process also highlights the constructed nature of narratives in general, making users aware of their own storytelling practices. Based on gained experiences, respondents often expect to find exactly what they were looking for, but this is not what exploratory search offers: users thus had to open themselves up to new search learning curves and expectation management. Across all user groups, exploratory search was understood as a kind of loose concept, or as research without a direction. If searching – especially in the early phases of research – produced unexpected results, it could already be regarded as exploratory, and successfully serendipitous. Moreover, the respondents stressed the importance of a ‘refine step’ in the research journey, when both research questions and search queries are revisited, repeated and revised. Opportunities of linked open data, then, seem to require a shift in search attitude or even search cultures. Moreover, as results are interlinked across data types, platforms and databases, free association is supported by exploratory search. Links which redirect users from a certain source to another were also often associated with exploratory search, and its functionality to make interconnections more visible. Once users were able to recognize the value of the lack of directionality and of meandering AV that exploratory search offers, they loosened their expectations of finding what they wanted to find, and rather started to focus on the value of what they happened to find whilst roaming the archive – allowing for unexpected insights into topics. B. Hagedoorn and S. Sauer, The Researcher as Storyteller 21 We have used different methods to gain insight into users’ search behaviour, contributing to an understanding of users’ “non-purposive information practices”50, as well as to the development of digital tools. Reflecting on tool usage with researchers grounds the research in the professional, daily practice of the end user, and strives to embrace the complexity of Digital Humanities projects: balancing Humanities’ and Computer Science concerns.51 Digitization has changed work practices of media scholars and media professionals, and in their research practices they increasingly use digital archives to create media texts. This means that retrieving audio-visual material requires an in-depth knowledge of how to find sources digitally. Our studies show how in interaction we can perhaps learn most, and more effectively, about this. A c k n o w l e d g m e n t s This research was supported by the Netherlands Institute for Sound and Vision in the context of Berber Hagedoorn as Sound and Vision Researcher in Residence in 2016-2017 and the Netherlands Organisation for Scientific Research (NWO) under project number CI-14-25 as part of the MediaNow project. This research was also made possible by the CLARIAH-CORE project financed by NWO, with the Research Pilot Narrativizing Disruption. The authors would like to thank the anonymous reviewers for their helpful comments and suggestions, and Hanne Stegeman for her research assistance during data categorization. B i o g r a p h i e s Berber Hagedoorn (b.hagedoorn@rug.nl) is Assistant Professor Media Studies at the University of Groningen. Her research interests revolve around audiovisual culture, creative reuse and storytelling across screens. She received the 2018 Europeana Research Grant award for digital humanities research into Europe’s cultural heritage. Hagedoorn is the Vice-Chair of ECREA’s Television Studies section (European Communication Research and Education Association) and organizes cooperation for European research and education into television’s history and its future as a multi-platform storytelling practice. She has extensive experience in Media and Culture Studies and Digital Humanities through large-scale European and Dutch best practice projects on digital heritage and cultural memory representation, including Europeana, VideoActive, EUscreen and CLARIAH. Hagedoorn has published in amongst others Continuum, Journal for Media History/Tijdschrift voor Mediageschiedenis, Media and Communication and see also https://berberhagedoorn.wordpress.com. Sabrina Sauer (s.c.sauer@rug.nl) is Assistant Professor Media Studies at the University of Groningen, Research Centre for Media and Journalism Studies. She has a background in Media Studies and Science and Technology Studies, and studied as an actor prior to writing her dissertation about user-technology improvisations as a source for ICT innovation. Her current research focuses on data-driven creative processes, the agency of users and technological artefacts, exploratory search and algorithm development, and serendipity. Apart from that, she is keenly interested in Digital Humanities, and questions around digital materiality. Sauer has published in amongst others Journal Of Science And Technology Of The Arts. 50 Edin Tabak, Information Cosmopolitics: An Actor-Network Theory Approach to Information Practices, Chandos Publishing, 2015. 51 Edin Tabak, ‘A Hybrid Model for Managing DH Projects,’ DH Quarterly, 11, 1, 2017. VIEW Journal of European Television History and Culture Vol. 7, 14, 2018 DOI: 10.18146/2213-0969.2018.jethc159 Publisher: Netherlands Institute for Sound and Vision in collaboration with Utrecht University, University of Luxembourg and Royal Holloway University of London. Copyright: The text of this article has been published under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 Netherlands License. This license does not apply to the media referenced in the article, which is subject to the individual rights owner’s terms. https://www.clariah.nl/ https://www.clariah.nl/projecten/research-pilots/nardis mailto:b.hagedoorn@rug.nl https://berberhagedoorn.wordpress.com� mailto:s.c.sauer@rug.nl http://www.digitalhumanities.org/dhq/vol/11/1/000284/000284.html http://creativecommons.org/licenses/by-nc-nd/3.0/nl/deed.en_GB http://dx.doi.org/10.18146/2213-0969.2018.jethc159 _Hlk1593146 _Hlk1150662 work_hfzicxtgw5bjdlonq4stxl2atu ---- Umanistica Digitale - ISSN:2532-8816 - n.4, 2019 V. Vanden Daelen – Data Sharing, Holocaust Documentation and the Digital Humanities: Introducing the European Holocaust Research Infrastructure (EHRI) DOI: http://doi.org/10.6092/issn.2532-8816/9036 Data Sharing, Holocaust Documentation and the Digital Humanities: Introducing the European Holocaust Research Infrastructure (EHRI) Veerle Vanden Daelen Kazerne Dossin, Memorial, Museum and Documentation Centre on Holocaust and Human Rights, Mechelen, Belgium veerle.vandendaelen@kazernedossin.eu Abstract. The European Holocaust Research Infrastructure (EHRI) started its work in October 2010 with financial support from the European Union. The project, which is currently under its second funding phase, continues developing according to its’ mission to support the Holocaust research community by building a digital infrastructure and facilitating human networks and by helping networking of Holocaust researchers and archives. EHRI provides online access to information about dispersed sources relating to the Holocaust through its Online Portal. Tools and methods are developed that enable researchers and archivists to collaboratively work with such sources and explore new methodologies within digital humanities. This contribution seeks to present the resources and services EHRI has to offer to the research community, with a special emphasis on the EHRI Portal. European Holocaust Research Infrastructure (EHRI) è un progetto lanciato nel 2010 grazie al sostegno dell'Unione Europea. Fa parte degli obiettivi del progetto la creazione di una infrastruttura digitale a supporto della comunità degli studiosi della Shoah e l'implementazione del networking fra ricercatori e istituti di conservazione. Nell'ambito di EHRI metodi e strumenti di lavoro sono sviluppati con l'obiettivo di favorire il lavoro collaborativo fra ricercatori e archivisti e, di studiare nuove metodologie per le Digital Humanities. Grazie al suo portale web (Online Portal) EHRI fornisce l'accesso alle informazioni sulle risorse d'archivio per la storia della Shoah disperse nei numerosi archivi europei e internazionali. Proprio i servizi e le risorse disponibili attraverso il portale web EHRI costituiscono il principale argomento di questo paper. 1 Umanistica Digitale - ISSN:2532-8816 - n.4, 2019 Introduction The European Holocaust Research Infrastructure (EHRI) started its work in October 2010 with initial financial support from the European Union Seventh Framework Programme for four years. Thanks to the continued EU support - EHRI is currently (2015-2019) a Horizon2020 EU-financed project with a total budget of almost eight million Euros - the project keeps on developing. The consortium in EHRI’s second phase under H2020 consists of 24 partner institutions from 17 different countries and includes research institutions, libraries, archives, museums, memorial sites and e-science specialists.1 Apart from this core working group, EHRI equally relies on the support of many other individuals and organisations in the broad fields of Holocaust studies and digital humanities. EHRI is devoted to building a Holocaust research infrastructure that is sustained by its network and will have a right of existence on its own accord. The mission and main objective of the European Holocaust Research Infrastructure (EHRI) is to support the Holocaust research community by building a digital infrastructure and facilitating human networks and by helping networking of Holocaust researchers and archives. EHRI provides online access to information about dispersed sources relating to the Holocaust through its Online Portal. Tools and methods are developed that enable researchers and archivists to collaboratively work with such sources and explore new methodologies within digital humanities. Apart from providing an online platform, EHRI also facilitates an extensive network of researchers, archivists and others to increase cohesion and co-ordination among practitioners and to initiate new transnational and collaborative approaches to the study of the Holocaust. EHRI thereby seeks to overcome one of the hallmark challenges of Holocaust research: the wide dispersal of the archival source material across Europe and beyond (because of the geographical scope of the Holocaust, attempts to destroy the evidence, migration of Holocaust survivors etc.) and the concomitant fragmentation of Holocaust historiography with a multiplicity of documentation projects. By bringing together experts from different fields, and by building an innovative digital infrastructure supported by a large community, EHRI is a flagship project that showcases the opportunities for historical research in the digital age. With this presentation at the EHRI workshop “Data Sharing, Holocaust Documentation, Digital Humanities: Best Practices, Case Studies, Benefits”, we would like to present the 1 NIOD, Institute for War, Holocaust and Genocide Studies (Amsterdam), Yad Vashem (Jerusalem), National Archives Belgium/CEGESOMA (Brussels), King’s College (London), Institute for Contemporary History (Munich), Jewish Museum in Prague, DANS (Den Haag), Wiener Library (London), Vienna Wiesenthal Institute for Holocaust Studies, Jewish Historical Institute, ŻIH (Warsaw), Mémorial de la Shoah (Paris), International Tracing Service (Arolsen), United States Holocaust Memorial Museum, USHMM (Washington D.C.), Bundesarchiv (Berlin / Koblenz), Elie Wiesel National Institute for the Study of the Holocaust in Romania (Bucharest), Hungarian Jewish Archives (Budapest), Vilna Gaon State Jewish Museum (Vilnius), Dokumentačné stredisko holokaustu (Bratislava), Contemporary Jewish Documentation Center Foundation CDEC (Milan), The Jewish Museum of Greece (Athens), Ontotext (Sofia), INRIA (Le Chesnay), Stowarzyszenie Centrum Badań nad Zagładą Żydów (Warsaw), Kazerne Dossin: Memorial, Museum and Documentation Centre on Holocaust and Human Rights (Mechelen). 2 V. Vanden Daelen – Data Sharing, Holocaust Documentation and the Digital Humanities: Introducing the European Holocaust Research Infrastructure (EHRI) resources and services EHRI has to offer to the research community, with a special emphasis on the EHRI Portal. EHRI resources and training include: Online Portal with information on Holocaust-related archival material held in institutions across Europe and beyond, Online Training in Holocaust Studies; Seminars and Workshops; Fellowship Programme; Conferences; Online Document Blog; Online Research Guides; and Tools and Methods for Digital History. EHRI Fellowships, Online Courses, Research Guides, Document Blog & Workshop The EHRI Fellowships support and stimulate Holocaust research by facilitating international access to key archives and collections as well as archival and digital humanities knowhow. The fellowships intend to support researchers, archivists, curators, digital humanists, and younger scholars (for information on past fellows and open calls, see https://ehri- project.eu/ehri-fellowship-call-2016-2018). The EHRI online courses also address the researchers, the general public and data managers/archivists (http://training.ehri-project.eu). There is on the one hand an unguided online course with 6 units from EHRI’s first phase (https://training.ehri-project.eu/), as well as now the development of an interactive tutored online course with six lessons (https://ehri-project.eu/interactive-ehri-online-course-holocaust- studies) and a Bundesarchiv-written course on German Archivistics (Aktenkunde). Whereas in EHRI’s first phase two Research Guides were published online (https://portal.ehri-project.eu/guides; e.g. on Theresienstadt, https://portal.ehri- project.eu/guides/terezin), EHRI is now exploring the options of its relatively new EHRI Document Blog (https://blog.ehri-project.eu/). The EHRI document blog provides a space to share ideas about Holocaust-related archival documents; it provides an innovative platform for the presentation, visualization, contextualization and interpretation of the data and metadata, using digital tools. EHRI furthermore reaches out and explores new methodologies via workshops and methodological seminars. These include, for example, a seminar for conservationists working on Holocaust-related materials and workshops on specific topics. The EHRI Portal The EHRI portal offers information on 57 countries, descriptions on 1,939 archival institutions across 51 countries, and 231,888 archival descriptions in 479 institutions (https://portal.ehri-project.eu/ July 17, 2017). The data in the portal are structured in a top- down fashion: from countries with country reports on the history, archival situation and status of EHRI’s research on the country, to an inventory of institutions which preserve Holocaust- relevant sources within these countries, to top-level collection descriptions (be it record groups, fonds, subfonds, collections or any other way the institution describing the sources structures them). EHRI’s goal is to provide information on the archives, not to provide digital representations of all the archival materials. EHRI focuses on collection descriptions and is not 3 https://portal.ehri-project.eu/ https://blog.ehri-project.eu/ https://portal.ehri-project.eu/guides/terezin https://portal.ehri-project.eu/guides/terezin https://portal.ehri-project.eu/guides https://ehri-project.eu/interactive-ehri-online-course-holocaust-studies https://ehri-project.eu/interactive-ehri-online-course-holocaust-studies https://training.ehri-project.eu/ http://training.ehri-project.eu/ https://ehri-project.eu/ehri-fellowship-call-2016-2018 https://ehri-project.eu/ehri-fellowship-call-2016-2018 Umanistica Digitale - ISSN:2532-8816 - n.4, 2019 aiming to be a “scan depository”, nor does it aim to be a complete public database on the (often privacy sensitive) file or document level (although we will take those descriptions if we can have them). So, instead of a digitization project giving direct access to the sources, it should be seen more as a “routeplanner”, assistance to researchers to identify the sources they need and to see information on the sources they are looking for across different institutions, languages and countries. If EHRI is aware of where the sources themselves are consultable in a digital format, it will include the link into the description. Storing all digital images of Holocaust-relevant archives is at this point not something the project can support or is aiming for. However, the contextualization, the merging of information on the sources across this many different countries and institutions is a tremendous help for researchers to identify their sources and it also allows the institutions preserving these sources to communicate to the international research community which sources they are holding, as quite often sources ended up in unexpected places. As such, EHRI is making sources visible in a systematic fashion in order to counteract the fragmentation of the sources. The project reveals interconnections (e.g. through a multilingual thesaurus with approx. 5470 terms; collation of authority files; relationships between originals and copies). The goal is to keep expand and enriching the online inventory of institutions and collections pertaining to the Holocaust in Europe, Israel and beyond, and to connect archives and users. The contextualization of the sources they preserve is indeed useful for the archives, as well as receiving potential expert user feedback. As such, the project is mutually useful for both researchers and archives, and the positive news on this ever-growing portal is that it has attracted in a relatively short time after its launch a high number of unique users, who make frequent use of this resource. Integration of Metadata into the EHRI Portal In EHRI’s first phase, the metadata in the portal were either bulk-imported by EHRI-IT or manually added by historians within the project. However, all data entry remained non- synchronized as even the bulk imports where one-time only harvests or imports. In EHRI’s second phase, the key factor of attention is on ensuring sustainable, meaning updatable, connections between the metadata providers and the project’s portal. Multiple Scenarios for Metadata Integration The integration of metadata into the EHRI portal can be done in various ways and can involve interaction with historians, archivists and IT/digital humanists within the project (as identified in the figures beneath). Typically, the historians and archivists (Work Package 9 - WP9 in the figure) will indicate which archives contain Holocaust-relevant collections. They will verify whether or not the institution already has descriptions of the sources or not. In the latter case, EHRI may decide to write collection descriptions itself or hire a local expert to do so. In the first case, with descriptions available, the first follow-up question is whether or not the 4 V. Vanden Daelen – Data Sharing, Holocaust Documentation and the Digital Humanities: Introducing the European Holocaust Research Infrastructure (EHRI) descriptions are digital and if so, in which format and in an exportable way or not. Samples of exports are provided to the EHRI IT of Work Package 10 (WP10) so that EHRI can assess which pathway would be possible to ingest information on the institution in case into the EHRI portal (hence the reference to 10 in the figure). Figure 1: Workflow between WP9 and WP10 regarding data imports into the EHRI infrastructure (2) (EHRI, D9.4 Resource Reports, Update April 2017). WP10 consequently verifies the sample export to evaluate whether or not the export is valid Encoded Archival Description or EAD and whether or not the institution has a Protocol for Metadata Harvesting (PMH) Endpoint. When both these questions get affirmative answers, establishing a connection between the institution and the EHRI project is a fairly straightforward endeavor, which only then entails the signing of a content-provider agreement (CPA) to ensure a sustainable connection to the EHRI project. Figure 2: Workflow between WP9 and WP10 regarding data imports into the EHRI infrastructure (2) (EHRI, D9.4 Resource Reports, Update April 2017) 5 Umanistica Digitale - ISSN:2532-8816 - n.4, 2019 The EHRI EAD Mapping Tool and the EAD Publishing Tool For those who send in sample data that did not provide valid EAD, the EHRI project has developed an EAD Mapping Tool. The EHRI Mapping Tool allows for the mapping of local metadata fields to the international EAD standard. The tool can be installed by the institution itself or by the EHRI IT and will take the metadata of the institution, map them to valid EAD and convert consequently all metadata passing through this mapping tool. If they consequently have OAI-PMH (Open Archives Initiative – Protocol for Metadata Harvesting), they are then ready to share their data with the EHRI project, as seen in the figure below for Collection Holding Institution A. In case there is no OAI-PMH for EHRI to harvest the metadata, EHRI can assist further by the installation of an EAD publishing tool (Resource Sync). The EHRI Metadata Publishing Tool has been created to help archives to publish their metadata in a sustainable way (allowing for semi-automatic updates of descriptions in the EHRI Portal). Here as well, the institution can install the tool itself or with the help of EHRI IT. As soon as the program is installed and the metadata are being stored on a for EHRI accessible place on the institution’s website, the sustainable connection to the project is a fact, as shown in the example of Collection Holding Institution B. Those able to provide valid EAD but without OAI-PMH can also install the EAD publishing tool in order to provide the EHRI project with their metadata in a sustainable way, as seen in the figure below for collection-holding institution C. Figure 3: EHRI Data Infrastructure (EHRI D10.1 and D10.2 Collection description publishing services) 6 V. Vanden Daelen – Data Sharing, Holocaust Documentation and the Digital Humanities: Introducing the European Holocaust Research Infrastructure (EHRI) EHRI Manual Data Entry and Follow-up In any case, the institutions preserving the documentation concerning the Holocaust are center stage in EHRI. It is not only those who have all the necessary knowhow and IT-tools available that are able to connect to the project. Also those institutions that are not yet having digital metadata or metadata in a format which would not be compatible with the use of the above- mentioned tools, are invited to share their metadata in the portal and make their metadata more openly available online. As already mentioned, where appropriate, archives can also be covered by manual surveying and manual data entry, either by EHRI staff or local experts, or by the collection-holding institution itself which in that case receives direct access to its own institution description in the EHRI portal and can – from there – add collection descriptions and child descriptions to its repository description. The screen shots below give an idea of how repositories, collections and child item descriptions can be created and updated within the EHRI portal. The extra tool behind the scenes is that every field from the ICA-standards, which form the basis for the forms, is explained when one clicks on the field itself. Moreover, all metadata entered in the portal can be exported in valid EAD to the respective institutions and as such, EHRI opens possibilities for further use of the data beyond the project itself. Figure 4: Illustration: Screen shots from the EHRI portal admin site 7 Umanistica Digitale - ISSN:2532-8816 - n.4, 2019 Open Source and Data Sharing Beyond the Project Because EHRI does not only wish for the metadata to be published in its own portal, but equally on the portal or website of the collection-holding institution or on other project’s websites, EHRI has developed tools that allow for this, i.e. the mapping and publishing tool work equally for institutions to publish on their own website as well as share their data with other projects. The same is true for those institutions for which the data are manually added to the portal. EHRI can export the data back to the institutions. With some assistance from a web designer or basic explanation on how to create a website by yourself, the institution can further communicate about the data via its own ways of communication. Those without a website can go for a minimum scenario by integrating a link to their repository and its holdings in the EHRI portal in their email signature and spreading the news as such. EHRI furthermore provides tutorials and a helpdesk for each of the explained pathways to bring metadata into the EHRI portal. So, all together, the EHRI portal and the open source EHRI tools help archives not only to join the EHRI project, but equally to publish their own data themselves and to exchange data with other archives, memorials, projects and portals. Figure 5: ADEMP (internal EHRI figure by Mike Priddy) To stay informed about EHRI’s activities and products, there are multiple options: the EHRI project website which includes links to all above named products (https://www.ehri-project.eu), the EHRI Facebook page (https://www.facebook.com/EHRIproject/), the EHRI newsletter (https://ehri-project.eu/ehri-newsletter) and the possibility to follow EHRI on Twitter (@EHRIproject). 8 https://twitter.com/EHRIproject https://ehri-project.eu/ehri-newsletter https://www.facebook.com/EHRIproject/ https://www.ehri-project.eu/ V. Vanden Daelen – Data Sharing, Holocaust Documentation and the Digital Humanities: Introducing the European Holocaust Research Infrastructure (EHRI) Acknowledgments Projects like EHRI are a group effort. The author would like to thank all her colleagues who contribute or have contributed to the EHRI project. 9 Introduction EHRI Fellowships, Online Courses, Research Guides, Document Blog & Workshop The EHRI Portal Integration of Metadata into the EHRI Portal Multiple Scenarios for Metadata Integration The EHRI EAD Mapping Tool and the EAD Publishing Tool EHRI Manual Data Entry and Follow-up Open Source and Data Sharing Beyond the Project Acknowledgments work_hjazlkhc3zfnjfhz6iakdscd7e ---- Mapping Meaning 1 Mapping Meaning: learnings from indigenous mapping technology for Australia's digital humanities mapping infrastructure Bill Pascoe, 2020 I acknowledge the Algonquin Anishnaabeg People and Nation where this conference is hosted, and Awabakal and Worimi people, land and waters where I'm writing. I pay my respects to Elders past, present and emerging. Introduction The Time Layered Cultural Map (TLCMap)1 digital humanities mapping infrastructure is for everyone, but the inspiration, conception and development of it has always had Aboriginal and Torres Strait Islander mapping at its heart. If Australian culture is world famous for anything it is the world’s oldest living culture, a culture for which connection to country is of vital importance. Many years ago, when a simple desire took shape to make it possible for people to add cultural layers to maps that other people could find, it was unthinkable without first considering Aboriginal and Torres Strait Islander culture and mapping technology. Indigenous views on country and its representation have factored into the software architecture and vision from the beginning. The transformational effect that the Colonial Frontier Massacres project has had on Australian culture was a catalyst sparking recognition of the important role digital humanities maps can play in the lives of Australians and played a role in the truth telling process of reconciliation. Five of the main projects in TLCMap are focused on Aboriginal and Torres Strait Islander culture and both acknowledge history and celebrate living culture. These projects come to TLCMap already as collaborations with Aboriginal and Torres Strait Islander people, and indigenous Australians are employed in TLCMap software development and research. Apology The pandemic disruption has delayed some things anticipated to have been complete by now and it wasn’t until the last week that I was sure I could contribute to DH2020, and so was not able to update the abstract by the deadline. This paper may differ slightly to the abstract. Maps and Translation Because this is an international audience I will make some points with reference to Anmatyerre artist, Clifford Possum Tjapaltjarri’s Warlugolong2, a seminal work of writing/art/mapping in the internationally recognised style of western desert ‘dot painting’. There are many art styles and story and song genres, traditional and contemporary, across more than 200 Aboriginal and Torres Strait Islander languages and peoples in Australia, but this one illustrates many points well. Please note that there is a controversial history in Aboriginal and Torres Strait Islander art over misappropriation, 1 TLCMap, http://tlcmap.org is a mapping platform of interoperable digital humanities mapping systems with development on new and existing systems, initiated through an ARC grant Project ID: LE190100019. 2 Tjapaltjarri, Clifford Possum Warlugulong 1977, (Anmatyerre) National Gallery of Australia https://artsearch.nga.gov.au/detail.cfm?IRN=167409 http://tlcmap.org/ Mapping Meaning 2 theft, secret knowledge, exploitation and intellectual property. This particular piece was made specifically for public viewing. Clifford Possum Tjapaltjarri's Warlugolong demonstrates how indigenous ontologies and ethics can be translated across cultures. This work is a landmark masterpiece in Australian and indigenous art. It is both traditional in using traditional symbolic systems to represent Tjukurpa, and contemporary in using the western convention of oil on rectangular canvas, use of the dot technique, and its ‘abstract’ aesthetic. It is also a map and a text, with elements of nine Tjukurrpa relating to places and navigation that can be read, if you learn how to read. Some of the things we can learn about how to do mapping, especially ‘deep mapping’, from indigenous mapping technology, through this include:  Country is an organising principle for navigating knowledge.  A map can exist in many media, not just a 2D grid of longitude and latitude. It can exist as a story, song, dance, painting, etc.  The meaning of a place is across many layers and through its connection to other places in an intersecting mesh.  Mapping is personal and social. Each place and part of a story is the responsibility of an individual. If anyone wants to hear the whole story they must travel to see that person and learn from them. Being connected to a place, understanding, and holding its story, means you are important to the longevity of culture, of the meaning of that place. When we look at a map we see our part in a greater whole and where we stand in relation to the world. Understanding the stories associated with places enhances our personal connection to where we grew up and where we live and work. Broadly defined culture is shared experience. By learning the meaning of the places we inhabit we are connected to our past and our future, and generations before and to come.  Having learned the meaning of a place through a map (painting, sand, words, etc), the meaning of that place is evident when next we see the land and water features, or the buildings. A map is a tool for teaching us ‘how to read country’. The land or place itself then means the lessons of the story. More broadly, places tell the story of our history and being in them, and remembering being in them is a mnemonic for that history. This is identity forming – how we came to be who we are where we are.  There is much more that could be discussed - many more complex details in indigenous mapping technology, and diversity across the continent, such as use of what is called ‘redundancy’ in information theory, polysemy, rhetoric, mnemonics, and relationships to seasons, land management and law but there isn’t space here. What I can learn by analogy from Clifford Possum Tjapaltjarri’s map is far from the experience of Anmatyere and Warlpiri living and growing up within their own culture. Want I mean to do here is illustrate an act of ‘translation’. In any translation something is lost and something is added. There is no one to one ‘mapping’ of meaning across cultures or individuals. Aileen Moreton-Robinson describes this as ‘incommensurable’ and an ongoing process: “This must be theorised in a way which allows for incommensurable difference between the situatedness of the Indigenous people in a colonizing settler society such as Australia and those who have come here. Indigenous and non-Indigenous peoples are situated in relation to (post)colonization in radically different ways - ways that cannot be made into sameness. There may well be spaces in Australia that could be described as postcolonial but these are not spaces inhabited Mapping Meaning 3 by indigenous people. It may be more useful, therefore, to conceptualise the current condition not as postcolonial but as postcolonizing with the associations of ongoing process which that implies.”3 Incommensurability doesn’t mean there is no understanding. Language exists both because we don’t understand each other and because we can understand each other better. This works initially by relating (mapping) new things to things already within our ken, and as we proceed our ken adjusts and changes. The Warlugolong painting is an illustration of how indigenous knowledges can be translated into something non-indigenous people can begin to understand. It has contributed greatly towards international recognition and respect for the sophistication of Aboriginal and Torres Strait Islander culture. Gary Foley points out that (paraphrasing) if you want to help indigenous people, teach yourself, then teach your own people.4 It is as a sad irony that in a country famous for an indigenous culture where the connection to and meaning of country is of central importance, most of us living here don’t know much at all about the places we live. Raewyn Connell suggests a way to avoid the history of objectification in academic indigenous study is to ‘learn from’ instead of ‘learn about’.5 If we are to do that, learnings such as those from Tjapaltjarri, Foley, Moreton-Robinson and many others must be built into TLCMap system architecture. It is incumbent on me then as system architect to take the responsibility to ‘learn from’ to heart. TLCMap, as a national digital humanities mapping infrastructure, has a role to play in enabling people to teach people to read the meaning of places in Australia. Decolonising Software Development As Aileen Moreton-Robinson points out ‘postcolonising’ or ‘decolonising’ is a process. It’s not as if we will ever arrive at an ‘uncolonised’ state since the future cannot be disentangled from the past. TLCMap involves a complicated mix of activity. ‘Decolonising’ is a term often used in terms of archives and collections, but TLCMap is more about ‘agency’. It is a platform enabling people to do research in spatiotemporal humanities that may produce archives, to work with the meaning of place – it isn’t a map but ways to do mapping. We have to consider what agency is involved, who designs that agency, with what assumptions, who has that agency and who can be effected by it. We could look at 3 ways in which decolonisation might occur in TLCMap software systems:  Content where indigenous ‘content’ is put into existing ‘colonial’ IT systems.  Bricolage where existing systems are turned to other purposes.  From the first Where the needs or world view/concepts/metaphors etc of indigenous people drive technological development from the beginning, without limiting possibility to already existing capabilities. In practice these abstractions aren’t mutually exclusive, and most situations involve something of all these approaches. 3 p30, Moreton-Robinson, A. (2003). ‘I Still Call Australia Home: Indigenous Belonging and Place in a White Postcolonising Society’. In Ahmed, S., Castañeda, C., Fortier, A. & Sheller, M. (eds) Uprootings/ Regroundings: Questions of Home and Migration Oxford & New York: Berg 4 Foley, Gary ‘Advice for white Indigenous activists in Australia’ and Foley, Gary 'Educate YOURSELF, then educate the people' 5 Connell, Raewyn Southern Theory: the global dynamics of knowledge in social science Crows Nest: Allen & Unwin, 2007 Mapping Meaning 4 For example, one of the early successes of TLCMap is the Gazetteer of Australian Historical Placenames (GHAP). It’s been commented that the gazetteer has colonial assumptions built in. It is based around placenames demarcated using a coordinate system and surveying technologies that were development to serve the project of European colonisation. The gazetteer begins as a list of ‘official’ placenames as decreed by a colonial government. The naming of places is itself an exercise of power, in stating what exists, and by omission, what does not, and in what language places are named. We provide a means for users to contribute place names. This is more at the ‘content’ end of the spectrum. None the less, this simple addition of functionality means anyone has an opportunity to intervene in the ‘authoritative’ government naming of places, including indigenous people, or researchers in consultation with indigenous people. Awareness of something being there can do something to counteract cultural blindness which factors into government and commercial decisions over land and water use. Other unexpected uses also arise, where we turn the gazetteer to various other ends simply because it is ready to hand. There are some indigenous place names with meanings that have become uncertain in places colonised for a long time. The quick and easy availability of the GHAP means we can quickly obtain maps and information that can help inquire into the meaning of the prefix ‘Coo’ in many south east Queensland placenames. Search results can be exported in open interoperable formats for visualisation, analysis and layering. As research contributions are made, the GHAP will be an increasingly valuable resource for people not only looking for a specific place, but simply wondering “What’s here?” to learn about both indigenous and non-indigenous history and meaning of place. Problems On The Way Unfortunately the pandemic has put us months behind in some cases, which is significant in a 1 year undertaking. The lock down has meant that trips to country that were to be a crucial part of the Ngadjeri Heritage project were cancelled, for example. One common difficulty highlighted by researchers in a recent discussion was the need to re-consult to obtain permission to do new things with information provided earlier such as to putting it on the web. TLCMap Projects TLCMap is an infrastructure or platform of interoperable tools and it involves a suite of projects to drive requirements and development and to demonstrate usefulness. The following are projects that have a specific focus on Aboriginal and Torres Strait Islander culture and history. At the InASA 2018: Unsettling Australia conference, Waanyi woman Josephine Davey, with her companions Ostiane Massiani and Kate Van Wezel movingly expressed her disappointment at the majority of the papers being about the history of violence towards Aboriginal and Torres Strait Islander country and culture which created the impression it was inevitable the same thing happen in her country.6 By contrast she was present to speak about how a ranger program was helping people travel great distances to access important traditional sites. This is a critique echoed in Walter & Suina’s critique of deficit based quantitative indigenous research, and elsewhere.7 The following TLCMap projects from across Australia include both history and traditional knowledge, and both acknowledge the bad and celebrate the good. 6 Davey, Josephine (Waanyi) InASA 2018: Unsettling Australia Conference 3/12/2018 - 5/12/2018 7 Walter, Maggie & Suina, Michele (2019) Indigenous data, indigenous methodologies and indigenous data sovereignty, International Journal of Social Research Methodology, 22:3, 233-243, DOI: 10.1080/13645579.2018.1531228 Mapping Meaning 5 Colonial Frontier Massacres8 Contact: Dr Bill Pascoe, Prof Lyndall Ryan https://c21ch.newcastle.edu.au/colonialmassacres/ This project maps colonial frontier massacres in Australia from 1780 to 1930. Ngadjuri Heritage Mapping Contact: Dr Julie Nichols, Prof Ning Gu This project is a collaboration between Ngadjuri people, particularly Quenten Agius, and University of South Australia staff, particularly Prof Ning Gu and Dr Julie Nichols. This project aims to improve best practice for digital mapping of indigenous heritage including virtual reality, panoramas and 3D architectural modelling. Journey Ways Contact: Dr Francesca Robinson, Prof Paul Arthur This project is a collaboration with Dr Noel Nannup (Nyoongar), Prof Paul Arthur and Dr Francesca Robinson, in consultation with Aboriginal people across WA. It is based on research that went into the ‘Great Journeys’ booklet9 and making this available in digital form. It describes the Aboriginal perspectives and stories that relate to major roads across Western Australia, which often follow traditional routes, and which have become further storied with historical use. It delves also into deep time, showing how stories relate to events of thousands of years ago according to geological time. NSW Aborigines Protection/Welfare Board 1883-1969: A History Contact: Prof Victoria Haskins and Prof John Maynard https://www.newcastle.edu.au/research-and-innovation/centre/purai/history-of-nsw-aborigines- protectionwelfare-board-1883-1969 This project provides a web interface and map into a research collection of Aboriginal Protection/Welfare Board sites in NSW and interviews with and photographs of people about their personal experiences with them. This project is lead by indigenous academic/s, employs indigenous research assistants, and presents Aboriginal perspectives. Aboriginal historians on this project are Prof John Maynard (UON), Dr Lawrence Bamblett (ANU), Dr Lorina Barker (UNE), Dr Ray Kelly (UoN) and Prof Jaky Troy (USyd) and indigenous PhD student, Ms Ashlen Francisco. Endangered Languages Map Data Contact: AProf Mark Harvey This project aims to consolidate and archive an overview of information about indigenous languages, particularly endangered languages in Australia in a way that can be accessed by others. Care has been taken to ensure that only information that can be made public is included in the open archive. This is part of long term work with speakers of endangered languages. OzSpace Contact: AProf Bill Palmer This linguistics project looks at how spatial relations and orientation is conceptualised and spoken about in Australian indigenous languages. It has two main parts, one is a database of languages with information describing the spatial and orientation features of languages, providing an overview. The other is visualisation tools, in particular tools that attempt to illustrate how space and orientation works in that language. 8 Ryan, Lyndall et al Colonial Frontier Massacres v3.0 C21CH, University of Newcastle, Australia, 2020 https://c21ch.newcastle.edu.au/colonialmassacres/ 9 Robertson, Francesca; Nannup, Noel; Barrow Jason Great Journeys Undertaken By Aboriginal People In Ancient Times in Western Australia Batchelor: Batchelor Institute https://c21ch.newcastle.edu.au/colonialmassacres/ https://www.newcastle.edu.au/research-and-innovation/centre/purai/history-of-nsw-aborigines-protectionwelfare-board-1883-1969 https://www.newcastle.edu.au/research-and-innovation/centre/purai/history-of-nsw-aborigines-protectionwelfare-board-1883-1969 https://c21ch.newcastle.edu.au/colonialmassacres/ Mapping Meaning 6 Bibliography Ara Irititja Ara Irititja Aboriginal Corporation (AIAC), 2019 https://www.irititja.com/ Arthur, Bill and Morphy, Frances Macquarie Atlas of Indigenous Australia Sydney: Macquarie Dictionary Publishers, 2019 Aveling , Nado (2013) ‘Don't talk about what you don't know’: on (not) conducting research with/in Indigenous contexts, Critical Studies in Education, 54:2, 203-214, DOI: 10.1080/17508487.2012.724021 Bardon, Geoffrey and Bardon, James Papunya: A Place Made After The Story, The Beginnings of the Western Desert Painting Movement Carlton: Miegunyah Press, 2007 Barwick, Linda; Marett, Allan; Blythe, Joe; Walsh, Michael Arriving, Digging, Performing, Returning: An Exercise in Rich Interpretation of a djanba Song Text in Moyle, R. M. (Ed.), Oceanic Encounters: Festschrift for Mervyn McLean. Auckland: Research in Anthropology and Linguistics Monographs. p13-24 Brody, Hugh Maps and Dreams Vancouver: Douglas & McIntyre, 1981 Burarrwanga, Laklak and family Welcome To Country Sydney: Allen & Unwin, 2013 Cane, Scott First Footprints: The Epic Story of the First Australians Sydney: Allen & Unwin, 2013 Coller, Matt Temporal Earth http://collection.temporalearth.net/pages/loadFile.html Connell, Raewyn Southern Theory: the global dynamics of knowledge in social science Crows Nest: Allen & Unwin, 2007 Cotter, Maria; Boyd, Bill; Gardiner, Jane Heritage Landscapes: Understanding Place and Communities Lismore: Southern Cross University Press, 2001 Dargin, Peter Aboriginal Fisheries of the Darling-Barwon Rivers Dubbo: Brewarrina Historical Society, 1976 Dixon, RMW and Duwell, Martin The Honey Ant Men’s Love Song and other Aboriginal Song Poems St Lucia: University of Queensland Press, 1994 Elder, Bruce Blood on the Wattle Sydney: New Holland Publishers, 1988 Fesl, Eve Mumewa Conned! St Lucia: University of Queensland Press, 1993 Foley, Gary The Koori History Website http://www.kooriweb.org/ Foley, Gary ‘Advice for white Indigenous activists in Australia’, The Juice Media, posted Sep 5, 2010 Filmed during the public discussion forum: 'Deactivating Colonialism / Decolonising Acivism' https://www.youtube.com/watch?v=uEGsBV9VGTQ convened by Clare Land at MAYSAR (Melbourne Aboriginal Youth, Sport and Recreation), Fitzroy: August 31st, 2010. Foley, Gary 'Educate YOURSELF, then educate the people', The Juice Media, posted Sep 5, 2010 https://www.youtube.com/watch?v=Iw8YVBbQgNg Filmed during the public discussion forum: 'Deactivating Colonialism / Decolonising Acivism' convened by Clare Land at MAYSAR (Melbourne Aboriginal Youth, Sport and Recreation), Fitzroy: August 31st, 2010 https://www.irititja.com/ http://collection.temporalearth.net/pages/loadFile.html http://www.kooriweb.org/ https://www.youtube.com/watch?v=uEGsBV9VGTQ https://www.youtube.com/watch?v=Iw8YVBbQgNg Mapping Meaning 7 Foley, Gary ‘Gary Foley 1981 ABC-TV interview’, posted Apr 29, 2019 https://www.youtube.com/watch?v=W0vpOp26PqA Gammage, Bill The Biggest Estate on Earth: How Aborigines Made Australia Crows Nest: Allen & Unwin, 2012 Huijser, Henk, and Brooke Collins-gearing. “Representing Indigenous Stories in the Cinema: Between Collaboration and Appropriation.” International Journal of Diversity in Organisations, Communities and Nations, 2007. https://www.academia.edu/2093466/Representing_Indigenous_Stories_in_the_Cinema_Between_ Collaboration_and_Appropriation Hendery, Rachel; Dousset, Laurent Alfred; McConvell, Patrick; Simoff, Simeon J Waves of words: mapping and modelling the history of Australia's Pacific ties ARC Funded Project 2018-2020 HuNI Deakin University https://huni.net.au/ InASA 2018: Unsettling Australia Conference 3/12/2018 - 5/12/2018 https://iash.uq.edu.au/event/session/1464 Johnson, Ian Heurist https://heuristnetwork.org/ Kelly, Lynne The Memory Code Sydney: Allen & Unwin, 2016 Kerkhove, Ray Aboriginal Camp Sites of Greater Brisbane Brisbane: Boolarong Press, 2015 Kerkhove, Ray with support and collaborations of Kabi Kabi traditional owners including Lyndon Davis, Kerry Jones, Arnold Jones, and others Kabi Kabi Sites and History of the Legendary Mount Coolum (Sunshine Coast, Qld) National Reconciliation Week, 2018 Kerkhove, Ray ‘Aboriginal Camps As Urban Foundations? Evidence from southern Queensland’ in Aboriginal History Vol 42 2018 Koch, Harold and Hercus, Luise (eds) Aboriginal Placenames: Naming and Re-naming the Australian Landscape Canberra: ANU E Press and Aboriginal History Incorporated, 2009 Leavey, Brett Virtual Songlines Brisbane: Bilbie Labs https://www.virtualsonglines.org/brett-leavy Maynard, John Who’s Traditional Land? The Wollotuka Institute, University of Newcastle, 2015 https://www.newcastle.edu.au/__data/assets/pdf_file/0009/41868/Research-document_John- Maynard_whose-land.pdf Maggie Walter & Michele Suina (2019) Indigenous data, indigenous methodologies and indigenous data sovereignty, International Journal of Social Research Methodology, 22:3, 233-243, DOI: 10.1080/13645579.2018.1531228 Mathew, John Two Representative Tribes Of Queensland London: T. Fisher Unwin, 1910 Moreton-Robinson, A. (2003). ‘I Still Call Australia Home: Indigenous Belonging and Place in a White Postcolonising Society’. In Ahmed, S., Castañeda, C., Fortier, A. & Sheller, M. (eds) Uprootings/ Regroundings: Questions of Home and Migration Oxford & New York: Berg, pp.23-40 Moreton-Robinson, Aileen (ed) Sovereign Subjects Crows Nest: Allen & Unwin, 2007 Muecke, Stephen. "Australian Indigenous Philosophy." CLCWeb: Comparative Literature and Culture 13.2 (2011) https://www.youtube.com/watch?v=W0vpOp26PqA https://huni.net.au/ https://iash.uq.edu.au/event/session/1464 https://heuristnetwork.org/ https://www.virtualsonglines.org/brett-leavy https://www.newcastle.edu.au/__data/assets/pdf_file/0009/41868/Research-document_John-Maynard_whose-land.pdf https://www.newcastle.edu.au/__data/assets/pdf_file/0009/41868/Research-document_John-Maynard_whose-land.pdf Mapping Meaning 8 Napaljarri, Peggy Rockman and Cataldi, Lee Warlpiri Dreamings and Histories Pymble: Harper Collins, 1994 Neale, Margo (ed) Songlines: Tracking The Seven Sisters Canberra: ACT National Museum of Australia Press, 2017 Needham W.J. Burragurra Revisited Fyshwick: CanPrint, 2019 Nunn, Patrick D and Reid, Nicholas ‘Aboriginal Memories Of Inundation Of The Australian Coast Dating from More than 7000 Years Ago’ in Australian Geographer, 47:1, 11-47, DOI: 10.1080/00049182.2015.1077539 Pascoe, Bruce Convincing Ground: Learning to Love Your Country Aboriginal Studies Press, 2007 Pascoe, Bruce Dark Emu: Aboriginal Australia and the Birth of Agriculture Broome: Magabala Books, 2018 Phillips, Sandra and Verhoeven, Deb “How Do We Live Together Without Killing Each Other?” Indigenous and Feminist, Perspectives on Relationality, doi:10.1093/ccc/tcaa007 Recogito Austrian Institute of Technology, Exeter University, Humboldt Institute for Internet and Society, The Open University and University of London https://recogito.pelagios.org/ Robertson, Francesca; Nannup, Noel; Barrow Jason Great Journeys Undertaken By Aboriginal People In Ancient Times in Western Australia Batchelor: Batchelor Institute Ryan, Lyndall et al Colonial Frontier Massacres v3.0 C21CH, University of Newcastle, Australia, 2020 https://c21ch.newcastle.edu.au/colonialmassacres/ Ryan, Lyndall & Lydon, Jane Remembering The Myall Creek Massacre Sydney: New South, 2018 Ryan, Lyndall Tasmanian Aborigines: A History Since 1803 Sydney: Allen & Unwin, 2012 Smith, Linda Tuhiwai Decolonizing Methodologies: Research and Indigenous Peoples Dunedin: University of Otago Press, 1999 Steele, J.G. Aboriginal Pathways in Southeast Queensland and the Richmond River St Lucia: University of Queensland, 1983 Sutton, Peter ‘Traditional Cartography in Australia’ chapters 9 and 10 in Woodward, David and Lewis, G. Malcolm The History Of Cartography Volume Two, Book Three: Cartography in the traditional African, American, Arctic, Australian, and Pacific Societies Chicago: University of Chicago Press, 1998, Edited by David Woodward and G. Malcolm Lewis Time Layered Cultural Map http://tlcmap.org/ Tjapaltjarri, Clifford Possum Warlugulong 1977, (Anmatyerre) National Gallery of Australia https://artsearch.nga.gov.au/detail.cfm?IRN=167409 Waanyi/Garawa Rangers Centre For Aboriginal Economic Policy Research Canberra: ANU College of Arts & Social Sciences Walsh, Micheal ‘A Polytropical Approach to the “Floating Pelican” Song: An Exercise in Rich Interpretation of a Murriny Patha (Northern Australia) Song’ in Australian Journal of Linguistics Vol. 30, No 1, Jan, 2010 pp117-130 https://recogito.pelagios.org/ https://c21ch.newcastle.edu.au/colonialmassacres/ http://tlcmap.org/ https://artsearch.nga.gov.au/detail.cfm?IRN=167409 Mapping Meaning 9 Walter, Maggie & Suina, Michele (2019) Indigenous data, indigenous methodologies and indigenous data sovereignty, International Journal of Social Research Methodology, 22:3, 233-243, DOI: 10.1080/13645579.2018.1531228 Won, Miguel; Murieta-Flores, Patricia; Martins, Bruno ‘Ensemble Named Entity Recognition (NER): Evaluating NER Tools in the Identification of Place Names in the Historical Corpora’ in Fontiers In Digital Humanities, Vol 5, 2018 work_hknhv6sruzfkvebqoevkjlbc7a ---- University of Winchester Skip to main navigation Skip to search Skip to main content Welcome to University of Winchester Explore profiles, expertise and research at University of Winchester Advanced search 62 Research Units 2558 Research output 183 Projects 94 Prizes 16326 Activities 146 Student theses University of Winchester Research Portal Welcome to the University of Winchester's Institutional Repository which showcases the excellent research undertaken across the University. The Repository enables open access to outputs where permitted, and full citation details where restrictions apply, making our research accessible worldwide through a searchable, browse-able database. New items are being added all the time. For further information about the Repository, please contact repository@winchester.ac.uk Collaborations within the past 5 years. Click dots and donuts to bring up details or Select a country from the list Collaborations within the past 5 years. Select a country to view shared publications and projects Close Powered by Pure, Scopus & Elsevier Fingerprint Engine™ © 2021 Elsevier B.V. We use cookies to help provide and enhance our service and tailor content. By continuing you agree to the use of cookies Log in to Pure About web accessibility University of Winchester contact form work_hlfsrxlkmngjlbtwngyk2q2qjm ---- IJAHC 1 Title: “Remote Locations: Early Scottish Scenic Films and Geo-databases” Authors: Maria A. Vélez-Serna and John Caughie Email of corresponding author: Maria Vélez-Serna Biographical notes: Maria A. Velez-Serna is a research assistant with the Early Cinema in Scotland project. Her PhD, at Glasgow, was about the emergence of the distribution trade, and she has also worked on Colombian cinema history. She has published articles in Particip@tions, Post Script, and the edited collection Performing New Media (John Libbey, 2014). John Caughie is Emeritus Professor at Glasgow University, and the Principal Investigator on the Early Cinema in Scotland project. He was a founding member of Film and Television Studies at Glasgow University from 1978. From 1999 to 2005, he was Dean of the Faculty of Arts, and from 2009 to 2011, he was Director of Arts Lab. Before stepping down in 2013, he had been a contributor and editor to Screen, the leading international journal in film and television studies, for over 30 years, and was co-editor, with Charlotte Brunsdon, of the Oxford University Press series Oxford Television Studies. Abstract: In the field of cinema history, an increased interest in social experience and context has challenged the centrality of the film and the primacy of textual analysis. The ‘Early Cinema in Scotland, 1896-1927’ research project takes a contextual approach, using geo-database tools to facilitate collaboration. This article shows how spatially-enabled methods can also be mobilized to bring issues of representation back into a cinema history project. We argue that, when the films have not survived, their geographical descriptors as recorded by trade-press reviews and catalogues offer new avenues of analysis. The article argues that foregrounding location as a significant element in the film corpus creates a new point of interconnection between film text and context. The juxtapositions and divergences between the spatial patterns of film production and cinema exhibition are connected to pre-cinematic traditions of representation. The spatial distribution also sheds light on the differences between films made for local and international consumption, reflecting on Scotland’s position in relation to discourses of modernity. Keywords: spatial historiography, new cinema history, early cinema, Scottish cinema, cinematic cartography, Geographic Information Systems, geo-database 2 Remote Locations: Early Scottish Scenic Films and Geo-databases As in many other humanities disciplines, spatial approaches have been gaining ground in film studies and cinema history, and increased attention to social and spatial contexts has challenged the centrality of the film text in current cinema historiography. This spatial turn in cinema studies is an encounter between humanistic and scientific disciplines, and the tensions between their approaches are as productive as the collaborations. The use of geo-databases as a research method plays a key role in this development. This article discusses some of the strategies developed by the Early Cinema in Scotland research team to address questions about textual representations within the conceptual and practical framework of an empirically-minded and spatially-aware cinema history project. A study of early non-fiction films from Scotland illustrates the value of location data to interrogate textual patterns even in the absence of texts, offering a way to engage with a filmography in which the films themselves have mostly been lost. Furthermore, the films can be analyzed through cartographic and database practices that foreground layering and connectivity, revealing relationships with other cultural artifacts and with different datasets. The context for this work is an interdisciplinary research project involving five researchers, and so the collaborative dimension of GIS methods is very valuable. The implementation of database tables and relationships has followed the evolving needs and interests of the researchers, leading to productive conversations about our definitions and methods. The place of the film text within the project has been a recurring question, as the core research agenda situates our work in the territory of “New Cinema History,” an outlook that borrows its methodologies from social and cultural historians in a cumulative effort to produce “a social geography of cinema.”1 Funded by the United Kingdom’s Arts and Humanities Research Council, the Early Cinema in Scotland project set out to address three questions: 1. What are the distinctive features of the early development of cinema and cinema-going in Scotland? 2. Given the well-documented popularity of cinema-going in Scotland in the period, what were the factors that inhibited the development of a sustainable feature film production capacity? 3. How does research on the circulation and reception of cinema in Scotland in the early years of the twentieth century add to wider debates about “the popularization of modernity and the modernization of popularity?”2 These questions will be addressed not only in the context of the economic, social and cultural history of Scotland in the early years of the last century, but in the wider context of a comparative understanding of early cinema outside the major production centres of the US and Europe: that is to say, in small countries, in minor regions, and in rural and small-town communities. This attention to institutional and social aspects places the project alongside a growing number of empirical studies of exhibition and cinemagoing, informed by an interest in what Robert C. Allen calls “the spatiality of the experience of cinema.”3 In contrast with classical theories of spectatorship and reception, empirical studies suggest that “for most audiences for most of the history of cinema, their primary relationship with ‘the cinema’ has not been with individual movies-as-artefacts or as texts, but with the social experience of cinema- going.”4 In this interdisciplinary scholarship the film text is no longer at the centre.5 A purely textual approach, in particular one that looked at “Scottish films” only, would thus be an impoverished representation of the Scottish relationship with cinema. In brief, what we find is that early cinema in Scotland was characterized by a legendary enthusiasm on the part of the 3 audience which, in turn, was catered for by a strong exhibition sector. What we do not find is that this enthusiasm for cinemagoing fostered a consistent or sustainable production sector, or stimulated indigenously-produced Scottish feature films.6 Beyond our interest in these dimensions of institutional configuration and social experience, looking beyond the text was also a pragmatic decision for our project, since only a small fraction of the films made in Scotland before the transition to sound have survived. Even if we wanted to conduct textual analysis, lateral approaches were required to address the broader questions about experience, representation, and modernity. In this article we explore Franco Moretti’s notion of “distant reading” as a model for an even more distanced approach to films, a remote reading, mediated and contextualized through their spatial attributes.7 What we share here are provisional insights from this exploratory process of bringing textual analysis back into the fold through mapping, and reflections on the analytic practices it enables. The conceptual interest in the spatiality of the cinema experience advocated by New Cinema History has sometimes found a methodological correlate in the use of Geographic Information Systems (GIS). As Julia Hallam and Les Roberts have argued, geo-database tools present two significant advantages for projects engaged in a spatial historiography of audiovisual media. Firstly, GIS visualization is organized in layers, and this enables certain ways of navigating, reading, and analyzing sources, in a synchronic layering of temporalities with critical potential. Second, geo-databases turn location data into a connecting point, bringing together disparate datasets that pertain to the same places.8 The mash-up map as scholarly tool is a crude but effective realization of geographer Doreen Massey’s idea of relational space as the dimension where historical trajectories are “thrown together” by happenstance.9 As Deb Verhoeven and Colin Arrowsmith argue, ‘[s]imply recognizing that film industries generate data with a temporal and spatial element enables the building of connections that can reveal previously obscure influences and relationships.’10 This relational potential is particularly valuable for historians working on topics, regions or periods that are less well documented, and it invites transnational and comparative approaches. While effective dataset integration is still an unrealized ambition in cinema history, building compatible data structures is a key step towards that aim.11 The first step for the Early Cinema project was to set up a relational MySQL database with GIS data imported from preliminary work carried out using QuantumGIS and PostgreSQL. The data fields and attributes have been defined in dialogue with other international projects, while retaining some local specificity. A common denominator of most cinema history projects involving databases is the centrality of the cinema venue. This is the case of “Going to the Show,” the website developed by the State Library of North Carolina under Robert C. Allen’s guidance, which documents the development of cinema exhibition in forty-five towns using fire insurance maps and newspaper sources.12 Jeff Klenotic’s work on New Hampshire exhibition history also uses venues as the primary marker, offering a sophisticated range of analytical categories on top of demographic and other base maps, and championing GIS as an exploration tool that accommodates “history from below” through grounded visualization.13 The Australian Cinemas Map, coupled with the Cinema Audiences in Australia database, has taken this analysis a step further, questioning the stability of the notion of venue itself, and reformulating it as a series of events linked to a point in space.14 Projects like these, and several others in development around the world, suggest that geo-spatial tools are becoming a standard component of research projects looking at the histories of cinema exhibition and reception, 4 embraced as a way to link up and contextualize the growing range of sources that cinema historians now employ. Like the projects mentioned above, the Early Cinema in Scotland database design placed geographical locations, rather than film titles, as the main integrating point and the relevant attribute for visualization. Film titles would only acquire a geographical attribute by virtue of being screened at one of these places. However, as the filmography grew, it became apparent that there were many films that had significant Scottish elements, but which may never have been screened in Scotland, thus limiting the usefulness of an exhibition-led cinematic geography. One of the distinctive aspects emerging from our research was the disparity between endogenous and exogenous representations of Scotland, as the prevalence of Scottish themes and settings in international productions far outstripped local output. While only one silent feature made in Scotland survives, the amateur drama Mairi: The Romance of a Highland Maiden (Andrew Paterson, 1912), a review of the British and American trade journals Bioscope, Moving Picture World and Motion Picture News produced at least 119 feature films released between 1908 and 1927 with Scottish settings and stories.15 The popularity of Scottish literature throughout the world in the nineteenth century is key to this anomaly: the works of Walter Scott are staples for film adaptations by European companies before World War 1, and after World War 1 historical romances of Mary Queen of Scots, Rob Roy, Bonnie Prince Charlie, and Young Lochinvar are part of the diet of global cinema. As the author of a 1945 film survey for the Edinburgh Film Guild put it, If the Waverley novels are now read less frequently, it is because their qualities are the very stuff of cinema, which can translate the romantic scene and stirring tale in a modern idiom of swift, sharp beauty keyed to the tenser spirit of the age. Where former generations found romance in Scott the present generation finds it in the cinema.16 These literary traditions, as Moretti and others have argued, had a geographic dimension, with the Highlands functioning, in Scott’s historical novels and in popular legend, as a frontier territory that allows travellers to journey into the past, setting in motion the narrative wheels of the genre as well as its anthropological impulses.17 If we were to explore the continuities in the grammar and the tropes of cinematic landscape from pre-modern and romantic representational forms, we would need to understand these spatial patterns, and therefore our analytical tools— that is, the geo-database—needed to facilitate this. After Moretti’s influential Atlas of the European Novel, a growing body of work on the spatiality of literature has continued exploring the relationships between fictional and topographic space. The best examples challenge both empiricist and dematerialized conceptions of space and place, bringing GIS practice into dialogue with the discourses and approaches developed within the humanities, and showing how, like maps, narratives produce forms of spatial understanding. Mapping the spaces of narrative fiction was also the initial point of contact between geography and film studies. However, as Peta Mitchell and Jane Stadler have noted, “literary geography and film geography are distinct traditions within geography, each with its own histories and assumptions.”18 In the same essay, which refers to the Cultural Atlas of Australia project, Stadler and Mitchell go on to outline their proposal for an intermedial geocritical method, combining the strengths of different disciplines’ spatial turns to examine how “[c]ultural narratives not only mediate and represent space, place, and location, but [are] themselves mediated representational spaces.” The Cultural Atlas of Australia, consequently, surveys narrative space across novels, plays, and films, providing a model for a critical 5 cybercartographic method that pays attention to the multiple perspectives and imaginative geographies of fiction.19 Mitchell and Stadler’s geocritical practice, by drawing on a variety of datasets and utilizing cartographic tools, connects Maltby’s exhortation for a “social geography of cinema” with the more text-centred directions of the spatial turn in film studies. These textual strands have sought to understand how films invent and signify spaces, in works like Charlotte Brunsdon’s London in Cinema, recognizing a mutually creative relationship.20 Closer to the pragmatic motivations of a geo-database platform, the notion of “cinematic cartography” actually involves mapping, while challenging any positivist associations that the practice may evoke. In their introduction to a dedicated issue of the journal of the British Cartographic Society, Sébastien Caquard and D. R. Fraser Taylor explained that this approach turns the implicit connections between cartographic practice and film into a mode of analysis, one that “acknowledges the importance of cartography as an objective and scientifically based discipline, as well as the importance of conveying different forms of emotions and sensations about places through cinematographic language.”21 Cinematic cartographies add another layer of complexity due to the unstable relationship between the profilmic and the diegetic space—that is to say, between location and setting.22 As Brunsdon points out, cinematic geographies are complicated by the fact that cinema “is, in one sense, constituted through the production of spaces. And these cinematic spaces are produced through the manipulation of other spaces and processes.”23 Mapping diegetic locations is a practice rooted in the text-centred literary tradition. On the other hand, mapping shooting locations has become an extremely popular practice—for tourism offices around the world, as well as independent enthusiasts. This distinction was adopted with the creation of two separate database attributes, so that our filmography could document both the setting and the shooting location of a film, if known. With these two location fields, the filmography became spatially- enabled. This means that we can now potentially map the films alongside the other entities in the database, and study the overlaps and divergences between their spatial arrangements. While the divergences and alleged identities between fictional settings and shooting locations deserve more detailed attention in future research, the rest of this paper focuses on non-fiction films, using the geo-database’s layering abilities to explore the inter-medial and inter-textual connections that underpin representation strategies in the silent period. While there is already a significant body of work on the relationships between fictional and topographic places in literature and narrative cinema, there are fewer examples of this approach that engage with documentary or non-fiction cinema.24 Salient amongst them is the Liverpool: City in Film project, which geo-referenced more than 1700 film and video items including everything from newsreels to amateur productions, spanning five decades of urban change in a provincial city. Mapping this large corpus with GIS tools allowed the researchers to examine how different film genres engaged with the city, finding “a series of overlapping mosaics of the city’s urban landscape” in which “specific production practices construct and project different spatial perceptions of the city.”25 This suggests that the geocritical approaches that have developed in relation to fictional geographies can still be necessary when looking at non-fiction films, as they offer their own spatial discourses and contribute to the production of social and cultural space, rather than simply bearing witness to it. Perhaps, riding on the continuing influence of an indexical paradigm, the relationship between a place and its representation in non-fiction is taken for granted. However, as a selective and fragmentary view of the world, and as an accumulation of intelligible discourses, non-fiction 6 films construct narratives of place adapted to different functions. One of the dominant forms of discourse during the early period is the travel film.26 Before the emergence of the term documentary, and its association with a more self-conscious rhetoric of realism, early cinema placed as much stress on the medium’s evidentiary value as in its imaginative possibilities. What Tom Gunning has called the “encyclopedic ambition” of early cinema promised to bring all the world to viewers in metropolitan centres.27 The travel film or “scenic” was thus one of the first film genres to emerge, and it took pride of place in the programmes of early travelling exhibitors, and then as part of the varied assemblies of films shown in nickelodeons and picture houses. As late as 1913, out of the more than 600 films released in the UK in a month, almost ten per cent were catalogued as travel films or scenic films.28 While their length was significantly below the mean, the sustained production of short travel films, mainly by British and European companies, ensured the survival of the “varied programme” that exhibitors believed audiences wanted.29 The travel film, as Ivo Blom points out in his study of the work of filmmaker Anton Noggerath in Iceland, draws on the popularity of travel writing in the eighteenth and nineteenth centuries.30 Like Iceland, Scotland was a favored topic for early modern travel writers, with the Highlands figuring as an accessible wilderness, a margin of Europe and of the British Empire that could be reached by train. Furthermore, the European Grand Tour that was fashionable for the British aristocracy and aspirant bourgeoisie had become too dangerous in the tumultuous conditions of the Revolutionary and Napoleonic Wars. The legitimacy of the Scottish Highlands as an alternative Grand Tour, as well as the pacification of the area a hundred years after the last Jacobite rebellion, were confirmed when Queen Victoria established a private residence at Balmoral in 1852. Landscape painting, by JMW Turner, for example, for Scott’s “Poetical Works” in 1831, and in particular the very popular work of Sir Edwin Landseer, had consolidated the alliance between visual style, literary representations and ideological constructions of the Highlands connected to an aesthetics of the sublime and a rugged exoticism. Lantern lecturers had access to photographic sets such as those produced by George Washington Wilson, a native of Banffshire who attracted both royal patronage and international acclaim for his artistic and technically skilled views, available commercially as single and stereoscopic prints from the 1860s.31 The geographical interest of Wilson’s work, as Charles Withers has argued, needs to be understood against a background of “historical and literary associations [that] drew tourists and artists both” to particular locations such as Loch Lomond, the Trossachs and Glencoe.32 This long history of visual and descriptive representation is engaged again in early non-fiction films about Scotland. Our database, which is still growing and does not claim to be comprehensive, includes at the moment eighty-five travel, educational, and interest films shot in Scotland and offered to the British trade by production companies of various origins and nationalities. Almost half of these were described as scenics, and include titles like A Holiday in the Highlands (Barker, 1919), Mountains and Glens of Arran (H&B, 1915), and The Bonnie Isle of Skye (Kineto, 1913). On a discursive level drawn largely from the trade press, the titles and descriptions suggest a continuity between pre-modern and Romantic literary traditions and the emergent conventions of cinematic landscapes. Thus, for instance, the Bioscope review for The Bonnie Isle of Skye talks of the “romantic and mystical beauty” of the Western Islands, and the invocation of “Caledonia, stern and wild” (from Scott’s Lay of the Last Minstrel) appears in the trade descriptions of both Scottish Scenery (1914) and Prince Charlie’s Country and the Western Highlands (1914).33 Practically, however, the corpus of films on which these continuities can be established is severely incomplete; like most productions of the nitrate era, the majority of the films is lost. 7 This creates a different challenge for our attempt to engage with the filmography on a textual level. Trying to study how these films conveyed representations of Scotland, without being able to see most of them, requires a new approach, and spatial tools can offer some answers. To borrow Moretti’s influential idea, setting and location are two elements that can be read “distantly.”34 Using the British trade journal The Bioscope, we collected the descriptions of Scottish-themed non-fiction films offered for UK distribution every week. These descriptions, while typically embellished and often equivocal, do, in the majority of cases, name locations. It is one of the interesting inflections of reading distantly or remotely through the trade press that the locations that are identified are those that are already known, that are already “mapped” on the tourist agenda and can be invoked in the selling of the film: the Spean Gorge, the waterfalls of the River Clyde, Loch Katrine. This plotting of locations, if framed effectively, gives us some foothold for an investigation of meaning-making strategies in early film representations of Scotland, and allows us to compare their geographical patterns to those in other texts and to situate them in relationship to a broader context. We are not simply reading landscape off the film, but off an imagined map, a “branded” landscape, drawn from nineteenth-century tours and tour guides, that pre-exists the film. While this is very much still work in progress, some of the findings start to show the potential of this geo-database treatment for addressing textual questions. In the last section of this paper, we discuss a corpus of thirty-nine non-fiction films made in Scotland between 1910 and 1927, and advertised in The Bioscope. A quarter of these films mention the Highlands in their title. The trade journal descriptions name seventy-five locations in total, which have been mapped manually. This exercise allows us to understand these films in relation not only to other films, but, importantly, to other dimensions of our research: demographic data, exhibition venues, and the locations of other topical and fictional films. At the core of this analysis is a very simple methodology: using Quantum GIS, we layer various types of data, from the topographic and demographic profiles to the places named in scenic and local topical films. Appropriate use of transparency and labelling allows us to explore overlapping data points and test hypotheses quickly and iteratively. Given the diversity of the primary sources, this is of necessity a work of bricolage, bringing together different time-scales and levels of accuracy. The overlapping temporalities marked in Figure 1 reflect the limits of the sources: Census dates, trade journal runs, and archival holdings. The problematic way in which spatial visualization seems to conflate time is a well-rehearsed discussion amongst digital humanists.35 As an exploratory tool, however, we retain the generative power of the “mash-up” map, with the caveat that a fuller historical explanation would demand a closer breakdown of the layers, their relationships, and the longitudinal changes within each dataset. 8 Figure 1: Locations of scenic films and local topical films compared to geographical distribution of cinema venues. Historical boundary data: Scottish Civil Parishes 1890 (digitized from Black’s Atlas), via EDINA Census Support. Census data: Southall, H.R., Gilbert, D.R. and Gregory, I., Great Britain Historical Database: Census Statistics, Demography, 1841-1931 [computer file]. Colchester, Essex: UK Data Archive [distributor], January 1998. SN: 3707, http://dx.doi.org/10.5255/UKDA-SN-3707-1. To begin with the most general observation, mapping the locations of these scenic films against population density – as per the 1911 Scottish census – reveals a sharp divergence. As the scenic films gravitate towards the Western and Central Highlands, there is a preference for sparsely populated areas. While Edinburgh and Glasgow are sometimes mentioned, they tend to appear as points of departure for a scenic voyage rather than as “scenes” in themselves. The River Clyde, which runs through Glasgow and whose shipbuilding industry produced over twenty per cent of the world’s mercantile ships (by tonnage) during its boom years at the turn of the century, is represented in three of the scenic films.36 However, the picturesque waterfalls to the east of the manufacturing area and the open estuary to the west are privileged over the cranes and molten steel at the centre of the industry. It was not until the Documentary Movement between the 1930s and the 1950s that industrial Scotland would be pictured heroically. Deleted: [Figure 1 here.]¶ 9 The preference for sparsely populated locales has another corollary in the minimal overlap these films have with the geography of the expansion of cinema. Put simply, most of the places depicted did not have a cinema; the films were not meant to be shown there. While itinerant non-theatrical exhibition was common in rural Scotland, and so it is not impossible that films were shown somewhere in the vicinity, there is a sharp distinction between films intended for national and international distribution and the extended practice of local topical filmmaking. There is no mention in the Oban Times, for example, of two scenics or interest films, Highland Games at Oban and Dunoon (Kineto, 1911) and Oban on Regatta Day (Kineto, 1913), being exhibited in the area. While they may or may not have been screened there, they were made by a major UK production company, aimed at an international rather than a local audience, and they did not attract local attention. The local film has been defined by Stephen Bottomore as one that expects “considerable overlap between the people appearing in the film and those who watch it.”37 These local topicals were crowd films: a practice initiated by travelling exhibitors, and adapted later by cinema managers needing to add the irresistible attraction of seeing yourself on screen to their programmes. Whether they nominally documented a gala day, parade, or news event, the camera was always turned on the audience, as this would guarantee their attendance at the show.38 Very few managers and operators had the skills and equipment to shoot and develop local topicals, so they were mostly commissioned from newsreel agents based in Glasgow or Edinburgh. It is thus not surprising that their geographical distribution favors the central belt of Scotland, which was both densely populated and very well provided with cinemas. Although we do not have time to develop the argument here, while it is part of the definition of the local topical film that it be familiar, everyday and recognisably local, it is part of the definition of the scenic film that it be, in some sense, exotic, removed from the everyday, and taking its significance from an already imagined space. Away from the heavy industry and the booming centres of population, most of the places filmed as scenic were connected to the railway or the ferry system – exotic but accessible. In part due to the material determinants of access, cinematic tourism echoed the geographical preferences of earlier tourist narratives. The falls of the Clyde, Loch Lomond and the Trossachs, and parts of Stirlingshire and Perthshire were as popular with filmmakers as they had been with literary visitors in the eighteenth and nineteenth centuries. In her Recollections of a Tour Made in Scotland, A. D. 1803, Dorothy Wordsworth recounts a meandering circuit starting in the Lake District, following the Clyde Valley and taking William and Dorothy Wordsworth, and, for part of the journey, Samuel Taylor Coleridge to the West and Central Highlands, ranging from Glen Coe in the North to the Gaelic-speaking areas of the Trossachs and Loch Katrine just thirty miles North of Glasgow. Their tour ends in the scenic area of the Borders, south of Edinburgh, where they are escorted by Sir Walter Scott. Drawn to waterfalls and gorges, Dorothy Wordsworth’s descriptions expect and evoke the sublime in the bleak landscape. While a fuller discussion of the overlaps and divergences between literary and cinematic tours is the subject of a different article, the simple exercise of mapping and juxtaposing different categories from the existing records, and layering cartographic data from different texts, starts to reveal how forms of cinematic discourse and modes of address are constructed by relation to space and place. Scotland’s complicated position in relation to modernity emerges in the contradictions between endogenous and exogenous forms of representation. Annie Morgan James argues, in her essay on Scottish landscapes in post-war cinema, that “the Highlands as cultural artefact define Scottishness, and in cinema the perpetual landscaping of Scotland intensifies the rurality of this stateless nation.”39 This is, however, only true of outward-facing forms of representation, 10 intended for an international rather than a local market. The rurality and grandeur of the Highlands is itself a discursive product: the production of an image of Scotland for a world imaginary. The fact that the geographic markers used in this analysis are taken solely from the trade descriptions of the films reminds us that this is advertising material. Its function is not to provide a shot-by-shot list of locations, but to sell the place and the journey, making explicit and implicit connections with existing horizons of expectations. The strongest imaginary at play in this commoditized Scottish geography is the Highlands as a vaguely defined, but powerfully symbolic territory, a European border with wildness and pre-modernity. The Highlands remain in these films, and in many feature films from the period, as an obstinate example of imprecise geography. As both literary and cinematic cartographers have shown, the geographies of fiction are often imprecise (as compared to the co- ordinate data expected by GIS software), and even when place names are given, the relationship between a place in a novel or narrative and that place in the world is complicated. Researchers working in the “Literary Atlas of Europe” project describe the uncertainty introduced by literary geographies as “a combination of subjectivity, vagueness and ambiguity (caused by the conceptualisation of literary places) on the one hand, and averaging, completeness and continuousness (resulting through the acquisition method of those literary objects) on the other hand.”40 In other words, it is difficult to create appropriate literary maps because places in literature are either imprecise or made up, while conventional cartography expects precise coordinates and sharp boundaries. A similar contradiction emerges in relation to film, with significant differences. Maurice Tourneur’s The White Heather (1919), for example, featuring a wreck off the coast of the Scottish Highlands, and commended in Bioscope for the vividness and accuracy of its “British atmosphere,” was filmed in Los Angeles Harbor. While narrative setting may be as defined or uncertain as in literature, the uncertainty regarding shooting location is only a contingent one. The indexical root of photographic representation means that there is always a very precise location—although we might not know what it was. From an empiricist perspective, therefore, the imprecision of this geography is merely a technical problem: it is possible to envision an image-recognition algorithm that matched the Highland landscape views to their co- ordinates, or an archival trove with the shooting diaries of all the camera operators involved. It is almost certainly more productive, however, to think through this imprecision and to work with it rather than strive to eliminate it. The tension between the perceived finality of a point on a map, and the fluidity of socially produced space, is a well-known point of contention, but also a creative force for humanities scholars working with digital methods. In the field of cinema history, a similar voltaic arc can be sparked between more text-centred and/or theoretical approaches, and the empirical and archival work that has challenged previous generalizations. The collaborative, data-sharing, linking and layering abilities of digital tools encourage exploratory, mash-up methodologies rather than competitive monotheism. In the context of the Early Cinema in Scotland project, an uncomplicated geo-database structure has enabled and encouraged us to engage with textual aspects as well as social and institutional issues. It allows one researcher’s work with demographics and exhibition history to interact with another’s investigation of film locations and literary precedents, or to help understand production patterns as both discursively and materially determined. Thus, multiple, possibly contradictory stories can be woven into new forms of historical narrative that do not erase difference or seek synthesis. Rather, they retain some of the imprecision and messiness of the social and cultural world sharpened and held in tension with a methodical and critical engagement with technology. 11 Acknowledgements The Early Cinema in Scotland research project is funded by a grant from the Arts and Humanities Research Council (UK), AH/1020535/1. The historical geography elements of this work use boundary material that is copyright of EDINA, University of Edinburgh, and is based on data provided through EDINA Census Support with the support of the ESRC and JISC. Census tables were obtained from the Great Britain Historical Database through the UK Data Archive. End Notes 1 Richard Maltby, “New Cinema Histories,” in Explorations in New Cinema History: Approaches and Case Studies, eds. Richard Maltby, Daniel Biltereyst, and Philippe Meers (Oxford: Wiley-Blackwell, 2011), 28. 2 Francesco Casetti, “Filmic Experience,” Screen 50, no. 1 (2009): 58. 3 Robert C. Allen, “Getting to ‘Going to the Show’,” New Review of Film and Television Studies 8, no. 3 (2010): 268. 4 Richard Maltby, “How can Cinema History Matter More?” Screening the Past 22 (2007), accessed December 15, 2014, http://tlweb.latrobe.edu.au/humanities/screeningthepast/22/board-richard-maltby.html. 5 Richard Altman, “Whither Film Studies (in a Post-film Studies World)?” Cinema Journal 49, no. 1 (2009): 134. 6 Trevor Griffiths, The Cinema and Cinemagoing in Scotland, 1896-1950 (Edinburgh: Edinburgh University Press, 2012), 279. 7 Franco Moretti, Distant Reading (London: Verso, 2013) 8 Julia Hallam and Les Roberts, “Mapping, Memory and the City: Archives, Databases and Film Historiography,” European Journal of Cultural Studies 14, no. 3 (2011): 368-369. 9 Doreen Massey, For Space (London: Sage, 2005), 151. 10 Deb Verhoeven and Colin Arrowsmith, “Mapping the Ill-disciplined? Spatial Analyses and Historical Change in the Postwar Film Industry,” in Locating the Moving Image, eds. Julia Hallam and Les Roberts (Bloomington: Indiana University Press, 2014), 107. 11 Karel Dibbets, “Cinema Context and the Genes of Film History,” New Review of Film and Television Studies 8, no. 3 (2011). See also the website “Cinema Context,” accessed January 7, 2015, http://www.cinemacontext.nl. 12 The University of North Carolina, “Going to the Show,” accessed January 7, 2015, http://docsouth.unc.edu/gtts/. 13 Jeffrey Klenotic, “Putting Cinema History on the Map: Using GIS to Explore the Spatiality of Cinema,” in Explorations in New Cinema History: Approaches and Case Studies, eds. Richard Maltby, Daniel Biltereyst, and Philippe Meers (Oxford: Wiley-Blackwell, 2011), 66. Klenotic’s project is online at “Mapping Movies,” accessed May 18, 2015, http://mappingmovies.unh.edu/maps. 14 Deb Verhoeven, “What is a Cinema? Death, Closure and the Database,” in Watching Films, eds. Karina Aveyard and Albert Moran (Bristol: Intellect, 2013), 33-51. See also the database at “Cinema Audiences in Australia,” accessed January 4, 2015, http://caarp.flinders.edu.au/home. 15 Bioscope was consulted on microfilm at the National Library of Scotland. For the American trade journals, our research was immensely facilitated by their availability via the “Media History Digital Library,” accessed January 7, 2015, http://mediahistoryproject.org/. 16 Norman Wilson, Presenting Scotland: A Film Survey (Edinburgh: Edinburgh Film Guild, 1945), 8. It is worth noting, however, the Scott novel that has never been adapted for cinema is Waverley itself. 17 Franco Moretti, Atlas of the European Novel (London: Verso, 1998), 37-38. 18 Peta Mitchell and Jane Stadler, “Redrawing the Map: An Interdisciplinary Approach to Australian Cultural narratives,” in Geocritical Explorations: Space, Place and Mapping in Literary and Cultural Studies, ed. Robert T. Tally, (New York: Palgrave Macmillan, 2011), 53. 19 “Cultural Atlas of Australia,” accessed January 5, 2015, http://www.australian-cultural- atlas.info/CAA/index.php. 20 Charlotte Brunsdon, London in Cinema: The Cinematic City since 1945 (London: BFI, 2007) 12 21 Sébastien Caquard and D. R. Fraser Taylor, “What is Cinematic Cartography?” The Cartographic Journal 46, no. 1 (2009): 7. 22 Mitchell and Stadler, “Redrawing the Map,” 58. 23 Brunsdon, London in Cinema, 7. 24 Or with non-fiction writing, for that matter. Ian Gregory’s research on historical travel writing and tourist guidebooks of the Lake District, which has geo-referenced 80 texts to explore how the region has been represented, is a pioneering example. See “Lakeland Geo-text Explorer,” accessed January 5, 2015, http://www.lancaster.ac.uk/fass/projects/spatialhum/geotext/. 25 Julia Hallam, “Mapping the ‘City’ Film 1930-1980,” in Locating the Moving Image, eds. Julia Hallam and Les Roberts (Bloomington: Indiana University Press, 2014), 177. See the project database at “Mapping the City in Film,” accessed May 18, 2015, https://www.liv.ac.uk/architecture/research/cava/cityfilm/. 26 Tom Gunning, “‘The Whole World Within Reach’: Travel Images Without Borders,” in Virtual Voyages: Cinema and Travel, ed. Jeffrey Ruoff (Durham: Duke University Press, 2006), 25. 27 Tom Gunning, “Early Cinema as Global Cinema: The Encyclopedic Ambition,” in Early Cinema and the “National,” eds. Richard Abel, Giorgio Bertellini, and Rob King (New Barnet: John Libbey, 2008). 28 “The Cinema Film Register,” The Cinema and Property Gazette, April 2, 1913, 61-66. 29 Ian Christie and John Sedgwick, “‘Fumbling Towards Some New Form of Art?’: The Changing Composition of Film Programmes in Britain, 1908-1914,” in Film 1900: Technology, Perception, Culture, eds. Annemone Ligensa and Klaus Kreimeier (New Barnet: John Libbey, 2009), 159. 30 Ivo Blom, “The First Cameraman in Iceland: Travel Films and Travel Literature,” in Picture Perfect: Landscape, Place and Travel in British Cinema before 1930, eds. Laraine Porter and Briony Dixon (Exeter: Exeter Press, 2007), 68. 31 Charles Withers, “Picturing Highland Landscapes: George Washington Wilson and the Photography of the Scottish Highlands,” Landscape Research 19, no. 2 (1994): 73. 32 Withers, “Picturing Highland landscapes,” 71. 33 “The Pick of the Programmes: What we Think of Them,” Bioscope, October 9, 1913 and March 26, 1914. 34 Moretti, Distant Reading, 67. 35 In cinema history specifically, this “flattening” of sequential events was one of the objections offered by Robert C. Allen against Ben Singer’s account of Manhattan nickelodeons. See Robert C. Allen, “Manhattan Myopia; Or, Oh! Iowa!” Cinema Journal 35 no. 3 (1996): 77. 36 Neil K. Buxton, “The Scottish Shipbuilding Industry Between the Wars: A Comparative Study,” Business History 10, no. 2 (1968): 119. 37 Stephen Bottomore, “From the Factory Gate to the ‘Home Talent’ Drama: An International Overview of Local Films in the Silent Era,” in The Lost World of Mitchell and Kenyon: Edwardian Britain on Film, ed. Vanessa Toulmin, Simon Popple, and Patrick Russell (London, BFI: 2004), 33. 38 See, for instance, from the Scottish Screen Archive’s collection, Arrival at Whitehart Hotel, Campbeltown (1914) http://ssa.nls.uk/film/0795, or many of the films made by Mitchell and Kenyon for travelling exhibitors in the North of England, such as Preston Egg Rolling (1901) http://player.bfi.org.uk/film/watch-preston-egg- rolling-c1901-1901/, accessed January 5, 2015. 39 Annie Morgan-James, “Enchanted Places, Land and Sea, and Wilderness: Scottish Highland Landscape and Identity in Cinema,” in Representing the Rural, eds. Catherine Fowler and Gillian Helfield (Detroit: Wayne State University Press, 2006), 186. 40 Anne-Kathrin Reuschel and Lorenz Hurni, “Mapping Literature: Visualisation of Spatial Uncertainty in Fiction,” The Cartographic Journal 48, no. 4 (2011): 298. work_hm4ljhe2mnbtratt3c3gw756dm ---- Text Mining at an Institution with Limited Financial Resources Search D-Lib: HOME | ABOUT D-LIB | CURRENT ISSUE | ARCHIVE | INDEXES | CALENDAR | AUTHOR GUIDELINES | SUBSCRIBE | CONTACT D-LIB D-Lib Magazine July/August 2016 Volume 22, Number 7/8 Table of Contents Text Mining at an Institution with Limited Financial Resources Drew E. VandeCreek Northern Illinois University Libraries drew@niu.edu DOI: 10.1045/july2016-vandecreek Printer-friendly Version (This Opinion piece presents the opinions of the authors. It does not necessarily reflect the views of D-Lib Magazine, its publisher, the Corporation for National Research Initiatives, or the D-Lib Alliance.) Abstract The digital humanities are now coming to the attention of a growing number of scholars and librarians, including many at medium-sized and small institutions that lack significant financial resources. Should these individuals seek to explore text mining, one of the digital humanities core activities, they are likely to confront the fact that their library cannot afford the typical expensive database products that contain large volumes of materials suitable for analysis. In this opinion piece, I suggest that vendors would benefit from increasing their customer base by offering potential users the opportunity to purchase discrete portions of data sets individually. This approach may prove practicable for libraries able to muster relatively modest sums for the purchase of single items. It also may represent a new source of revenue for vendors, or at least an opportunity to build trust and goodwill in the digital humanities community. The Problem The digital humanities' increasing prominence in academic life, marked by such things as the advertisements seeking applications for new positions and calls for papers, has brought it to the attention of a large number of humanities scholars, librarians and administrators not employed at the larger institutions that have heretofore often led the field's development. Many have expressed an interest in the field. These individuals often do not have access to as many financial resources as the field's leaders often enjoy. This shortfall makes itself apparent in any number of ways: the lack of a technical infrastructure robust enough to support many types of digital humanities work; a lack of information technology professionals that understand, appreciate and can support the work; and an inability to attend professional development workshops at other institutions. Another potential problem to be faced by this new group of practitioners at non-elite institutions with limited resources will arise when they undertake text mining, one of the digital humanities' core activities, and confront the expense of acquiring a corpus of data to mine. In this article I discuss the problem, and propose a partial solution which, while far from ideal, could allow these practitioners to begin. Text Mining: the Cost of Getting Started I attended the University of Michigan's "Beyond CTRL+F: Text Mining Across the Disciplines 2016" workshop on February 1, 2016. I want to thank the University of Michigan Libraries for organizing and hosting the event. I enjoyed it. It must have taken a great deal of work. When the workshop first came to my attention, I noticed that participants could attend at no charge. This was too good to be true. Working at a state university in the bankrupt state of Illinois, I of course have access to no financial support for professional development activities. I happily drove to Ann Arbor and stayed overnight at my own expense, then took part in the workshop. Without the free-admission policy, I might not have gone to the event. The workshop began with a session devoted to "finding your corpus." This seemed reasonable. No one can perform text mining until they have some text. The session featured representatives of several vendors of subscription products providing access to large amounts of textual materials: ProQuest, JSTOR, Gale, Alexander Street Press (full disclosure — I edited an online product for Alexander Street Press and have cashed their checks) and several others. It dawned on me that the no-charge policy resulted, of course, from these vendors' sponsorship of the event. As sponsors, they enjoyed the opportunity to pitch their products to members of a captive audience who had expressed an interest in text mining. Vendor representatives described how scholars and students might use their products for text-mining projects. They presented an impressive set of resources, but they did emphasize that library users were not simply to bring up one of their databases and begin to download the very large bodies of text they wanted to use. Vendors of online library resources typically offer their products for subscription with the proviso that library patrons not use them too much. From a vendor's point of view, a database user might download a very large amount of text and then turn around and put it on the web for free use. Thus, they monitor their product's use, and terminate access if they detect that a patron is downloading too much material. Vendor representatives at the Ctrl+F event explained that their policies direct prospective text miners to use their products to discover potentially suitable text materials, then submit a request for a specific corpus, which they will then prepare and deliver for an extra fee in the range of $500-$1,000. This made something very apparent to me: text mining is in many cases only practicable at its intended scale at institutions commanding the financial resources necessary to 1) subscribe to these products, and 2) go on to pay the additional fee. Of course Open Access entities like HathiTrust make text materials available at the scale required for text mining activities at no cost, but it is important to recognize that vendors of subscription-based products like those discussed at the Ctrl+F event also represent a major source of text materials that scholars will likely find very attractive. I noticed that a significant number of scholars employed at institutions well outside the vendors' target audience of university libraries with budgets allowing them to purchase or subscribe to high-cost digital resources in the humanities attended the "Beyond Ctrl+F" event. Those with whom I conversed often emphasized that they were happy to attend such an introductory-level event hosted by a major institution of high reputation. It offered an opportunity to get oriented in the field, to get started in the work. I suspect that a number of these individuals must have reached the same conclusion that I did: "I can only do this if I can find text available at no charge. I must direct my research toward questions that can be answered by reference to free-use data alone." My Experience I attended the Ctrl+F event as a digital humanities professional responsible for the encouragement and support of activities like text mining at my university. I am also a scholar of nineteenth and early twentieth century American intellectual and political history. I am interested in language and rhetoric in American political development. More specifically, I am interested in how Americans have talked about the federal government. What did they have to say about its scope of activity? How might Americans have understood what it did, or did not do? What language did they use to argue for more, or less, government involvement in the American economy and society? Did their language reflect the influence of major intellectual traditions like liberalism and republicanism in political thought, or perhaps romanticism and sensibility in literature and culture? I turned to speeches and debates in Congress as a good source of arguments for and against specific state activities. This led me to the Congressional Record, a very large set of text that is available in a searchable text format from several sources. The Library of Congress' A Century of Lawmaking for a New Nation web site provides free access to full-text versions of the Congressional Record beginning with the year 1995. I needed access to full-text versions of the record from the nineteenth century. This led me to ProQuest Congressional, a subscription product providing a variety of Congressional materials. Unfortunately, my university library's subscription to ProQuest Congressional did not include materials from the Congressional Record before 1985. When our Acquisitions Department contacted ProQuest to inquire about the matter they learned that we might purchase the back file materials for the nineteenth-century Congressional Record for a one-time payment of approximately $25,0000. This was an all-or-nothing proposition: purchase the entire back file, or purchase nothing. ProQuest's price was a complete non-starter at my financially strapped university. I asked librarians at several institutions with large library budgets if they might acquire materials for me, in effect providing an inter-library loan, but found that vendors' contracts restrict use to individuals defined as members of an individual institution's user community. I attempted to resolve my problem by asking vendors if they would sell me my preferred chunk of data by itself (the Congressional Record, 1873-1896), rather than an entire database product or back file, at a more reasonable price. ProQuest declined to negotiate, but Hein Online (another vendor of digitized government documents) agreed. I bought, at my own expense, the text of the Congressional Record for the period 1873-1896 for a price I could accept. I now have it available for research. Upon completing this transaction, I discovered that the University of North Texas Libraries, which present a digitized version of the entire Congressional Record, would provide me with their uncorrected text data at no charge. I thank the University of North Texas Libraries for the use of their data, and recommend them to other students and scholars. Their collections include a large amount of digitized Texas newspapers, as well as records of the Federal Communication Commission. However, like other not-for-profit providers of text data, North Texas offered uncorrected copy. With two versions of the same data in hand, I may have an opportunity to compare the results they produce in text-mining work. In any event, corrected text is clearly more useful than uncorrected materials. The Vendors' Perspective As I pondered the situation, I tried to take ProQuest's point of view. I understand that most library vendors are private concerns and need to make a profit for their investors. Their representatives sell that product in order to earn a living. Nevertheless, the Congressional Record is a government publication available at no charge in libraries and other depositories of federal materials. How could ProQuest charge so much for the use of it? I imagined that from ProQuest's perspective, they are not selling access to a government publication in the public domain. They are selling access to a value-added version of it: a digitized, full-text searchable version of the materials available in an online format. Their costs include funds devoted to the initial digitization of materials originally published in an analog format; the markup and other technical work required to prepare the text for use with a search engine; the storage and preservation of the materials on a technical infrastructure requiring maintenance and upgrades; and the online service of the digital materials themselves, again on an infrastructure requiring maintenance and regular upgrades. Of these costs, those devoted to digitization itself deserve specific discussion. Many librarians and humanities scholars have taken some part in the digitization of materials at some point in their career. Experience with the process reveals that the various software products that convert type-set, analog materials to a digital format are far from foolproof. They often produce enough errors to compromise the materials' usefulness, at least to some degree. This is especially true of older materials, in which ink has often faded and pages have yellowed with age. In my experience nineteenth-century materials digitized from an analog format usually have a very high error rate. I examined a small sample of ProQuest's Congressional Record materials, which they courteously provided me. It contained a very small amount of scanning errors, significantly fewer than those found in the portion of the UNT data that I reviewed, and about the same as the Hein materials. I tentatively determined that in my case vendors provide access to better text than that available for free. If a researcher were to attempt to bring the Open Source data up to the quality of the ProQuest materials, s/he would have to find a way to fix many of the errors in it, most likely by using a script that finds and replaces common scanning errors in a document. In my experience most humanities scholars and students cannot write search and replace scripts, nor do they know how to find them online, ready to use, and implement them in ways that many technologists and programmers do. I certainly do not. Most libraries and medium-sized and smaller institutions with limited resources lack access to this type of technical expertise. Thus, when Hein and ProQuest charge fees for materials in the public domain, they charge for access to more accurate digitized text. A Measure of Progress My experience with Hein Online led me to draw a parallel to another experience I had with a vendor in a somewhat similar, but not identical, situation. In the past several years I have taken part in the activities of the Digital POWRR Project, an IMLS-funded activity that produced a study of digital preservation challenges and potential solutions at medium-sized and smaller colleges and universities lacking large financial resources. Our study included the review of a number of applications and tools available for use in digital preservation activities. Among them we found a comprehensive, all-in-one product called Preservica. They made no pricing information available online. We had to call for a quote. When we contacted a Preservica sales representative to ask if they might make the product available to our study for testing at little or no cost, they immediately rejected us, explaining that Preservica is a version of a digital preservation product that the company originally sold to large corporations such as banks. They have now begun to market it to other very large institutions with need to preserve digital materials that have suitable budgets, ranging from universities to state and national governments. Apparently, medium-sized and smaller institutions with little money did not represent an attractive market segment. The Digital POWRR Project published a white paper resulting from the study, "From Theory to Action: Good Enough Digital Preservation for Under-Resourced Cultural Heritage Institutions". It recommended that institutions unable to afford a product like Preservica adopt a one-step-at-a-time approach to digital preservation activities using sets of open-source tools in combinations suited to their particular needs. Another thing occurred in the process of conducting the study. Through a frank and open exchange of views with members of the Digital POWRR team, Preservica executives became aware that they were leaving money on the table by adopting a call-for-quote stance and pricing their product at a level that put it well out of reach of smaller, less prosperous institutions. We urged them to adopt a more transparent pricing policy and become aware of this other market, which the response to our study has shown is vast. There are only so many institutions with the resources necessary to buy Preservica at their initial price level. What happens when they all have acquired or constructed a satisfactory digital preservation application? Where does the company find growth then? Preservica executives changed their position, instituting a transparent, online pricing policy and devising versions of their product priced to suit more modest budgets. I want to suggest that vendors of large sets of humanities text materials do the same. My Recommendation I suggest that vendors of library database products recognize that they can contribute to future scholarship, ease a major, obvious inequity in the field and, perhaps, find a new source of revenue by making chunks of text data available for sale on an à la carte basis. In many cases, this would require them to offer libraries that do not subscribe to their products a free trial-period use so that researchers might identify materials of interest. It would also require the additional administrative work involved in processing a number of transactions involving lesser amounts of funds than those to which they are accustomed. I understand that vendors will raise these objections, but I believe they should investigate this potential sales model in a systematic fashion and determine if they can earn profits with it. I submit that vendors would not need to understand this approach as a charity measure. I suspect that purveyors of large, online humanities text databases may well confront a situation similar to that which the Digital POWRR team perceived in Preservica's case. Once they have sold their products to the limited number of institutions able to afford them, where do they find growth? Of course they can grow by introducing new products, but do they not want to find revenue growth in legacy products as well? Representatives of a number of vendors may reply to this observation by noting that they price their products on the basis of an institution's number of full-time enrolled students, or offer access to a limited number of simultaneous logins, measures that can help a smaller institution. This is not enough. It may prove to be a benefit to smaller institutions to some degree, but it is only a partial measure. It certainly does not help cases like mine — a large institution lacking the budget level to buy even these versions of products — and there are many such institutions. If vendors do not recognize and respond to the market made up of medium-sized and smaller institutions of lesser financial means, I fear that they will make a powerful contribution to the perpetuation of the existing situation: students and scholars at the wealthiest colleges and universities can do text mining work with access to very large collections of suitable materials, while others may never find their corpus. Those vendors will also, in my estimation, leave money on the table. Even if they cannot earn any profit from this type of sale, it may be worthwhile for them to sell materials at a modest loss in order to earn the trust and goodwill of the scholars, librarians, and other practitioners populating the digital humanities. I ask vendors to consider the above proposition, and digital humanists and librarians at institutions of all sizes and financial conditions to raise these issues associated with access to their materials with vendors' sales representatives. Acknowledgements The author thanks Jim Millhorn of Northern Illinois University Libraries and Alix Keener of the University of Michigan Libraries for help in gathering information for this article. About the Author Drew E. VandeCreek is Director of Digital Scholarship and Co-Director of the Digital Convergence Lab at Northern Illinois University Libraries. He holds a Ph.D. in American History from the University of Virginia. He has secured funding for and directed the development of a number on online resources exploring nineteenth-century American history, available from the University Libraries Digital Collections. Copyright ® 2016 Drew E. VandeCreek work_hnpz6ksevfbafby5h5rdw6mhdq ---- Microsoft Word - WorkingDH_WHKChun_LMRhody.docx [Note: The following is the full text of an essay published in differences 25.1 (2014) as part of a special issue entitled In the Shadows of the Digital Humanities edited by Ellen Rooney and Elizabeth Weed. Duke UP’s publishing agreements allow authors to post the final version of their own work, but not using the publisher’s PDF. The essay as you see it here is thus a standard PDF distinct from that created by Duke UP. Subscribers, of course, can also read it in the press’s published form direct from the Duke UP site. Other than accidentals of formatting and pagination this text should not differ significantly from the published one. If there are discrepancies they are likely the result of final copy edits and the exchange between the differences style guide and our standardized format. This article is copyright © 2014 Duke University Press.] Citation: Volume 25, Number 1 doi 10.1215/10407391-‐2419985 © 2014 by Brown University and differences: A Journal of Feminist Cultural Studies Working the Digital Humanities: Uncovering Shadows between the Dark and the Light Wendy Hui Kyong Chun And Lisa Marie Rhody The following is an exchange between the two authors in response to a paper given by Chun at the “Dark Side of the Digital Humanities” panel at the 2013 Modern Languages Association (mla) Annual Convention. This panel, designed to provoke controversy and debate, succeeded in doing so. However, in order to create a more rigorous conversation focused on the many issues raised and elided and on the possibilities and limitations of digital humanities as they currently exist, we have produced this collaborative text. Common themes in Rhody’s and Chun’s responses are: the need to frame digital humanities within larger changes to university funding and structure, the importance of engaging with uncertainty and the ways in which digital humanities can elucidate “shadows” in the archive, and the need for and difficulty of creating alliances across diverse disciplines. We hope that this text provokes more ruminations on the future of the university (rather than simply on the humanities) and leads to more wary, creative, and fruitful engagements with digital technologies that are increasingly shaping the ways and means by which we think. 2 Part 1 The Digital Humanities A Case of Cruel Optimism? (Chun) What follows is the talk given by Wendy Chun on January 4, 2013, at the mla convention in Boston. It focuses on a paradox between the institutional hype surrounding DH and the material work conditions that frequently support it (adjunct/soft money positions, the constant drive to raise funds, the lack of scholarly recognition of DH work for promotions). Chun calls for scholars across all fields to work together to create a university that is fair and just for all involved (teachers, students, researchers). She also urges us to find value in what is often discarded as “useless” in order to take on the really hard problems that face us. I want to start by thanking Richard Grusin for organizing this roundtable. I’m excited to be a part of it. I also want to start by warning you that we’ve been asked to be provocative, so I’ll use my eight minutes here today to provoke: to agitate and perhaps aggravate, excite and perhaps incite. For today, I want to propose that the dark side of the digital humanities is its bright side, its alleged promise—its alleged promise to save the humanities by making them and their graduates relevant, by giving their graduates technical skills that will allow them to thrive in a difficult and precarious job market. Speaking partly as a former engineer, this promise strikes me as bull: knowing gis (geographic information systems) or basic statistics or basic scripting (or even server-side scripting) is not going to make English majors competitive with engineers or cs (computer science) geeks trained here or increasingly abroad. (*Straight up programming jobs are becoming increasingly less lucrative.*) But let me be clear: my critique is not directed at DH per se. DH projects have extended and renewed the humanities and revealed that the kinds of critical thinking (close textual analysis) that the humanities have always been engaged in is and has always been central to crafting technology and society. DH projects such as Feminist Dialogues in Technology, a distributed online cooperative course that will be taught in fifteen universities across the globe, and other similar courses that use technology not simply to disseminate but also to cooperatively rethink and regenerate education on a global scale—these projects are central. In addition, the humanities should play a big role in big data, not simply because we’re good at pattern recognition (because we can read narratives embedded in data) but also, and more importantly, because we can see what big data ignores. We can see the ways in which so many big data projects, by restricting themselves to certain databases and terms, shine a flashlight under a streetlamp. 3 I also want to stress that my sympathetic critique is not aimed at the humanities, but at the general euphoria surrounding technology and education. That is, it takes aim at the larger project of rewriting political and pedagogical problems into technological ones, into problems that technol- ogy can fix. This rewriting ranges from the idea that moocs (massive open online courses), rather than a serious public commitment to education, can solve the problem of the spiraling costs of education (moocs that enroll but don’t graduate; moocs that miss the point of what we do, for when lectures work, they work because they create communities, because they are, to use Benedict Anderson’s phrase, “extraordinary mass ceremonies”) to the blind embrace of technical skills. To put it as plainly as possible: there are a lot of unemployed engineers out there, from forty-something assembly program- mers in Silicon Valley to young kids graduating from community colleges with cs degrees and no jobs. Also, there’s a huge gap between industrial skills and university training. Every good engineer has to be retaught how to program; every film graduate, retaught to make films. My main argument is this: the vapid embrace of the digital is a form of what Lauren Berlant has called “cruel optimism.” Berlant argues, “[A] relation of cruel optimism exists when something you desire is actually an obstacle to your flourishing” (1). She emphasizes that optimistic relations are not inherently cruel, but become so when “the object that draws your attachment actively impedes the aim that brought you to it initially.” Crucially, this attachment is doubly cruel “insofar as the very pleasures of being inside a relation have become sustaining regardless of the content of the relation, such that a person or world finds itself bound to a situation of profound threat that is, at the same time, profoundly confirming” (2). So, the blind embrace of DH (*think here of Stanley Fish’s “The Old Order Changeth”*) allows us to believe that this time (once again) graduate students will get jobs. It allows us to believe that the problem fac- ing our students and our profession is a lack of technical savvy rather than an economic system that undermines the future of our students. As Berlant points out, the hardest thing about cruel optimism is that, even as it destroys us in the long term, it sustains us in the short term. DH allows us to tread water: to survive, if not thrive. (*Think here of the ways in which so many DH projects and jobs depend on soft money and the ways in which DH projects are often—and very unfairly—not counted toward tenure or promotion.*) It allows us to sustain ourselves and to justify our existence in an academy that is increasingly a sinking ship. 4 The humanities are sinking—if they are—not because of their earlier embrace of theory or multiculturalism, but because they have capitulated to a bureaucratic technocratic logic. They have conceded to a logic, an enframing (*to use Heidegger’s term*), that has made publishing a question of quantity rather than quality, so that we spew forth mpus or minimum publishable units; a logic, an enframing, that can make teaching a burden rather than a mission, so that professors and students are increasingly at odds; a logic, an enframing, that has divided the profession and made us our own worst enemies, so that those who have jobs for life deny jobs to others—others who have often accomplished more than they (than we) have. The academy is a sinking ship—if it is—because it sinks our students into debt, and this debt, generated by this optimistic belief that a university degree automatically guarantees a job, is what both sustains and kills us. This residual belief/hope stems from another time, when most of us couldn’t go to university, another time, when young adults with degrees received good jobs not necessarily because of what they learned, but because of the society in which they lived. Now, if the bright side of the digital humanities is the dark side, let me suggest that the dark side—what is now considered to be the dark side—may be where we need to be. The dark side, after all, is the side of passion. The dark side, or what has been made dark, is what all that bright talk has been turning away from (critical theory, critical race studies—all that fabulous work that #TransformDH is doing). This dark side also entails taking on our fears and biases to create deeper collaborations with the sciences and engineering. It entails forging joint (frictional and sometimes fractious) coalitions to take on problems such as education, global change, and so on. It means realizing that the humanities don’t have a lock on creative or critical thinking and that research in the sciences can be as useless as research in the humanities—and that this is a good thing. It’s called basic research. It also entails realizing that what’s most interesting about the digital in general is perhaps not what has been touted as its promise, but rather, what’s been discarded or decried as its trash. (*Think here of all those failed DH tools, which have still opened up new directions.*) It entails realizing that what’s most interesting is what has been discarded or decried as inhuman: rampant publicity, anonymity, the ways in which the Internet vexes the relationship between public and private, the ways it compromises our autonomy and involves us with others and other machines in ways we don’t entirely know and control. (*Think here of the constant and promiscuous exchange of information that drives the Internet, something that is usually hidden from us.*) As Natalia Cecire has argued, DH is best when it takes on the 5 humanities, as well as the digital. Maybe, just maybe, by taking on the inhumanities, we’ll transform the digital as well. Thank you. The sections in asterisks are either points implied in my visuals or in the talk, which I have elaborated upon in this written version. Part 2 The Digital Humanities as Chiaroscuro (Rhody) Taking as a point of departure your thoughtful inversion of the “bright” and “dark” sides of the digital humanities, I want to begin by revisiting the origin of those terms as they are born out of rhetoric sur- rounding the 2009 mla Annual Convention, when academic and popular news outlets seemed first to recognize digital humanities scholarship and, in turn, to celebrate it against a dreary backdrop of economic recession and university restructuring. Most frequently, such language refers to William Pannapacker’s Chronicle of Higher Education blog post on December 28, 2009, in which he writes: Amid all the doom and gloom of the 2009 mla Convention, one field seems to be alive and well: the digital humanities. More than that: Among all the contending subfields, the digital humanities seem like the first “next big thing” in a long time, because the implications of digital technology affect every field. I think we are now realizing that resistance is futile. One convention attendee complained that this mla seems more like a conference on technology than one on literature. I saw the complaint on Twitter. (“MLA”) Of course, Pannapacker’s relationship to digital humanities has changed since his first post. In a later Chronicle blog entry regarding the 2012 mla Annual Convention, Pannapacker walked back his earlier characterization of the digital humanities, explaining: “I regret that my claim about DH as the nbt—which I meant in a serious way—has become a basis for a rhetoric that presents it as some passing fad that most faculty members can dismiss or even block when DH’ers come up for tenure” (“Come-to- DH”). Unfortunately for the public’s perception of digital humanities, the provocativeness of Pannapacker’s earlier rhetoric continues to receive much more attention than the retractions he has written since. 6 In 2009, though, Pannapacker was reacting to the “doom and gloom” with which a December 17 New York Times article set the stage for the mla Annual Convention by citing dismal job prospects for PhD graduates. The Times article begins with a sobering statistic: “faculty positions will decline 37 percent, the biggest drop since the group began tracking its job listings 35 years ago” (Lewin). Pannapacker, though, wasn’t the first one who called digital humanities a “bright spot.” That person was Laura Mandell, in her post on the Armstrong Institute for Interactive Media Studies (aims) blog on January 13, 2010, just following the conference: “Digital Humanities made the news: these panels were considered to be the one bright spot amid ‘the doom and gloom’ of a fallen economy, a severely depressed job market, and the specter of university-restructuring that will inevitably limit the scope and sway of departments of English and other literatures and languages” (“Digital”). In neither her aims post nor in her mla paper does Mandell support a “vapid embrace of the digital” or champion digital humanities as a solution to the sense of doom and gloom in the academy. Rather, in both, Mandell candidly and openly contends with one of the greatest challenges to digital humanities work: collaboration. The “brightness” surrounding digital humanities at the 2009 MLA convention was based on the observation that DH and media studies panels drew such high attendance because they focused on long-standing, unresolved issues not just for digital humanities but for the study of literature and language at large. For example, in Mandell’s session, “Links and Kinks in the Chain: Collaboration in the Digital Humanities”—a session presided over by Tanya Clement (University of Maryland, College Park) and that also included Jason B. Jones (Central Connecticut State University), Bethany Nowviskie (Neatline, University of Virginia), Timothy Powell (Ojibwe Archives, University of Pennsylvania), and Jason Rhody (National Endowment for the Humanities [NEH])—presenters addressed the challenges and cautious optimism that scholarly collaboration in the context of digital humanities projects requires.1 Liz Losh’s reflections on the panel recall a perceived consensus that collaboration is hard enough that one might be tempted to write it off as a fool’s errand, as Nowviskie’s tongue-in-cheek use of an image titled “The Ministry of Silly Walks” (borrowed from a Monty Python skit) implied. But neither Nowviskie’s nor Mandell’s point was to stop trying; quite the opposite, their message was that collaboration takes hard work, patience, revisions to existing assumptions about academic status, and a willingness to compromise when the stakes feel high. As Mandell recalls in her post: “[M]y deep sense of it is that we came to some conclusions (provisional, of course). Digital 7 Humanists, we decided, are concerned to protect the openness of collaboration and intellectual equality of participants in various projects while insuring the professional benefits for those contributors whose positions within academia are not equal (grad students, salaried employees, professors)” (“Digital”). That is a tall order, especially because digital humanities scholarship unsettles deeply rooted institutional beliefs about how humanists do research. If the digital humanities in 2009 seemed “bright,” it was in large part because it refocused collective attention around issues that vexed not just digital humanists but their inter-/ trans-/ multi-disciplinary peers, those Julia Flanders is noted for having called “hybrid scholars,” a term not limited to digital humanists. Furthermore, across the twenty-seven sessions at the conference that might be considered digital humanities or media studies related, most addressed, at least in a tangential way, issues related to working across institutional barriers.2 In other words, the bright optimism of 2009 for digital humanists was not that of economic recovery, employment solutions, and technological determinism, but of consensus building and renewed attention to long-standing institutional barriers. One takeaway from the 2009 MLA panels is also a collective sense of strangeness in claiming “digital humanities” as a name when it draws together such a diversity of humanities scholars with so many different research agendas under a common title—an unease that, perhaps, may be attributed to the chosen theme of the Digital Humanities 2011 conference, “Big Tent Digital Humanities.” What the four years since the “Links and Kinks” panel have proven is that its participants were right: collaboration, digital scholarship, and intellectual equality are really hard, and no, we haven’t come up with solutions to those challenges yet. Reorienting the bright side/dark side debate away from the pro- vocativeness of its media hype and back toward the spirit of creating con- sensus around long-standing humanities concerns, I would like to suggest that the “dark side” of digital humanities is that we are still struggling with issues that we began calling attention to even earlier than 2009: effectively collaborating within and between disciplines, institutions, and national boundaries; reorienting a deeply entrenched academic class structure; recovering archival silences; and building a freer, more open scholarly dis- course. Consequently, a distorted narrative that touts digital humanities as a “bright hope” for overcoming institutional, social, cultural, and economic challenges has actually made it harder for digital humanities to continue acting as a galvanizing force among hybrid scholar peers and to keep the focus on shared interests because such rhetoric falsely positions digital humanities and the “rest” of humanities as if they’re in opposition to one 8 another. DH and Technological Determinism Moving beyond the “bright/dark” dichotomy is in part compli- cated by the popular complaint first levied against digital humanities at the 2009 mla conference that “resistance is futile” and that the convention seemed to be more about technology than literature (see Pannapacker, “mla,” above). Setting aside the problematic opposition between “technology” and “literature” that Pannapacker’s unnamed source makes, the early euphoria over digital humanities that you call attention to in your talk is frequently linked to a sense that digital humanists have fallen victim to a pervasive technological determinism. The rhetoric of technological determinism, however, more often comes from those who consciously position themselves as digital humanities skeptics—which is in stark contrast to how early adopters in the humanities approached technology. In 1998, early technology adopters like Dan Cohen, Neil Fraistat, Alan Liu, Allen Renear, Roy Rosenzweig, Susan Schreibman, Martha Nell Smith, John Unsworth, and others didn’t encourage students to learn html (HyperText Markup Language), sgml (Standard Generalized Markup Language), or tei (Text Encoding Initiative) so they could get jobs. They did it, in large part, so students could understand the precarious opportunity that the World Wide Web afforded scholarly production and communication. Open, shared standards could ensure a freer exchange of ideas than proprietary standards, and students developed webpages to meet multiple browser specifications so that they could more fully appreciate how delicate, how rewarding, and how uncertain publishing on the Web could be in an environment where Netscape and Microsoft Internet Explorer sought to corner the market on Web browsing.3 Reading lists and bibliographies in those early courses drew heavily from the textual studies scholarship of other early adopters such as Johanna Drucker, Jerome McGann, Morris Eaves, and Joseph Viscomi, whose work had likewise long considered the material economies of knowledge production in both print and digital media. 9 Consider the cautious optimism that characterizes Roy Rosen- zweig and Dan Cohen’s 2005 Introduction to Digital History, which begins with a chapter titled “Promises and Perils of Digital History”: We obviously believe that we gain something from doing digital history, making use of the new computer-based technologies. Yet although we are wary of the conclusions of techno-skeptics, we are not entirely enthusiastic about the views of the cyber-enthusiasts either. Rather, we believe that we need to critically and soberly assess where computer networks and digital media are and aren’t useful for historians—a category that we define broadly to include amateur enthusiasts, research scholars, museum curators, documentary filmmakers, historical society administrators, classroom teachers, and history students at all levels [. . .]. Doing digital history well entails being aware of technology’s advantages and disadvantages, and how to maximize the former while minimizing the latter. (18) In other words, digital history, and by extension digital humanities, grew out of a thoughtful and reflective awareness of technology’s potential, as well as its dangers, and not a “vapid embrace of the digital.” Moreover, the earliest convergence between scholars of disparate humanities backgrounds coalesced most effectively and openly in resistance to naive technological determinism. Anxiety, however, creeps into conversations about digital humanities with phrases like “soon it won’t be the digital humanities [. . .] it will just be the humanities.” Used often enough that citing every occasion would be impossible, such a phrase demonstrates and fuels a fear that methods attributed to digital humanities will soon be the only viable methods in the field, and that’s simply not true. And yet, unless there is a core contingent of faculty who continue to distribute their work in typed manuscripts and consult print indexes of periodicals that I don’t know about, everyone is already a digital humanist insofar as it is a condition of contemporary research that we must ask questions about the values, technologies, and economies that organize and redistribute scholarly com- munication—and that is and always has been a fundamental concern within the field of digital humanities since before it adopted that moniker and was called merely “humanities computing.”4 10 DH and moocs Related to concerns over technological determinism is an indictment that digital humanities has given way to a “vapid embrace of the digital” as exemplified by universities’ recent love affair with moocs. You describe the moocification of higher education very well as the desire to “rewrit[e] political and pedagogical problems into technological ones, into problems that technology can fix. This rewriting ranges from the idea that moocs, rather than a serious public commitment to education, can solve the problem of the spiraling cost of education [. . .] to the blind embrace of technological skills.” Digital humanists who have dared to tread on this issue most often do so with highly qualified claims that higher education, too, requires change. For example, Edward Ayers’s article in the Chronicle, “A More-Radical Online Revolution,” contends that if an effective online course is possible, it is only so when the course reorients its relationship to what knowledge production and learning really are. He points out that technology won’t solve the problem, but learning to teach better with technology might help. Those two arguments are not the same. The latter acknowledges that we have to make fundamental changes in the way we approach learning in higher education—changes that most institutions celebrating and embracing moocs are unwilling to commit to by investing in human labor. In solidarity with Ayers’s cautious optimism are those like Cathy Davidson, who has often made the point that moocs are popular with university administrators because they are the least disruptive to education models that find their roots in the industrial revolution—and conversely this is why most digital humanists oppose them. DH and Funding Another challenge presented by the specter of media attention to the field of digital humanities has been the perception that it draws on large sums of money otherwise inaccessible to the rest of humanities researchers. Encapsulating the “cruel optimism” you identify as described by Lauren Berlant, hopeful academic administrations may once have seen digital humanities research as having access to seemingly limitless pools of money— an assumption that creates department and college resentments. But there’s a reality check that needs to happen, both on the part of hopeful administrations and on the part of frustrated scholars: funding overall is scarce. Period. Humanists are not in competition with digital humanists for funding: humanists are in competition with everyone for more funding. For example, since 2010, the National Endowment for the Humanities 11 (neh) budget has been reduced by 17 percent. In its Appropriations Request for Fiscal Year 2014, the neh lists the 2012 Office of Digital Humanities (odh) actual budget at $4,143,000. In other words, odh—the neh division charged with funding digital research in the humanities—controls the smallest budget of any other division in the agency by a margin of $9 to 10 million (National Endowment 13; see table 1 at the end of this article). Since most grants from odh are institutional grants as opposed to individual grants (such as fellowships or summer stipends), a substantive portion of each odh award is absorbed by the sponsoring institution in order to offset “indirect costs.” When digital humanities centers and their institu- tions send out celebratory announcements about how they just received a grant for a digital humanities project for x number of dollars, only a fraction of that money actually goes to directly support the project in question. Anywhere between 25 to 55 percent of digital humanities grant funds are absorbed by the institution to “offset” what are also referred to as facilities and administrative—f&a—costs, or overhead. Indirect cost rates are usually negotiated once each year between the individual academic institutions and a larger federal agency (think Department of Defense, Environmental Protection Agency, National Institutes of Health, National Aeronautics and Space Administration, or Department of the Navy), and they are presumably used to support lab environments for stem-related disciplines (science, technology, engineering, and mathematics). Whatever the negotiated cost rate at each institution, that same rate is then applied to all other grant recipients from the same institution who receive federal funds regardless of discipline. While specialized maintenance personnel, clean rooms, security, and hazard insurance might be necessary to offset costs to the institution to support a stem-related research project, it is unclear the extent to which digital humanities projects benefit from these funds. Thus, while institutions are excited to promote, publicize, and even support digital humanities grant applications (bright side), that publicity simultaneously casts long shadows obscuring from public view the reality that the actual dollar amount that goes directly to support DH projects is significantly reduced. If we really wanted to get serious about exploring the shadows of digital humanities research, we might begin by asking probative questions about where those indirect costs go and how they are used. In fact, as Christopher Newfield points out in “Ending the Budget Wars: Funding the Humanities during a Crisis in Higher Education,” more of us humanists should be engaging in a healthy scrutiny of our institution’s budgets. New-field points out that academic administrations have been milking humanities departments for quite a long time without clear indication of where income from humanities general education courses actually go: 12 First we must understand that though the humanities in general and literary studies in particular are poor and struggling, we are not naturally poor and struggling. We are not on a permanent austerity budget because we don’t have the intrinsic earning power of the science and engineering fields and aren’t fit enough to survive in the modern university. I suggest, on the basis of a case study, that the humanities fields are poor and struggling because they are being milked like cash cows by their university administrations. The money that departments generate through teaching enrollments that the humanists do not spend on their almost completely unfunded research is routinely skimmed and sent elsewhere in the university. As the current university funding model continues to unravel, the humanities’ survival as national fields will depend on changing it. (271) Lack of clarity about where money absorbed by academic institutions as indirect costs ends up is linked to a much wider concern about whether or not humanities departments really should be as poor and struggling as they are. Here is an opportunity in which we could use the so-called celebrity status of digital humanities to cast new light on the accounting, budgeting, and administrating of humanities colleges in general to the benefit of faculty and researchers regardless of their research methods. DH and Collaboration The topic of money, however, returns us to the complicated constellation of issues that accompany collaboration. Barriers to collabora- tion, as Mandell, Nowviskie, Powell, Jones, and Rhody discussed in 2009, are less a matter of fear or bias against collaborating with the sciences or engineering than they might have been in the past. As it turns out, though, collaboration across institutional boundaries is hard because financing it is surprisingly complex and often insufficient. In 2009, the Digging into Data Challenge announced its first slate of awardees. Combining the funds and efforts of four granting agencies (jisc [Joint Information Systems Committee], neh, nsf [National Science Foundation], and sshrc [Social Sciences and Humanities Research Council]), Digging into Data grants focused on culling resources, emphasizing collaboration, and privileging interdisciplinary research efforts—all valuable and laudable goals. In a follow-up report (unfortunately named) One Culture: Computationally Intensive Research in the Humanities and Social Sciences: A Report on the Experiences of First Respondents to the Digging into Data Challenge, however, participants 13 identify four significant challenges to their work: funding, time, communication, and data (Williford and Henry). In other words, just about everything it takes to collaborate presents challenges. The question is, though, what have we been able to do to change this? How well have we articulated these issues to those who don’t call them- selves digital humanists in ways that make us come together to advocate for better funding for all kinds of humanities research, rather than constantly competing with one another to grab a bigger piece of a disappearing pie? The frustrating part in all of this is that we know collaboration is hard. We want to bridge communities within the humanities, across to social science and stem disciplines, and even across international, cultural, and economic divides. Unless we really set to work on deeper issues like revising budgets, asking pointed questions about indirect cost rates, and figuring out how to communicate across disciplines, share data, and organize our collective time, four years from now we will still be asking the same questions. DH and Labor Finally, there are other “shadows” in the academy where digital humanists have been hard at work. While no one in the digital humanities really believes that technical skills alone will prepare anyone for a job, important work by digital humanists has helped reshape the discourse around labor and employment in academia. For example, Tanya Clement and Dave Lester’s neh-funded white paper “Off the Tracks: Laying New Lines for Digital Humanities Scholars” brought together digital humanities practitioners to consider career trajectories for humanities PhDs employed to do academic work in nontenure, often contingent university positions. For example, groups such as DH Commons, an initiative supported by a coalition of digital humanities centers called centerNet, put those interested in tech- nology and the humanities in contact with other digital humanities practitio- ners through shared interests and needs. “Alt-Academy,” a MediaCommons project, invites, publishes, and fosters dialogue about the opportunities and risks of working in academic posts other than traditional tenure-track jobs. 14 While none of these projects could be credited with “finding jobs” for PhDs, per se, they are demonstrations of the ways digital humanities practitioners have made academic labor a central issue to the field. Worth noting: all of these projects have come to fruition since 2009 and in response to concerns about labor issues, recognition, and credit in a stratified academic class structure. And yet, none of these approaches on their own are solutions. There are still more people in digital humanities who are in contingent, nontenure-track positions than there are in tenure-track posts. A heavy reliance on soft funding continues to fuel an academic class structure in which divisions persist between tenure-track and contract faculty and staff— divisions that seem to be reinscribed along lines of gender and race difference. As long as these divisions of labor remain unsatisfactorily addressed, it promises to dim the light of a field that espouses the value of “intellectual equality” (Mandell). Even though recent efforts by the Scholarly Communication Institute (sci) (an Andrew W. Mellon Foundation–supported initiative) have not answered long-standing questions of contingent academic labor and placement of recent PhDs in the humanities, efforts to survey current alternative academic (alt-ac) professionals and to build a network of digital humanities graduate programs through the Praxis Network constitute important steps toward addressing these widely acknowledged problems across a spectrum of humanities disciplines. As a field, digital humanities has not promised direct avenues to tenure-track jobs or even alt-ac ones; however, digital humanities is a community of practice that, born out of an era of decreasing tenure-track job openings and rhetoric about the humanities in crisis, has worked publicly to raise awareness and improve dialogue that identifies, recognizes, and rewards intellectual work by scholars operating outside traditional tenure-track placements. DH Silences and Shadows I agree that what is truly bright about the digital humanities is that it has drawn from passion in its critical, creative, and innovative approaches to persistent humanities questions. For example, I look at the work of Lauren Klein, whose 2012 mla paper was one of four that addressed the archival silences caused by slavery. Klein’s paper responded directly to Alan Liu’s call to “reinscribe cultural criticism at the center of digital humanities work” (“Where Is?”). Her computational methods explore the silent presence of James Hemings in the archived letters of Thomas Jefferson: 15 To be quite certain, the ghost of James Hemings means enough. But what we can do is examine the contours that his shadow casts on the Jefferson archive, and ask ourselves what is illuminated and what remains concealed. In the case of the life—and death—of James Hemings, even as we consider the information disclosed to us through Jefferson’s correspondence, and the conversations they record—we realize just how little about the life of James Hemings we will ever truly know. (“Report”) Klein proposes one possible way in which we might integrate race, gender, and postcolonial theory with computer learning to develop methodologies for performing research in bias-laden archives, whereby we can expose and address absences. Still, while we have become more adept at engaging critical theory and computation in our scholarship, we have spent little of that effort constructing an inclusive, multivalent, diverse, and self-conscious archive of our own field as it has grown and changed. The shadows and variegated terrain of the digital humanities, this odd collection of “hybrid scholars,” is much more complicated, as one might expect, than the bright/dark binary by which it is too often characterized. Recovering the histories of DH has proven complicated. Jacqueline Wernimont made this point famously well in a paper she delivered at DH2013 and in a forthcoming article in Digital Humanities Quarterly (dhq). Wernimont explains that characterizing any particular project as feminist is difficult to do: “The challenges arise not from a lack of feminist engagement in digital humanities work, quite the opposite is true, but rather in the difficulty tracing political, ideological, and theoretical commitments in work that involves so many layers of production.” Put simply: the systems and networks from which DH projects arise are wickedly complex. Perhaps a bit more contentiously: the complexity of those networks has enabled narratives of digital humanities to evolve that elide feminist work that has been foundational to the field. Wernimont’s claim runs contrary to the impulse to address through provocation the sobering challenges that confront the digital humanities. Rather than claiming that “no feminist work has been done in DH,” Wernimont engages productively with the multifaceted work conditions that have led to our understanding of the field. As you suggest at the tail end of your talk, we often claim to “celebrate failures,” but it is unclear to what extent we follow through on that intent. Despite John Unsworth’s 1997 insistence in “Documenting the Reinvention of Text: The Importance of Failure” that we make embracing failure a disciplinary value, we very rarely do it. Consequently, we have riddled our discipline’s own archive with silences about our work process, 16 our labor practices, our funding models, our collaborative challenges, and even our critical theory. As a result, we have allowed the false light of a thriving field alive with job opportunities, research successes, and techno- logical determinism to seep into those holes. In other words, we have not done what we as humanists should know better than to do: we have not told our own story faithfully. Even so, recent events have demonstrated important steps to improving transparency in digital humanities. This summer at the DH2013 conference, Quinn Dombrowski did what few scholars are willing or bold enough to do. She exposed a project’s failure in a talk titled, “Whatever Hap- pened to Project Bamboo?” Dombrowski recounted the challenges faced by an Andrew W. Mellon–funded cyberinfrastructure project between 2008 and 2012. Tellingly, when you go to the project’s website, there is no discussion of what happened to it—whether or not it met its goals, or why, or even what institutions participated in it. There is a “documentation wiki” where visitors might review the archived project files, an “issue tracker,” and a “code repository.” There is even a link to the “archive” copy of the website as it existed during its funding cycle. That is it. In the face of this silence, Dombrowski provided a voice for what might be seen as the project’s failure to begin hashing through the difficulties of collaboration and the dangers of assuming what humanists want before asking them. Dombrowski’s paper was welcomed by the community and cel- ebrated as a necessary contribution to our scholarly communication prac- tices. Significantly, many DH projects, particularly those that receive federal funding, do have outlets for discussing their processes, management, and decisions; however, where these scholarly and reflective documents are published is often in places where those starting out in digital humanities are unlikely to find them. White papers, grant narratives, and project histories— informally published scholarship called gray literature—discuss significant aspects of digital humanities research, such as rationales for staffing decisions, technology choices, and even the critical theories that are foundational to a project’s development. Still, gray literature is often stored or published on funders’ websites or in institutional repositories. Occasionally, though less frequently, white papers may be published on a project’s website. Since these publications reside outside a humanist’s usual research purview, they are less likely to be found or used by scholars new to the field. In her essay “Let the Grant Do the Talking,” Sheila Brennan suggests that wider circulation of these materials would prove an important contribution to scholarship: “One way to present digital humanities work could be to let grant proposals and related reports or white papers do some of the talking for us, because those forms of writing already provide 17 intellectual rationales behind digital projects and illustrate the theory in practice.” Brennan continues by explaining that grant proposals are often heavily scrutinized by peer reviewers and provide detailed surveys of exist- ing resources. Most federal funders require white papers that reflect upon the nature of the work performed during the grant when the grant period is over, all of which are made available to the public. While the nature of the writing differs from what one might find in a typical journal article, grant proposals and white papers address general humanities audiences. That means a body of scholarly writing already exists that addresses the history, composition, and development of a sizeable portion of digital humanities work. The challenge resides in making this writing more visible to a broader humanities audience. Although we still have work to do to continue filling in the archi- val silences of digital humanities, I believe that it is a project worth the work involved. Eschewing the impulse to draw stark contrasts between digital humanities and the rest of the humanities, choosing instead to delve into the complex social, economic, and institutional pressures that a “technological euphoria” obscures represents a promising way ahead for humanists—digital and otherwise. Part 3 Shadows in the Archive (Chun) First, thank you for an excellent and insightful response, for the ways you historicize the “bright side” rhetoric, take on the challenges of funding, and elaborate on what you find to be DH’s dark side: your points about the silences about DH’s work process, its labor practices, funding mod- els, collaborative challenges, and critical theory are all profound. Further, your move from bright/dark to shadows is inspiring. By elaborating on the work done by early adopters and younger scholars, you show how digital humanists do not engage in a “vapid embrace of the digital.” You show that the technological determinists rather than the practicing digital humanists are the detractors (and I would also insert here supporters). Indeed, if any group would know the ways in which the digital humanities do not guarantee everything they are hyped to do, it is those who have for many years worked under the rubric of “humanities computing.” As Liu has so pointedly argued, they have been viewed for years as servants rather than masters (“Where Is”). They know intimately the precariousness of soft money projects, the difficulty of being granted tenure for preparing rather than interpreting texts, and the ways in which teaching students mark-up languages hardly guarantees them jobs. For all these reasons, the “bright side” rhetoric is truly baffling—unless, of course, one considers the institutional framework within which the digital humanities has been embraced. As you point out, it has not given institutions the access to the limitless pools of money they once hoped for, but it has given them access to indirect cost recovery—something that very few humanities projects provide.5 It also gives them a link to the future. As William Gibson, who coined the term “cyberspace” before he had ever used a computer, once quipped, “[T]he future is already here—it’s just not evenly distributed.” The cruel optimism I describe is thus a “vapid embrace of the digital” writ large, rather than simply an embrace of the digital humanities. One need only think back to the mid-1990s when the Internet became a mass medium after its backbone was sold to private corporations and to the rhetoric that surrounded it as the solution to all our problems, from racial discrimination to inequalities in the capitalist marketplace, from government oversight to the barriers of physical location. And as you note, this embrace is most pointed among those on the outside: soon after most Americans were on the Internet, the television commercials declaring the Internet the great equalizer disappeared. Stanley Fish’s “The Old Order Changeth” compares DH to theory, stating, “[O]nce again, as in the early theory days, a new language is confidently and prophetically spoken by those in the know, while those who are not are made to feel ignorant, passed by, left behind, old.” Yet, your discussion of what you see as the dark side—that, because of DHers’ silences, “[W]e have allowed the false light of a thriving field alive with job opportunities, research successes, and technological determinism to seep into those holes”—made me revisit Berlant again and in particular her insistence that cruel optimism is doubly cruel because it allows us to be “bound to a situation of profound threat that is, at the same time, profoundly confirming” (2). It is the confirmation—the modes of sur- vival—that generate pleasure and make cruel optimism so cruel. Also, as Berlant emphasizes, optimism is not stupid or simple, for “often the risk of attachment taken in its throes manifests an intelligence beyond rational calculation” (2). Given the institutional structures under which we work, I 19 find your call for DHers to tell their own story faithfully to be incredibly important and, I think also, incredibly difficult. Rather than focus on DH, though, I want to return to the broad- ness of my initial analysis and your response. I was serious when I stated that my comments were not directed toward DH per se, but rather toward the technological euphoria surrounding the digital, a euphoria that makes political problems into ones that technology can solve. Here, I think the problem we face is not the “crisis in the humanities” or the divide between humanists and digital humanists, but rather the defunding of universities, a defunding to which universities have responded badly. I remember a for- mer administrator at Brown once saying: “[W]e are in the business of two things: teaching and research. Both lose money.” His point was that viewing research simply as a way to generate revenue (“indirect costs”) overlooks the costs of doing “big” research; his point was also that the university was in the business not of making money, but of educating folk. Grasping for ever-diminishing sums of grant money to keep universities going—a grasping that also entails a vast expenditure in start-up funds, costs for facilities, and so on, arguably available to only a small number of already elite universities—is a way to tread water for a while but is unsustainable. We see the unsustainability of this clearly in the recent euphoria around moocs, which are not, as you point out, embraced by the DH com- munity even as they are increasingly defining DH in the minds of many. They are sexy in a way that Zotero is not and Bamboo was not. moocs are attractive for many reasons, not least in terms of their promise (and I want to stress here that it is only a promise—and that promises and threats, as Derrida has argued, have the same structure) to alleviate the costs of getting a college degree. But why and how have we gotten here? And would students such as my younger self, educated in Canada in the 1980s, have found moocs so attractive? As I stressed at the mla, the problem is debt: the level of student debt is unsustainable, as are the ways universities are approaching the problem of debt by acquiring more of it (a problem, I realize, that affects most institutions and businesses in the era of neoliberalism). The problem is also the strained relationship between education and employment. To repeat a few paragraphs from that talk: The humanities are sinking—if they are—not because of their earlier embrace of theory or multiculturalism, but because they have capitulated to a bureaucratic technocratic logic. They have 20 conceded to a logic, an enframing (*to use Heidegger’s term*), that has made publishing a question of quantity rather than quality, so that we spew forth mpus or minimum publishable units; a logic, an enframing, that can make teaching a burden rather than a mission, so that professors and students are increasingly at odds; a logic, an enframing, that has divided the profession and made us our own worst enemies, so that those who have jobs for life deny jobs to others—others who have often accomplished more than they (than we)—have. The academy is a sinking ship—if it is—because it sinks our students into debt, and this debt, generated by this optimistic belief that a university degree automatically guar- antees a job, is what both sustains and kills us. This residual belief/hope stems from another time, when most of us couldn’t go to university, another time, when young adults with degrees received good jobs not necessarily because of what they learned, but because of the society in which they lived. We—and I mean this “we” broadly—have not been good at explaining the difference between being educated and getting a job. A college degree does not guarantee a job; if it did in the past, it was because of demographics and discrimination (in the broadest sense of the term). One thing we can do is to explain to students this difference and to tell them that they need to put the same effort into getting a job that they did into getting into college. To help them, we have not only to alert them to internships and job fairs but also to encourage them to take risks, to expand the courses they take in university and to view challenging courses as rewarding. I cannot emphasize how much I learned—even unintentionally—from doing both systems design engineering and English literature as an undergraduate: combined, they opened up new paths of thinking and analyzing with which I’m still grappling. Another thing we can do is address, as you so rightly underscore, how the university spends money. Most importantly, we need to take on detractors of higher edu- cation not by conceding to the rhetoric of “employability,” but arguing that the good (rather than goods) of the university comes from what lies outside of immediate applicability: basic research that no industrial research center would engage in, the cultivation of critical practices and thinking that make us better users and producers of digital technologies and better citizens. I want to emphasize that this entails building a broad coalition across all disciplines within the university. The sciences can not only be as useless as the humanities, they can also be as invested in remaining silent and bathing in the false glow of employability and success as some in the DH. As I mentioned in the mla talk, there are students who graduate from the sciences and cannot find jobs; the sciences are creative and critical; the sciences, of all the disciplines, are most threatened by moocs. We need to build coalitions, rather than let some disciplines be portrayed as “in crisis,” so that ours, we hope, can remain unscathed. To live by the rhetoric of usefulness and practicality—of technological efficiency—is also to die by it. Think of the endlessness of debates around global climate change, debates that are so endless in part because the probabilistic nature of science can never match its sure rhetoric. What I also want to emphasize is that these coalitions will be fractious. There will be no consensus, but, inspired by the work of Anna Tsing, I see friction as grounding, not detracting from, political action. These coalitions are also necessary to take on challenges facing the world today, such as the rise of big data. Again, not because they are inherently practical, but rather, because they can take on the large questions raised by it, such as: given that almost any correlation can be found, what is the relationship between correlation and causality? between what’s empirically observable and what’s true? I want to end by thinking again of Berlant’s call for “ambient citizenship” as a response to cruel optimism and Lauren Klein’s really brilliant work, which you cite and which I—along with my coeditors Tara McPherson and Patrick Jagoda—am honored to publish as part of a special issue of American Literature on new media and American literature (“Image”). Berlant ends Cruel Optimism by asking to what extent attending to ambient noise could create forms of affective attachment that can displace those that are cruelly optimistic. These small gestures would attend to noises and daily gestures that surround us rather than to dramatic gestures that too quickly become the site of new promises (although she does acknowledge that ambient citizenship resonates disturbingly with George W. Bush’s desire to “get rid of the filter”). Ambient citizenship would mean attending to things like teaching: teaching, which is often accomplished not by simply relaying information (this is the mooc model), but through careful attention to the noises in and dynamics of the classroom. I also wonder how this notion of ambient citizenship can be linked to Klein’s remarkable work discovering the contours of James Heming in the letters of Thomas Jefferson. Jefferson, as Klein notes, was meticulous about documentation and was very much aware of leaving an archive for history. Searching for “information” about Heming, his former 22 slave and chef, though, is extremely difficult, and reducing the lives of slaves to lists and accounts—to the signals that remain—is unethical. Drawing from the work of Saidiya Hartmann and Stephen Best, Klein uses DH tools to trace the ghost, the lingering presence, of Heming. She uses these tools to draw out the complexity of relations between individuals across social groups. Resisting the logic of and ethic of recovery, she makes the unrecorded story of Hemings “expand with meaning and motion.” She also, even as she uses these tools, critiques visualization as “the answer,” linking the logic of visualization to Jefferson’s uses of it to justify slavery. Klein’s work epitomizes how DH can be used to grapple with the impossible, rather than simply usher in the possible. I think that her work— and some other work in DH—by refusing the light and the dark, reveals the ways in which the work done by the union of the digital and the humanities (a union that is not new, but rich in history) will not be in the clearing (to refer to Heidegger), but rather, as you suggest, in the shadows. 23 *This column reflects fy 2013 annualized funding, including a 0.612% increase as provided by the FY 2013 Continuing Appropriations Resolution, p.l. 112-‐175. FY 2012 FY 2013 FY 2014 Approp. Estimate Request Bridging Cultures $3,494 $3,515 $9,000 Education Programs 13,179 13,260 13,250 Federal/State Partnership 40,435 40,683 43,432 Preservation and Access 15,176 15,269 15,750 Public Programs 13,404 13,486 14,000 Research Programs 14,502 14,591 15,435 Digital Humanities 4,143 4,168 4,450 We the People 2,995 3,013 — Program Development 499 502 500 Subtotal 107,827 108,487 115,817 Challenge Grants 8,537 8,408 8,850 Treasury Funds 2,381 2,396 2,400 Subtotal 10,738 10,804 11,250 Administration 27,456 27,624 27,398 Total $146,021 $146,915* $154,465 Table 1 FY 2014Appropria- - tion Request ($ in thousands). NEH.gov 24 WENDY HUI KYONG CHUN is Professor and Chair of Modern Culture and Media at Brown University. She has studied both systems design engineering and English literature, which she combines and mutates in her current work on digital media. She is the author of Programmed Visions: Software and Memory (Massachusetts Institute of Technology Press, 2011) and Control and Freedom: Power and Paranoia in the Age of Fiber Optics (Massachusetts Institute of Technology Press, 2006). She is working on a monograph titled “Habitual New Media.” LISA MARIE RHODY is Research Assistant Professor at the Roy Rosenzweig Center for History and New Media at George Mason University. Her research employs advanced computational methods such as topic modeling to revise existing theories of ekphrasis—poetry to, for, and about the visual arts. She is editor of the Journal of Digital Humanities and project manager for the Institute of Museum and Library Services’ (imls) signature conference, WebWise. Anderson, Benedict. Imagined Communities: Reflections on the Origin and the Spread of Nationalism. London: Verso, 1983. Ayers, Edward L. “A More-‐Radical Online Revolution.” Chronicle of Higher Education 4 Feb. 2013. http://chronicle.com/article/A-‐More-‐Radical-‐Online/136915/. Berlant, Lauren. Cruel Optimism. Durham: Duke up, 2011. Brennan, Sheila. “Let the Grant Do the Talking.” Journal of Digital Humanities 1.4 (Fall 2012). http://journalofdigitalhumanities.org/1-‐4/let-‐the-‐grant-‐do-‐the-‐talking-‐by-‐sheila-‐brennan/ (accessed 26 July 2013). Cecire, Natalia. “Theory and the Virtues of Digital Humanities.” Introduction. Journal of Digital Humanities 1.1 (Winter 2011). http://journalofdigitalhumanities.org/1-‐1/introduction -‐theory-‐and-‐ the-‐virtues-‐of-‐digital-‐humanities-‐by-‐natalia-‐cecire/ (accessed 26 July 2013). 1 See “Links and Kinks in the 4 See John Unsworth’s talk, “What Chain: Collaboration in the Digital Is Humanities Computing and Humanities” for an abstract of the What Is Not?” for more along these 2009 mla Convention panel. lines. 2 For a list of the twenty-‐seven digi 5 Indirect cost recovery started dur tal humanities and media studies ing World War II and the era of Big sessions presented at the 2009 mla Science: the government agreed to Convention, see Sample. pay for the physical infrastructure 3 At the time, much media attention was devoted to the United States v. Microsoft Corporation antitrust case initiated in 1998 and settled by the United States Department needed for funded projects; private grant agencies—still a large source of funding for the humanities, often in the form of fellowships— routinely refuse to pay for these offsets. of Justice in 2001, which created a backdrop for ensuing conver sations about open standards in humanities computing. Notes Works Cited 25 Clement, Tanya, and Dave Lester. “Off the Tracks: Laying New Lines for Digital Humanities Scholars.” http://mith.umd.edu/wp-‐content/uploads/whitepaper_offthetracks.pdf (accessed 26 July 2013). Davidson, Cathy. “Humanities 2.0: Promise, Perils, Predictions.” pmla 123.3 (2008): 707–17. Dombrowski, Quinn. “Whatever Happened to Project Bamboo?” Conference paper. DH2013 Conference. 19 July 2013. University of Nebraska–Lincoln. Fish, Stanley. “The Old Order Changeth.” New York Times 26 Dec. 2011. http://opinionator .blogs.nytimes.com/2011/12/26/the-‐old-‐order-‐changeth/. Flanders, Julia. “The Productive Unease of 21st-‐Century Digital Scholarship.” Digital Humanities Quarterly 3.3 (2009). http://www.digitalhumanities.org/dhq/vol/3/3/000055/000055 .html. Gibson, William. “The Science in Science Fiction.” Talk of the Nation. npr 30 Nov. 1999. http:// www.npr.org/templates/story/story.php?storyId=1067220. Klein, Lauren F. “The Image of Absence: Archival Silence, Data Visualization, and James Hemings.” American Literature and New Media. Spec. issue of American Literature 85.4 (Dec. 2013): 661–68. . “A Report Has Come Here.” Lauren F. Klein (blog). 9 Jan. 2013. http://lmc.gatech .edu/~lklein7/2012/01/09/a-‐report-‐has-‐come-‐here-‐social-‐network-‐analysis-‐in-‐the-‐papers-‐of -‐ thomas-‐jefferson/. Lewin, Tamar. “At Colleges, Humanities Job Outlook Gets Bleaker.” New York Times 18 Dec. 2009. “Links and Kinks in the Chain: Collaboration in the Digital Humanities.” Panel. Modern Languages Association Program Archive 29 Dec. 2009. http://www.mla.org/conv_listings_detail? prog_id=490&year=2009. Liu, Alan. “Digital Humanities and Academic Change.” English Language Notes 47 (Spring 2009): 17–35. ebsco Host (accessed 10 Dec. 2009). “Where Is Cultural Criticism in the Digital Humanities?” Alan Liu. Webpage. http://liu.english.ucsb.edu/where-‐is-‐cultural-‐criticism-‐in-‐the-‐digital-‐humanities/ (accessed 27 July 2013). Losh, Liz. “The Ministry of Silly Walks.” Virtualpolitik 29 Dec. 2009. http://networkedblogs .com/p22905895. Mandell, Laura. “Digital Humanities: The Bright Spot.” aims 13 Jan. 2010. http://aims.muohio .edu/2010/01/13/digital-‐humanities-‐the-‐bright-‐spot/. National Endowment for the Humanities Appropriations Request for Fiscal Year 2014. Washington, dc. National Endowment for the Humanities, 2013. http://www.neh.gov/files/neh _request_fy2014.pdf (accessed 26 July 2013). Newfield, Christopher. “Ending the Budget Wars: Funding the Humanities during a Crisis in Higher Education.” Profession 1 (2009): 270–84. http://www.mlajournals.org/doi/pdf/10.1632 /prof.2009.2009.1.270 (accessed 26 July 2013). Pannapacker, William. “The mla and the Digital Humanities.” Chronicle of Higher Education 28 Dec. 2009. http://chronicle.com/blogPost/The-‐MLAthe-‐Digital/19468/. “Pannapacker at mla: The Come-‐to-‐DH Moment.” Chronicle of Higher Education 7 Jan. 2012. http://chronicle.com/blogs/brainstorm/pannapacker-‐at-‐the-‐mla-‐2–the-‐come -‐to-‐dh-‐moment/42811. 26 Rosenzweig, Roy, and Dan Cohen. Digital History: A Guide to Gathering, Preserving, and Presenting the Past on the Web. Philadelphia: u of Pennsylvania p, 2005. Sample, Mark. “Digital Humanities Sessions at the 2009 mla.” Sample Reality (blog). 15 Nov. 2009. http://www.samplereality.com/2009/11/15/digital-‐humanities-‐sessions-‐at-‐the-‐2009-‐mla/. Tsing, Anna L. Friction: An Ethnography of Global Connection. New Jersey: Princeton up, 2005. Unsworth, John. “Documenting the Reinvention of Text: The Importance of Failure.” Journal of Electronic Publishing 3.2 (Dec. 1997). http://dx.doi.org/10.3998/3336451.0003.201 (accessed 26 July 2013). “What Is Humanities Computing, and What Is Not?” http://computerphilologie .tu-‐ darmstadt.de/jg02/unsworth.html (accessed 26 July 2013). Wernimont, Jacqueline. “Not (Re)Covering Feminist Methods in Digital Humanities.” Jacqueline Wernimont (blog). 19 July 2013. http://jwernimont.wordpress.com/2013/07/19 /not-‐recovering-‐ feminist-‐methods-‐in-‐digital-‐humanities. Williford, Christa, and Charles Henry. One Culture: Computationally Intensive Research in the Humanities and Social Sciences: A Report on the Experiences of First Respondents to the Digging into Data Challenge. Washington, dc: clir, 2012. http://www.clir.org/pubs/reports /pub151 (accessed 26 July 2013). work_hqbi22efmvf2xpdtmeopsds2hm ---- In, Out, Across, With- Collaborative Education and Digital Humanities In, Out, Across, With: Collaborative Education and Digital Humanities (Job Talk for Scholars' Lab) MAR 2ND, 2017 I’ve accepted a new position as the Head of Graduate Programs in the Scholars’ Lab, and I’ll be transitioning into that role over the next few weeks! As a part of the interview process, we had to give a job talk. While putting together this presentation, I was lucky enough to have past examples to work from (as you’ll be able to tell, if you check out this past job talk by Amanda Visconti). Since my new position will involve helping graduate students through the process of applying for positions like these, it only feels right that I should post my own job talk as well as a few words on the thinking that went into it. Blemishes, jokes, and all, hopefully these materials will help someone in the future find a way in, just as the example of others did for me. And if you’re looking for more, Visconti has a great list of other examples linked from her more recent job talk for the Scholars’ Lab. For the presentation, I was asked to respond to this prompt: What does a student (from undergraduate to doctoral levels) need to learn or experience in order to add “DH” to his or her skill set? Is that an end or a means of graduate education? Can short-term digital assignments in discipline-specific courses go beyond “teaching with technology”? Why not refer everyone to online tutorials? Are there risks for doctoral students or the untenured in undertaking digital projects? Drawing on your own experience, and offering examples or demonstrations of digital research projects, pedagogical approaches, or initiatives or organizations that you admire, make a case for a vision of collaborative education in advanced digital scholarship in the arts and humanities. I felt that each question could be a presentation all its own, and I had strong opinions about each one. Dealing with all of them seemed like a tall order. I decided to spend the presentation close reading and deconstructing that first sentence, taking apart the idea that education and/or digital humanities could be thought of in terms of lists of skills at all. Along the way, my plan was to dip into the other questions as able, but I also assumed that I would have plenty of time during the interview day to give my thoughts on them. I also wanted to try to give as honest a sense as possible of the way I approach teaching and mentoring. For me, it’s all about people and giving them the care that they need. In conveying that, I hoped, I would give the sort of vision the prompt was asking for. I also tried to sprinkle references to the past and present of the Scholars’ Lab programs to ground the content of the talk. When I mention potential career options in the body of the talk, I am talking about specific alumni who came through the fellowship programs. And when I mention graduate fellows potentially publishing on their work with the Twitter API, well, that’s not hypothetical either. So below find the lightly edited text of the talk I gave at the Scholars’ Lab - “In, Out, Across, With: Collaborative Education and Digital Humanities.” I’ve only substantively modified one piece - swapping out one example for another. And a final note on delivery: I have heard plenty of people argue over whether it is better to read a written talk or deliver one from notes. My own sense is that the latter is far more common for digital humanities talks. I have seen both fantastic read talks and amazing extemporaneous performances, just as I have seen terrible versions of each. My own approach is, increasingly, to write a talk but deliver that talk more or less from memory. In this case, I had a pretty long commute to work, so I recorded myself reading the talk and listened to it a lot to get the ideas in my head. When I gave the presentation, I had the written version in front of me for reference, but I was mostly moving through my own sense of how it all fit together in real time (and trying to avoid looking at the paper). My hope is that this gave me the best of both worlds and resulted in a structured but engaging performance. Your mileage may vary! In, Out, Across, With: Collaborative Education and Digital Humanities It’s always a treat to be able to talk with the members of the UVA Library community, and I am very grateful to be here. For those of you that don’t know me, I am Brandon Walsh, Mellon Digital Humanities Fellow and Visiting Assistant Professor of English at Washington and Lee University. The last time I was here, I gave a talk that had almost exclusively animal memes for slides. I can’t promise the same robust Internet culture in this talk, but talk to me after and I can hook you up. I swear I’ve still got it. In the spirit of Amanda Visconti, the resources that went into this talk (and a number of foundational materials on the subject) can all be found in a Zotero collection at the above link. I’ll name check any that are especially relevant, but hopefully this set of materials will allow the thoughts in the talk to flower outwards for any who are interested in seeing its origins and echoes in the work of others. And a final prefatory note: no person works, thinks or learns alone, so here are the names of the people in my talk whose thinking I touch upon as well as just some – but not all – of my colleagues at W&L who collaborate on the projects I mention. Top tier consists of people I cite or mention, second tier is for institutions or publications important to discussion, and final tier is for direct collaborators on this work. Today I want to talk to you about how best to champion the people involved in collaborative education in digital research. I especially want to talk about students. And when I mention “students” throughout this talk, I will mostly be speaking in the context of graduate students. But most of what I discuss will be broadly applicable to all newcomers to digital research. My talk is an exhortation to find ways to elevate the voices of people in positions like these to be contributors to professional and institutional conversations from day one and to empower them to define the methods and the outcomes of the digital humanities that we teach. This means taking seriously the messy, fraught, and emotional process of guiding students through digital humanities methods, research, and careers. It means advocating for the legibility of this digital work as a key component of their professional development. And it means enmeshing these voices in the broader network around them, the local context that they draw upon for support and that they can enrich in turn. I believe it is the mission of the Head of Graduate Programs to build up this community and facilitate these networks, to incorporate those who might feel like outsiders to the work that we do. Doing so enriches and enlivens our communities and builds a better and more diverse research and teaching agenda. This talk is titled “In, Out, Across, With: Collaborative Education and Digital Humanities,” and I’ll really be focusing on the prepositions of my title as a metaphor for the nature of this sort of position. I see this role as one of connection and relation. The talk runs about 24 minutes, so we should have plenty of time to talk. When discussing digital humanities education, it is tempting to first and foremost discuss what, exactly, it is that you will be teaching. What should the students walk away knowing? To some extent, just as there is more than one way to make breakfast, you could devise numerous baseline curricula. This is what we came up with at Washington and Lee for students in our undergraduate digital humanities fellowship program. We tried to hit a number of kinds of skills that a practicing digital humanist might need. It’s by no means exhaustive, but the list is a way to start. We don’t expect one person to come away knowing everything, so instead we aim for students to have an introduction to a wide variety of technologies by the end of a semester or year. They’ll encounter some technologies applicable to project management, some to front-end design, as well as a variety of programming concepts broadly applicable to a variety of situations. Lists like this give some targets to hit. But still, even as someone who helped put this list together, it makes me worry a bit. I can imagine younger me being afraid of it! It’s easy for us to forget what it was like to be new, to be a beginner, to be learning for the first time, but I’d like to return us to that frame of thinking. I think we should approach lists like these with care, because they can be intimidating for the newcomer. So in my talk today I want to argue against lists of skills as ways of thinking. I don’t mean to suggest that programs need no curriculum, nor do I mean to suggest that no skills are necessary to be a digital humanist. But I would caution against focusing too much on the skills that one should have at the end of a program, particularly when talking about people who haven’t yet begun to learn. I would wager that many people on the outside looking in think of DH in the same way: it’s a big list of unknowns. I’d like to get away from that. Templates like this are important for developing courses, fellowship, and degree-granting programs, but I worry that the goodwill in them might all too easily seem like a form of gatekeeping to a new student. It is easy to imagine telling a student that “you have to learn GitHub before you can work on this project.” It’s just a short jump from this to a likely student response - “ah sorry - I don’t know that yet.” And from there I can all too easily imagine the common refrain that you hear from students of all levels - “If I can’t get that, then it’s because I’m not a technology person.” From there - “Digital humanities must not be for me.” Instead of building our curricula out of as-yet-unknown tool chains, I want to float, today, a vision of DH education as an introduction to a series of professional practices. Lists of skills might be ends but I fear they might foreclose beginnings. Instead, I will float something more in line with that of the Scholarly Communication Institute (held here at UVA for a time), which outlined what they saw as the needs of graduate and professional students in the digital age. I’ll particularly draw upon their first point here (last of my slides with tons of text, I swear): graduate students need training in “collaborative modes of knowledge production and sharing.” I want to think about teaching DH as introducing a process of discovery that collapses hierarchies between expert and newcomer: that’s a way to start. This sort of framing offers digital humanities not as a series of methods one does or does not know, but, rather, as a process that a group can engage in together. Do they learn methods and skills in the process? Of course! Anyone who has taken part in the sort of collaborative group projects undertaken by the Scholars’ Lab comes away knowing more than they came in with. But I want to continue thinking about process and, in particular, how that process can be more inclusive and more engaging. By empowering students to choose what they want to learn and how they want to learn it, we can help to expand the reach of our work and better serve our students as mentors and collaborators. There are a few different in ways in which I see this as taking place, and they’ll form the roadmap for the rest of the talk. Apologies - this looks like the sort of slide you would get at a business retreat. All the same - we need to adapt and develop new professional opportunities for our students at the same time that we plan flexible outcomes for our educational programs. These approaches are meant to serve increasingly diverse professional needs in a changing job market, and they need to be matched by deepening support at the institutional level. So to begin. One of our jobs as mentors is to encourage students to seek out professionally legible opportunities early on in their careers, and as shapers of educational programs we can go further and create new possibilities for them. At W&L, we have been collaborating with the Scholars’ Lab to bring UVA graduate students to teach short-form workshops on digital research in W&L classrooms. Funded opportunities like this one can help students professionalize in new ways and in new contexts while paying it forward to the nearby community. A similar initiative at W&L that I’ve been working on has our own library faculty and undergraduate fellows visiting local high schools to speak with advanced AP computer science students about how their own programming work can apply to humanities disciplines. I’m happy to talk more about these in Q&A. We also have our student collaborators present at conferences, both on their own work and on work they have done with faculty members, both independently and as co-presenters. Here is Abdur, one of our undergraduate Mellon DH fellows, talking about the writing he does for his thesis and how it is enriched by and different from the writing he does in digital humanities contexts at the Bucknell Digital Scholarship Conference last fall. While this sort of thing is standard for graduate students, it’s pretty powerful for an undergraduate to present on research in this way. Learning that it’s OK to fail in public can be deeply empowering, and opportunities like these encourage our students to think about themselves as valuable contributors to ongoing conversations long before they might otherwise feel comfortable doing so. But teaching opportunities and conferences are not the only ways to get student voices out there. I think there are ways of engaging student voices earlier, at home, in ways that can fit more situations. We can encourage students to engage in professional conversations by developing flexible outcomes in which we are equal participants. One approach to this with which I have been experimenting is group writing, which I think is undervalued as a taught skill and possible approach to DH pedagogy. An example: when a history faculty member at W&L approached the library (and by extension, me) for support in supplementing an extant history course with a component about digital text analysis, we could have agreed to offer a series of one-off workshops and be done with it. Instead, this faculty member – Professor Sarah Horowitz – and I decided to collaborate on a more extensive project together, producing Introduction to Text Analysis: A Coursebook. The idea was to put the materials for the workshops together ahead of time, in collaboration, and to narrativize them into a set of lessons that would persist beyond a single semester as a kind of publication. The pedagogical labor that we put into reshaping her course could become, in some sense, professionally legible as a series of course modules that others could use beyond the term. So for the book, we co-authored a series of units on text analysis and gave feedback on each other’s work, editing and reviewing as well as reconfiguring them for the context of the course. Professor Horowitz provided more of the discipline-specific material that I could not, and I provided the materials more specific to the theories and methods of text analysis. Neither one of us could have written the book without the other. Professor Horowitz was, in effect, a student in this moment. She was also a teacher and researcher. She was learning at the same time that she produced original scholarly contributions. Even as we worked together, for me this collaborative writing project was also a pedagogical experiment that drew upon the examples of Robin DeRosa, Shawn Graham, and Cathy Davidson, in particular. Davidson taught a graduate course on “21st Century Literacies” where each of her students wrote a chapter that was then collected and published as an open-access book. For us as for Davidson, the process of knowing, the process of uncovering is something that happens together. In public. And it’s documented so that others can benefit. Our teaching labor could become visible and professionally legible, as could the labor that Professor Horowitz put into learning new research skills. As she adapts and tries out ideas, and as we coalesce them into a whole, the writing product is both the means and the end of an introduction to digital humanities. Professor Horowitz also wanted to learn technical skills herself, and she learned quite a lot through the writing process. Rather than sitting through lectures or being directed to online tutorials by me, I thought she would learn better by engaging with and shaping the material directly. Her course and my materials would be better for it, as she would be helping to bind my lectures and workshops to her course material. The process would also require her to engage with a list of technologies for digital publishing. Beyond the text analysis materials and concepts, the process exposed her to a lot of technologies: command line, Markdown, Git for version control, GitHub for project management. In the process of writing this document, in fact, she covered most of the same curriculum as our undergraduate DH fellows. She’s learning these things as we work together to produce course materials, but, importantly, the technical skills aren’t the focus of the work together. It’s a writing project! Rather than presenting the skills as ends in themselves, they were the means by which we were publishing a thing. They were immediately useful. And I think displacing the technology is helpful: it means that the outcomes and parameters for success are not based in the technology itself but, rather, in the thinking about and use of those methods. We also used a particular platform that allowed Professor Horowitz to engage with these technologies in a light way so that they would not overwhelm our work – I’m happy to discuss more in the time after if you’re interested. This to say: the outcomes of such collaborative educations can be shaped to a variety of different settings and types of students. Take another model, CUNY’s Graduate Center Digital Fellows program, whose students develop open tutorials on digital tools. Learning from this example, rather than simply direct students or colleagues towards online tutorials like these, why not have them write their own documents, legible for their own positions, that synthesize and remix the materials that they already have found? The learning process becomes something productive in this framing. I can imagine, for example, directing collaboratively authored materials by students like these towards something like The Programming Historian. If you’re not familiar, The Programming Historian offers a variety of lessons on digital humanities methods, and they only require an outline as a pitch to their editorial team, not a whole written publication ready to go. Your graduate students could, say, work with the Twitter API over the course of a semester, blog about the research outcomes, and then pitch a tutorial to The Programming Historian on the API as a result of their work. It’s much easier to motivate yourselves to write something if you know that the publication has already been accepted. Obviously such acceptance is not a given, but working towards a goal like this can offer student researchers something to aim for. Their instructors could co-author these materials, even, so that everyone has skin in the game. This model changes the shape of what collaborative education can look like: it’s duration and its results. You don’t need a whole fellowship year. You could, in a reasonably short amount of time, tinker and play, and produce a substantial blog post, an article pitch, or a Library Research Guide (more on that in a moment). As Jeff Jarvis has said, “we need to move students up the education chain.” And trust me - the irony of quoting a piece titled “Lectures are Bullshit” during a lecture to you is not lost on me. But stay with me. Collaborative writing projects on DH topics are flexible enough to fit the many contexts for the kind of educational work that we do. After all, no one needs or values the same outcomes, and these shared and individual goals need to be worked out in conversation with the students themselves early on. Articulating these desires in a frank, written, and collaborative mode early on (in the genre of the project charter), can help the program directors to better shape the work to fit the needs of the students. But I also want to suggest that collaborative writing projects can be useful end products as well as launching pads, as they can fit the shape of many careers. After all, students come to digital humanities for a variety of different reasons. Some might be aiming to bolster a research portfolio on the path to a traditional academic career. Others might be deeply concerned about the likelihood of attaining such a position and be looking for other career options. Others still might instead be colleagues interested in expanding their research portfolio or skillset but unable to commit to a whole year of work on top of their current obligations. Writing projects could speak to all these situations. I see someone in charge of shaping graduate programs as needing to speak to these diverse needs. This person is both a steward of where students currently are – the goals and objectives they might currently have – as well as of where they might go – the potential lives they might (or might not!) lead. After all, graduate school, like undergraduate, is an enormously stressful time of personal and professional exploration. If we think simply about a student’s professional development as a process of finding a job, we overlook the real spaces in which help might be most desired. Frequently, those needs are the anxieties, stresses, and pressures of refashioning yourself as a professional. We should not be in the business of creating CV lines or providing lists of qualifications alone. We should focus on creating strong, well-adjusted professionals by developing ethical programs that guide them into the professional world by caring for them as people. In the graduate context, this involves helping students deal with the academic job market in particular. To me in its best form, this means helping students to look at their academic futures and see proliferating possibilities instead of a narrow and uncertain route to a single job, to paraphrase the work of Katina Rogers. A sprinkler rather than a pipeline, in her metaphor. As Rogers’s work, in particular, has shown, recent graduate students increasingly feel that, while they experienced strong expectations that they would continue in the professoriate, they received inadequate preparation for the many different careers they might actually go on to have. The Praxis Program and the Praxis Network are good examples of how to position digital humanities education as answers to these issues. Fellowship opportunities like these must be robust enough that they can offer experiences and outcomes beyond the purely technical, so that a project manager from one fellowship year can graduate with an MA and go into industry in a similar role just as well-prepared as a PhD student aiming to be a developer might go on to something entirely different. And the people working these programs must be prepared for the messy labor of helping students to realize that these are satisfactory, laudable professional goals. It should be clear that this sort of personal and professional support is the work of more than just one person. One of the strengths of a digital humanities center embedded in a library like this one at UVA is that fellows have the readymade potential to brush up against a variety of career options that become revealed when peaking outside of their disciplinary silos: digital humanities developers and project manager positions, sure, but also metadata specialists, archivists, and more. I think this kind of cross-pollination should be encouraged: library faculty and staff have a lot to offer student fellows and vice versa. Developing these relationships brings the fellows further into the kinds of the work done in the library and introduces them to careers that, while they might require further study to obtain, could be real options. To my mind the best fellowship programs are those fully aware of their institutional context and those that both leverage and augment the resources around them as they are able. We have been working hard on this at W&L. We are starting to institute a series of workshops led by the undergraduate fellows in consultation with the administrators of the fellowship program. The idea is that past fellows lead workshops for later cohorts on the technology they have learned, some of which we selectively open to the broader library faculty and staff. The process helps to solidify the student’s training – no better way to learn than to teach – but it also helps to expand the student community by retaining fellows as committed members. It also helps to fill out a student’s portfolio with a cv-ready line of teaching experience. This process also aims to build our own capacity within the library by distributing skills among a wider array of students, faculty, and staff. After all, student fellows and librarians have much they could learn from one another. I see the Head of Graduate Programs as facilitating such collaborations, as connecting the interested student with the engaged faculty/staff/librarian collaborator, inside their institution or beyond. But we must not forget that we are asking students and junior faculty to do risky things by developing these new interests, by spending time and energy on digital projects, let alone presenting and writing on them in professional contexts. The biggest risk is that we ask them to do so without supporting them adequately. All the technical training in the world means little if that work is illegible and irrelevant to your colleagues or committee. In the words of Kathleen Fitzpatrick, we ask these students to “do the risky thing,” but we must “make sure that someone’s got their back.” I see the Head of Graduate Programs as the key in coordinating, fostering, and providing such care. Students and junior faculty need support – for technical implementation, sure – but they also need advocates – people who can vouch for the quality of their work and campaign on their behalf in the face of committees and faculty who might be otherwise unable to see the value of their work. Some of this can come from the library, from people able to put this work in the context of guidelines for the evaluation of digital scholarship. But some of this support and advocacy has to come from within their home departments. The question is really how to build up that support from the outside in. And that’s a long, slow process that occurs by making meaningful connections and through outreach programs. At W&L, we have worked to develop an incentive grant program, where we incentivize faculty members who might be new to digital humanities or otherwise skeptical to experiment with incorporating a digital project into their course. The result is a slow burn – we get maybe one or two new faculty each term trying something out. That might seem small, but it’s something, particularly at a small liberal arts college. This kind of slow evangelizing is key in helping the work done by digital humanists to be legible to everyone. Students and junior faculty need advocates for their work in and out of the library and their home departments, and the person in this position is tasked with overseeing such outreach. So, to return to the opening motif, lists of skillsets certainly have their place as we bring new people into the ever-expanding field: they’re necessary. They reflect a philosophy and a vision, and they’re the basis of growing real initiatives. But it’s the job of the Head of Graduate Programs to make sure that we never lose sight of the people and relationships behind them. Foremost, then, I see the Head of Graduate Programs as someone who takes the lists, documents, and curricula that I have discussed and connects them to the people that serve them and that they are meant to speak to. This person is one who builds relationships, who navigates the prepositions of my title. It’s the job of such a person to blast the boundary between “you’re in” and “you’re out” so that the tech-adverse or shy student can find a seat at the table. This is someone who makes sure that the work of the fellows is represented across institutions and in their own departments. This person makes sure the fellows are well positioned professionally. This person builds up people and embeds them to networks where they can flourish. Their job is never to forget what it’s like to be the person trying to learn. Their job is to hear “I’m not a tech person” and answer “not yet, but you could be! and I know just the people to help. Let’s learn together.” work_hqonytq3nbg6xcia22yu7ggkzy ---- Big Data and Digital Humanities Jochen Tiepmar Abstract In academic discourse, the term Big Data is often used incorrectly or not considered in relevant use cases. This paper investigates the term Big Data in the context of text oriented digital humanities and in the process shows that it is not necessarily an issue of big data sets. The goal is to provide a starting point or a guideline for researchers in the humanities to relate their work to the concept of Big Data. It may even show the reader that they might be working on a task that can be considered as Big Data even though the data set itself is comparatively small. As such, this paper should not be seen as a concrete solution to specific problems but as a general overview that is based on several years of practical research experience. This paper also argues that interoperability is one of the most prominent Big Data issues in text oriented digital humanities. Jochen Tiepmar Leipzig University, Institute for Computer Science, � jtiepmar@informatik.uni-leipzig.de Archives of Data Science, Series A (Online First) DOI 10.5445/KSP/1000087327/01 KIT Scientific Publishing ISSN 2363-9881 Vol. 5, No. 1, 2018 mailto:jtiepmar@informatik.uni-leipzig.de 2 Jochen Tiepmar 1 Introduction Defining the term Big Data is not trivial. The most obvious defining factor is the size of a data set, but this property can not be applied universally and depends on the domain context as well as data type specific properties or measurements. For instance, text volume can be measured as number of tokens/documents or byte. While a token or document count can often result in impressive and seemingly large numbers, the corresponding bytes are often not in an area that can be considered as large. Yet certain text mining analyses – like citation analysis – and use case specific circumstances – like a real time requirement – may result in workflows that are already too calculation expensive for technically small data sets. IBM suggests the 4 Vs, data specific properties to help describe the Big Data relevance of a problem. These Vs are Volume, Veracity, Velocity and Variety. 1.1 Volume Volume is the most obvious aspect of Big Data and describes the size of a data set. The bigger a data set is, the more effort is required to process, share or store it. Especially medical applications like analysis of MRI images and simulations like weather models or particle systems can create and require large amounts of data. The increasing amount of digital and sometimes publicly available sensory information that is collected – for a vast number of examples, see works about Smart Cities or Internet of Things – will probably increase the need for solutions for size-related problems. Usually, a data set is not characterized as a Big Data problem if smaller than at least 1 Terabyte, and since current standard database systems and hard drives are able to store and manage several terabytes of data without any major issues, most Big Data Volume problems deal with memory and not disk space. Information that is stored in memory can be accessed faster than that in disk drives, but it is lost when the system is shut down. Therefore, disk space is usually used to store, manage, and archive data sets while memory is usually used for more dynamic, analytical tasks. Memory is currently also more expensive – and, therefore, more limited – than disk space, which means that the memory requirements that Big Data and Digital Humanities 3 qualify as a Big Data problem are usually lower than disk-space requirements. An arbitrarily chosen estimated border value could be 100 Gigabytes. In the context of text-oriented digital humanities, volume can also be used to refer to more information-related aspects like the number of tokens, sentences, or documents, as it is usually done for text corpora. Information-related size statistics can quickly result in seemingly big and impressive numbers while the required disk space stays relatively small. In the context of this analysis, Volume with a capitalized letter V refers to disk or memory space. Table 1 illustrates this relationship for some of the biggest data sets (Deutsches Textarchiv (DTA), Geyken et al (2011); Textgrid, Neuroth et al (2011)) that were collected in the context of this work. The disk space is calculated based on the uncompressed data set that is available for download and usually includes additional markup, which implies that the actual text data Volume is usually smaller. The number of documents and tokens is calculated based on the data set. The document number is the number of individual files, and the tokens were delimited by the characters ="<.>()[]{},:;, tab, newline, and whitespace. Textgrid provides multiple documents as part of one XML file, namely the with several TEI documents. These documents were separated into individual files. The token and document count can differ from the official project statistics, because they include the XML markup. This is intentional, since the point is to illustrate the relation between the number of words in a set of files and their hard disk space and, for this comparison, it is more correct to include the markup as tokens as it also influences the file sizes. Table 1: Text corpus statistics vs. hard disk space. Text Corpus Documents Tokens Disk Space DTA 2,435 211,185,949 1.3 GB Textgrid 91,149 232,567,480 1.8 GB PBC1 831 289,651,896 1.9 GB As Table 1 shows, the required disk space for text data is quite small even for comparatively big data sets. Problematic file sizes can usually only occur for 1 Parallel Bible Corpus (PBC), Mayer and Cysouw (2014) 4 Jochen Tiepmar text data sets that include optical scans of the document pages, which shall not be considered as text data but as image data. The English Wikipedia can be considered as one of the largest online text collections. Yet, according to its own statistics,2 as of February 2013, the size of the XML file containing only the current pages, no user or talk pages, was 42,987,293,445 bytes uncompressed (43 GB). It can be stated that storing and managing text data is not a Volume problem with respect to disk size. The data size is also not problematic with respect to setups that are designed to work in memory. At the time of writing, the current prices for 64 GB RAM based on Amazon.com range from 541.95e3 to 1,071.00e,4 which might be too expensive to consider this as standard hardware, but this is probably far from problematic for a project that is designed with the requirement of managing a Wikipedia-size text collection in memory. It must be emphasized that this is not a phenomenon that occurs because the amount of data is still small, and, therefore, can be expected to change in the near future. Instead, it can be considered as a constant characteristic of the practical use of text data. Data sets in this context correspond to individual document collections that tend to include documents that share a certain set of properties like a specific author, language, time period, or any kind of thematic relation. Das Deutsche Textarchiv only includes German literature covering a relatively limited time frame, and the Parallel Bible Corpus only includes Bible translations. Even if a data set includes a wide array of parameter configurations it can always be distinguished from other data sets by its specific properties. It is highly unlikely that the trend for this kind of data is headed toward centralization. This characteristic is especially important in text analysis because, in order to research specific effects, it is important to eliminate the impact of unrelated variables. A token frequency trend analysis usually requires a monolingual text corpus to avoid effects like the German feminine noun article die being counted as the English verb to die. Even in more inclusive use cases like a global digital library, it can be counter-productive not to limit the content to books and include – for instance – Twitter data or collected forum discussions. Therefore, it can be stated that the relatively small disk or memory size required to manage only the text data is and will not be a Big Data-related problem because of 2 https://en.wikipedia.org/wiki/Wikipdia:Size_of_Wikipedia#Size_of_the_English _Wikipedia_database 3 HyperX FURY DDR4 HX421C14FBK4/64 RAM Kit 64GB (4x16GB) 2133MHz DDR4 CL14 DIMM 4 Kingston KVR18R13D4K4/64 RAM 64GB (DDR3 ECC Reg CL13 DIMM Kit, 240-pin) Big Data and Digital Humanities 5 the purpose and characteristics of this kind of data. It is unlikely that the size of document collections is an issue that cannot be solved using present-day standard hardware. If text content is considered primary data, then external annotations and annotated text content can be considered secondary data. Annotated text content can include information about Part-of-Speech tags, named entities, editorial notes, and much more. External annotations can include citation links or linked resources like audio or image snippets to specific text passages. Secondary data does not have to be associated with the original text and can also occur as word frequency tables, topic models, co-occurrence & collocation data, or in the form of any other analytical result format. Secondary data in the context of text is usually the result of automated analytical processes or manual editing. Especially, the amount of information that is added by automated analytical processes can significantly increase the Volume of a data set. The amount of this kind of data depends on the analytical processes that are done and the results that are produced. A representative overview of this kind of data would require an unreasonable amount of work, and provide little to no value because the results for the individual projects would be project-specific and could not be compared. The Wortschatz project (Quasthoff and Richter (2005)) at Leipzig University generates a lot of annotation data and word statistics based on several sentence lists collected from online resources. The sentence lists can be considered the primary data, while everything else – including indices for the primary data – can be considered secondary data. Table 2 shows the relation between the Volumes of primary and secondary data based on the three samples deu_mixed_2011, deu_news_2011 and deu_newscrawl_2011. The information was compiled based on information given by a server administrator with direct access to the databases. Table 2: Primary vs secondary data Volume (Wortschatz). Data Set Primary Data (Bytes) Secondary Data (Bytes) deu_mixed_2011 37,270,576,048 517,020,294,364 deu_news_2011 3,672,898,564 59,421,534,187 deu_newscrawl_2011 3,735,178,336 222,879,231,073 6 Jochen Tiepmar The values in the table are not comparable to each other because each data set includes different sets of database tables. This is not an issue because the purpose is only to illustrate that secondary data tends to be of more Volume than primary data. Combined with the trend for increased interoperability and research in- frastructures that may store and provide annotations that would have been considered as temporary data in project-specific workflows, it may even be possible that exponential Volume growth occurs in the near future because of further annotations that are based on or caused by existing annotations. It can be stated that secondary data itself can qualify as a Volume problem because text annotation can increase the amount of meta information that is attached to any piece of text data without limit, and therefore, the Volume can be inflated indefinitely. Estimating whether or not this would result in Big Data sized document collections would be speculation. Yet, this work proposes that it is unlikely that future document collections will include every piece of annotated information in their documents because it makes the documents harder to read, and the information may even be contradictory to each other. It is more likely and reasonable that text passage references are used to link annotation results to text passages and between external services. 1.2 Variety Variety is about the different types and formats of data sets. Types include more broad differentiations like audio, video, or sensory data and also different file types for each media type like mp3, wav, and flac for audio files. Since the context of this work is text-oriented digital humanities, the types of data are already relatively limited but still include many file types – like tex, txt, xml, doc, csv, pdf, and many more – with specific characteristics. Other layers of complexity in Variety are differences in markup formats for a specific file type – like different XML schemas – and a vast number of workflows and access methods for data. This indicates that the Big Data issue Variety is similar to the increasing need for interoperability that is described in Section 2 and is very relevant in the context of text-oriented digital humanities. Big Data and Digital Humanities 7 1.3 Velocity Velocity describes the processing speed and is especially significant because it has a direct impact on the end-user experience while the other issues are generally only problematic for the service provider. For instance, a navigation system that calculates the best route based on sensory information about the current traffic would not be usable if this calculation requires several hours of processing time. More academic use cases are workflows that include a lot of experimental parameter permutation or the creation of domain-specific training data sets for neural networks and machine learning. A very common way to increase the processing speed of a workflow or algorithm is to parallelize it by dividing it into subsets of problems that are independently solved by different threads or computers in a network cluster and then combining their results. Parallelization of algorithms is an issue that is far from trivial and in some cases may be counter-productive or even impossible to implement because certain workflows can not be divided into independent sub problems. Specific tasks in the text-oriented digital humanities – for example, citation analysis – can be parallelized and provide interesting research questions with regard to Velocity. 1.4 Veracity Veracity refers to the quality and trustworthiness of data and is especially relevant in the context of sensory data where it can be a complex problem to distinguish between a correctly measured anomaly and a malfunction of a sensor. This can result in reduced efficiency and in financial losses as described in Dienst and Beseler (2016). Optical Character Recognition (OCR) can be considered as a complex Veracity-related problem in the context of text-oriented digital humanities. This observation is supported by the conclusions of Chaudhuri et al (2017). Nuances that distinguish certain letters can be hard to interpret correctly by a computer. Since OCR often has to work with documents that were not created digitally, problems like handwriting and unwanted image artefacts have to be considered. Even a comparatively high accuracy of 95% implies that every 20th character was guessed wrongly, which correlates to six mistakes in this sentence. 8 Jochen Tiepmar 1.5 The Big Vs and Digital Humanities A problem can be more or less characterized as Big Data the more or less complex it is as regards to one or many of the Big Data Vs. This especially implies that a problem does not necessarily have to include particularly large sets of data to be considered Big Data. The different aspects can be related to or influence each other. A relatively small data set that needs to be processed exceptionally fast is also a Big Data problem and Veracity can become decreasingly or increasingly important with increasing Volume, depending on the use case. A larger data set can decrease the impact of individual errors but also increase their absolute number in case of a systemic problem. This work argues that the following relations between the Vs and the digital humanities can be observed: • Volume is an issue that does exist with regard to secondary data but generally not as prominent as in other data related contexts and domains. • Velocity and Veracity can be problematic in specific tasks in citation analysis (time effectiveness) and digital humanities like OCR (Veracity). • Variety can be mapped to interoperability, a well known and universal issue in the digital humanities. The following section illustrates, why interoperability or Variety is an especially complex issue in such a broad field of the digital humanities. 2 Interoperability (Variety) Interoperability in the context of this work means the ability to interchange or reuse tools and data sets between different (research) projects. The Oxford Dictionary 2016 defines interoperability as “The ability of computer systems or software to exchange and make use of information” (Oxford Dictionary (2016)). Three technical aspects are relevant to the exchange of functions and data sets: Tools & workflows must understand the data, data types & markup must be understandable by the tools, and data availability & access must be provided. Big Data and Digital Humanities 9 2.1 Tools & workflow Variety Many projects in the text-oriented digital humanities can be characterized as specialized solutions that are not generally applicable to other research projects as e.g. Perseus (Smith et al (2000)), Das Deutsche Textarchiv (Geyken et al (2011)), and The Parallel Bible Corpus (Mayer and Cysouw (2014)). They use existing or newly created technologies to provide project-specific solutions for their project-specific data sets, including the use of publicly available tools like source code repositories (Perseus) as well as hand-crafted solutions (Das Deutsche Textarchiv, Parallel Bible Corpus). Tool reuse can be complicated because of domain-specific circumstances. For instance, it is not unusual to use a whitespace-based word tokenizer in Latin-based languages, which cannot be applied to Chinese texts. There may also be the case that individual tasks in a workflow are considered to be solved more easily using an improvised script instead of investing the effort to evaluate already existing solutions. The result is a set of workflows that consist of an increasingly bigger set of hand-crafted project-specific programs. The general consequence is a heterogeneity of technical solutions which makes it even harder for future researchers to find the tool combinations that are potentially useful for a given research problem. This issue is well-known in the digital humanities community as evidenced by the increasing popularity of digital infrastructures and archival projects like CLARIN (Hinrichs and Krauwer (2014)) and Das Digitale Archiv NRW (Thaller (2013)). With the increasing familiarity, acceptance, generality, and usability of existing tools and frameworks, this variety of (potentially redundant) workflows will probably decrease over time. Source code repositories like Github are already an established technical basis for collaborative text-editing workflows5 and mentions of natural language processing tools like the Part-of-Speech Tagger from the Stanford Natural Language Group (commonly referred to as the Stanford Tagger, Manning et al (2014)) rarely require further explanation. Yet, due to domain and context-specific requirements and also the fact that tool implementers are often motivated to try out and provide new solutions with their individual set of advantages and disadvantages, this workflow variety will probably evolve but never completely disappear, for examples, see the justifications for the toolkits that are offered by almost every Natural Language 5 See https://github.com/PerseusDL or https://github.com/tillgrallert/digital-muqtabas. 10 Jochen Tiepmar Processing group. It is unlikely that a complicated field like the text-oriented digital humanities with its vast variety of research questions and potentially incompatible parameter configurations can be covered by a comprehensive “Jack of all trades”-kind of solution. It can also be argued that this would not be a desirable scenario since a variety of solutions can be expected to be more flexible and promote improvements by innovation. Even established tools and workflows can be expected to change over time due to updates and technical improvements or complete paradigm shifts like the currently emerging trend for workflow parallelization. 2.2 Data type & markup Variety It can be counter-productive not to use established text-markup formats because the specification of a project-specific and competent format requires significantly more effort than the reuse of an existing one. Additionally, since formats like TEI/XML and DocBook already provide comprehensive sets of domain-specific features, it is hard to find acceptance and curiosity for new text markup formats in the research and tool development communities. It is more likely that future researchers will be trained in established markup formats and use or extend these for their purposes as, for example, described in Kalvesmaki (2015). Tool compatibility increases the value of a published data set, and therefore, it can be expected that this aspect will develop toward more interoperable data sets in established formats without further external intervention. 2.3 Data availability & access Variety Access to data sets in the text-oriented digital humanities is generally provided through project-specific websites and solutions, including zipped data dumps (e.g. Textgrid (Neuroth et al (2011)), German Political Speeches (Barbaresi (2012))), source code repositories (e.g. Digital Muqtabas (Grallert (2016)), Perseus), and website-specific catalogues or search forms (e.g. Das Deutsche Textarchive, Parallel Bible Corpus). There does not exist a widely accepted solution for a universal interface for text data. The argument can be made that such a solution could not already be implemented because an application- Big Data and Digital Humanities 11 independent reference & retrieval system for text data did not exist. Text data retrieval systems like archives or website catalogues are not designed to be reusable because they are not meant to provide the basis for other systems but instead, a context-specific way to retrieve data. For example, the search catalogue that serves the data from the Parallel Bible Corpus is not designed to be also able to serve the data from Das Deutsche Textarchiv. Therefore, the data references can be expected to be not compatible with other projects. Application-independent reference systems like ISBN (Griffiths (2015)) or DOI (Paskin (2010)) provide reusable identifiers for text resources but do not serve data in any way. They refer to the electronic resource as a whole, which typically correlates to one file or document while the Canonical Text Service (CTS) protocol (Smith (2009)) extends this principle to individual text passages. This aspect has good potential for improvement. Text referencing and retrieval systems can be combined to provide access to data in an application-independent way as it is already done for complete resources as soon as a reference system like ISBN is integrated into a data archive. Adapting this principle to text passages and combining it with a retrieval web service – as it is done with the CTS implementation described in Tiepmar (2018) – can significantly increase interoperability across projects. 3 Conclusion In summary, it can be stated that Big Data is a complex issue, especially when it is considered in a broad domain like digital humanities, even if it is restricted to the text oriented areas of this field. This paper argues that the trivial assumption that Big Data requires large data sets is not necessarily correct in this context and that other aspects and especially the issue of interoperability may be more relevant. It also shows that focusing only on volume related data aspects may result in ignorance against a significant number of potentially interesting use cases. Interoperability is further divided into three aspects and it is shown that one of them - data availability & access - shows huge potential for significant improvements. This paper lists numerous practically relevant research problems that can be considered as Big Data without requiring large data sets and in the process provides useful starting points and arguments for interested researchers that want to work in this area. Acknowledgements Part of this work was funded by the German Federal Ministry of Education and Research within the project ScaDS Dresden/Leipzig (BMBF 01IS14014B). 12 Jochen Tiepmar References Barbaresi A (2012) German political speeches – corpus and visualization (2nd release). In: Poster Session of the German Linguistic Society, Special Interest Group on Computational Linguistics (DGfS-CL), German Linguistic Society, Special Interest Group on Computational Linguistics (DGfS) / Open Archive of Human and Society Sciences (HAL), Frankfurt (Germany) / Paris (France), URL https://halshs. archives-ouvertes.fr/halshs-00677928 Chaudhuri A, Mandaviya K, Badelia P, Ghosh SK (2017) Optical Character Recognition Systems for Different Languages with Soft Computing. Springer International Publish- ing, Cham (Switzerland). DOI 10.1007/978-3-319-50252-6 Dienst S, Beseler J (2016) Automatic Anomaly Detection in Offshore Wind SCADA Data. In: Win Europe Summit Conference 2016, University of Leipzig / Global Tech I Offshore Wind GmbH, Leipzig / Hamburg (Ger- many), URL https://windeurope.org/summit2016/conference/ submit-an-abstract/pdf/626738292593.pdf Geyken A, Haaf S, Jurish B, Schulz M, Steinmann J, Thomas C, Wiegand F (2011) Das Deutsche Textarchiv: Vom historischen Korpus zum aktiven Archiv. In: Digi- tale Wissenschaft – Stand und Entwicklung digital vernetzter Forschung in Deutschland, Schomburg S, Leggewie C, Lobin H, Puschmann C (eds), Marketing des Hochschulbib- liothekszentrum des Landes Nordrhein-Westfalen (hbz), Cologne (Germany), p. 157– 161, URL https://hbz.opus.hbz-nrw.de/frontdoor/index/index/ docId/206 Grallert T (2016) Digital Muqtabas: An open, collaborative,and scholarly digital edition of Muhammad Kurd Ali’s early Arabic periodical Majallat al-Muqtabas (1906–1917/18). URL https://github.com/tillgrallert/digital-muqtabas Griffiths S (2015) ISBN: A History. NISO’s Information Standards Quarterly, Summer & Fall 2015 27(2):46–48, URL https://groups.niso.org/publications/ isq/v27no2-3/Griffiths/ Hinrichs E, Krauwer S (2014) The clarin research infrastructure: Resources and tools for ehumanities scholars. In: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14), Chair) NCC, Choukri K, Declerck T, Loftsson H, Maegaard B, Mariani J, Moreno A, Odijk J, Piperidis S (eds), European Lan- guage Resources Association (ELRA), Reykjavik (Iceland), p. 1525–1531, URL http: //www.lrec-conf.org/proceedings/lrec2014/index.html Kalvesmaki J (2015) Three Ways to Enhance the Interoperability of Cross-References in TEI XML. Symposium on Cultural Heritage Markup, Washington, DC (USA), vol. 16, DOI 10.4242/BalisageVol16.Kalvesmaki01 Manning CD, Surdeanu M, Bauer J, Finkel J, Bethard SJ, McClosky D (2014) The Stanford CoreNLP Natural Language Processing Toolkit. In: Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Association for Computational Linguistics, Baltimore, MD (USA), p. 55–60, DOI 10.3115/v1/P14- 5010, URL http://aclweb.org/anthology/P14-5010 https://halshs.archives-ouvertes.fr/halshs-00677928 https://halshs.archives-ouvertes.fr/halshs-00677928 https://doi.org/10.1007/978-3-319-50252-6 https://windeurope.org/summit2016/conference/submit-an-abstract/pdf/626738292593.pdf https://windeurope.org/summit2016/conference/submit-an-abstract/pdf/626738292593.pdf https://hbz.opus.hbz-nrw.de/frontdoor/index/index/docId/206 https://hbz.opus.hbz-nrw.de/frontdoor/index/index/docId/206 https://github.com/tillgrallert/digital-muqtabas https://groups.niso.org/publications/isq/v27no2-3/Griffiths/ https://groups.niso.org/publications/isq/v27no2-3/Griffiths/ http://www.lrec-conf.org/proceedings/lrec2014/index.html http://www.lrec-conf.org/proceedings/lrec2014/index.html https://doi.org/10.4242/BalisageVol16.Kalvesmaki01 https://doi.org/10.3115/v1/P14-5010 https://doi.org/10.3115/v1/P14-5010 http://aclweb.org/anthology/P14-5010 Big Data and Digital Humanities 13 Mayer T, Cysouw M (2014) Creating a massively parallel Bible corpus. In: Proceed- ings of the Ninth International Conference on Language Resources and Evaluation (LREC’14), Chair) NCC, Choukri K, Declerck T, Loftsson H, Maegaard B, Mariani J, Moreno A, Odijk J, Piperidis S (eds), European Language Resources Association (ELRA), Reykjavik (Iceland), p. 3158–3163, URLhttp://www.lrec-conf.org/ proceedings/lrec2014/index.html Neuroth H, Lohmeier F, Smith KM (2011) University of Edinburgh Library Learning Services, Edinburgh (UK). vol. 6, p. 222–231, DOI 10.2218/ijdc.v6i2.198 Oxford Dictionary (2016) Definition of interoperability in english: Interoperabil- ity. In: Oxford Dictionaries, Oxford University Press, URL https://en. oxforddictionaries.com/definition/interoperability Paskin N (2010) Digital Object Identifier (DOI®) System, . Tertius Ltd., Oxford (UK), URL http://www.doi.org/overview/080625DOI-ELIS-Paskin.pdf Quasthoff U, Richter M (2005) Projekt Deutscher Wortschatz. Babylonia 15(3):33–35, Babylonia / Fondazione Lingue e Culture, Bellinzona / Comano (Switzerland), URL http://babylonia.ch/de/archiv/anni-precedenti/2005/ nummer-3-05/projekt-deutscher-wortschatz/ Smith DA, Rydberg-Cox JA, Crane G (2000) The Perseus Project: a digital library for the humanities. Literary and Linguistic Computing 15(1):15–25, DOI 10.1093/llc/15.1.15 Smith DN (2009) Citation in classical studies. 3(1)The Alliance of Digital Humanities Organizations (ADHO), URL http://www.digitalhumanities.org/dhq/ vol/3/1/index.html Thaller M (2013) Das Digitale Archiv NRW in der Praxis – Eine Softwarelösung zur digi- talen Langzeitarchivierung. Kölner Beiträge zu einer geisteswissenschaftlichen Fachin- formatik, Band 5,Verlag Dr. Kovač, Hamburg Tiepmar J (2018) Implementation and Evaluation of the Canonical Text Services Protocol as Part of a Research Infrastructure in the Digital Humanities. PhD thesis, Leipzig University / Leipzig University Library, Leipzig, URL http://nbn-resolving. de/urn:nbn:de:bsz:15-qucosa2-212926 http://www.lrec-conf.org/proceedings/lrec2014/index.html http://www.lrec-conf.org/proceedings/lrec2014/index.html https://doi.org/10.2218/ijdc.v6i2.198 https://en.oxforddictionaries.com/definition/interoperability https://en.oxforddictionaries.com/definition/interoperability http://www.doi.org/overview/080625DOI-ELIS-Paskin.pdf http://babylonia.ch/de/archiv/anni-precedenti/2005/nummer-3-05/projekt-deutscher-wortschatz/ http://babylonia.ch/de/archiv/anni-precedenti/2005/nummer-3-05/projekt-deutscher-wortschatz/ https://doi.org/10.1093/llc/15.1.15 http://www.digitalhumanities.org/dhq/vol/3/1/index.html http://www.digitalhumanities.org/dhq/vol/3/1/index.html http://nbn-resolving.de/urn:nbn:de:bsz:15-qucosa2-212926 http://nbn-resolving.de/urn:nbn:de:bsz:15-qucosa2-212926 Big Data and Digital Humanities Introduction Volume Variety Velocity Veracity The Big Vs and Digital Humanities Interoperability (Variety) Tools & workflow Variety Data type & markup Variety Data availability & access Variety Conclusion work_hslmqi4fi5dmlgmhf6b7537ugq ---- (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 1 The Consequences of Framing Digital Humanities Tools as Easy to Use Paige Morgan p.morgan@miami.edu ORCID: 0000-0001-8076-7356 ABSTRACT This article examines the recurring ways in which some of the most popular DH tools are presented as easy to use. It argues that attempts to couch powerful tools in what is often false familiarity, directly undermines the goal of encouraging scholarly innovation and risk taking. The consequences of framing digital tools as either easy or more difficult shapes the relationship between librarians and the students and faculty whose research they support, and, more broadly, the role and viability of libraries as spaces devoted to skill acquisition. Keywords: infrastructure, digital humanities, DH tools, DH pedagogy A digital humanities librarian provides consultations to researchers who are developing or struggling with DH projects. Frequently, these consultations begin with the researcher apologizing and explaining to the librarian their poor aptitude for digital humanities. In many cases, these researchers’ prior experience includes a referral to one or more digital humanities tools that have been branded as user-friendly/easy to use. At first, it can look as though this phenomenon is chiefly the result of language and rhetoric used to frame various DH tools — a component influenced by the software industry’s move towards graphical user interfaces and marketing software for everyone to use, whether in the workplace or at home, regardless of gender, age, or other factors that affect digital tools. That language remains the article’s primary focus. However, the issue is not simply tool-framing language. The taglines and framing in tool mailto:p.morgan@miami.edu http://orcid.org/0000-0001-8076-7356 (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 2 documentation are the most visible and stable form, as opposed to more ephemeral instances of language in LibGuides, promotional materials, and workshops and conversations at conferences. Researchers are encountering and struggling with an approach to DH growth and expansion that substantially relies on marketing aspects of DH research as easy. In other words, this article explores the way that our framing for DH tools and resources shapes researchers’ emotions and expectations. Sociologist Susan Leigh Star examined “the work behind the work” in scientific research contexts, meaning “the countless, taken-for-granted and often dismissed practices of assistants, technicians, and students that made scientific breakthroughs possible” (Timmermans 2016, 1). The infrastructure set-up for digital humanities, and the pressures that it places on students, serve as a parallel area of hidden work that can be illuminated. Despite the presence of “easiness” rhetoric in multiple contexts, tool presentation language is often the most concrete example that is available for analysis. Tool presentation language is the material that constitutes users’ introduction to the tool — usually the front page of a website, the about page, and any promotional videos — the materials that create a tool’s reputation. Instead of residing in a particular tool, or the tool creators’ choices, this is a problem within the design of the larger field of the digital humanities, a problem that can remain largely invisible. Recent efforts in library and DH scholarship have focused on illuminating work in digital humanities that tends to go unseen (Shirazi 2016); by unpacking the challenges around tool framing, one can lay the ground for working with them more effectively. Defining Easiness (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 3 Ease of use is one of the most desirable characteristics for any given tool — rivaled only in popularity by the quality of being free. It is not merely a digital humanities fascination — developers have been pursuing the creation of user-friendly graphical interfaces since the late 1960s. That pursuit has its own complex and continuing history, bound up in corporate rivalry and the outsized influence of certain tech leaders, such as Steve Jobs and his fascination with skeuomorphic design. As the tech industry has exerted influence on DH in many ways, it is unsurprising that DH tools have emulated this aspect of tech design. Easiness can seem like an obvious goal for DH support practitioners and tool developers; it goes hand in hand with efforts to democratize the field and make learning and research opportunities more available, regardless of whether institutions have existing and active DH programs. The easier it is to do DH, the more people will try it out — an appealing prospect at a time when humanities departments are looking for ways of asserting their continuing relevance, reinventing themselves in response to cultural shifts, and working to demonstrate that they provide students with job-ready skills. Easiness is attractive in part because it is powerful. The availability of easy-to- use tools shapes DH support infrastructure and affects how DH is incorporated into the classroom, in terms of how much time is needed to show students how to configure a tool and begin using it. For individual scholars developing projects, perceived ease or difficulty can be a deciding factor if there are multiple tools from which to choose and may determine whether the scholar decides to pursue the project at all. Transitioning to digital from conventional printed scholarship includes an adjustment to iterating through (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 4 multiple stages; and may involve multiple, modular outputs, such as datasets, websites, and processing workflows (Brown et. al. 2009, par. 7). The technical and scholarly ambitiousness of a particular project will intersect with each other. Depending on a scholar or team’s prior experience, the impacts of this intersection may be hard to predict (Brown et. al. 2009, par. 6). The problem of unpredictable challenges is complicated further by the pressure researchers face to show their deliverables to colleagues who may be less accustomed to the ups and downs of iteration, but are still called to evaluate it, either for promotion or degree completion. While guidelines and articles from major disciplinary organizations (Modern Language Association 2012; Presner 2012; American Historical Association 2015) discussing the evaluation of digital scholarship acknowledge the iterative nature of digital work, it is harder for such guidelines to prepare colleagues for evaluating mid-stage outputs with aesthetics that may not match the sophistication of the various commercial websites that individuals encounter every day. All these factors contribute to making “easy” tools compelling. Despite its considerable dazzle, easiness is an abstract and intangible quality; the promise of easiness, or an easy-to-use tool, is that some process (whether display, formatting, organization, or analysis) can be accomplished with minimal difficulty, confusion, or extra labor. When such processes are simplified, researchers feel more able to focus their learning on what they perceive as most relevant to their research question and intellectual work. In digital humanities, and in the context of technology generally, easiness is most likely to be associated with tools that are classified as “out- of-the-box,” meaning that they do not require configuration or modification to work, or “off-the-shelf,” meaning that they are standardized, rather than customized, and (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 5 intended for general audiences to be able to use. Because easiness is abstract, it can be taken as synonymous for other qualities, like speed (cf. various statements about accomplishing a process or analysis with “one click”). Though the variants on “easy” are common in tool branding, terms like “fast” and “simple” are regular alternatives. For many tools, it would be more accurate to say that they make a given process not easy, but easier than an alternative. Easiness is subjective — what is easy for one user may not be for another. It is important to understand that easiness is subjective because it is situated and dependent upon other factors. These factors include the particular nature of the material being worked with (i.e., whether the material is text or image-based), and its condition (i.e., whether a dataset has been examined and normalized), as well as the availability (or lack) of training or experience that provides a user with relevant contextual knowledge. However, researchers may not see this situatedness clearly. Finally, because easiness is both powerful and subjective, it is value-laden; and it carries a backlash for individuals who expect to find a process or tool to be easy yet discover the opposite. The backlash comes in part from researchers’ inexperience with the various interdependencies and situatedness of easiness — many of which are complexities of technological, academic, and library systems and infrastructure. Ideally, a researcher pushes past the backlash, and over time they gain familiarity and experience that help them make choices about their research project or their career with greater autonomy. Part of the reason that claims about easiness have such weight is that they inevitably tell us stories about the available infrastructure and its condition — whether or not there are opportunities to learn a particular skill (e.g., a coding (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 6 language), and how legible and genuine those opportunities appear to the audience for whom the tool is intended. As a result, scrutinizing easiness rhetoric can be helpful for librarians and administrators who are trying to get a clearer sense of their patrons’ needs, or who want to think more critically about the type of support they are providing. Examples of Easiness Framing Easiness has become sufficiently important that in digital humanities LibGuides and tool bibliographies, it may be the first or second characteristic mentioned for any tool listed. A typical description might consist of one or two sentences explaining “[Tool] is free and easy to use and allows you to [process/visualize/analyze content].” This sort of description echoes the taglines and catchphrases associated with various tools. Besides Omeka and Scalar, there is Stanford’s Palladio (“Visualize complex historical data with ease.”), the Knight Lab’s TimelineJS (“Easy-to-make, beautiful timelines”) and JuxtaposeJS (“Easy-to-make frame comparisons”), CartoDB (“Maps for the web, made easy” – while this is no longer CartoDB’s official catchphrase, it is still widely visible in search results). Although qualities such as access, sustainability, and portability are significant concerns in DH, in examining libguides and other DH tool roundups, one sees that they are referenced far less than if a tool will be easy. The guide authors try to succinctly articulate what each tool is meant to do; what processes it speeds up, facilitates, or makes easier; and the language that is used to present its capabilities and its value to potential users. In order to get a concrete sense of how this language appears, and the promises and assertions that tool framing makes, this article will examine three tools developed (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 7 specifically for DH use within the last ten years. The point of this examination is not to critique or accuse the tools – they are merely the most concrete and available examples of a more widespread ephemeral phenomenon that shows up not only in written contexts, but also in workshops, webinars, and casual conversations. Omeka.net Omeka was released by the Center for History and New Media at George Mason University in 2008, and it is intended for an audience of users in the galleries, libraries, archives, and museums (GLAM) sector, as well as anyone else wanting to build exhibits and collections online. It allows for the creation of multiple collections of items with metadata structured according to disciplinary or institutional schemas and standards. Users have the ability to follow widespread practices that will make their data interoperable, adjust those schemas to a local house style, or do a bit of each as needed. The sort of functionality that Omeka makes possible is available in software developed for the GLAM community but is often priced at an institutional level that puts it out of reach of individuals and the smallest institutions. This sort of software may be available as open-source and may require experienced tech support personnel to manage the back-end setup and ongoing maintenance. Since the initial release, the Omeka development team has worked to improve the tool’s functionality and accessibility, both through the Omeka.net subscription service and by making it available as a “one-click install” through Internet service providers like Reclaim Hosting. Omeka’s contributions are remarkable, though hard to explain succinctly for audiences who are unfamiliar with the existing software contexts. Dan Cohen summarized it as “WordPress for your exhibits and collections” at the original release, (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 8 aiming at a description that would make it easy for people to describe the tool to others. Up until September 2017, Omeka.net featured a prominent tagline: “your online exhibit is one click away.” In its website redesign that tagline was replaced by a less exuberant description: Getting started is easy with Omeka with our hosted service.” The Omeka.org website continues marketing Omeka via Cohen’s original WordPress reference under the heading “Simple to use”: “Our ‘five-minute setup’ makes launching an online exhibition as easy as starting a blog. No code knowledge required.” This rhetoric isn’t precisely mismatched, because Omeka does indeed allow users to start adding items and metadata right away. For those already versed in metadata standards and best practices, the main learning curve will involve getting accustomed to the interface. However, many digital humanists coming from departments such as English and History are unlikely to have received this training, and as such, face an additional and substantial learning curve, because there is more to a good Omeka exhibit than simply getting content onto the web. The Omeka.net documentation acknowledges this challenge in its Getting Started section, where it recommends that users plan out their content before building an Omeka website and refers them to Cohen & Rosenzweig’s Digital History: A Guide to Gathering, Preserving, and, Presenting the Past on the Web. The Omeka.org documentation goes further, recommending that users sketch out wireframes of their site prior to building it. Both versions of Omeka encourage new users to explore the showcases of existing Omeka sites. But while Omeka may make building an exhibit as easy as blogging on a technical level, its framing is easily misunderstood by users who fail to anticipate the complex intellectual work required to produce a site that is ready to share publicly. (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 9 Scalar Scalar is the creation of the Alliance for Networking and Visual Culture (ANVC) in association with Vectors Journal and the Institute for Multimedia Literacy at the University of Southern California. An open beta version was released in spring 2013, and the current version, Scalar 2.0, was released in late 2015. ANVC presents their work as “explor[ing] new forms of scholarly publishing aimed at easing the current economic crisis faced by many university presses while also serving as a model for media-rich digital publication,” and describes Scalar as a “key part” of this process, facilitating collaboration and material sharing between libraries, archives, scholarly societies and presses” (ANVC: About the Alliance n.d.). These partnerships have resulted in one of Scalar’s most unique features: the ability to add images and videos from organizations like the Shoah Foundation and the Internet Archive to a Scalar site by performing a keyword search, selecting results with a checkbox, and clicking a button to import them, along with any associated metadata. This entire process (including the optional step of editing individual item metadata) can be performed within the Scalar user interface. Once imported, users can select from a few different layouts available via a dropdown menu in order to emphasize text or media, or split the emphasis between the two (Scalar: Selecting a Page's Default View, n.d.). The other feature that especially distinguishes Scalar from other CMSs is the structural freedom that it grants users. Where blogging platforms like Blogger, WordPress, and Dreamwidth structure content chronologically, Scalar has no default organizational structure. Instead, it allows users to create pages, which can be (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 10 combined into paths, annotated, tagged, or used as tags for other content. This gives them multiple options for creating non-linear, nested, radial, recursive, and intersecting narratives. Configuring these choices is accomplished primarily through a Relationships menu at the bottom of each page created, below the main text input window. The actual, final steps of creating an organic structure through a combination of selecting objects and dragging and dropping them within a GUI requires far fewer steps in Scalar than it would in any other environment, and is further enhanced by the fact that Scalar includes options to show visual representations of the structure (Path View, Tag View). However, this structural freedom is also the aspect of Scalar that requires the most careful advance planning from users in order to avoid producing a tangle of disconnected, disparate files. As such, its organizational freedom is simultaneously the feature that most complicates Scalar’s self-presentation of easiness. Like Omeka, Scalar articulates its claim of easiness through a comparison to blogging (“...if you can post to a blog, you can use Scalar”), pointing to the similarities of the WYSIWYG interface in its text input window and those used by WordPress and other blogging platforms. The trailer also connects itself to the activity of blogging by emphasizing the simplicity with which authors can work with a wide range of media types — not just how easy it is to “import media directly without cutting and pasting code” but also combining different types of media, such as “tagging poems with videofiles or tagging images with audiofiles.” What the trailer wants to convey is that any media type the user could imagine — from images and text to maps and source code — can be juxtaposed within a Scalar book, all without requiring the book’s author to have any knowledge of markup language. This emphasis on diverse media formats is (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 11 coupled throughout the trailer with statements about Scalar’s ability to handle quantity — not only in terms of media, but also that Scalar makes it “easy to work with multiple authors because each author’s contributions are tracked and all versions preserved.” As the trailer ends, the narrator reiterates that despite the wide variety of options available (visualizations, paths, annotations, etc.), “all these objects are designed to work together to make it easier for you to create objects to think with — the thinking is still up to you.” As was the case with Omeka, Scalar’s claims aren’t untrue – it does offer unique functionality that simplifies and streamlines the processes of juxtaposing media and crafting non-linear narratives; and it does so in a way that saves considerable technical labor. In emphasizing its most innovative functionalities, however, Scalar’s framing underemphasizes that these functionalities come with their own particular workload. The more complex a narrative structure is, and the more material it contains, the more important it is to have experience managing data with workflows, strict file naming practices, and/or data dictionaries. Without such practices, or a site structure that has been carefully determined in advance, users are more likely to end up with a tangled mess rather than the sophisticated site that they had hoped for. Likewise, Scalar’s documentation raises the question of what tool managers tell users to prepare them for the work of developing site structure. Scalar’s presentation materials focus on the ease with which Scalar can keep track of multiple users – however, this focus tends to obscure the social decision making that will almost certainly be required; as well as the emphasis on how much freedom to show different objects skirts around the reality that producing a good site is often a case of learning (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 12 what not to show in order to keep the narrative streamlined and compelling, rather than simply showing a great quantity of objects. DH Box DHBox (http://www.dhbox.org) is currently in development at the CUNY Graduate Center. As the newest of the tools that I have examined in this piece, DHBox is an indication that easy tool rhetoric is still being used. DHBox uses containers to create remote environments in the cloud that are already configured for several popular and powerful DH tools, including IPython, RStudio, WordPress, and Mallet. Containers allow programs to run in virtual environments that are identical, rather than risking the possibility that some users’ settings and configurations will generate errors. Using pre- configured container environments can substantially cut down on the set-up time before students can get started actually using tools. The streamlined setup enables students to work with complex tools like Mallet and the NLTK on their own laptops without needing a physical computer lab, or requiring the instructor to consult or negotiate with campus IT personnel. DHBox makes a few prominent claims about its easiness. A brief statement centered on its front page explains that “setting up an environment for digital humanities computational work can be time-consuming and difficult. DH Box addresses this problem by streamlining installation processes and providing a digital humanities laboratory in the cloud through simple sign-in via a web browser.” The “About” page reiterates that DHBox allows a cloud laboratory to be deployed “quickly and easily” from (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 13 any computer with an internet connection, promising a device agnostic lab ready to go in minutes. Though DHBox emphasizes how much easier it is to use than it is to create a lab from scratch, it is not actually intended for beginners, as a closer look at the About page shows. DHBox makes it simple to set up a lab if you have an internet connection and “some contextual knowledge.” This abstract phrase gets clarified further down — the tool is intended for users who “know what the command line is” and “what a server does.” For others, the creators recommend a list of four resources to help bring potential users into the target audience, including a portion of the Apache HTTP Server documentation, Shaw’s “The Command Line the Hard Way” book, lessons hosted at the Programming Historian site, and Posner’s “How Did They Make That?.” This is a substantial reading list, but one that should provide a novice digital humanist with a solid grounding in the relevant concepts. Oddly enough, there is no explicit suggestion that individuals using DHBox need to understand how the gold-standard tools it contains work — the implication is that once the virtual lab is up and running, the rest of the progress will follow naturally. The idea of easiness, especially in tech contexts, is often associated with support for new and inexperienced users; however, DHBox is a reminder that the situated nature of easiness means that it can also be intended specifically for advanced users. The presentation materials for DHBox attempt to be direct with would-be users by offering two benchmark questions that must be answered in order to use the tool productively; and the creators acknowledge that users might need to learn more, rather than simply suggesting that the tool will have excellent results for anyone and everyone. (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 14 What tool users are looking for Tool users want the easiest experience possible, but looking at these three tools in particular enables one to more concretely define what easiness means in the context of DH. The emphasis on graphical user interfaces and no coding or technical knowledge suggests a desire for as little preparation as possible — particularly the desire to avoid learning material that is purely technical and has no equivalent in their home disciplines, such as understanding image aspect ratios or file compatibility issues. For researchers who are already overburdened, this is an understandable rational economic choice. Users are also looking for tools that give them the ability to fully realize their imaginations, and to produce something new and dramatically different from what non- DH methods allow. This output could be new because it is a highly visual digital exhibit, or because it features non-linear narratives or juxtapositions of strikingly different media, or because it makes it possible for an entire graduate seminar to have access to sophisticated analytical tools like RStudio and Mallet. Users may likewise be looking for tools that allow them to explore a particular method in depth, and achieve mastery, especially within a given period of time, i.e., one semester-long course (Goldstone 2016). Finally, though this is rarely made directly explicit by the tool presentations themselves, users want stability, and to feel that any effort that they make in a tool will be rewarded and worthwhile, rather than failing (Terras 2014a; Terras 2014b). This is most evident in language that gestures towards the tool’s output. Sometimes this is conveyed by promising speed (an exhibit that is one click away) and sometimes by (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 15 promising complexity. Scalar’s creators understand that “important topics require time and sustained attention to be fully explored,” and work to convey to authors that with Scalar, they will be able to create a Scalar book that is worthy of committed attention from readers. While digital humanists may want to avoid spending time acquiring extraneous knowledge, they are drawn to the field because they are willing to make an investment — but they want that investment to “provide a satisfying moment of completion” (Brown 2009, par. 10) or move them closer to being able to declare the project finished (Kirschenbaum 2009, par. 1). In light of these needs, we might ask whether easiness is a quality that digital humanities tool creators should pursue. In “Blunt Instrumentalism: On Tools and Methods,” Dennis Tenen (2016) argues in favor of caution around easiness in DH research, because prioritizing it often comes at the expense of understanding the critical inner workings of analytical tools. Overreliance on out-of-the-box tools can result in researchers confusing the tools themselves with methodologies (117), and the end result is that the scholarship is less finely-grained and rigorous. The best kinds of tools, according to Tenen, are “the ones we make ourselves” – though he acknowledges the formidable labor involved in producing, marketing, and maintaining such tools, especially when working within academic contexts. Tenen characterizes a preference for easiness as a sort of intellectual laziness or lazy thinking, when more attention to method is warranted (118). In some cases, this critique is highly applicable; in others, it fails to take in to account that the preference for easiness is influenced by a lack of infrastructure – and that some tools, like DH Box, are intended specifically to solve the common infrastructure problem of a lack of physical space. Out-of-the-box tools, which (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 16 might be better characterized as “entry-level” DH tools, are arguably fulfilling a community need. But whose role and responsibility is it to guide new users through those tools and into the more complex understanding of methodologies that might develop as users become more familiar with them? How libraries fit into DH infrastructure growth Whether identified as “digital humanities” or previous terms like “humanities computing” or “technological humanities,” librarians and scholars have been using tools in research contexts for a long time. The current wave of DH seems to have begun around ten years ago, kicked off in part by the creation and release of affordable and user-friendly tools like Omeka, as well as CHNM’s Zotero citation manager. William Pannapacker’s 2009 pronouncement in the Chronicle of Higher Education that DH seemed like “the first ‘next big thing’ in a long time,” was disputed by digital humanists for whom the field was nothing new — still, Pannapacker’s observation reflected the start of a rise in DH-focused hiring. While the quantity of available new DH-focused positions was overstated in some cases (Risam 2013), there has been demonstrable growth in certain sectors. In 2010, there were two searches for Digital Humanities Librarian jobs, and that number has risen steadily since, with twenty-eight job searches for librarians or similarly titled library-based, front-facing positions (such as Digital Scholarship Coordinator, Digital Scholarship Lead) in both 2015 and 2016 — an indication that libraries are actively working to increase their direct involvement with DH (Morgan and Williams 2015). (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 17 As the field of digital humanities and the number of roles associated with it have grown, various concerns and questions have arisen about how to effectively build infrastructure and support systems that are both productive and scalable. Many of these discussions focus on the roles that libraries and librarians play — whether in supporting DH as a service, being the driving force or an active collaborator in DH growth, or providing much needed guidance for archiving and maintaining digital scholarly work. As projects and tools have been created and aged and sometimes disappeared, the larger DH community has begun to be more aware of the importance of sustainability (Davis 2016). Furthermore, in enterprise-level software and hardware provision, librarians have far more expertise and experience than traditional academic personnel. However, this pressure to achieve success and provide expertise risks becoming unsustainable for libraries themselves, while simultaneously failing to fully acknowledge the contributions that they have made to DH growth. There are several excellent articles and essays discussing the opportunities and challenges that libraries face as they develop involvement and support strategies for digital humanities and digital scholarship. In this instance, I want to focus on the challenges that out-of-the-box, easy-to-use tools seem to have the potential to ameliorate, if not solve completely. These include the tendency to assign librarians or coordinators ample amounts of responsibility for creating digital humanities successes without giving them the necessary authority to do so (Posner 2013, 47), a lack of training opportunities (Posner 2013, 46), and a tendency to award credit for achievements to faculty, rather than library collaborators (Posner 2013, 48). These hurdles are further complicated by the sheer variety of requests that occur, many of (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 18 which include requests for time-consuming and non-extensible customization (Vinopal and McCormick 2013, 28). Libraries and librarians are under pressure to produce demonstrable results; to have learned enough from “intensive development for boutique projects” to provide the scalable support that scholars need, often as inexpensively as possible (Maron and Pickle 2014, 30); and to have a reproducible model that can be clearly articulated to stakeholders, and adapted as needed over time. Easy-to-use tools can help with many of these challenges. Because they are branded as entry-level tools, and have documentation, they are positioned to allow librarians to be more hands-off, relieving them of the responsibility for success. If librarians are more hands-off, they are less likely to go uncredited for their work; and if the tools can offer the right balance of restrictions and customization, then the library is absolved of that burden as well. The 2011 ARL SPEC Kit for Digital Humanities survey found that 48% of libraries characterized their digital humanities services as offered on an “ad hoc” basis (Bryson et. al. 2011, 23) — sometimes described as a “service-and-support” model, where projects are initiated by faculty who approach the library with ideas (Posner 2013; Muñoz 2013). An alternate approach is the skunkworks or library incubator model (see Muñoz 2013; Nowviskie 2013), where the library develops DH projects in which it plays a leadership role and allows students and faculty opportunities to be involved. The ad hoc or service-and-support model can be problematic because relatively few members of the campus community have access to it.The skunkworks/incubator model depends on the library having the startup expertise it needs to develop and execute good projects that are compelling to faculty and students, and that provide them with (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 19 opportunities to develop the experience and skills that they see as useful. Even when an incubator can successfully create opportunities that draw faculty and students in, access can be fairly limited. Both of these models have risks in terms of sustainability and scalability. A third model has emerged, one that is more scalable and sustainable — let’s call it “lightweight-service-and-support.” This model may include one or more dedicated personnel, i.e. a DH librarian or specifically DH programmer, but it is resource- conservative, and cautious about providing too much one-to-one guidance that would be unfair to other support seekers, because such guidance would not scale, and would quickly constitute a significant/unsustainable time commitment for the librarian or team. The lightweight-service-and-support model relies heavily on easy-to-use tools, which offer researchers several options while still scaling well to a library’s support capacity. The tools’ user community, documentation, and their popularity (which can result in how-to videos and example projects) helps to lessen the amount of training, management, and outreach that librarians need to do. This model looks very similar to the second tier of support that Vinopal and McCormick (2013) explain how the supported tools “should offer a fixed set of templates, so users can pick the format, style, or functionality that best meets their needs … If services at this level are well- designed and supported, a majority of scholars could rely on these sustainable alternatives to one-off solutions” (32). Vandegrift and Varner likewise gesture towards this model when they provide a concise formula for how libraries should conceptualize their DH offerings: “the goal is to have the fewest tools to support that meet the most needs” (2013, 71). Lightweight-service-and-support need not be the only tier of the (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 20 model as Vinopal and McCormick’s four-tiered model makes clear; however, in the absence of resources for higher tiers to develop potentially ground-breaking and grant- winning projects, lightweight-service-and-support can still serve a wide range of community members. Establishing practices and models that can help make DH in libraries sustainable and scalable is important work that can and will help libraries continue evolving along with scholarly disciplines. But are the practices that are scalable and sustainable for libraries equally sustainable and scalable for the faculty and students who look to the library for DH opportunities? DH as scalable and nonscalable To explain further, anthropologist Anna Lowenhaupt Tsing defines scalability as the ability to expand without having to rethink or transform the underlying basic elements. She examines scalability as a specific approach to design — one that has allowed for both the precision of the factory and the computer; and she argues that scalability is so ubiquitous and powerful that it stops us from noticing the aspects of the world that are not scalable. To push back against this suppressive impulse, Tsing’s nonscalability theory is to allow us to see “how scalability uses articulations with nonscalable forms, even as it denies or erases them” (Tsing 2012, 506). Scalability prioritizes and values precision-nested fit — and it is the driving force behind much of our current infrastructure. The goals of nonscalability theory are to focus on perceiving the heterogeneous and nonscalable forms and understand that they, too, have roles to play in growth. At the heart of nonscalability theory is the question of how we look at, (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 21 and how we handle, the idea of diversity — specifically, the diversity of objects that do not fit within the precision-nested growth structures of scalability. Diversity, argues Tsing, isn’t simply different — it can contain the potential for transformative change. Rawson and Muñoz (2016) adapt Tsing’s theoretical framework to unpack and examine their work “cleaning” data in the NYPL’s “What’s On the Menu?” archive, featuring over one hundred years of menus from restaurants, cafés, hotels, and other dining establishments. They argue that the concept of “data cleaning” and the use of the phrase “data cleaning” obscure the complex and heterogeneous details of the process as well as the degree to which it is high-stakes critical work with far-reaching effects that can impact the value of research findings. To reduce that process to “data cleaning” is to misunderstand a highly nonscalable process as a scalable one. Rawson and Muñoz set out to “clean” and normalize the data of different dishes and food items within the collection. Although the NYPL had arranged the menus in the collection to be interchangeable objects within the catalog, and although menus have a common overall format (i.e., food items with prices, grouped according to particular meals or particular sections of meals), each menu showed considerable variation. Some of this variety was straightforward to normalize (e.g., fifteen variant listings for potatoes au gratin). To clean this data would be to make it scalable — to allow users to query the entire archive of menus to understand when, where, and how potatoes au gratin appeared, and get an accurate answer. However, as they worked to clean the data so that it would help answer research questions about the effect of wartime food rationing on menus or the changing boundaries of what constituted a dish over time, Rawson and Muñoz began to understand that reducing variants to a single value was “not a self- (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 22 contained problem, but rather an issue that required returning to [their] research questions and investigating the foods themselves.” The individual menu items’ heterogeneity was central to answering the research questions, and what was needed was not to make each food item scalable, but instead to create a dataset that would be compatible with the NYPL archive and illuminate (and allow users to interact with) the nonscalable heterogeneous aspects of the menu contents. Becoming aware of the pressures of scalability can be difficult even for experienced digital humanists. Rawson and Muñoz explain that when they began “cleaning” their data, they saw their main challenge and goal as “processing enough values quickly enough to ‘get on with it’” (page). The characteristics associated with scalability — speed, simplicity, and unimpeded growth — have considerable overlap with the characteristics associated with easiness. The tools we use — whether we are their creators or their consumers — are not immune to the pressure to be scalable. Tsing’s theory of nonscalability, which Rawson and Muñoz have shown to have considerable implications for how we conceive of our goals when working with data, is equally relevant to both DH projects and to the infrastructure that we build for people who are working on them. DH projects are nonscalable. This means that they are particularly nonscalable with various out-of-the-box tools (not only Omeka and Scalar) because as Tsing explains, scalability is the “ability to expand without distorting the framework” (Tsing 2012, 523). Tools designed to present and process data may appear or present themselves as though they come with that framework in place. Omeka has items and item types with metadata categories; Scalar has pages, paths, and tags — but these components are building blocks, and a highly incomplete framework, if they (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 23 can be said to be a framework at all. And this is precisely as it should be — they are there to be distorted, or, rather, to be transformed, as researchers’ projects take shape. When tools present themselves as easy, quick, and simple, they are promising the user that working with them will be scalable. And when those of us who are in the position of introducing those tools reiterate and reinforce that presentation, we are likewise telling researchers that they should expect scalability and strive for it, despite the fact that they are engaging in an eminently nonscalable process. We are encouraging them to imagine the complex diversity of their material without preparing them for the transformative process that including it will require. Instead of helping them learn to see heterogeneity, and find effective ways of interacting with it, by training them to expect easiness, we are leaving an empty space in their preparation — and that space is as likely as not to end up filled with a conviction of their own inadequacy. The consequence is not only this emotional plunge. Out-of-the-box tools may successfully circumvent technical work, but in doing so, they may also bypass the thought process of imagining a research question and its answers beyond the constraints and affordances of a single tool. This can impact the depth and richness of the answer to the research question, as well as the project’s long-term sustainability. Thinking beyond the capabilities of a particular tool can also be an opportunity for researchers to utilize their existing disciplinary expertise in making decisions about data categories and relationships between materials – and in the process, gain much needed confidence for future experimentation, allowing them to work with less dependence upon librarians or other support personnel. (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 24 Possible avenues for intervention The ways that “easiness” rhetoric can shape tool users’ expectations and experiences are a challenge. This challenge intersects with a related problem, namely, that the community of practice in DH is still grappling with how best to incorporate data modeling in DH. A data model defines the objects or entries that a database (or really any data presentation system, including content management systems) contains. It sets out the rules for how different pieces of data are connected with each other. If entries have additional data that modifies them (i.e., a data model about individuals might include their nationality, and depending on the focus of the database, one part of the model might specifically focus on defining how to record complexities around nationality, such as individuals who are born in one country to parents who are citizens of another country.) Effectively incorporating data modeling involves articulating the questions and complexities that accompany it in humanities contexts; and the work of disseminating and/or training DHers to understand their work with various tools as data modeling. Posner has previously noted that “humanists have a very different way of engaging with evidence than most scientists or social scientists” (Posner 2015). For example, close reading is more likely to work towards describing a specific pattern within a text and tracing it from its start to end point. The focus of many traditional humanities scholarly essays is identifying and elucidating one or a small number of objects which are unique. To use Tsing here, humanities research is much more focused on illuminating and celebrating nonscalability; thus, it is no surprise that humanists have, even within the DH community, hesitated about invoking the idea of “data” in relation to their work. (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 25 However, organizing data is what allows researchers to produce scholarship (Posner 2015). When the Omeka documentation suggests that users should plan their site before beginning to use the tool, they are obliquely suggesting that scholars need to develop a data model that allows an Omeka site to be driven by a more complex principle than “let me show you all my stuff.” Scalar users face the same challenge — perhaps even more so, since in Scalar the capacity for non-linear and intersecting paths plus the ability to display both text and media-focused pages means that scholars could conceivably be working with two interlocking data models: one for their narrative and one for their non-narrative content. And this need applies to other DH tools as well — including several of the tools available through DHBox. Data modeling is not easy work — but helping students understand how it fits into the process of working with so-called “easy” tools would be one way of preparing them better. This example (and potential impact) of data modeling underscores that the problems created by easy tool rhetoric cannot simply be attributed to the tool creators and the teams that designed and wrote their publicity materials. If our libguides and workshop promotional materials draw on the same tool presentation that emphasizes easiness, then we are also using easiness rhetoric just as the tool makers are. Who has the responsibility and capacity to intervene in this situation? What kind of intervention is appropriate? While tool creators bear some responsibility, there is, in most cases, a gap between the authors of a tool’s presentation site and the readers. Librarians who are mentoring students and faculty who are learning new tools — or who are in charge of designing and maintaining a local infrastructure system — are positioned to fill that gap because they are usually closer to the learners than the tool creators are. Given (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 26 humanists’ uncertainty around thinking of their materials as data (Keener 2015, par. 33), librarians and instructors offering basic tool trainings are more likely to be successful because they can have conversations that go both ways in consulting contexts. Our models for DH development and support in libraries need to consider not only what tools to provide — but also how those tools’ capabilities and reputation shape infrastructure — and how we can design around the tools’ rhetoric in response. In “On Nonscalability,” Tsing points out several examples in which scalability has been achieved in part through a reliance on disciplined labor. One example that she uses is that of sugar cane cutters in Puerto Rico in the 1950s. The workers had a limited time frame in which to work, and their working conditions were crowded and dangerous — especially because of the sharp machetes that each worker used. The result was that “workers were forced to use their full energy and attention to cut in synchrony and avoid injury” (Tsing 2012, 512). By disciplining themselves to learn the skill of synchronous cutting, they solved the company’s problem — and transformed themselves from nonscalable individuals into a scalable work force. Disciplined labor can be created when any powerful entity (a factory, a corporation, or even a library) identifies an infrastructural problem that they then leave to less powerful individuals to solve by changing themselves in some way. The creation of disciplined labor isn’t necessarily malicious. In the context of library infrastructure for DH tools, the problem is the nonscalability of individual DH projects versus the scalable support that we offer in the form of entry-level tools. Because the tools present themselves as easy to use, it is easier for libraries (and departments) to decide that only minimal training is needed, and that the rest can be left to the students themselves. The students become disciplined (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 27 laborers because they see DH tool facility as leading both to greater prestige and to jobs. Even when tools make beneficial achievements in terms of what is possible, the potential for problems exists. Scalar, Omeka, DHBox, and numerous other tools that can be used for DH make it possible for researchers to produce scholarly objects that would not have been possible otherwise without months or sometimes years of training. DHBox takes three tremendous difficulties (money, space, staff), and transforms them into a different difficulty (an individual user’s knowledge of servers and the command line). Scalar and Omeka transform the challenge of needing knowledge around databases, HTML, and CSS, transforming those challenges into the need for a user to understand how to develop an effective data model. All three tools are beneficial to the larger community of practice of digital humanities – and, yet, all three can be problematic as well, because through the combination of the way that libraries use them in building DH infrastructure, and the way that the tools present themselves, they shift tremendous responsibility for success directly onto the individual user and that user’s capacity to pick up wide-ranging (and not always easily accessible) knowledge on the fly. The resulting phenomenon is a form of what economist Jacob Hacker (2008) has identified as “risk shift.” Hacker identifies risk shift by tracing changes in frameworks for economic protection (including banking, income, healthcare, and retirement). Risk shift is the phenomenon by which support provided by larger corporate and social entities (employers, insurance companies, banks) is withdrawn, and responsibility for preventing risks is placed on individual families. While Hacker’s research traces this phenomenon through the larger American employment system, sociologist Tressie McMillan Cottom’s (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 28 recent book Lower Ed: The Troubling Rise of For-Profit Colleges in the New Economy argues that the same risk shift can be seen in the higher education system as credential costs that used to be supported by federal grants have shifted more onto students. A certain reliance on DH tools marketed as “easy to use” creates a similar risk shift for our students and faculty learning to use them, including librarians who are working with limited amounts of time to pick up DH skills and experience. There is no simple solution to the problems that can be created by “easiness” rhetoric. Certainly, the answer is not that the tools featuring it are bad and that we should stop using them. Nor is it for us to take a reverse approach and brand the tools as ultra-challenging, suitable only for hardcore data nerds (a problematic approach that has been an aspect of DH in the past in debates about hacking vs. yacking (Cecire 2012; Nowviskie 2016). Training and dialogue specifically focused on data modeling throughout the community could and will be very helpful, but it will take time for that to happen. If it does, it will be well-augmented by a more complex understanding among DH infrastructure providers (whether in libraries, centers, or departments) of what scalability means with regard to DH. Among other things, this more complex understanding might involve scrutinizing what needs tools are meeting — scrutinize these needs especially through the tools’ marketing and self-presentation — and consider how those needs might shape infrastructure. One specific aspect of this might involve looking at the differences between what tool presentation leads users to think they need (i.e. lots of different types of media) vs. the contextual knowledge that more experienced digital humanists know they need (including naming conventions, data models, etc.). This doesn’t mean that libraries necessarily have to dramatically increase (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 29 their DH infrastructure investment or expend substantially more resources — if we are alert, deliberate, and proactive, it is possible to build infrastructure that is scalable, both for libraries, and for our users. Conclusion When researchers embarking on a digital humanities project look for the right tool, the perceived easiness of that tool is an important consideration. Tools that can provide an easy-to-use experience are becoming an important part of library infrastructure for DH because they seem to require less support and labor from library personnel involved in introducing DH methodologies to students and faculty. However, tools branded as “easy to use” can create a backlash in which users’ research stalls and they blame themselves when a particular tool was more difficult than they expected. This article has sought to better understand the challenges presented by easy tool rhetoric for DH service providers by examining the presentation and documentation of three digital humanities tools. This examination revealed that though the tools have made valuable contributions that substantially simplify certain technical aspects of producing websites and multimedia objects, the rhetoric of their presentation tends to elide the vital and challenging critical thinking that users must do while using the tools. This elision underscores key competencies, such as data modelling, that the larger digital humanities community is only just beginning to grapple with. Libraries have an important role to play in helping tool users develop knowledge that will avoid the backlash of easy tools. (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 30 [Many thanks to Yvonne Lam for invaluable conversations throughout the development of this essay; and to Alex Gil, Yvonne Lam, Emily McGinn, Roopika Risam, and Rachel Shaw for feedback on earlier versions.] References “Alliance for Networking Visual Culture.” n.d. http://scalar.usc.edu/ “Alliance for Networking Visual Culture » About The Alliance.” n.d. http://scalar.usc.edu/about/ American Historical Association. 2015. “Guidelines for the Professional Evaluation of Digital Scholarship by Historians | AHA.” American Historical Association. June. https://www.historians.org/teaching-and-learning/digital-history-resources/evaluation-of- digital-scholarship-in-history/guidelines-for-the-professional-evaluation-of-digital- scholarship-by-historians Brown, Susan, Patricia Clements, Isobel Grundy, Stan Ruecker, Jeffery Antoniuk, and Sharon Balazs. 2009. “Published Yet Never Done: The Tension Between Projection and Completion in Digital Humanities Research.” Digital Humanities Quarterly 3(2). http://www.digitalhumanities.org/dhq/vol/3/2/000040/000040.html Bryson, Tim, Miriam Posner, Alain St. Pierre, and Stewart Varner. 2011. “Digital Humanities, (SPEC Kit 326). http://publications.arl.org/Digital-Humanities-SPEC-Kit-326/ Cecire, Natalia. 2012. “When Digital Humanities Was in Vogue.” Journal of Digital Humanities 1(1). http://journalofdigitalhumanities.org/1-1/when-digital-humanities-was- in-vogue-by-natalia-cecire/ Cohen, Dan. 2008. “Introducing Omeka.” Dan Cohen (blog), February 20. http://www.dancohen.org/2008/02/20/introducing-omeka/ (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 31 Cottom, Tressie McMillan. 2017. Lower Ed: The Troubling Rise of For-Profit Colleges in the New Economy. The New Press. Davis, Robin Camille. 2016. “Die Hard: The Impossible, Absolutely Essential Task of Saving the Web for Scholars.” Presentation, Eastern New York Association of College & Research Libraries Meeting, May 23. http://academicworks.cuny.edu/jj_pubs/76/ “DH Box.” n.d. http://dhbox.org/ “DH Box: About.” n.d. http://dhbox.org/about Goldstone, Andrew. 2016. “Teaching Quantitative Methods: What Makes It Hard (in Literary Studies).” Pre-print (forthcoming in Debates in the Digital Humanities 2018. https://doi.org/10.7282/T3G44SKG. Hacker, Jacob. 2008. The Great Risk Shift: The New Economic Insecurity and the Decline of the American Dream. Oxford, New York: Oxford University Press. Keener, Alix. 2015. “The Arrival Fallacy: Collaborative Research Relationships in the Digital Humanities.” Digital Humanities Quarterly 9(2). http://www.digitalhumanities.org/dhq/vol/9/2/000213/000213.html Kirschenbaum, Matthew G. 2009. “Done: Finishing Projects in the Digital Humanities.” Digital Humanities Quarterly 3(2). http://www.digitalhumanities.org/dhq/vol/3/2/000037/000037.html Maron, Nancy L., and Sarah Pickle. 2014. “Sustaining the Digital Humanities Host Institution Support beyond the Start-Up Phase.” Ithaka S+R. https://digital.library.unt.edu/ark:/67531/metadc463533/m2/1/high_res_d/SR_Supporting _Digital_Humanities_20140618f.pdf (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 32 Modern Language Association. 2012. “Guidelines for Evaluating Work in Digital Humanities and Digital Media” Modern Language Association. January. https://www.mla.org/About- Us/Governance/Committees/Committee-Listings/Professional-Issues/Committee-on- Information-Technology/Guidelines-for-Evaluating-Work-in-Digital-Humanities-and- Digital-Media Morgan, Paige, and Helene Williams. 2016. “The Expansion and Development of DH/DS Librarian Roles: A Preliminary Look at the Data.” Presentation, Digital Libraries Federation Forum 2016. https://osf.io/vu22f/ Munoz, Trevor. 2013. “In Service? A Further Provocation on Digital Humanities Research in Libraries.” dh+lib. June 19. http://acrl.ala.org/dh/2013/06/19/in-service-a-further- provocation-on-digital-humanities-research-in-libraries/ Nowviskie, Bethany. 2013. “Skunks in the Library: A Path to Production for Scholarly R&D.” Journal of Library Administration 53(1): 5366. doi:10.1080/01930826.2013.756698. ———. 2016. “On the Origin of ‘Hack’ and ‘Yack.’” In Debates in the Digital Humanities, edited by Lauren F. Kelin and Matthew K. Gold. University of Minnesota Press. http://dhdebates.gc.cuny.edu/debates/text/58 “Omeka.Net.” n.d. http://www.omeka.net/ “Omeka.Net: About.” n.d. http://info.omeka.net/about/. Pannapacker, William. 2009. “The MLA and the Digital Humanities.” HASTAC (blog). December 30. https://www.hastac.org/blogs/nancyholliman/2009/12/30/mla-and-digital- humanities (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 33 Posner, Miriam. 2013. “No Half Measures: Overcoming Common Challenges to Doing Digital Humanities in the Library.” Journal of Library Administration 53(1): 43–52. doi:10.1080/01930826.2013.756694 Posner, Miriam. 2015. “Humanities Data: A Necessary Contradiction.” Miriam Posner’s Blog. June 25. http://miriamposner.com/blog/humanities-data-a-necessary-contradiction/ Presner, Todd. 2012. “How to Evaluate Digital Scholarship.” Journal of Digital Humanities 1(4). http://journalofdigitalhumanities.org/1-4/how-to-evaluate-digital-scholarship-by- todd-presner/ Rawson, Katie, and Trevor Muñoz. 2016. “Against Cleaning.” Curating Menus (blog), July 6. http://www.curatingmenus.org/articles/against-cleaning/ Risam, Roopika. 2013. “Where Have All the DH Jobs Gone?” Roopika Risam (blog), September 15. http://roopikarisam.com/uncategorized/where-have-all-the-dh-jobs-gone/ “Scalar 1 User’s Guide: Creative Use of Structure.” n.d. Scalar 1 User’s Guide. http://scalar.usc.edu/works/guide/creative-use-of-structure “Scalar 1 User’s Guide: Selecting a Page’s Default View.” n.d. Scalar 1 User’s Guide. http://scalar.usc.edu/works/guide/selecting-a-pages-default-view Shirazi, Roxanne. 2016. Conditions of (In)Visibility: Cultivating a Documentary Impulse in the Digital Humanities. Invisible Work in the Digital Humanities Symposium. Florida State University, November 17-18. https://www.youtube.com/watch?v=28LIvujbrS8. Tenen, Dennis. 2016. “Blunt Instrumentalism: On Tools and Methods.” In Debates in the Digital Humanities 2016, edited by Lauren F. Klein and Matthew K. Gold. University of Minnesota Press. (Accepted Manuscript) Version of Record at https://www.tandfonline.com/doi/full/10.1080/10691316.2018.1480440 CUL: Easy Tools Submission: 34 Terras, Melissa. 2014a. “A Decade in Digital Humanities.” Melissa Terras’ Blog (blog), May 27. http://melissaterras.blogspot.com/2014/05/inaugural-lecture-decade-in-digital.html. ———. 2014b. “Reuse of Digitised Content: Chasing an Orphan Work Through the UK’s New Copyright Licensing Scheme.” Melissa Terras’ Blog (blog). February 4. http://melissaterras.blogspot.com/2014/10/reuse-of-digitised-content-4-chasing.html. Timmermans, Stefan. 2016. “Introduction: Working with Leigh Star.” In Boundary Objects and Beyond, edited by Geoffrey C. Bowker, Stefan Timmermans, Adele E. Clarke, and Ellen Balka. Cambridge, Massachusetts: The MIT Press. Tsing, Anna Lowenhaupt. 2012. “On Nonscalability: The Living World Is Not Amenable to Precision-Nested Scales.” Common Knowledge 18(3): 505 – 524. doi:10.1215/0961754X-1630424 Vandegrift, Micah, and Stewart Varner. 2013. “Evolving in Common: Creating Mutually Supportive Relationships Between Libraries and the Digital Humanities.” Journal of Library Administration 53(1): 67 – 78. doi:10.1080/01930826.2013.756699 Vinopal, Jennifer, and Monica McCormick. 2013. “Supporting Digital Scholarship in Research Libraries: Scalability and Sustainability.” Journal of Library Administration 53(1): 27 – 42. doi:10.1080/01930826.2013.756689 work_hucex7asuzhu5esayctslwqdm4 ---- Digital Humanities 2010 1 The Importance of Pedagogy: Towards a Companion to Teaching Digital Humanities Hirsch, Brett D. brett.hirsch@gmail.com University of Western Australia Timney, Meagan mbtimney.etcl@gmail.com University of Victoria The need to “encourage digital scholarship” was one of eight key recommendations in Our Cultural Commonwealth: The Report of the American Council of Learned Societies Commission on Cyberinfrastructure for the Humanities and Social Sciences (Unsworth et al). As the report suggested, “if more than a few are to pioneer new digital pathways, more formal venues and opportunities for training and encouragement are needed” (34). In other words, human infrastructure is as crucial as cyberinfrastructure for the future of scholarship in the humanities and social sciences. While the Commission’s recommendation pertains to the training of faculty and early career researchers, we argue that the need extends to graduate and undergraduate students. Despite the importance of pedagogy to the development and long-term sustainability of digital humanities, as yet very little critical literature has been published. Both the Companion to Digital Humanities (2004) and the Companion to Digital Literary Studies (2007), seminal reference works in their own right, focus primarily on the theories, principles, and research practices associated with digital humanities, and not pedagogical issues. There is much work to be done. This poster presentation will begin by contextualizing the need for a critical discussion of pedagogical issues associated with digital humanities. This discussion will be framed by a brief survey of existing undergraduate and graduate programs and courses in digital humanities (or with a digital humanities component), drawing on the “institutional models” outlined by McCarty and Kirschenbaum (2003). The growth in the number of undergraduate and graduate programs and courses offered reflects both an increasing desire on the part of students to learn about sorts of “transferable skills” and “applied computing” that digital humanities offers (Jessop 2005), and the desire of practitioners to consolidate and validate their research and methods. We propose a volume, Teaching Digital Humanities: Principles, Practices, and Politics, to capitalize on the growing prominence of digital humanities within university curricula and infrastructure, as well as in the broader professional community. We plan to structure the volume according to the four critical questions educators should consider as emphasized recently by Mary Bruenig, namely: - What knowledge is of most worth? - By what means shall we determine what we teach? - In what ways shall we teach it? - Toward what purpose? In addition to these questions, we are mindful of Henry A. Giroux’s argument that “to invoke the importance of pedagogy is to raise questions not simply about how students learn but also about how educators (in the broad sense of the term) construct the ideological and political positions from which they speak” (45). Consequently, we will encourage submissions to the volume that address these wider concerns. References Breunig, Mary (2006). 'Radical Pedagogy as Praxis'. Radical Pedagogy. http://radicalpeda gogy.icaap.org/content/issue8_1/breunig.ht ml. Giroux, Henry A. (1994). 'Rethinking the Boundaries of Educational Discourse: Modernism, Postmodernism, and Feminism'. Margins in the Classroom: Teaching Literature. Myrsiades, Kostas, Myrsiades, Linda S. (eds.). Minneapolis: University of Minnesota Press, pp. 1-51. http://radicalpedagogy.icaap.org/content/issue8_1/breunig.html http://radicalpedagogy.icaap.org/content/issue8_1/breunig.html http://radicalpedagogy.icaap.org/content/issue8_1/breunig.html Digital Humanities 2010 2 Schreibman, Susan, Siemens, Ray, Unsworth, John (eds.) (2004). A Companion to Digital Humanities. Malden: Blackwell. Jessop, Martyn (2005). 'Teaching, Learning and Research in Final Year Humanities Computing Student Projects'. Literary and Linguistic Computing. 20.3 (2005): 295-311. McCarty, Willard, Kirschenbaum , Matthew (2003). 'Institutional Models for Humanities Computing'. Literary and Linguistic Computing. 18.4 (2003): 465-89. Unsworth et al. (2006). Our Cultural Commonwealth: The Report of the American Council of Learned Societies Commission on Cyberinfrastructure for the Humanities and Social Sciences. New York: American Council of Learned Societies. work_hx7uxdq4obarpkoubgihxbdala ---- Spazi antichi e futuri possibili: la geografia classica nelle Digital Humanities Introduzione1 In un famoso contributo apparso sul «Digital Humanities Quarterly»2 del 2009, Tom Elliott, uno dei principali esponenti 1 Ritenere di riassumere in queste poche pagine l’intero panorama della “geografia classica digitale” sarebbe, oltre che estremamente pretenzioso, an- che molto ingenuo. Ho scelto sulla base del mio solo, sindacabilissimo, giudi- zio personale, alcuni fra i progetti recenti a mio parere rappresentativi, allo scopo di dare un’idea generale della situazione in questo campo, indirizzata a una audience di classicisti di stampo più tradizionale. Necessariamente, questa selezione comporta parzialità, la cui responsabilità è interamente a mio cari- co. Il curatissimo sito Ancient World Online (AWOL) fornisce una panorami- ca certamente più ampia e completa di tutte le iniziative riguardanti la geo- grafia antica e moderna nella ricerca digitale: http://ancientworldonline. blogspot.de/2012/09/roundup-of-resources-on-ancient.html. Per il lettore vo- lenteroso che volesse approfondire le tematiche qui trattate tramite letture più generali, rimando a un piccolo glossario in corso di pubblicazione sui termini tecnici dell’Informatica Umanistica: https://github.com/ChiaraPalladino /TuftsDCC/wiki/DH-words-vademecum. 2 Una discussione sulle origini, la storia e le tendenze nelle cosiddette Di- gital Humanities va oltre lo spazio e gli scopi di questa discussione. Un in- quadramento di base è fornito dal datato, ma pur sempre solido, S. Schreib- man, R. Siemens, J. Unsworth, Companion to Digital Humanities, Blackwell, Oxford 2004. Si veda anche M. Dacos-P. Mounier, Humanités numériques. État «FuturoClassico» n. 4, 2018 pp. 149-177 ISSN: 2465-0951 © 2018 - Centro Interuniversitario di Ricerca di Studi sulla Tradizione http://ancientworldonline.blogspot.de/2012/09/roundup-of-resources-on-ancient.html http://ancientworldonline.blogspot.de/2012/09/roundup-of-resources-on-ancient.html https://github.com/ChiaraPalladino/TuftsDCC/wiki/DH-words-vademecum https://github.com/ChiaraPalladino/TuftsDCC/wiki/DH-words-vademecum Chiara Palladino 150 della “geografia classica digitale”, rifletteva su come sarebbe cambiato il panorama dei suoi studi nel 20173. Egli immaginava di avere a disposizione un immenso sistema di mappatura che in poche mosse potesse mostrargli le coordinate geografiche di tutti i suoi riferimenti bibliografici; di poter modellare automatica- mente su una mappa un itinerario di epoca romana, e di poterne esplorare specifiche sezioni, risalendo ai manoscritti che ne tramandavano il testo e alle rispettive edizioni; di visualizzare differenti opzioni di mappatura del viaggio descritto a seconda delle connessioni tracciate fra le varie aree localizzate; di effet- tuare, in pochi passaggi, analisi comparative, risalendo a passaggi simili in altre fonti, confrontando la sequenza e il reticolo dei luoghi e individuando dove si sovrapponevano, e in cosa consistevano le differenze, ad esempio, nella ortografia dei topo- nimi nei vari testimoni manoscritti; di estendere questa analisi alla espressione delle distanze, per analizzarne la coerenza inter- na, i riscontri con altre fonti, e così via. «Ci aspettiamo che, nel 2017, – scriveva – la rivoluzione geo-computazionale attualmente in atto, intersecatasi con le tendenze dell’informatica e della so- cietà, contribuisca a cambiare in modo significativo i modi di pensare e di diffondere la ricerca»4. Elliott però aggiungeva: «Per gli umanisti, i compiti della ricerca tradizionale resteranno largamente inalterati: la scoperta, l’organizzazione, l’analisi delle fonti primarie e secondarie con il des lieux et positionnement de la recherche française dans le contexte internatio- nal, OpenEditions: Institut Français, Marseilles 2014, e G. Bodard, S. Mahony, Digital Research in the Study of Classical Antiquity, Routledge, London 2016. Nel campo della filologia classica, segnalo il contributo recente di T. Koent- ges, Classical text and the digital revolution, «The Amphora Issue» XLIII, 2, 2015, pp. 31-46, con utile bibliografia. 3 T. Elliott-S. Gillies, Digital Geography and Classics, «Digital Humanities Quarterly» III, 1, 2009, http://digitalhumanities.org/dhq/vol/3/1/000031/000031. html. 4 «We envision a 2017 in which the geo-computing revolution, now un- derway, has intersected with other computational and societal trends to effect major changes in the way humanist scholars work, publish and teach» (ibid.). http://digitalhumanities.org/dhq/vol/3/1/000031/000031.html http://digitalhumanities.org/dhq/vol/3/1/000031/000031.html Spazi antichi e futuri possibili 151 fine ultimo di comunicare e disseminare i risultati e le informazioni, per l’utilità e l’educazione degli altri». Dunque, senza negare il necessario ampliamento degli orizzonti, determi- nato dall’indubbia sfida posta dalle nuove tecnologie, si continua a ribadire che il metodo di ricerca scientifica consiste, e conti- nuerà a consistere, in una serie di passaggi di ipotesi e verifica, solidamente fondati sulle evidenze fattuali, sempre praticati con la massima onestà intellettuale. «Quello che ci aspettiamo che cambi, – continuava Elliott – è l’entrata a regime di un metodo di lavoro molto più ampiamente collaborativo, dove una percen- tuale molto più alta del tempo di lavoro è spesa nell’analisi e nella comunicazione professionale, il tutto supportato da una rete di connessione pervasiva, e sempre attiva. Molto del lavoro solitario e tedioso di text mining5, ricerca bibliografica e organiz- zazione delle informazioni, saranno gestiti tramite strumenti computazionali, ma noi – corsivo mio – diverremo più responsa- 5 In linguaggio informatico, analisi meccanica (text mining o data mining), ossia un processo che consiste nel ricavare ed estrarre da un testo, o corpus di testi non strutturato, tipologie di informazioni espresse in modo più o meno sistematico, che le macchine possono essere programmate per riconoscere au- tomaticamente: ad esempio, l’identificazione di nomi propri classificati se- condo persone e luoghi, specifici costrutti sintattici, riferimenti bibliografici, glosse, termini in altre lingue, elementi di un determinato ambito lessicale (ad es. del gergo militare o della sfera del sentimento). Nella ricerca storica, il Na- tural Language Processing (NLP) è attualmente uno dei metodi di text mining più in voga: una semplice introduzione in M. Piotrowski, Natural Language Processing for Historical Texts, Morgan & Claypool, San Rafael 2012. Si veda anche A. Kao-S.R. Poteet, Natural Language Processing and Text Mining, Springer, London 2007. Doverosa, anche se scontata, è la precisazione che il text mining non esonera minimamente lo studioso del testo dall’analisi di quello che legge e dalla riflessione su di esso: anzi, il suo scopo è proprio quello di aumentare il tempo di lavoro preposto a tali operazioni creative, nel- le quali la macchina non è in grado di sostituirsi all’uomo, alleviando la fatica di quelle più meccaniche, come raccogliere e ricopiare a mano gli elementi di interesse. Ciò non esenta, né tantomeno elimina, la necessità della lettura ravvicinata, dell’interpretazione, e soprattutto del riscontro dei dati estratti dalla macchina con il contesto originale (in altre parole, i passaggi metodolo- gici essenziali di ogni ricerca seria). Chiara Palladino 152 bili della qualità e dell’efficacia del nostro lavoro per via di come diffondiamo i risultati delle nostre ricerche. […] L’informazione che potrà essere restituita dalle macchine sarà ricavata da un “pastiche globale” di archivi digitali e meccanismi di pubblica- zione, che copriranno virtualmente ogni nuova pubblicazione accademica, così come riproduzioni digitali di molta della produ- zione a stampa, grafica e audio oggi in circolazione»6. Questa interpretazione del futuro, in parte volutamente utopistica, nasceva dalle innovazioni di un decennio, il primo degli anni Duemila, in cui la geolocalizzazione e l’introduzione di avanzatissime tecniche di mappatura hanno effettivamente rivo- luzionato il nostro modo di interpretare e orientarci nello spazio intorno a noi. Elliott, quindi, riecheggiava l’impatto nelle Digital Humanities del cosiddetto spatial turn7, un movimento di pensie- ro iniziato fra la fine del 1800 e gli anni Sessanta, ma veramente unificato a livello intellettuale solamente negli anni Settanta, quando assunse le caratteristiche di una tendenza intellettuale 6 «For Humanists, general research tasks will remain largely unchanged: the discovery, organization and analysis of primary and secondary materials with the goal of communicating and disseminating results and information for the use and education of others. But we expect to see a more broadly col- laborative regime in which a far greater percentage of work time is spent in analysis and professional communication, all underpinned by a pervasive, al- ways-on network. Much of the tedious and solitary work of text mining, bib- liographic research and information management will be handled by compu- tational agents, but we will become more responsible for the quality and ef- fectiveness of that work, by virtue of how we publish our research results. [...] The information offered us in return will be drawn from a global pastiche of digital repositories and publication mechanisms, surfacing virtually all new academic publication, as well as digital proxies for much of the printed, graphic and audio works now for sale, in circulation or on exhibit in one or more first-world, brick-and-mortar bookstores, libraries or museums» (El- liott-Gillies, Digital Geography and Classics cit.). 7 La più completa rassegna sull’impatto dello “spatial turn” nelle discipline storico-artistiche è fornita da J. Guidi, Spatial Humanities. What is the Spatial Turn?, Scholar’s Lab-University of Virginia Library, http://spatial.scholarslab. org/spatial-turn/. Si veda anche B. Warf-S. Arias (ed. by), The Spatial Turn: Interdisciplinary Perspectives, Routledge, London 2014. http://spatial.scholarslab.org/spatial-turn/ http://spatial.scholarslab.org/spatial-turn/ Spazi antichi e futuri possibili 153 che in quegli anni riscopriva l’importanza dello spazio come entità utile a comprendere vari aspetti del mondo, le dinamiche di potere, quelle economiche, il simbolismo religioso, la territo- rialità. Alla base vi è l’idea che lo spazio non sia riconducibile alla semplice geografia naturale, ma che sia plasmato, prodotto dalla società8, come l’espressione di essa nei suoi vari aspetti. Il corol- lario di quest’affermazione è che, ovviamente, la descrizione dello spazio non sia rispondente a principi oggettivi9, ma sia piuttosto frammentata, complessa, composta di molteplici lin- guaggi e fattori. Questa “rivoluzione” ebbe un grande impatto sulla critica letteraria, e stimolò l’avvio di riflessioni teoriche importantissime basate sulle interpretazioni dello spazio e del tempo negli universi narrativi10. Ma un impatto ancora più gran- de fu registrato nelle tecnologie della navigazione: l’importanza determinante assunta dallo spazio nella società moderna contri- buì alla creazione, negli anni Sessanta, del Geographic Informa- tion System (GIS), e alla sua inevitabile estensione, negli anni Novanta, alla ricerca storica e archeologica. Questa innovazione fu, si potrebbe dire, il primo caso di com- mistione sistematica fra tecnologia e ricerca storica nell’era mo- derna. Il GIS non solo fornì risorse d’importanza capitale per il miglioramento delle tecniche di analisi e scoperta, ma contribuì ad introdurre inedite questioni di metodo: l’incontro-scontro fra discipline tecnologiche, caratterizzate dalla precisione e dalla 8 H. Lefebvre, La Production de l’espace, «L’Homme et la Société» 31-32, 1974, pp. 15-32. 9 Da cui la prolifica tendenza alla messa in discussione della cartografia come mezzo obiettivo di descrizione dello spazio: M. Monmonier, How to Lie with Maps, University Press, Chicago 2014; R. Kitchin-M. Dodge, Rethinking maps, «Progress in Human Geography» XXXI, 3, giugno 2007, pp. 331-344. 10 Tra cui vale almeno la pena di citare M.M. Bachtin, Estetica e romanzo, trad. it. Einaudi, Torino 2001; F. Moretti, Atlante del Romanzo Europeo: 1800- 1900, Einaudi, Torino 1997; D.J. Bodenhamer, J. Corrigan, T.M. Harris (ed. by), Deep Maps and Spatial Narratives, Indiana University Press, Bloomington 2015; B. Westphal, Geocriticism: Real and Fictional Spaces, Palgrave Macmillan, New York 2011. Chiara Palladino 154 certezza geometrica dei dati, e discipline storiche, basate per definizione su informazioni parziali e non strutturate, favorì un approccio critico, fortemente orientato all’individuazione e al superamento dei limiti attraverso un continuo miglioramento dei metodi di entrambe11. Sui possibili esiti e sulle ancora inesplorate potenzialità di questo processo tornerò in conclusione. Al tempo del contributo di Elliott, questa combinazione fra informatica, geografia e discipline storiche aveva già stabilito le tecnologie e i passaggi metodologici fondamentali per la mappa- tura su larga scala dei riferimenti geografici nelle fonti, tramite la loro estrazione e il loro inserimento in una rete di coordinate spaziali. Nei primi anni 2000, Google sperimentò la mappatura su larga scala di centinaia di migliaia di testi le cui scansioni erano liberamente consultabili nell’archivio Google Books: il risultato fu la presenza di sezioni, all’interno delle pagine di consulta- zione, in cui era possibile visualizzare la mappa ricavata non solo dai riferimenti geografici presenti nel testo, ma anche i cosiddetti metadati (edizione, luogo di stampa, origine dell’autore etc.). Adoperando gli stessi principi, anche se su scala inferiore, Perseus, il più grande archivio di testi antichi Open Source12, ha effettuato il parsing sul alcune traduzioni di grandi testi geografi- ci dell’antichità classica, mappando e indicizzando automatica- mente tutti i riferimenti a luoghi noti in essi contenuti. Come risultato di questo processo, testi come la Descrizione della Grecia di Pausania presentano un indice, ordinabile alfabeticamente o per frequenza, una mappa dei luoghi menzionati nel testo o in singole sezioni, il dataset di tutti i riferimenti completi di nome, coordinate geografiche e vari altri dati utili; l’utente ha inoltre la 11 F.J. Harvey, A Primer of GIS: Fundamental Geographic and Cartographic Concepts, Guilford Press, New York 2008; D.J. Bodenhamer, J. Corrigan, T.M. Harris, The Spatial Humanities: GIS and the Future of Humanities Scholarship, Indiana University Press, Bloomington-Indianapolis 2010. 12 G.R. Crane, Perseus Digital Library, 1992-. Consultato il 5/03/2018: http://www.perseus.tufts.edu/hopper/. http://www.perseus.tufts.edu/hopper/ Spazi antichi e futuri possibili 155 possibilità di attingere, cliccando sul testo o sugli indici analitici, alle voci di autorità bibliografiche presenti online, come dizio- nari, enciclopedie e atlanti. Ovviamente, questo processo non è affatto esente da errori e approssimazioni13. Le moderne tecnologie di Named Entity Recog- nition (cfr. infra) consentono un margine di accuratezza elevato solo lavorando sulle lingue moderne, e dunque sulle traduzioni. Effettuare un simile lavoro sulle lingue storiche (il greco e il latino, ma anche l’arabo classico o il sanscrito), implica ben altro sforzo; inoltre, sottoporre a mappatura riferimenti a luoghi anti- chi è concettualmente più problematico, a causa della parzialità delle informazioni in nostro possesso. Tornerò fra poco su questo argomento. La tecnologia di queste operazioni di mappatura, però, è basata su principi metodologici sostanzialmente generali, il cui perfezionamento dipende dalla frequenza e dalla scala delle loro applicazioni14. Tali principi si riassumono in tre passaggi fondamentali: identificazione, disambiguazione e catalogazione. 13 Lo faceva già notare Elliott a proposito della scarsa qualità dell’esito nel caso dei luoghi antichi, nell’esperimento di Google Books (Elliott-Gillies, Di- gital Geography and Classics cit.). A proposito dei limiti e delle ancora non sfruttate opportunità di organizzazione semantica e disambiguazione dei rife- rimenti geografici nei testi antichi si veda anche A. Babeu et al., Named Entity Identification and Cyberinfrastructure, «Research and Advanced Technology for Digital Libraries». Lecture Notes in Computer Science, presentato alla In- ternational Conference on Theory and Practice of Digital Libraries, Springer, Berlin-Heidelberg 2007, pp. 259-270. 14 Il Natural Language Processing applicato all’inglese moderno è giunto sostanzialmente allo stato dell’arte, essendo questa una lingua praticata oggi da milioni di parlanti, la lingua stessa dell’informatica e (dato non trascurabi- le) dell’economia, su cui la scala dei dati utilizzabili è composta da miliardi di testi; il caso del cinese moderno, per quanto meno noto in Occidente, è simile. Lingue più circoscritte nella loro applicabilità e meno adoperate, sono anche molto meno frequentemente sottoposte a processi analoghi. È il caso anche di alcune lingue moderne, come il farsi; alcune lingue antiche, tuttavia, si trova- no nella posizione di contribuire a un sostanziale miglioramento delle tecno- logie, essendo attestate su una scala potenzialmente amplissima e in corpora “chiusi”, dunque linguisticamente consolidati: si pensi alla sola Patrologia La- Chiara Palladino 156 I primi due passaggi fanno parte di un settore dei linguaggi di programmazione, e sono correlati l’uno all’altro: nell’insieme, essi prendono il nome di Named Entity Recognition and Classification15. L’identificazione, o recognition, consiste nel met- tere a punto un breve codice (o, più precisamente, script) che sot- topone il documento ad analisi sistematica (o parsing), al fine di ricercare le stringhe di testo contenenti specifiche categorie, ad es. luoghi, persone, organizzazioni, espressioni cronologiche, etc. Tali codici si basano spesso sull’esistenza di librerie preesistenti, atte a fornire alla macchina una lista indicativa dei nomi da riconoscere e disambiguare. Script più complessi possono arriva- re a combinare i passaggi più elementari, come l’estrazione di tutte le stringhe di testo inizianti per maiuscola, con funzioni aggiuntive, ad esempio la differenziazione delle maiuscole da punteggiatura da quelle dei nomi propri16. Il passaggio successivo all’identificazione è, ovviamente, la disambiguazione. Non basta identificare un toponimo, ad esem- pio Alessandria, così come non basta identificare un Alessandro come un nome di persona. Occorre poi associare quel nome a tina per il latino, o al caso emblematico dell’arabo classico, lingua con una spiccata vocazione alla scrittura e con testimonianze manoscritte, ancora oggi non del tutto censite, dell’ordine di milioni di esemplari. Tuttavia, l’interesse per l’analisi computazionale su queste lingue si è intensificato solo in tempi molto recenti: è, dunque, impossibile (si direbbe pretestuoso) pretendere che si raggiunga, in un decimo del tempo e con un centesimo delle risorse, lo stes- so livello di accuratezza. Come ogni altra disciplina, la tecnologia richiede tempi di maturazione per perfezionarsi. 15 Introduzione all’argomento in D. Nadeau-S. Sekine, A survey of named entity recognition and classification, «Linguisticae Investigationes» XXX, 1, 2007, pp. 3-26. 16 Un metodo particolarmente innovativo consiste nell’utilizzare l’inglese come “lingua ponte”, attraverso l’allineamento del testo originale con la sua traduzione. M. Berti, The Digital Fragmenta Historicorum Graecorum and the Ancient Greek-Latin Dynamic Lexicon, in F. Mambrini, M. Passarotti, C. Sporleder (ed. by), Proceedings of the Workshop on Corpus-Based Research in the Humanities (CRH), 10 December 2015 Warsaw, Poland, Institute of Com- puter Science-Polnish Academy of Sciences, Warszawa 2015, pp. 117-123. Spazi antichi e futuri possibili 157 una identità, in maniera tale che non venga confuso con altri. Tale processo deve avvenire abbinando il nome in questione a categorie identificative univoche, ad esempio (ma, si badi bene, non necessariamente) delle coordinate geografiche; per i nomi di persona ci si può servire delle tecniche tipiche della prosopogra- fia, come la data di nascita o di morte, le relazioni di parentela, l’origine geografica, e così via. In questo modo, sapremo che Alessandro è Alessandro detto Magno, re di Macedonia, figlio di Filippo, nato a Pella nel 356 a.C., e che Alessandria è Alessandria d’Egitto, collocata presso il Delta del Nilo, vicina all’insediamento di Pharos, alla latitudine 31.1982456667 e longitudine 29.9079146667, anche denominata al-Iskandarīya durante l’era Ottomana. La disambiguazione in ambito computazionale è anche il pri- mo passo della catalogazione, cui corrisponde l’assegnazione di un identificatore univoco, stabile, semanticamente senza partico- lare significato ma riconoscibile come tale dalle macchine. In ger- go, questo identificatore è detto URI (Uniform Resource Identi- fier), e ha l’aspetto di un indirizzo web a cui è associato un identificatore numerico unico. Naturalmente il primo problema della disambiguazione è che occorrono dei riferimenti canonici adeguati, dizionari o atlanti, che forniscano autorità per assegnare a quel nome una identifica- zione esatta17. Quando questo lavoro viene effettuato sul Web e non sulla carta, è essenziale che tali riferimenti siano puntual- mente riscontrabili online: questo comporta la necessità di rende- re gli atlanti a stampa utilizzabili nel contesto digitale. Pertanto, nel 2000 è stato reso per la prima volta accessibile online uno dei più moderni e aggiornati atlanti del mondo greco e romano, il 17 Non mi soffermerò sul tema vasto e complesso degli atlanti e dei data- base di orientamento archeologico, che sono stati progetti pionieri del settore e rappresentano tuttora uno standard elevatissimo a cui aspirare. È mio inte- resse, in questa sede, concentrarmi sugli aspetti ancora problematici dell’otte- nere la medesima precisione nel campo delle fonti primarie, in particolar mo- do testuali, sul mondo antico. Chiara Palladino 158 cosiddetto Barrington Atlas18: pubblicato anche a stampa nel set- tembre del 2000 da Richard Talbert e Thomas Elliott19, esso includeva la mappatura dei luoghi tramite moderni sistemi di geolocalizzatione20. I dati così raccolti costituiscono l’ossatura di un database online, che oggi, dopo varie fasi di espansione, rap- presenta la risorsa di riferimento più vasta e importante di tutti gli studi, digitali e non, sulla geografia del mondo antico. Si tratta del database collaborativo Pleiades, che ad oggi raccoglie circa 40.000 luoghi antichi e relative collocazioni21. Sul piano concettuale, il merito principale del progetto è stato quello di stabilire la necessità di ridefinire le nozioni-chiave di “luogo” e “spazio” nello studio del mondo antico e nell’era del GIS, aprendo la strada a riflessioni sull’ambiguità del concetto di coordinata geografica, per uno spazio dinamico e ambiguo come 18 Per quanto non l’intero contenuto del Barrington Atlas sia liberamente accessibile tramite Pleiades, è pur sempre apprezzabile che i dati essenziali di una costosa pubblicazione a stampa siano stati messi online e resi, non solo leggibili, ma fruibili senza il pagamento di ulteriori dazi. Si spera che, in futu- ro, tali dati possano essere arricchiti dai riferimenti incrociati a enciclopedie meno recenti, ma di enorme importanza per gli studi classici, come la Realen- cyclopädie der Altertumswissenschaft, la cui progressiva digitalizzazione in modalità Open Access è oggi in atto. Il fatto che lo sia su una piattaforma for- nita da Wikipedia, risorsa sulla quale i dotti nostrani storcono spesso il naso, non ne sminuisce in alcun modo l’importanza, semmai l’amplifica (https:// de.wikisource.org/wiki/Paulys_Realencyclop%C3%A4die_der_classischen_ Altertumswissenschaft). 19 R.J.A. Talbert-R.S. Bagnall, Barrington Atlas of the Greek and Roman World, University Press, Princeton 2000. 20 Il corpus dei dati del Barrington Atlas è oggi mantenuto e aggiornato dall’Ancient World Mapping Center dell’Università Chapel Hill del North Ca- rolina (http://awmc.unc.edu/), che li adopera per ulteriori applicazioni, come ad esempio il webservice Antiquity á la Carte (R. Horne, AWMC: Antiquity À-La- Carte, 2012-. Consultato il 10/02/2018: http://awmc.unc.edu/awmc/applications/ alacarte/), che consente di combinare diverse tipologie di informazioni geo- grafiche, come il reticolo di strade del periodo imperiale, gli acquedotti, i limi- ti provinciali, e naturalmente le coordinate delle principali aree urbane. 21 R. Bagnall et al., Pleiades: A Community-Built Gazetteer and Graph of Ancient Places, 2006-. Consultato il 2/02/2018: http://pleiades.stoa.org. https://de.wikisource.org/wiki/Paulys_Realencyclop%C3%A4die_der_classischen_Altertumswissenschaft https://de.wikisource.org/wiki/Paulys_Realencyclop%C3%A4die_der_classischen_Altertumswissenschaft https://de.wikisource.org/wiki/Paulys_Realencyclop%C3%A4die_der_classischen_Altertumswissenschaft http://awmc.unc.edu/ http://awmc.unc.edu/awmc/applications/alacarte/ http://awmc.unc.edu/awmc/applications/alacarte/ http://pleiades.stoa.org/ Spazi antichi e futuri possibili 159 quello antico, e a una nuova definizione del concetto di luogo sulla base di specifiche categorie culturali, più che cartografiche. Nello studio del mondo antico, infatti, non è raro che un luogo attestato non possa essere ricondotto a coordinate spaziali, e di certo non con la precisione richiesta dai sistemi moderni: questo, tuttavia, non lo rende meno importante. L’idea alla base di Pleiades è, quindi, quella di considerare un luogo non come un toponimo cui corrispondono coordinate oggettive, ma come una “entità culturale” a cui sono connesse una serie di caratteristiche, di cui le coordinate sono solo una, e non necessariamente la principale. Altre caratteristiche riconosciute sono la categoria del luogo in questione, la sua definizione politica, il suo periodo o periodi di attestazione, i suoi nomi attestati nel corso del tempo, le sue connessioni con altre entità spaziali o sociali (fiumi, mari, porti, vie, tribù, individui, popoli, edifici pubblici, siti archeolo- gici…). A queste caratteristiche strutturali si affiancano quelle più prettamente bibliografiche, come ad esempio i riferimenti a dizio- nari e atlanti, ovvero ad altre risorse, come i database epigrafici o i cataloghi museali e delle soprintendenze, ma anche le raccolte di immagini pubbliche, come Flickr.com. Il risultato è la possibi- lità, per l’utente, di accedere analiticamente a una enorme varietà di riferimenti aggiuntivi e in continua crescita, spesso con com- pleta libertà di utilizzo e pubblicazione dei dati di partenza22. 22 Pleiades è un database “collaborativo”: a parte i redattori principali e i responsabili del progetto, i suoi aggiornamenti e arricchimenti sono intera- mente dovuti al libero contributo di studiosi volontari e ricercatori, che con- tribuiscono a perfezionarne i riferimenti, a disambiguare e a correggere, non- ché ad aggiungere informazioni. Un recente esempio è l’aggiunta delle atte- stazioni in arabo di oltre 5000 toponimi presenti nel database, nell’ambito del progetto CALCS, cfr. V. Vitale, Pelagios-Cross-Cultural After-Life of Classical Sites (CALCS), 2016. Consultato il 5/03/2018: https://research.sas.ac.uk/search/ research-project/152/pelagios-cross-cultural-after-life-of-classical-sites-(calcs)/). Tale opportunità di partecipazione, semplice e diretta, e specificamente pensata per gli studiosi, elimina a monte la giustificazione autoassolutoria della pre- senza, inevitabile, di errori, e attribuisce invece a chi li individua la responsa- bilità (e il merito) di correggerli, nella speranza che un valente ricercatore sia meglio informato, ma altrettanto motivato, di un utente di Wikipedia. https://research.sas.ac.uk/search/research-project/152/pelagios-cross-cultural-after-life-of-classical-sites-(calcs)/ https://research.sas.ac.uk/search/research-project/152/pelagios-cross-cultural-after-life-of-classical-sites-(calcs)/ Chiara Palladino 160 Fig. 1. Una voce in Pleiades. Mentre Pleiades, nato come database generale del mondo antico, si avvia a una espansione oltre i confini della cosiddetta classicità, il Digital Atlas of the Roman Empire, ideato sempre nel 2000 da Johan Åhlfeldt dell’Università di Lund23, ha l’intento di raccogliere e, possibilmente, mappare, ogni aspetto della geogra- fia dell’Impero, inclusi i miliari romani, i database delle chiese copte e cristiane, gli anfiteatri, gli acquedotti e via dicendo, e ne fornisce una mappatura semanticamente categorizzata. Una men- zione merita anche il progetto Trismegistos dell’università di Leuven24, già autorità di riferimento nel campo della papirologia 23 J. Åhlfeldt, Digital Atlas of the Roman Empire (DARE), 2015-2017. Con- sultato il 10/02/2018: http://dare.ht.lu.se/. 24 M. Depauw et al., Trismegistos. Consultato il 10/02/2018: https://www. trismegistos.org/. Si veda anche M. Depauw-T. Gheldof, Trismegistos: An In- terdisciplinary Platform for Ancient World Texts and Related Information, in Ł. http://dare.ht.lu.se/ https://www.trismegistos.org/ https://www.trismegistos.org/ Spazi antichi e futuri possibili 161 e dell’epigrafia, che sta avviando una catalogazione analitica delle informazioni spaziali fornite dalle fonti primarie già raccolte nel database25. I testi più interessanti dal punto di vista dell’informazione geografica sono stati sottoposti a Named Entity Recognition, e le informazioni estratte verificate, catalogate e disambiguate manualmente, e ove possibile correlate ai riferi- menti già esistenti. Come risultato di questa operazione, è ora possibile ricavare dati molto completi circa i riferimenti geogra- fici relativi al Nord Africa in numerose delle fonti testuali conservate nel database, come ad esempio i Bicchieri di Vicarello o l’Itinerario Antonino. Esplorare lo spazio antico La catalogazione e mappatura del mondo antico è, ovvia- mente, il passaggio preliminare di ogni analisi più raffinata, e la sua pubblicazione nel contesto digitale ne consente, alla luce dei nuovi studi, un perfezionamento continuo e critico. Ma lo “spatial turn” ha stimolato anche approcci di ricerca volti alla esplorazione della geografia intesa come spazio dinamico e vissu- to, sfidando i concetti di rappresentazione cartografica “statica” insiti nei processi di geolocalizzazione: è il concetto stesso di mappatura che implica la necessità di varcare i confini delle modalità di rappresentazione di un singolo medium, per giungere a un passaggio “dal testo alla mappa” che consenta la valorizza- zione di tutte le informazioni, ossia non solo la rappresentazione piatta dei luoghi menzionati in una fonte, ma delle relazioni se- mantiche e dinamiche insite nella percezione dello spazio e nel- Bolikowski et al. (ed. by), Theory and Practice of Digital Libraries-TPDL 2013 Selected Workshops, Springer, Cham 2013, pp. 40-52. 25 L’iniziativa prende il nome di Trismegistos Places, consultabile a http://www.trismegistos.org/geo/. http://www.trismegistos.org/geo/ Chiara Palladino 162 l’orientamento nel paesaggio ivi descritti26, le loro implicazioni sociali e culturali, i loro mutamenti27. In anni recenti la discus- sione si è arricchita per via della maggiore enfasi posta sulla discrepanza concettuale fra la rappresentazione dello spazio nelle società premoderne, per definizione “non cartografiche”28, e i moderni metodi di mappatura, cui corrisponde una tecnica di navigazione – e quindi una percezione spaziale – molto diversa. Per questo motivo la “rappresentazione su mappa” non potrà che essere parziale, se intesa semplicemente nei limiti dei moderni standard del GIS. Uno dei primi esperimenti in questo senso è stato la digita- lizzazione della Tabula Peutingeriana realizzata, ancora una volta, da Talbert ed Elliott come corollario al Barrington Atlas29. La scansione dell’immagine, opportunamente segmentata, è stata associata a un insieme di legende, simboli e indicazioni di classi- ficazione sovrapponibili all’immagine stessa nell’interfaccia di lettura. Alla mappa, quindi, sono associate indicazioni semanti- che, che forniscono una classificazione analitica dei suoi diversi componenti geografici, storici e concettuali. Inoltre, ogni luogo indicato, con le caratteristiche che gli sono associate nella carta, è associato a un riferimento “moderno” nel Barrington Atlas; il risultato è la creazione di un database che non tiene conto soltan- 26 A proposito delle implicazioni concettuali del passaggio dal mezzo scrit- to al mezzo visuale, si veda almeno Ø. Eide, Media Boundaries and Conceptual Modelling, Palgrave Macmillan UK, London 2015. 27 E. Barker et al. (ed. by), New Worlds from Old Texts: Revisiting Ancient Space and Place, University Press, Oxford 2016. 28 Seppure superato in alcune parti, ancora oggi il testo di Pietro Janni è il riferimento principale per la questione della navigazione spaziale nelle società premoderne. P. Janni, La mappa e il periplo : cartografia antica e spazio odolo- gico, G. Bretschneider, Roma 1984. 29 T. Elliott, Constructing a Digital Edition for the Peutinger Map, in R.J.A. Talbert, R.W. Unger (ed. by), Cartography in antiquity and the Middle Ages. Fresh perspectives, new methods, Brill, Leiden-Boston 2008, pp. 99-110; R.J.A. Talbert, Rome’s World: The Peutinger Map Reconsidered, Cambridge University Press, Cambridge-New York 2010. Spazi antichi e futuri possibili 163 to dei riferimenti geografici contemporanei, ma ricava le infor- mazioni essenziali dalla semantica della mappa stessa. L’esigenza di comprendere il movimento nel mondo antico nei suoi aspetti dinamici è stata invece alla base del progetto Orbis, curato dall’Università di Stanford30. Orbis, la cui mappa di connettività si basa in gran parte su dati archeologici, offre una simulazione delle modalità di viaggio nell’Impero Romano nel II secolo della nostra era: tramite una interfaccia online, è possibile impostare una serie di condizioni, scelte fra i fattori che più noto- riamente determinano le modalità di viaggio nell’antichità (perio- do dell’anno, modalità del percorso, mezzo di trasporto etc.), e attraverso una combinazione di modelli di simulazione fornisce i costi, i tempi e le variabili del percorso scelto. Non si tratta di una ricostruzione basata su fonti primarie, bensì di un modello mate- matico e probabilistico, che necessita di un contesto di ricerca e domande investigative molto precise per essere efficace31. Un analogo tentativo di analisi dello spazio “dinamico” e vissuto, questa volta attraverso l’analisi delle fonti primarie, è stato compiuto nel 2014 con il progetto Hestia32. Esso consiste in una interfaccia integrata di lettura realizzata sulle Storie di Ero- doto, sia in greco che in inglese. La vista iniziale offre una pano- ramica, ingrandibile ed esplorabile, di tutti i luoghi menzionati dal testo e le relative statistiche di frequenza e densità. La visione analitica offre una serie di finestre affiancate, che consistono nel testo stesso, suddiviso per libro, capitolo e paragrafo secondo il sistema editoriale consueto, la mappa dei luoghi menzionati nel passaggio selezionato, una linea temporale che ne descrive la progressione narrativa, e un codice di colori che indica i luoghi 30 W. Scheidel et al., ORBIS: The Stanford Geospatial Network Model of the Roman World, 2014-. Consultato il 10/02/2018: http://orbis.stanford.edu/. 31 W. Scheidel, Orbis: The Stanford Geospatial Network Model of the Roman World, SSRN Scholarly Paper, Social Science Research Network, Rochester- NY 2015. 32 E. Barker et al., Hestia: Herodotus Encoded Space-Text-Imaging Archive, 2014. Consultato il 10/02/2018: http://hestia.open.ac.uk/. http://orbis.stanford.edu/ http://hestia.open.ac.uk/ Chiara Palladino 164 menzionati sulla base della frequenza. Cliccando su un luogo è possibile visualizzare tutti i passaggi delle Storie in cui esso compare, nonché visualizzare in una nuova finestra la mappa delle entità spaziali a cui quel particolare luogo è connesso nel corso della narrazione, e il numero di volte in cui si connette ad essi. Nel corso della ricerca analitica sul testo, inoltre, ci si è chiesti in che modo si potesse esplorare la concezione dello spazio di Erodoto attraverso mezzi di rappresentazione visuale: ci si è presto resi conto che rappresentare l’opera di Erodoto in termini cartografici non era sufficiente per comprenderne le implicazioni storiche e narrative. Vi era, in altri termini, un problema semantico che poneva la necessità di prescindere dalla rappresentazione cartesiana dello spazio. Si è scelto, dunque, di servirsi del principio della connettività, o network theory, parten- do dalla premessa metodologica secondo cui la rappresentazione dello spazio in forma linguistica ha la sua struttura portante nella creazione di relazioni semantiche fra entità33. Nel caso di Hestia, la network theory è stata applicata allo scopo di investigare la ricchezza del testo al di là della rappresentazione bidimensionale della mappa, e in parte proprio per svincolarsi dalle costrizioni imposte da essa. Attraverso tecniche di text mining, sono stati estratti dal testo tutti i riferimenti geografici, e quelli caratteriz- zati da co-occorrenza nella medesima porzione di testo (misurata secondo un principio di unità-paragrafo) sono stati messi in rela- zione fra loro sulla base di quattro criteri di tipo linguistico, ricavati essenzialmente dalla tipologia di forma verbale che 33 La network theory, fin dall’inizio caratterizzata da forti implicazioni spa- ziali, è stata introdotta nell’ambito della critica letteraria dal celebre saggio di Franco Moretti, Graphs, Maps, Trees: Abstract Models for Literary History (Ver- so, London-New York 2005). Moretti ha qui dimostrato l’importanza di legge- re un testo attraverso metodi di distant reading, per ricavarne informazioni spesso non visibili tramite la lettura progressiva: l’esito più importante del distant reading è, per l’appunto, l’estrazione di informazioni relative ai rap- porti macroscopici dei personaggi in un universo narrativo, che possono esse- re rappresentati attraverso varie forme di mappatura, non necessariamente cartografica. Spazi antichi e futuri possibili 165 compariva nella proposizione in cui si trovavano (posiziona- mento, movimento, dinamicità, trasformazione)34. Si è così generato non solo il reticolo di connessioni, che è possibile esaminare nell’interfaccia di lettura, ma anche un database di relazioni che può essere visualizzato indipendentemente dalla mappa, dove invece che al posizionamento geografico può essere data priorità ai vari indici di connettività, o anche alla frequenza. Fig. 2. Un’immagine dell’interfaccia di lettura di Hestia. C’è, però, un altro problema che occorre sottolineare. Tutto quello che è stato fatto con Hestia poteva essere prodotto con relativa esattezza soltanto sulla traduzione inglese. Ai tempi in cui il progetto si è sviluppato, non era pensabile compiere un’a- nalisi così raffinata su un testo in greco antico, per lo meno non 34 S. Bouzarovski-E. Barker, Between East and West: movements and trans- formations in Herodotean topology, in Barker et al. (ed. by), New worlds from Old Texts cit., pp. 155-180. Chiara Palladino 166 con i finanziamenti di un progetto di durata poco più che annua- le35. Questo ci porta a un altro tema fondamentale, ossia quello della creazione dei dati partendo dalle fonti primarie: in altre pa- role, la questione delle edizioni digitali. Leggere lo spazio antico Il supporto digitale offre, a livelli impensabili in precedenza, la possibilità di raccogliere, organizzare e rendere fruibili le più di- sparate categorie di informazione. Le informazioni spaziali fanno parte della tipologia di dato che può essere raccolto e rappresen- tato su questo supporto, lavorando direttamente sulla lingua di partenza: in altre parole, il supporto digitale può contribuire a creare nuove tipologie di edizioni dei testi antichi, in cui sia ade- guatamente valorizzato anche il dato spaziale. Nel campo dell’editoria digitale, i linguaggi di marcatura sono da tempo adoperati per identificare le informazioni di particolare interesse. L’esito di questa operazione è soprattutto la possibilità di indicizzare automaticamente quelle informazioni, con livelli di precisione e opportunità di analisi molto diversificati36. Questa modalità è stata scelta nel caso della marcatura dei sistemi di citazione nella edizione digitale dei Deipnosofisti di Ateneo, o Di- 35 Si confrontino i risultati ottenuti da Trismegistos Places con finanzia- menti, tempi e personale molto più generosi. Si sa che, in ambiente accademi- co, non è “elegante” parlare di risorse economiche: tuttavia proprio queste ri- sorse hanno consentito a Trismegistos di raggiungere risultati su una scala di complessità paragonabile a Hestia, ma lavorando sui testi rigorosamente in lingua originale. 36 Il linguaggio di marcatura per eccellenza, nel caso dei testi, è XML, o Extensible Markup Language (https://www.w3.org/XML/). Nei suoi sotto- schemi, esplicitamente creati per le edizioni di testi complessi, TEI ed EpiDoc (http://www.tei-c.org/, http://epidoc.sourceforge.net/), viene utilizzato come linguaggio standard per l’editoria e l’archiviazione di testi digitali, creati ex novo o frutto della trasposizione su supporto digitale di fonti già esistenti. https://www.w3.org/XML/ http://www.tei-c.org/ http://epidoc.sourceforge.net/ Spazi antichi e futuri possibili 167 gital Athenaeus37. Al testo, già disponibile nell’edizione di Georg Kaibel, è stata affiancata la versione digitale dell’Index Scripto- rum, nonché i Dialogi Personae curati da August Meineke e dallo stesso Kaibel, e il recentissimo Index of authors, texts and persons della nuova edizione di Douglas Olson: ogni luogo citato viene fornito con apposita concordanza fra il sistema di citazione di Kaibel e quello delle pagine di Casaubon, ancora oggi familiare a molti lettori. Il risultato è un unico grande indice, che raccoglie non solo le concordanze, ma classifica semanticamente ogni au- tore, consente di risalire al passaggio riferito, e connette analiti- camente tutte le informazioni ad esso pertinenti. Ma soprattutto, l’indice sfrutta in pieno le potenzialità del nuovo supporto digita- le, in quanto si estende al riconoscimento e all’analisi dei passag- gi del testo di Ateneo riconosciuti come citazioni di altri autori: tali passaggi sono stati appositamente marcati all’interno del testo stesso dei Deipnosofisti. Il risultato è non solo un indice pie- namente fruibile per le opportunità di ricerca più disparate, ma anche la possibilità di varie operazioni di distant reading, come ad esempio l’estrazione di tutte le citazioni di determinati nomi, ovvero categorie di autori, ovvero opere letterarie, suddivisi in base ai luoghi in cui compaiono attraverso tutta l’opera38. 37 M. Berti, Digital Athenaeus. A digital edition of the Deipnosophists of Athe- naeus of Naucratis. Consultato il 05/03/2018: http://www.digitalathenaeus.org. 38 M. Berti et al., Documenting Homeric Text-Reuse in the Deipnosophistae of Athenaeus of Naucratis, «Bulletin of the Institute of Classical Studies» LXIX, 2, 2016, pp. 121-139. http://www.digitalathenaeus.org/ Chiara Palladino 168 Fig. 3. Un esempio delle applicazioni degli indici digitali del Digital Athe- naeus: visualizzare tutte le citazioni a Omero come autore e la loro frequenza classificata per libro. Un piccolo progetto in confronto, ma focalizzato specifica- mente sulla geografia letteraria, è stato recentemente promosso dall’università di Zagreb, nell’ambito della raccolta e archivia- zione digitale dei testi latini di autori croati del Rinascimento, che prende il nome di CroALA (Croatiae Auctores Latini)39. Il progetto ha condotto alla realizzazione di un Index Locorum atto a raccogliere i riferimenti geografici secondo un sistema di classi- ficazione creato specificamente per la cosiddetta geografia lette- raria, che presenta aspetti problematici peculiari, che non neces- sariamente si prestano a una semplice classificazione univoca: com’è evidente, un luogo poetico può non essere necessaria- mente reale, ma nemmeno interamente fittizio; può essere la proiezione di un’entità reale nella letteratura e nell’immaginario 39 N. Jovanović et al., CroALa: Croatiae Auctores Latini, 2009-2014. Consul- tato il 05/03/2018: http://croala.ffzg.unizg.hr/. http://croala.ffzg.unizg.hr/ Spazi antichi e futuri possibili 169 (ad esempio, l’Olimpo), può aver cambiato denominazione, e persino coordinate, nel corso del tempo; inoltre, può essere reale ma non necessariamente appartenere alla geografia terrestre (si pensi alla Luna). La premessa metodologica del CroALA Index Locorum è stata quella di ridiscutere il principio stesso dell’“i- dentificatore univoco”, sulla base dell’essenziale considerazione che non sempre un nome riferito ad un luogo ne eredita le medesime caratteristiche. Il risultato, accessibile tramite l’archi- vio dell’Index Locorum40, sono varie categorie di indici tutti interconnessi fra loro, e che da ultimo rimandano al contesto originale, dove il sistema di classificazione adottato segue un articolato data model, e dove sono stati creati complessi identifi- catori, univoci ma estremamente flessibili e manipolabili, allo scopo di mettere in relazione le differenti entità con la realtà (concreta e “poetica”) a cui si riferiscono. Una maniera alternativa di raccogliere i dati spaziali dalla fonte primaria è la cosiddetta annotazione esterna (o stand-off annotation), che prescinde dai linguaggi di markup e viene effettuata spesso attraverso interfacce web di semplice utilizzo. Uno dei servizi oggi più utilizzati, nell’ambito dello studio delle fonti dell’antichità, è Recogito (http://recogito.pelagios.org/), uno strumento messo a punto nell’ambito di Pelagios41, un progetto mirante a creare una infrastruttura centrale per le risorse e le iniziative relative al concetto, largamente inteso, di “spazio” nel mondo premoderno, allo scopo di favorire l’interconnessione fra progetti di ricerca ed archivi attraverso gli standard dei Linked Open Data42. 40 N. Jovanović, Croala-Pelagios: CITE Semantic Annotations for Place Refer- ences in Croatian Latin Texts. XQuery, 2017. Consultato il 05/03/2018: https:// github.com/nevenjovanovic/croala-pelagios. 41 L. Isaksen et al., Pelagios Commons: Linking the Places of Our Past, 2015-. Consultato il 10/02/2018: http://commons.pelagios.org/. 42 I Linked Open Data sono la tecnologia più importante del cosiddetto Semantic Web (T. Berners-Lee, «Linked Data», 2006: https://www.w3.org/ DesignIssues/LinkedData.html). In sintesi, si tratta di una serie di standard e http://recogito.pelagios.org/ https://github.com/nevenjovanovic/croala-pelagios https://github.com/nevenjovanovic/croala-pelagios http://commons.pelagios.org/ https://www.w3.org/DesignIssues/LinkedData.html https://www.w3.org/DesignIssues/LinkedData.html Chiara Palladino 170 Allo scopo di integrare le informazioni provenienti da fonti secondarie, come atlanti e database online, con i dati provenienti dalle fonti primarie, Pelagios ha fornito agli utenti la possibilità di raccogliere ed esplorare tali informazioni in modo critico, allo stesso tempo contribuendo alla creazione di dati nuovi: Recogito è uno strumento di facile accesso, tramite il quale l’utente può creare un proprio profilo, caricare un testo o una mappa, deci- derne i criteri di condivisione e annotare stringhe di testo conte- nenti informazioni rilevanti, con particolare attenzione – ovvia- mente – ai riferimenti geografici. Nel caso dei toponimi, poi, è anche possibile effettuare operazioni di disambiguazione semi- automatica, attingendo ai database online come Pleiades o il Digital Atlas of the Roman Empire. Servendosi del principio dell’annotazione esterna, Recogito consente di lavorare direttamente sulla fonte, archiviando le annotazioni altrove, in maniera tale che esse siano immediata- mente disponibili sotto forma di dataset in vari linguaggi, senza dover passare attraverso l’estrazione dell’informazione marcata, come nel caso dei testi in XML. Inoltre, questo sistema permette di rendere direttamente accessibili, sotto forma di Linked Open Data, le informazioni create dall’utente: ciò implica non solo maggiore visibilità dell’informazione creata, ma anche un au- mento delle opportunità che quella informazione possa essere adoperata da altri, per la sua incorporazione in nuovi progetti di ricerca o archivi. di tecnologie cui attenersi per rendere il proprio contenuto online accessibile, utilizzabile e connettibile ad altri caratterizzati da un qualche tipo di affinità semantica. Da qualche anno a questa parte, molti database e archivi digitali hanno reso le proprie risorse compatibili con gli standard dei Linked Open Data, per incrementare le proprie potenzialità di accesso e di utilizzo, e per favorire sempre più una metodologia di ricerca trasversale, che trae indubbio beneficio dalla possibilità di accedere a contenuti diversificati attraverso l’utilizzo di un vocabolario semantico comune. Spazi antichi e futuri possibili 171 Fig. 4. Riquadro di annotazione in Recogito. Naturalmente, quello che la macchina non può dire all’utente è cosa o come annotare. La classificazione delle informazioni spa- ziali, specie nelle fonti testuali, è tutto tranne che immediata, e deve necessariamente rispondere a criteri metodologici ponde- rati: dunque, l’annotazione è stata scelta come criterio di raccolta delle informazioni proprio per la sua flessibilità di processo esplorativo. In generale, l’annotazione semantica, che consente di associare, all’interno di un ambiente Web, informazioni di vario tipo a un’entità rinvenuta in una fonte, può muoversi in due direzioni: la prima, facendo riferimento a una classificazione già esistente e complessa, preferibilmente sotto forma di ontologia43; 43 Il concetto di ontologia è un prestito della filosofia, dove la parola deno- ta lo “studio dell’essere”: in informatica esso è passato ad indicare le pratiche di rappresentazione della conoscenza in forme organizzate ed esprimibili in linguaggi comprensibili alle macchine, attraverso la definizione di una strut- Chiara Palladino 172 la seconda, creando da zero un sistema secondo criteri emersi dallo studio dalla fonte stessa, laddove le ontologie e gli atlanti a disposizione non rispondano alle esigenze del ricercatore (si pen- si, ad esempio, ad opere a metà fra la geografia naturale e quella fantastica, come ad esempio il Satyricon). Entrambe queste strade sono percorribili indipendentemente, e rispondono a intenti di- versi nel processo di analisi della fonte primaria, l’uno più generale e comparativo, in quanto focalizzato sulla classifi- cazione in relazione a un “vocabolario” semantico di riferimento, l’altro più concentrato sulla fonte e sulla particolare concezione spaziale del suo autore. L’annotazione si configura, quindi, come qualcosa di più che un semplice processo di classificazione, ma è un modo per arricchire un riferimento spaziale a un livello di approfondi- mento pressoché illimitato, ben oltre il semplice “place- tagging”44. tura formale composta di entità e relazioni fra di esse. Nel Semantic Web, le ontologie vengono adoperate per specificare vocabolari concettuali “stan- dard”, utili alla classificazione dei fenomeni: T. Gruber, Ontology (Computer Science), in L. Liu-M. Tamer Öszu (ed. by), Encyclopedia of database systems, Springer, Boston 2009. Le ontologie spaziali e spazio-temporali vengono ge- neralmente adoperate per la classificazione di fenomeni relativi alla geografia e alla cronologia nel Semantic Web: esse possono fornire un punto di parten- za importante per la disambiguazione e l’annotazione di riferimenti spaziali all’interno delle fonti antiche. Si veda in proposito l’importante progetto GeoLat (F. Ciotti et al., TEI, ontologies, linked open data: geolat and beyond, «Journal of the Text Encoding Initiative» 8, 2015, pp. 1-20). 44 Da questo genere di “commento espanso” vengono le potenzialità più importanti per la futura editoria digitale. Si veda in proposito almeno R. Af- ferni et al., ... but what should I put in a digital apparatus? A not-so-obvious choice. New types of digital scholarly editions, in P. Boot et al. (ed. by), Advanc- es In Digital Scholarly Editing. Papers Presented At The Dixit Conferences in The Hague, Cologne and Antwerp, Sidestone Press, Cologne 2017, pp. 141-143. Spazi antichi e futuri possibili 173 Conclusione Nell’articolo citato in apertura, Elliott auspicava un futuro in cui le informazioni spaziali fossero accuratamente marcate e verificate, nelle edizioni accademiche dei testi, tramite sistemi che ne consentissero la rapida indicizzazione e la mappatura. Elliott auspicava altresì che l’enorme mole di testi non marcati, invece, potesse essere passibile di trattamenti completamente automatici, per raggiungere risultati apprezzabili, e che la disambiguazione potesse essere effettuata in maniera computa- zionale grazie al miglioramento delle tecnologie. Si può senz’al- tro dire che, rispetto a questa visione, vi è ancora molta strada da fare. Tuttavia, nei circa dieci anni successivi, l’incontro-scontro fra le discipline storiche e il mondo digitale ha aperto nuove questioni e nuovi problemi metodologici. Per prima cosa, è necessario riconoscere che lavorare con le macchine rimette in discussione i metodi della ricerca storica: non perché essi siano fallaci, ma proprio perché il loro trasferi- mento su un diverso supporto costringe alla ridefinizione di alcu- ni concetti chiave; la macchina, infatti, necessita di una precisione che non lascia spazio alla vaghezza, e costringe a se- guire un metodo rigoroso e basato sulle evidenze fattuali, più che sulle speculazioni. Questo significa, nel nostro caso, che è neces- sario definire con precisione, e preliminarmente, i concetti di geografia, di luogo, di entità spaziale, per far sì che si capisca univocamente qual è l’oggetto della nostra analisi, e affinché la macchina possa sobbarcarsi molto del lavoro meccanico di raccolta delle informazioni, prima affidato alla buona volontà del ricercatore. Tuttavia, le discipline storiche comportano problemi che mettono in crisi approcci troppo meccanicisti. Abbiamo visto che le esigenze dell’analisi della geografia antica e letteraria, basata su dati per definizione qualitativi, mal si accordano con i mecca- nismi troppo costrittivi del GIS, che è un approccio quantitativo per eccellenza. Andiamo quindi verso qualcosa che, pur adot- tando alcuni approcci del GIS, è altro, è una “geografia digitale” Chiara Palladino 174 che dà valore ad aspetti prettamente storico-culturali. Questo ap- proccio è stato definito da Elliott un-GIS, ossia qualcosa che non conferisce importanza assoluta al concetto puramente quantita- tivo delle coordinate geografiche, ma ha la flessibilità necessaria per includere i dati provenienti dall’indagine umanistica. Lo stesso discorso si può fare per gli identificatori univoci, o URI, che funzionano per le macchine, ma non hanno la stessa ricchez- za semantica dei linguaggi naturali, e spesso sono messi in discussione dalla mancanza di contesto e dall’ambiguità tipiche della ricerca storica. Questo comporta la necessità di realizzare sistemi di identificazione semanticamente più ricchi e flessibili dei semplici indicatori numerici45. Da questa situazione, però, possono emergere non solo dei nuovi problemi, ma anche degli approcci nuovi46. Mi limito qui a 45 Un grande progresso in questo senso è stato fatto in campo bibliografi- co, con l’introduzione del CTS (Canonical Text Service). D.N. Smith-C.W. Blackwell, Four URLs, Limitless Apps: Separation of Concerns in the Homer Multitext Architecture, CHS White Papers: https://chs.harvard.edu/CHS/ article/display/4846. 46 Si potrebbe obiettare se realmente si imponga, per così dire, agli umani- sti, la necessità di “convertirsi” attivamente ai criteri e ai metodi del mondo digitale, e alla ovvia obiezione se tale conversione non comporti, in un futuro prossimo, la perdita di informazioni o di approcci che sono pensabili solo nel mondo della stampa. La prima riposta a una tale obiezione è che tale passag- gio di supporto è una rivoluzione inevitabile: e chi ha fatto studi filologici, in virtù della maggiore consapevolezza dei meccanismi che sottendono alle mo- dalità di diffusione e produzione del sapere, ha gli strumenti sufficienti per comprendere la portata di questo cambiamento. Ma in assenza di tale convin- cimento (i mutamenti culturali generano necessariamente delle resistenze), è preferibile che siano gli umanisti a definire i paradigmi con cui svolgere ri- cerca seria, invece di lasciarlo fare ad altri, che siano le case editrici o i pro- duttori di software, la cui considerazione e competenza nei confronti di tutto ciò che concerne le materie umanistiche, e in special modo storiche, è triste- mente nota. Il cambiamento è in atto, ed è inarrestabile: neppure un’apoca- lisse del World Wide Web, quale in molti preconizzano con un certo compia- cimento, potrebbe mai annullare il processo di produzione e diffusione di in- formazione in formato digitale, che non ha nulla a che fare con l’esistenza di https://chs.harvard.edu/CHS/article/display/4846 https://chs.harvard.edu/CHS/article/display/4846 Spazi antichi e futuri possibili 175 proporne alcuni, che già dovrebbero essere emersi nel corso di questo breve contributo. Il primo è la possibilità, mai riscontrata prima a un tale livello di complessità, di creare indici semanticamente raffinati e artico- lati, sempre connessi al contesto di partenza e potenzialmente legati a una quantità infinita di informazioni aggiuntive; le op- portunità offerte dal supporto digitale in questo senso possono essere sfruttate ai fini della creazione di uno dei desiderata della geografia classica, un lessico, o dizionario critico aggiornato della geografia antica47. I metodi di text mining consentono l’automa- tizzazione del lavoro di estrazione delle informazioni, ma il supporto digitale può prestarsi a rappresentare anche il carattere editorialmente complesso di un’opera del genere, che richiede, evidentemente, un lavoro profondamente analitico da parte di una varietà di figure specializzate. Ma il processo di analisi spaziale comporta anche un arricchi- mento della nozione di edizione critica che va ben oltre il concetto di “indice”: la narrazione spaziale nel suo complesso può essere vista come un sistema linguistico e cognitivo funzionale alla navigazione, il prodotto coeso di una civiltà, e si presta dunque a metodi innovativi di analisi e rappresentazione48. Infine, il merito principale delle Digital Humanities è l’aver rimesso in essere, concretamente, quella interdisciplinarità che in queste discipline si è perduta da tempo: l’attenzione quasi Internet. Perciò, forse, sarebbe meglio che siano i diretti interessati a dettare le modalità con cui questo processo dovrebbe avvenire, proprio al fine di evi- tare il più possibile il rischio della perdita di informazioni e metodi. Diversa- mente, saranno i ‘giganti dell’informazione’ a definire come e dove questa si- tuazione si evolverà, e a spese di chi. 47 La necessità di un dizionario critico della geografia antica è già stata messa in rilievo da D. Marcotte, Les Géographes Grecs, tome I, introduction générale, Pseudo-Scymnos . Circuit de la Terre, texte établi et traduit par D. Marcotte, vol. I, Les Belles Lettres, Paris 2000. 48 M. Thiering-K. Geus (ed. by), Features of Common Sense Geography: Im- plicit Knowledge Structures in Ancient Geographical Texts, Lit Verlag, Berlin- Münster-Wien-Zürich-London 2014. Chiara Palladino 176 esclusiva alle cosiddette culture classiche ha comportato spesso la perdita, involontaria, del contesto generale, quello del mondo antico e premoderno, in cui le civiltà oggetto di studio intera- giscono con altre, che esistono con pari dignità, e sono portatrici di altri modi di vedere il mondo. Nell’ambito del digitale, le disci- pline dell’antichità greca e romana hanno certamente segnato il passo con anticipo, ma numerose altre si stanno facendo strada, arricchendo le metodologie già in essere e contribuendo a porre le premesse per una visione “globale” dell’antichità. Fra i nume- rosi progetti oggi attivi, mi preme citare l’immenso archivio di risorse bibliografiche, prosopografiche e geografiche messo a punto per il siriaco nel progetto Syriaca49, la creazione di un atlante rifinito e profondamente connesso alle fonti letterarie per il mondo islamico50, e il crescente perfezionamento delle tecnolo- gie di data mining, named entity recognition e network visualiza- tion attualmente in atto per il cinese51. Si spera, dunque, che in futuro la ricerca digitale possa contribuire a ricreare un’imma- gine del mondo antico e premoderno che ne restituisca la piena complessità culturale e storica. 49 T.A. Carlson et al., Syriaca.Org: The Syriac Reference Portal. Consultato il 05/03/2018: http://syriaca.org/. 50 M. Seydi-M. Romanov, Al-Ṯurayyā Project, 2013-. Consultato il 10/02/2018: https://althurayya.github.io/. 51 Si veda ad esempio H. De Weerdt, Information, Territory, and Networks: The Crisis and Maintenance of Empire in Song China, Harvard University Asia Center, Cambridge 2015. http://syriaca.org/ https://althurayya.github.io/ Spazi antichi e futuri possibili 177 Abstract. This paper summarizes the current situation of Ancient Geography within the larger context of the Digital Humanities. It proposes an overview of the most important achievements and initiatives in the digital analysis of spatial sources, emphasizing their innovative approaches in research, but also con- sidering issues in the difficult relationship between machine-based collection of data and the traditional means of investigation of the Humanities. In con- clusion, it proposes a set of promising methods and strategies to be pursued for the future of the spatial analysis of premodern sources. Keywords. Ancient Geography, Digital Humanities, Digital Editions, GIS, Network Theo- ry, GeoHumanities, Digital Libraries. Chiara Palladino Furman University, Classics Department chiara.palladino@furman.edu work_i2eulfvrwjaw7oy7fpt5zfmkwi ---- A landscape of data – working with digital resources within and beyond DARIAH RESEARCH ARTICLE A landscape of data – working with digital resources within and beyond DARIAH Tibor Kálmán1 & Matej Ďurčo2 & Frank Fischer3 & Nicolas Larrousse4 & Claudio Leone5 & Karlheinz Mörth2 & Carsten Thiel5 Published online: 3 April 2019 # Springer Nature Switzerland AG 2019 Abstract The way researchers in the arts and humanities disciplines work has changed significantly. Research can no longer be done in isolation as an increasing number of digital tools and certain types of knowledge are required to deal with research material. Research questions are scaled up and we see the emergence of new infrastructures to address this change. The DigitAl Research Infrastructure for the Arts and Humanities (DARIAH) is an open international network of researchers within the arts and humanities community, which revolves around the exchange of experiences and the sharing of expertise and resources. These resources comprise not only of digitised material, but also a wide variety of born- digital data, services and software, tools, learning and teaching materials. The sustaining, sharing and reuse of resources involves many different parties and stakeholders and is influenced by a multitude of factors in which research infrastructures play a pivotal role. This article describes how DARIAH tries to meet the requirements of researchers from a broad range of disciplines within the arts and humanities that work with (born-)digital research data. It details approaches situated in specific national contexts in an otherwise large heterogeneous international scenario and gives an overview of ongoing efforts towards a convergence of social and technical aspects. Keywords Research infrastructure . Digital humanities . Arts and humanities . Sustainability. DARIAH . FAIR principles 1 Introduction Funding agencies, on both the European and national levels, increasingly require that research data and publications produced in publicly funded research projects be International Journal of Digital Humanities (2019) 1:113–131 https://doi.org/10.1007/s42803-019-00008-6 * Tibor Kálmán tibor.kalman@gwdg.de Extended author information available on the last page of the article http://crossmark.crossref.org/dialog/?doi=10.1007/s42803-019-00008-6&domain=pdf mailto:tibor.kalman@gwdg.de published in an open access format. Policy recommendations on research data man- agement are being revised in the context of Open Science (European Commission 2018). It has become a common practice for researchers to publish their research data in an open-access fashion, using free or permissive licenses. In the arts and humanities in particular, however, data sharing and reuse among researchers is not a commonly established practice. Even if researchers in these disciplines published their data in European repositories and archives, this data is often hard to find, access, or reuse. Even if there were an increased awareness of the need and benefit of sharing resources within the disciplines of the arts and humanities, much needs to be done to make it an integral part of an everyday research practice. The sharing of resources is an inherently complex phenomenon that involves many different actors and is influenced by many factors. Challenges to the level of the data itself are well summarised by the FAIR principles, which comprise of stable identifiers, rich, broadly disseminated metadata, widely adopted formats, vocabularies and proto- cols (Wilkinson et al. 2016). These requirements need to be supported by an appropri- ate technical infrastructure: (a) stable repositories for depositing and publication of the data; (b) means for broad dissemination of metadata, most notably the Open Archives Initiative’s Protocol for Metadata Harvesting (OAI-PMH) in combination with large- scale aggregators; (c) authentication and authorisation infrastructure (AAI), allowing for fine-grained handling of permissions and (d) interoperability between tools, i.e., support for established formats and availability of well-defined APIs and import/export functionality to ensure permeability and an easy data flow within the research process. These technical requirements need to be underpinned by policy measures: promotion of standards and permissive intellectual property rights (IPR) for research seconded by clear licensing. It is also important to establish academic gratification for the creation and publication of research data and software, as well as to appreciate its value as research output and enable a proper academic contribution. The latter point is partic- ularly crucial: while the other aspects could be considered as, primarily, enabling factors, the gratification aspect constitutes a strong incentive for researchers to willingly share their work. All of these measures need to be accompanied by appropriate training and outreach campaigns, raising awareness and ensuring the transfer of this kind of knowledge. Both scholars and students and the interested public need to have the opportunity to acquaint themselves with digital methods, technologies, formats and best practices. Ideally, this should take place in intensive, small-scale, hands-on settings, which focus on individ- ual aspects, up-to-date online training material, comprehensive documentation, and opportunities for on-demand personal consultations with experts. The sharing of resources should not be seen as a mere handover of data, but rather as an integral aspect of working with digital resources, interwoven with all the various stages of the research data lifecycle, from creation and curation to dissemination of digital resources for reuse and knowledge acquisition. It naturally affects and is affected by all stakeholders in the research area. While the decision of individual scholars to share the resources they created is the conditio sine qua non, it is crucial to embed the resource in a fruitful, supportive broader environment that ensures all the above- mentioned enabling factors. The traditional institutional context might be the home organisation of the scholar, but given the global challenge to increase the accessibility of research data, the issue at stake cannot be addressed by individual institutions 114 International Journal of Digital Humanities (2019) 1:113–131 anymore and requires joint efforts on many levels, involving entities from the individ- ual research groups up to European and global institutions. Research infrastructure consortia feature a multi-layered structure, ranging from topic-specific working groups and national consortia to the governing bodies on a European level. They are in an ideal position to tackle these multifaceted challenges. Not only do they represent their respective community, but they are also an integral part of it, possessing a deep understanding of research practices in the field. This article gives an overview of the ongoing developments and reflects on the current discourse within and beyond the DARIAH research infrastructure. It is struc- tured as follows: First, we present the DARIAH initiative in detail, including the reasons for its initiation and its unique position in the European context. We then shift our focus to describe different national chapters of DARIAH and their take on dealing with (born-)digital research data collections in a heterogeneous research environment. By helping to moderate the change of scientific practices in the humanities, we aim to make it easier to integrate digital and technical aspects into research workflows in disciplines that were previously rather ‘untechnical’. Some remarks on our work towards a con- vergence of social and technical aspects of this endeavour will conclude the article. 2 DARIAH – A digital and distributed infrastructure for the arts and humanities A research infrastructure can serve as the basis for offering services and resources for the sharing and management of data and for the management of associated legal and organisational issues. Developing such a sustainable research infrastructure, which integrates existing resources, tools and services to broaden the possibilities of a truly open science, and promotes the acceptance of digitally-enabled approaches is also the raison d’être of the DARIAH initiative. DARIAH is short for Digital Research Infrastructure for the Arts and Humanities. This pan-European organisation aims at enabling and supporting digital research methods and teaching across the arts and humanities (DARIAH 2018). DARIAH- EU, as the umbrella organisation is called, was founded in the framework of the European Strategic Forum for Research Infrastructures (ESFRI) and first appeared on the ESFRI roadmap in 2006 as one of six projects for the humanities and social sciences (European Roadmap for Research Infrastructures 2006: 33). Within the ESFRI, the legal form of European Research Infrastructure Consortium (ERIC) has been developed to enable the funded European research alliances to operate on a stable, long-term basis. After a long preparation phase, the DARIAH-ERIC was established by the European Commission in August 2014. To date, 17 countries–– Austria, Belgium, Croatia, Cyprus, Denmark, France, Germany, Greece, Ireland, Italy, Luxembourg, Malta, Poland, Portugal, The Netherlands, Serbia and Slovenia––have become DARIAH members, and the list of cooperating partners in these and other countries is growing. Six further candidate countries are expected to become members by 2020. In practice, DARIAH is a vivid marketplace of ideas and know-how, where people from different countries and disciplines can meet and collaborate, help and learn from each other. It addresses the aforementioned challenges in many different ways. Mainly through its individual partners, DARIAH provides the necessary basic technical International Journal of Digital Humanities (2019) 1:113–131 115 infrastructure and specialised tooling to underpin the whole research process; be it virtual research environments (VRE) for co-creation and publication, repositories for long-term preservation and publication of research data, general publication platforms, or generic project-management solutions, allowing efficient communication in highly distributed collaboration setups. Around these technical efforts, DARIAH also orga- nises numerous training and outreach events to raise awareness and transfer practical skills for digital methods to the scholarly community. On the European level, DARIAH uses its unique position and capacity to push forward necessary policy work that makes the handling and especially sharing of research resources easier. It propagates the utilisation of standards to address the problem that large parts of the produced research data are neither visible, nor reusable (legally or technically). This is why DARIAH engages in the Open Science Policy Platform (OSPP) (Edmond 2018). In the framework of the ongoing project DESIR (DARIAH ERIC Sustainability Refined, see CORDIS 2018), DARIAH has identified six dimensions of sustainability that it seeks to strengthen: dissemination, growth, technology, robustness, trust, education. Up until the projected end of DESIR in December 2019, we will see international workshops and other types of dissemination events to initiate collaborations and further educational work, and the existing services will be enhanced with a focus on entity-based search, scholarly content management, visualisation and text-analytic ser- vices. Furthermore, DARIAH collaborates with other SSH infrastructures such as CESSDA (Consortium of European Social Science Data Archives, see CESSDA 2018), CLARIN (Common Language Resources and Technology Infrastructure [see CLARIN 2018]), and the emerging research software engineering community. The aim is to find a common understanding of how to sustain research software, to address specific challenges of research infrastructures, and to develop a unified technical reference (Kalman et al. 2018). It is a declared task in the DARIAH Strategic Action Plan, released in November 2017, to help developing sustainability models for Digital Humanities (DH) projects and their data collections, especially to ensure the longevity of such projects after the direct funding period has run out (DARIAH 2017). In the future, DARIAH aims at working towards a more resilient, robust setup of the technical infrastructure, making datasets and services more independent from individ- ual providers through stronger cooperation between partners of the consortium, and with e-Infrastructures like EGI (EGI 2018), EOSC (European Commission 2017) or EUDAT (EUDAT 2018), offering basic generic services. With concentrated expertise both on infrastructural aspects and on actual research in the Digital Humanities, DARIAH can act as a broker and mediate between the needs of individual research projects and the large-scale technical solutions offered by e-Infrastructures. Several initiatives were started to lay the technical and organisational groundwork for such collaboration between DARIAH and related e-Infrastructures. For instance, the EGI DARIAH Competence Centre (Harmsen et al. 2015) helped with pilot projects like Storing and Accessing DARIAH contents on EGI (Wandl-Vogt et al. 2017), to analyse, distinguish and meet DARIAH requirements within the EGI infrastructure. The EOSC- hub initiative, which consolidates and integrates access mechanisms to e-Infrastructure resources, recently initiated its DARIAH Thematic Service (Dumouchel 2017) to strengthen the collaboration. Through institutions that are active in both CLARIN and DARIAH, there is cooperation with EUDAT, with particular regard to topics related to preservation and access to long-term storage resources. 116 International Journal of Digital Humanities (2019) 1:113–131 3 National Flavours of DARIAH In this Section, we give an overview over different approaches and national flavours of DARIAH that are working with and sharing a wide variety of data and services through software and tools as well as accompanying learning and teaching material. We present three different examples of DARIAH member countries that demonstrate how national activities contribute to the overall goals. A crucial characteristic of the DARIAH research infrastructure is its distributed nature as a federated network where most of the services are not offered by a central instance, but through the contributions of individual partners. There are various ways in which DH research communities, their data, and their supporting infrastructures are embedded in the national research landscapes. 3.1 DARIAH in Austria 3.1.1 National consortium CLARIAH-AT Right from the start, the national group of humanities research infrastructures in the humanities was set-up as one joint organisational structure comprising of both CLARIN and DARIAH (Ďurčo and Mörth 2014). This approach proved to be very efficient and successful. Interestingly enough, dynamics aiming at a higher degree of interaction and cooperation can also be seen in other countries. In the Netherlands, two infrastructures run one big national project; in Denmark and France, the coordination of both RIs is placed with the same person or institution; in Germany, talks on greater interaction are ongoing, and in other countries similar tendencies can be discerned. The Austrian Centre for Digital Humanities at the Austrian Academy of Sciences (ACDH-OeAW 2015) is the coordinating national institution for both research infrastructures. The centre was founded with the intention to foster the change towards digital paradigms in the humanities and pursues a dual agenda of conducting digitally enabled research and providing technical expertise and support to the research communities at the Academy and in the Austrian research landscape. ACDH-OeAW is not the only player in Austria offering services for the digital humanities community. In CLARIAH-AT, the national group of institutions involved in the two European Research Infrastructure Consortia CLARIN and DARIAH, 14 partner institutions work together to provide a common framework to improve the situation with respect to efficiency of dealing with research data. In 2015, numerous partners of the consortium contributed to a national strategy for Digital Humanities in Austria (Alram et al. 2015). One of the central goals of this strategy, which was fleshed out at the request of the then Ministry for Science, Research and Economy, was the creation of infrastructures to guarantee long-term preservation of research data. One of the measures proposed in the strategy to achieve this goal was the establishment of a national repository federation to ensure long-term access to research data hosted by exchanging expertise, sharing technologies, and interlinking repository resources. The long-term goal is to reach an agreement between individual partners of the federation making sure that partners would step in with their repositories as fall-back options in case one of the participating repositories ceases to exist. Implementation of the measures is part of the agenda for the CLARIAH-AT consortium for the upcoming three-year period. International Journal of Digital Humanities (2019) 1:113–131 117 3.1.2 Data services – One-stop shop for DH projects In the following, we highlight one specific institution, the ACDH-OeAW, to exemplify how local centres support their respective communities, contributing their share to the common cause. ACDH-OeAW strives to cover the whole research process: project planning, data modelling, data curation and processing, digitisation, application devel- opment, service hosting and especially long-term preservation of data. All of this is accompanied by personal consulting and support for individual research endeavours and knowledge transfer, as well as outreach activities promoting the use of digital methods in the various fields of the humanities. Stable, reliable, long-term preservation of research data being an essential precondition for sharing of resources, the ACDH-OeAW is running a repository called ARCHE (A Resource Centre for the HumanitiEs) (ARCHE 2017) as one of its core services offering stable hosting of digital research data––in particular, for the Austrian humanities com- munity. ARCHE welcomes data from all researchers in the Austrian Academy of Sciences, but also from other institutions in and outside the country. While its predecessor, CLARIN Centre Vienna / Language Resources Portal, was dedicated to digital language resources, ARCHE is open to a broader range of disciplines. ARCHE is mainly meant to preserve resources related to Austria, which would include resources that were collected or created in Austria, or involve a geographical area or historical period of interest to Austrian scholars. The collection policy details the types of data the repository is ready to accept and store. ARCHE has been awarded the CLARIN B centre status and certified under the Core Trust Seal (CoreTrustSeal 2018), formerly Data Seal Approval. Secure and robust long-term preservation of data hinges on many factors. Next to the technical level (bitstream preservation), a host of data-related aspects (metadata, established formats), and the institutional setting are to be considered. ARCHE explicitly states which formats it recommends and accepts for depositing. The categories are ‘preferred’ and ‘accepted’. Preferred formats are expected to be stable and usable also in the long-term. Accepted formats are considered less reliable for the long-term and are converted to one of the preferred formats during the ingest process, both formats being stored. The preservation plan, which is currently being developed, will describe the workflow for format monitoring and migration, so as to ensure that data is preserved if formats become obsolete. ARCHE pursues the principles of Open Access and Open Data. It encourages data depositors to use open licences, like CC-BY and CC-BY-SA, adhere to rules for good scientific practice, and apply the FAIR Data Principles. The repository itself supports the FAIR principles in various ways. Not only does it make the data findable by offering search and browse functionalities, but it also makes it available for harvesting through third-party aggregators, such as CLARIN’s metadata catalogue Virtual Lan- guage Observatory (VLO) (Van Uytvanck et al. 2010), by means of publishing metadata via OAI-PMH. It makes the data accessible by assigning persistent identifiers and interoperable by promoting the use of recommended formats and offering direct access to the data and metadata for both human and machine interaction. And, finally, all of these measures contribute to the reuse of the data. In addition to ACDH-OeAW, two other participating institutions have been provid- ing stable hosting and publishing solutions for research data: the Centre for Information Modelling, with the ACDH at the University of Graz running the repository GAMS (Stigler and Steiner 2014) and the University of Vienna, with the PHAIDRA repository 118 International Journal of Digital Humanities (2019) 1:113–131 (Budroni and Höckner 2010). All three repositories build on Fedora Commons (Fedora 2018), GAMS being an integrated system which comes with a specialised ingest tool and a Text Encoding Initiative (TEI) based publication framework. The common technical framework is a good basis for establishing a repository federation, where data could be transferred to and hosted by one of the other partners in case one of the services would shut down. Although sustainable preservation of data is an indispensable part of up-to-date data management in research, there are a number of other components required to cover the whole range of workflow steps in digitally working projects. We refer specifically to tools for automatic processing of data and also solutions supporting the manual collaborative creation and curation of born-digital data (commonly referred to as virtual research environments). Confronted with a multitude of projects with at times very individual needs, ACDH-OeAW adopted a pragmatic approach, trying to use what is there and to provide the missing pieces. In practice this means, e.g., that data encountered in projects encoded in MS Word or Excel files are converted to formats better suited to the long term, like TEI or Simple Knowledge Organisation System (SKOS). Yet, in other cases, we develop project-specific web-based applications with custom-tailored data models, which allow the project teams to create and curate data collaboratively. While this may seem inefficient, we increasingly witness consolidation tendencies and economies of scale, as the colleagues supporting the projects gain more experience in generic frameworks, which allows us to develop new applications with considerably less effort, and re-integrate new functionalities required by new projects back into the common code-base. For ACDH-OeAW, knowledge transfer and outreach are central pillars of the DH strategy. The team organises numerous training activities, most notably the two event series ACDH Lectures and ACDH ToolGallery. The latter being a one-day format, in which various practical tools are presented in a combination with a theoretical introduc- tion on a given topic and a hands-on session, giving participants a chance to try out a particular tool with the support of a qualified expert. ACDH-OeAWalso runs the platform Digital Humanities Austria (DHA 2015), which is the main national dissemination channel for DH in Austria; it is used to announce events and features a comprehensive exhibition of DH projects and a DH bibliography, which serves as an entry point for humanities scholars to delve into DH. An essential part of the community-building efforts is the annual DHA conference, which was organised by ACDH-OeAW in the first three years, before starting to move to other Austrian cities: in 2017, the conference was organised by the Research Centre Digital Humanities at the University of Innsbruck. Part of the institute’s strong commitment to training & education is also the provision of two specialised services for the DH community: #dariahTeach (DARIAH-TEACH 2017), an e-learning platform for teaching material for DH, and the DH Course Registry (DH-registry 2017), an online catalogue providing an overview of DH-related curricula in Europe being collaboratively maintained by CLARIN and DARIAH. 3.2 DARIAH in Germany 3.2.1 National consortium – DARIAH-DE DARIAH-DE is the German national contribution to DARIAH. It currently consists of a consortium of 19 partners, comprising universities, academies of sciences and International Journal of Digital Humanities (2019) 1:113–131 119 independent research institutions, libraries, data centres, a non-governmental organiza- tion (NGO) and a commercial partner (DARIAH-DE 2018h). Now in its third project phase, DARIAH-DE receives funding from the German Federal Ministry of Education and Research. The project’s current focus is the preparation of the operational phase in 2019, aimed at providing a permanent infrastructure for the arts and humanities in Germany, a process which DARIAH-DE and CLARIN-D are jointly advancing in close collaboration with the ministry, the academies of sciences and disciplinary stakeholders (Forschungsinfrastrukturen für die Geisteswissenschaften 2018). The heterogeneous nature of the DARIAH-DE consortium enables the research project to address the multi-faceted challenges for research infrastructures. Two pillars of DARIAH-DE are its tight integration with research and teaching through its partners. Dedicated work packages focus on quantitative data analysis, visualisation and anno- tation with the two focal points addressed in each. Another work package researches the impact and reach of DH in the humanities community, while a strong collaboration with CLARIAH-AT under the umbrella of #dariahTeach focussed on curricular, edu- cational and training materials on a wide variety of topics. The third main aspect is the provision and operation of the technological infrastruc- ture: from basic components such as servers, monitoring and user support through collaboration solutions and development toolchains to the layer of scholarly services. For these, DARIAH-DE’s infrastructure partners, such as data and computing centres and libraries, provide existing and well-established components and services. This includes an authentication and authorisation infrastructure (AAI) that is part of the worldwide authentication network, built by the higher education and research institu- tions. Over the course of the DARIAH-DE project, the tight collaboration of the developers embedded in their fields and the service providers operating the services have been focused upon and sustainability solutions have been developed to ensure the basis for the long term operation of this infrastructure. Finally, the pillar most relevant to the present article is dedicated to the processing and storing of research data, for which several tools and services are offered. Building on the TextGrid project, DARIAH-DE has continued the devel- opment of the TextGrid Repository, focussed on critical digital scholarly editions and optimised for XML-TEI encoded data, to build the DARIAH-DE Repository (cf. DARIAH-DE 2018g). The operation of the repository is institutionalised through the Humanities Data Centre (HDC), a joint venture of Gesellschaft für wissenschaftliche Datenverarbeitung mbH Göttingen (GWDG) and Göttingen State and University Library (SUB). Both institutions thus ensure the sustainability of all data stored in the repository. This repository is one component of the Data Federation Architecture (DFA, see (Gradl and Henrich 2016) for an overview of the underlying concepts and Fig. 1 for the underlying workflow) offered by DARIAH-DE to manage research data. 3.2.2 Data services – A federation architecture The DFA consists of the DARIAH-DE Repository, the Collection Registry, the Generic Search and the Data Modeling Environment (DME). All components (services and applications) of the DFA are designed to interact with one another. They can be used all together or as standalone services depending on the individual needs of the researcher. 120 International Journal of Digital Humanities (2019) 1:113–131 The DARIAH-DE Repository (DARIAH-DE 2018f) is a digital long-term archive for humanities and cultural scientific research data, enabling researchers to store and publish data in a secure and sustainable manner. At the entry point, the DARIAH-DE Publikator (DARIAH-DE 2018e) offers a user-friendly web interface for data manage- ment, description and ingest into the repository. The storage backend is divided into two areas: a restricted private storage area and a public area. All preparation for publication is done in the private storage area via the Publikator and involves three simple steps: First, a collection needs to be created; second, all associated data belonging to the collection has to be uploaded and, finally, all data has to be described by metadata. The repository uses the Dublin Core Simple (cf. Dublin Core Metadata Initiative 2013) metadata standard for description of data, only a few fields are mandatory, such as licence information. Furthermore, persistent identifiers for stable referencing are provided through the publication process – the collections as well as all associated objects get individual Digital Object Identifiers (DOIs). There is a dedicated PID-Service as part of the DFA for assigning unique identifiers and registering them at the DataCite DOI-network. Once published, all data is publicly available. After publication, an optional but highly recommended possibility is the registration of the collection in the Collection Registry (DARIAH-DE 2018a). The Collection Registry enables researchers to make their published data even more visible and understandable and, therefore, more accessible. A draft entry with the metadata already mentioned is automatically created during the publication process and stored in the Collection Registry for further enrichment. For this, a dedicated metadata model for enhanced description of collections and associated data is provided: the DARIAH Fig. 1 DARIAH-DE Data Federation Architecture, Tobias Gradl (updated version from: Gradl et al., 2015, used with permission) International Journal of Digital Humanities (2019) 1:113–131 121 Collection Description Data Model, DCDDM (see DARIAH-DE 2017), based on (Dublin Core Metadata Initiative 2007). Once the collection is registered, all data is searchable via the DARIAH Generic Search interface. Due to the modular design of DARIAH’s Data Federation Architecture, all kinds of metadata––including such that describe data published outside the DARIAH-DE DFA––can be registered and made accessible for the Generic Search. Information on how to access data can be provided, including specification of interfaces and APIs. This includes data that originate in a digital form, but also non-digital data or collections of objects. The design of the Generic Search (DARIAH-DE 2018c) is aimed at providing researchers in the Digital Humanities with an individually adjustable search facility for their research needs. The myCollections functionality enables them to compile their own query by preselecting the sources out of the Collection Registry, store and share them with research colleagues. This allows researchers to precisely query predefined metadata sets. Custom collections can be added at any time via the Collection Registry interface to enlarge the data set of their own query. The Generic Search is accessible without registration and allows a combination of different search strategies and dynamic adjustment of the enquiry‘s granularity, e.g., by adjusting the faceted classification or the number of included collections. If collections with different metadata schemes need to be integrated into the DFA, the Data Modelling Environment (DME) (DARIAH-DE 2018b), as a further compo- nent allows a web based user-friendly mapping and association of metadata fields. The web interface enables researchers to explicate their knowledge on the semantic de- scription of their collections. This bottom-up approach allows for more flexibility when including additional external sources, without enforcing explicit standards. This is especially important for the arts and humanities disciplines with their variety of perspectives on collections, terminology and data models. Besides the Data Federation Architecture, which is designed for research data man- agement purposes of all disciplines within the arts and humanities, DARIAH-DE also offers tools and services that are used for specific project contexts or are related to specific research methods. There are general services for collaborative work and project manage- ment allowing collaboration across locations. Furthermore, tools for annotating, analysing and visualising data are provided. A prominent example is the Geo-Browser (DARIAH- DE 2018d), which allows the analysis of space-time relations of data and collections of source material, facilitating their representation and visualisation in a correlation of geographic spatial relations at corresponding points of time and sequences. Additionally, a virtual research environment (VRE), especially designed for the creation of digital editions based on XML/TEI, offers open source tools and services to collabora- tively edit and generate research data. The VRE TextGrid (TextGrid 2018) enables the editing, storing and publishing of data for scholars in the humanities in a protected environment. DARIAH-DE is not only a digital research infrastructure, but also a social infrastruc- ture. It fosters exchange of experiences and expertise and offers a variety of communica- tion and training facilities, like user meetings, issue specific workshops with hands-on sessions, and regular events on the theme of Digital Humanities, spanning a broad range of topics. The information supply of DARIAH-DE is continuously being enhanced and provided through multiple channels and platforms, e.g. through a Digital Humanities blog (DHdBlog), a Twitter account with current news, a YouTube channel (DHd-Kanal) with 122 International Journal of Digital Humanities (2019) 1:113–131 tutorials, a “Doing Digital Humanities” bibliography as well as many publications and presentations which have been created during the seven years of project lifetime so far. DARIAH-DE creates a network of digital humanities services, expertise and com- munities to support research and cooperation in the humanities and cultural sciences, and promotes open access sharing of digital resources. 3.3 DARIAH in France 3.3.1 National consortium – DARIAH-FR The CNRS (Centre National de la Recherche Scientifique – National Centre for Scientific Research) is a public organisation under the responsibility of the French Ministry of Education and Research. The CNRS, in connection with universities, has implemented an ecosystem aiming to cover the entire lifecycle of the production of scientific data and publications in the Humanities and Social Sciences. This ecosystem is based on the following infrastructures: Open Editions (2018), CCSD (Centre pour la Communication Scientifique Directe 2018), PERSEE (Portail de diffusion de publications scientifiques) and TGIR Huma-Num (Très Grande Infrastructure de Recherche Huma-Num 2018). Huma-Num coordinates the participation in DARIAH and CLARIN of the above- mentioned organisations, as well as other potential contributors, such as Huma-Num’s national consortia (see below). It is also involved in other European and international projects like OPERAS (OPERAS 2018). Huma-Num is an infrastructure that aims to facilitate the digital turn in Humanities and Social Sciences and is part of the national ESFRI roadmap, which is in turn aligned with the European Union’s ESFRI frame- work. This allows good perspectives for recurrent funding. To perform these missions, Huma-Num’s organisation is based on both human and technological layer. It funds “groups of people”, called consortia, working on common areas of interest (e.g., similar scientific objects) and also provides a technological infrastructure, offering a variety of platforms and tools to process, preserve and disseminate digital research data. The main idea of a consortium is to organise multidisciplinary collective dialogue within research communities by bringing together different types of actors (researchers, technical staff, etc.) coming from different institutions, with the aim of creating synergies. In return, a consortium is expected to provide technological (or scientific) good practices and produce corpora, new standards, and tools. Furthermore, Huma-Num provides a technological infrastructure on national scale, based on a large network of partners. Technically, the infrastructure itself is hosted in a big data centre built by and for physicists. A long-term preservation facility from another data centre (CINES – Centre Informatique National de l’Enseignement Supérieur) is also utilised. In addition, a group of correspondents in the “Maison des Sciences de l’Homme” network (MSH Network 2018) all over France is in charge of relaying information about Huma-Num’s services and tools. 3.3.2 Data services throughout the data lifecycle Huma-Num provides tools and services for each step in the research data lifecycle. It coordinates the production of digital data, while offering a variety of platforms and tools to International Journal of Digital Humanities (2019) 1:113–131 123 process, preserve and disseminate the data. It also provides research projects with a range of utilities to facilitate the interoperability of various types of digital raw data and metadata (see Fig. 2). More specifically for digital collections, the aim is to foster the exchange and dissem- ination of metadata, and of the data itself, via standardised tools and lasting, open formats. These tools developed, by Huma-Num, are all based on semantic web technologies, mainly for their auto-descriptive features, and for the enrichment opportunities they enable. All our resources are, therefore, fully compatible with the Linked Open Data (LOD). Three services have been designed and developed by Huma-Num to process, store and display research data, while preparing them for re-use and long-term preservation; to put it another way, the aim is to provide a chain of tools to make data FAIR. These complemen- tary services embrace the research data lifecycle and are designed to meet the needs arising there from: constitute a coherent chain of research data tools. While they interact smoothly with one another, they are also open to external tools using the same technologies. The scientific objective is to promote data sharing so that other researchers, com- munities, or disciplines, can reuse them, including from an interdisciplinary perspective and in different ways. A map, for example, may become a scientific object, which reflects both the point of view of a geographer and that of a historian. More generally, the principles and methods of the Semantic Web (RDF, SPARQL, SKOS, OWL), on which these services rely, enable data to be documented or re-documented for various uses without confining them to inaccessible silos. Another important point is to make the storage of data independent of the device used to disseminate the data. Another objective is to prevent the loss of data by preparing their long-term preservation. Documenting the use of appropriate formats, which are the basis of data interoperabil- ity, greatly facilitates the archiving process. The workflow implemented by Huma-Num has been built on interoperability. The aim is to foster the exchange and dissemination of metadata, but also of the data themselves via standardised tools and lasting, open formats. Huma-Num uses different technologies for cold, warm and hot data. If the technology used for hot data was quite Fig. 2 DARIAH-FR’s Services for Data, Huma-Num 124 International Journal of Digital Humanities (2019) 1:113–131 classical, for warm data, Huma-Num has established a mesh of distributed storage all over France (currently 9 nodes) using different storage technologies encapsulated. Thus, backup and versioning can be made on any node. Furthermore, the data center where Huma-Num’s infrastructure is hosted provides a backup on tapes for cold data. Huma-Num already provides a long-term preservation service based on the CINES (Centre Informatique National de l’Enseignement Supérieur, 2018) facility, a National Computer Center of Higher Education which is responsible for permanent archiving for scientific data in France. This is much more than the bit preservation done with the above- mentioned technologies. A long-term preservation project means that one needs to organise the data with a view to reuse by someone, who did not participate in its creation, that presupposes a lot of curation. In addition, the data should be expressed in a format accepted by the partner and additional information has to be provided to document the context of data production, metadata, etc. Huma-Num accompanies these projects by acting as go-between linking data producers, CINES, archivists and other actors. After a detailed description of three national landscapes, we now shift our focus to the ongoing efforts towards a convergence on the European level in light of the heterogeneity of research data collections, of formats, tools and services. 4 Convergence of tools, methods and collections It was always the vision of DARIAH to enable the DH research community to reuse and build on existing solutions, developed in and by the community. This includes both the social and the technical aspects of the convergence from individual solutions to a distributed infrastructure. The social aspect builds around the idea of an Open Marketplace, which enables us to share and review existing services and solutions. From the technical side, DARIAH has identified the need to address the sustainability of the software, which provide some of the core parts of any digital infrastructure. In the following section, we describe how these are being addressed. 4.1 The open marketplace The idea of developing DARIAH ‘as a social marketplace for services’ (Blanke et al. 2011) dates back as far as to the preparatory phase of the DARIAH initiative. The long-term goal is to provide an Open Marketplace platform, which is planned as an easy-entry place where scholars can find solutions for the digital aspects of their daily research work, such as software, tools, (born-)digital data sets, repositories, services, learning and teaching material. The Marketplace targets all researchers from the broader SSH, not just those scholars who would regard themselves as digital humanists. Various approaches had been started in the past to provide collections and registries with similar goals. The most important difference between such approaches and the DARIAH Marketplace is that it will contextualise the tools and services offered, with user feedback, user stories, links to training material, showcases, contact addresses, ratings. It is going to be actively curated and sustained by the DARIAH community. The idea is not that these solutions would be produced by DARIAH itself, but that International Journal of Digital Humanities (2019) 1:113–131 125 the Marketplace creates visibility for them to help researchers do their work (DARIAH 2017) (Fig. 3). There have been previous attempts at providing an active, community-backed registry of digital tools and services. While most of them did not always live up to their expectations (for a prominent example cf. Dombrowski 2014), one can still learn from them and reuse their highly curated data. Such an attempt was undertaken within the framework of the H2020 project “Humanities at Scale” coordinated by the DARIAH-ERIC. Building on TERESAH, the “Tools E-Registry for E-Social science, Arts and Humanities” originally developed within the FP7 project “Digital Services Infrastructure for Social Sciences and Humanities” (DASISH) until 2014, a demonstrator for a central registry with distributed data sources was created (Engelhardt et al. 2017). While the DARIAH Marketplace is still being formed, it is the declared goal not to just add another list-based overview of digital tools, but to assemble and highlight DH knowledge. The platform will create a place addressing and involving the entire research community and also, eventually, the public and industry (bearing in mind EOSC and EU access policy guidelines for research infrastructures). 4.2 Sustainability of tools and software The social aspect of the marketplace is built on the idea of sharing and reviewing existing services and solutions. In the case of software, providing some of the core technical parts of any digital infrastructure, DARIAH has identified the need to address its sustainability problems (cf. Thiel 2017). In the current status-quo, the construction of sustainable infrastructures is done through grant-based research projects, which has a number of problems. Soft- ware built to address specific research questions is often developed in an ad- hoc manner. This is not helped by the fact that software is not yet generally accepted as creditable research output in and of itself. Without a recognition of Fig. 3 Illustrative sketch of DARIAH Open Marketplace 126 International Journal of Digital Humanities (2019) 1:113–131 the value of the software as a form of research, the individual researcher’s willingness to invest additional time into improving the software in a way that does not directly impact the output will be minimal. The requirement to provide data management plans as part of H2020 grants, which is implemented by national and other funders, sees source code as being identified as digital resources that need preservation. To address this, the UK’s Software Sustainability Institute developed a solution to create a Software Management Plan through DMPonline (Software Sustainability Institute 2018) and GitHub and Zenodo have joined forces to add a simple possibility to publish GitHub releases in Zenodo, making software releases citable through DOIs (GitHub 2016). Archiving code is the first step in ensuring the availabil- ity for future re-use and reproducibility of research output generated with that software. The second step is making sure that the code can be processed and executed when needed, which goes beyond classical practices of data curation, (cf. Katz et al. 2016) for a discussion on the topic. In our context, two problems are most relevant. For reproducibility of results, access to the entire exact build environment is required and it must, therefore, be referenced in the archived software in a machine readable format. For re-use of the software, the adaptability to the constantly changing reality of information technology, such as changes to external libraries and dependencies, becomes relevant. As tech- nology progresses, so do research questions and new applications not envisioned during the original development can emerge (cf. Harms, Grabowski 2011). For a future researcher to be able to actually adapt a given software product, sufficient documentation and code legibility must exist. While research thrives on innovative solutions with fast-paced development progress, the requirements for software maintainability for the long run are directly contrary (see Hettrick 2016, Chapter 3) for a more detailed discussion. This is also a particular problem for infrastructures striving to sustain software developed within projects as services. To be able to do so, the infrastructure providers must make a judgement on the expected and unexpected cost that long-term software maintenance will incur. This can only be done if the software is of sufficiently good quality. To address this, infrastructures are developing guidelines and best practices for developers. At the same time, existing quality measures, such as ISO standards, can be one frame of reference (see e.g. Buddenbohm et al. 2017), while (Doorn et al. 2016) suggest estab- lishing an independent certification, modelled on the Data Seal of Approval, now CoreTrustSeal (CoreTrustSeal 2018). For an infrastructure to provide a valuable service to the scholarly commu- nity, the reliability and the trustworthiness of the services offered is a funda- mental prerequisite. By improving the quality of the software and making this transparent to the end user of the technology through the Open Marketplace platform, DARIAH strives to address both. In particular, through DESIR work was started on a general Technical Reference (Moranville et al., 2018) as baseline for new development and the Marketplace will improve the findability and discoverability of research software. The combination of both supports and builds upon known recommendations for research software (Jiménez et al. 2017). International Journal of Digital Humanities (2019) 1:113–131 127 5 Conclusion We have summarised ongoing developments and reflected current discussions within the research infrastructure DARIAH and within some of DARIAH’s member states, which are creating and integrating solutions for challenges of heterogeneous research data, tools, services in the arts and humanities. We highlighted that the focus of DARIAH is not simply digitized analogue material of galleries, libraries, archives, and museums. As (digital) research produces born-digital materials (e.g. datasets, tools, softwares), which have to be managed, DARIAH’s collection of data is much broader. The challenges, issues and factors of the heterogeneity of (born-)digital research data that DARIAH aims to address only become apparent in large international infrastruc- tures willing to integrate heterogeneous research practices, data formats, tools and services from the wide range of DH disciplines. This article provided insights into this process, both on European and national levels, and reflected on discussions and solutions in the broader DARIAH network. These discussions include the many factors and challenges that influence the sharing of resources in the arts and humanities. The DARIAH research infrastructure seeks to support the scholarly community to enable and foster the work with and sharing of digital resources in numerous ways. This includes the need to look at the activities on the European and national levels and is exemplified by the three examples from member countries, showcasing also the variety in the setups of the national consortia. In order to support communities in reusing distributed existing resources in a coherent manner, a coordinated multi-faceted strategy is paramount. It has to involve technological provisions for robust services as well as sustainable software plans, work on policy level promoting use of standards and permissive licensing, all accompanied by training and outreach activities to raise awareness and convey practical skills on digital methods. DARIAH also acknowledges its position in the general landscape of existing initiatives, infrastructures, as well as projects, and strives to promote exchange and leverage synergies with them. In addition to the collaborations with the initiatives of the SSH communities like CESSDA, CLARIN, EUROPEANA and OpenAIRE, the cooperations with e-Infrastructures like EGI, EOSC or EUDAT are intensified and expanded. A central goal of this pan-European endeavour is to enable, promote, and simplify the discovery and access to the wealth of (born-)digital resources available in line with the FAIR principles. In order to achieve this, DARIAH has started developing a curated community-driven discovery platform, the DARIAH Open Marketplace. Once released, it will serve the researchers and broader audiences in finding data sets, tools and services that are applicable and reusable in their daily research. The key to success is to involve the commu- nities, and in this regard, the Marketplace has a pivotal role for the future. References ACDH-OeAW (2015). Austrian Centre for Digital Humanities at the Austrian Academy of Sciences. Retrieved from https://www.oeaw.ac.at/acdh/. Accessed 26 Feb 2018. 128 International Journal of Digital Humanities (2019) 1:113–131 https://www.oeaw.ac.at/acdh/ Alram, M., Benda, Ch., Ďurčo, M., Mörth, K., Wentker, S., Wissik, T., Budin, G., et al. (2015). DHAUSTRIA-STRATEGIE. Sieben Leitlinien für die Zukunft der digitalen Geisteswissenschaften in Österreich. Wien. https://doi.org/10.1553/DH-AUSTRIA-STRATEGIE-2015. ARCHE (2017). A Resource Centre for the HumanitiEs. Retrieved from https://arche.acdh.oeaw.ac.at/. Accessed 26 Feb 2018. Blanke, T., Bryant, M., Hedges, M., Aschenbrenner, A. & Priddy, M. (2011). Preparing DARIAH. IEEE 7th International Conference on E-Science. IEE Digital Library: Stockholm (pp. 158–165). https://doi. org/10.1109/eScience.2011.30. Buddenbohm, S., Matoni, M., Schmunk, S., & Thiel, C. (2017). Quality assessment for the sustainable provision of software components and digital research infrastructures for the arts and humanities. Bibliothek Forschung und Praxis, 41(2), 231–241. https://doi.org/10.1515/bfp-2017-0024. Budroni, P., Höckner, M. (2010). Phaidra, a Repository Project of the University of Vienna; in: iPRES 2010, 7th International Conference on Preservation of Digital Objects, Vienna. CCSD (Centre pour la Communication Scientifique Directe) (2018). A center which offers a set of services for the management of open archives. Retrieved from https://www.ccsd.cnrs.fr. Accessed 26 Feb 2018. CESSDA (2018). About CESSDA. Retrieved from https://www.cessda.eu/About>. Accessed 26 Feb 2018. CINES (Centre Informatique National de l'Enseignement Supérieur) (2018). Digital archiving solutions for long term preservation. Retrieved from https://www.cines.fr/en/long-term-preservation. CLARIN (2018). CLARIN in a Nutshell. Retrieved from https://www.clarin.eu/content/clarin-in-a-nutshell. Accessed 26 Feb 2018. CORDIS (2018). DARIAH ERIC Sustainability Refined. Retrieved from https://cordis.europa. eu/project/rcn/207190_en.html. Accessed 26 Feb 2018. CoreTrustSeal (2018). CoreTrustSeal Data Repository Certification. Retrieved from https://www. coretrustseal.org/. Accessed 26 Feb 2018. DARIAH (2017). 2020: 25 Key Actions for a Stronger DARIAH by 2020. Retrieved from https://www.dariah. eu/wp-content/uploads/2017/02/DARIAH_STRAPL_v06112017.pdf. Accessed 26 Feb 2018. DARIAH (2018). Dariah in a Nutshell. Retrieved from https://www.dariah.eu/about/dariah-in-nutshell/. Accessed 26 Feb 2018. DARIAH-DE (2017). DARIAH Collection Description Data Model DCDDM. Retrieved from https://github. com/DARIAH-DE/DCDDM. Accessed 26 Feb 2018. DARIAH-DE (2018a). DARIAH-DE Collection Registry. Retrieved from https://colreg.de.dariah.eu. Accessed 26 Feb 2018. DARIAH-DE (2018b). DARIAH-DE: Data Modelling Environment. Retrieved from https://dme.de.dariah. eu/dme. Accessed 26 Feb 2018. DARIAH-DE (2018c). DARIAH-DE Generic Search. Retrieved from https://search.de.dariah.eu/search/. Accessed 26 Feb 2018. DARIAH-DE (2018d). DARIAH-DE Geo-Browser. Retrieved from https://geobrowser.de.dariah.eu/. Accessed 26 Feb 2018. DARIAH-DE (2018e). DARIAH-DE Publikator. Retrieved from https://repository.de.dariah.eu/publikator. Accessed 26 Feb 2018. DARIAH-DE (2018f). DARIAH-DE Repository. Retrieved from https://de.dariah.eu/repository. Accessed 26 Feb 2018. DARIAH-DE (2018g). Data Federation Architecture Technical Documentation. Retrieved from https://repository.de.dariah.eu/doc/services/. Accessed 26 Feb 2018. DARIAH-DE (2018h). Der DARIAH-DE Forschungsverbund. Retrieved from https://de.dariah.eu/der- forschungsverbund>. Accessed 26 Feb 2018. DARIAH-TEACH (2017). dariahTeach. Retrieved from https://teach.dariah.eu/. Accessed 26 Feb 2018. DHA (2015). Digital Humanities Austria. Retrieved from http://digital-humanities.at/. Accessed 26 Feb 2018. DH-registry (2017). DH Course Registry. Retrieved from https://registries.clarin-dariah.eu/courses/. Accessed 26 Feb 2018. Dombrowski, Q. (2014). What ever happened to project bamboo? Literary and Linguistic Computing, 29(3), 326–339. https://doi.org/10.1093/llc/fqu026. Doorn, P., Aerts, P. and Lusher, S. (2016). Research software at the heart of discovery, DANS & NLeSC. Retrieved from https://www.esciencecenter.nl/pdf/Software_Sustainability_DANS_NLeSC_2016.pdf. Accessed 26 Feb 2018. Dublin Core Metadata Initiative (2007). Dublin Core Collection Description Application Profile. Retrieved from http://dublincore.org/groups/collections/collection-application-profile/. Accessed 26 Feb 2018. Dublin Core Metadata Initiative (2013) Dublin Core metadata element set, version 1.1: Reference description. Retrieved from http://www.dublincore.org/documents/dces/. Accessed 26 Feb 2018. International Journal of Digital Humanities (2019) 1:113–131 129 https://doi.org/10.1553/DH-AUSTRIA-STRATEGIE-2015 https://arche.acdh.oeaw.ac.at/ https://doi.org/10.1109/eScience.2011.30 https://doi.org/10.1109/eScience.2011.30 https://doi.org/10.1515/bfp-2017-0024 https://www.ccsd.cnrs.fr/ https://www.cessda.eu/About https://doi.org/10.1553/DH-AUSTRIA-STRATEGIE-2015 https://www.clarin.eu/content/clarin-in-a-nutshell https://cordis.europa.eu/project/rcn/207190_en.html https://cordis.europa.eu/project/rcn/207190_en.html https://www.coretrustseal.org/ https://www.coretrustseal.org/ https://www.dariah.eu/wp-content/uploads/2017/02/DARIAH_STRAPL_v06112017.pdf https://www.dariah.eu/wp-content/uploads/2017/02/DARIAH_STRAPL_v06112017.pdf https://www.dariah.eu/about/dariah-in-nutshell/ https://github.com/DARIAH-DE/DCDDM https://github.com/DARIAH-DE/DCDDM https://colreg.de.dariah.eu/ https://dme.de.dariah.eu/dme https://dme.de.dariah.eu/dme https://search.de.dariah.eu/search/ https://geobrowser.de.dariah.eu/ https://repository.de.dariah.eu/publikator https://de.dariah.eu/repository https://repository.de.dariah.eu/doc/services/ https://de.dariah.eu/der-forschungsverbund https://de.dariah.eu/der-forschungsverbund https://teach.dariah.eu/ http://digital-humanities.at/ https://registries.clarin-dariah.eu/courses/ https://doi.org/10.1093/llc/fqu026 https://www.esciencecenter.nl/pdf/Software_Sustainability_DANS_NLeSC_2016.pdf http://dublincore.org/groups/collections/collection-application-profile/ http://www.dublincore.org/documents/dces/ Dumouchel, S. (2017). How the notion of access guides the organization of a European research infrastruc- ture: the example of DARIAH. Retrieved from https://dh2017.adho.org/abstracts/088/088.pdf> [Last accessed 17 May 2018]. Ďurčo, M. & Mörth, K. (2014). CLARIN-DARIAH.AT – Weaving the network, in: 9th Language Technologies Conference. Information Society – IS 2014, Ljubljana, Slovenia, pp. 14–18. Edmond, J. (2018 Feb) Untangling Barriers: Director Jennifer Edmond on DARIAH’s Commitment to Open Science. Retrieved from https://www.dariah.eu/?p=1997. Accessed 26 Feb 2018. EGI (2018). EGI: advanced computing for research. Retrieved from https://www.egi.eu/about/. Accessed 26 Feb 2018. Engelhardt, C., Leone, C., & Moranville, Y. (2017). Distributed Metadata Schema and Demonstrator for Open Humanities Methods. [Research Report] Göttingen State and University Library; DARIAH. 2017. Available at https://hal.archives-ouvertes.fr/hal-01637051v1. EUDAT (2018). What is EUDAT? Retrieved from https://www.eudat.eu/what-eudat. Accessed 26 Feb 2018. European Commission (2017). EOSC Declaration. Retrieved from https://ec.europa. eu/research/openscience/pdf/eosc_declaration.pdf. Accessed 26 Feb 2018. European Commission (2018). Commission Recommendation of 25.4.2018 on access to and preservation of scientific information.Retrieved from http://ec.europa.eu/newsroom/dae/document.cfm?doc_id=51636. Accessed 26 Feb 2018. European Roadmap for Research Infrastructures. (2006). Report 2006. Luxembourg: Office for Official Publications of the European Communities. Retrieved from https://ec.europa. eu/research/infrastructures/pdf/esfri/esfri_roadmap/roadmap_2006/esfri_roadmap_2006_en.pdf. Accessed 26 Feb 2018. Fedora (2018). Fedora Repository. Retrieved from http://fedorarepository.org/. Accessed 26 Feb 2018. Forschungsinfrastrukturen für die Geisteswissenschaften (2018). Wissenschaftsgeleitete Forschungsinfrastrukturen für die Geistes- und Kulturwissenschaften in Deutschland. Retrieved from https://www.forschungsinfrastrukturen.de/. Accessed 26 Feb 2018. GitHub (2016). Making Your Code Citable. Retrieved from https://guides.github.com/activities/citable-code/. Accessed 26 Feb 2018. Gradl, T., & Henrich, A. (2016). Die DARIAH-DE-Föderationsarchitektur – Datenintegration im Spannungsfeld forschungsspezifischer und domänenübergreifender Anforderungen. Bibliothek Forschung und Praxis, 40(2), 222–228. https://doi.org/10.1515/bfp-2016-0027. Gradl, T., Henrich, A., & Plutte, C. (2015). Heterogene Daten in den Digital Humanities: Eine Architektur zur forschungsorientierten Föderation von Kollektionen. In Baum, C. & Stäcker, T.(eds.) Grenzen und Möglichkeiten der Digital Humanities. Zeitschrift für digitale Geisteswissenschaften, 1. DOI: https://doi.org/10.17175/sb001_020. Harms, P., & Grabowski, J. (2011). Usability of Generic Software in e-Research Infrastructures. Journal of the Chicago Col loquium on Digital Humanities and Computer Science, 1(3) 1–18. http://resolver.sub.uni- goettingen.de/purl?gs-1/9238. Harmsen, H., Kalman, T. & Wandl-Vogt, E. (2015). DARIAH meets EGI. Inspired newsletter – Issue 19. Retrieved from https://www.egi.eu/news-and-media/newsletters/Inspired_Issue_19/dariah.html. Accessed 26 Feb 2018. Hettrick, S. (2016). Research Software Sustainability: Report on a Knowledge Exchange Workshop. Retrieved from https://www.esciencecenter.nl/pdf/Research_Software_Sustainability_Report_on_KE_Workshop_ Feb_2016_FINAL.PDF>. Accessed 26 Feb 2018. Jiménez R.C., Kuzak M., Alhamdoosh M., et al. (2017). Four simple recommendations to encourage best practices in research software [version 1]. F1000Research, 6:876. https://doi.org/10.12688/f1000 research.11407.1. Kalman, T., Thiel, C., Van Uytvanck, D., Moranville, Y. (2018). Sustainable Research Software – Managing a Common Problem of SSH Infrastructures. Digital Infrastructures for Research 2018, Lisbon, Portugal Retrieved from https://indico.egi.eu/indico/event/3973/session/22/contribution/111 Katz, D. S., Niemeyer, K. E., Smith, A. M., Anderson, W. L., Boettiger, C., Hinsen, K., & Hooft, R. (2016). Software vs. Data in the Context of Citation. PeerJ Preprints, 4. https://doi.org/10.7287/peerj. preprints.2630v1. Moranville, Y., Rodzis, M. & Thiel, C. (2018). DARIAH Technical Reference. Retrieved from