Korey Jackson

Publishing | Digital Humanities

suzbookg2.jpg

MLA Special Session Accepted! “Scaling and Sharing: Data Management in the Humanities”

Excited to be taking in part in the upcoming MLA convention in Boston! Our special session, “Scaling and Sharing: Data Management in the Humanities,” will be part of a series of conversations about all things lang, lit, and (open) access–in keeping with the convention theme. Here’s the submitted abstract, which provides a good overview of what will be on offer during the session:

In 2006 the American Council for Learned Societies released a report titled Our Cultural Commonwealth summarizing the promises and challenges of “big data” within the humanities and social sciences. The radical growth of computing, networking, and digital storage promised (or at least prefaced) a new era of “cumulative, collaborative, and synergistic” scholarship. And as we’ve seen in the half-dozen years since the report was issued, much of this promise has been realized. Examples include inter-institutional projects like those sponsored by the Digging into Data program, administered by the NEH’s Office of Digital Humanities; the Mellon-funded Project Bamboo, designed to become a content management and collaboration hub for IT and humanities researchers; and massive data collection undertakings like the Shoah Foundation’s Visual History Archive, a collection of nearly 52,000 testimonies from Holocaust and other genocide survivors.

Of course, most humanities research datasets don’t begin to approach this kind of scale. Single researchers and research teams working with local materials, databases, and storage are still very much the norm. The questions that this roundtable focuses on, then, are: How do we define and support good humanities data practices at the individual and local level? What options exist for managing and distributing data? And how can we ensure that individual datasets are open, interoperable, and accessible by as many researchers as possible? Presenters will address these questions from a number of angles, considering everything from the theoretical redefinitions of what counts as data in humanities research to the tools and expertise necessary for capturing and distributing useful data.

As of a term of art, data has the tendency to conjure up hard stats and bean counting inimical to the “softer side” of the hermeneutic process. However, rather than simply replicating this divisive approach to datasets, or urging humanistic practice to shed its reliance on close reading and take up with cold, hard positivism, this panel seeks to query precisely what data can be and do in a humanities context. One particular type of standardized information set familiar to humanities scholars is the research bibliography. Panelists will discuss how shared bibliographies can become not simply the intertextual foundations of good research, but data points for the creation of new knowledge. In particular, Spencer Keralis, Digital Scholarship Research Associate and CLIR Postdoctoral Fellow at the University of North Texas, will discuss his collaboration on the Susanna Rowson Digital Compendium. The Compendium is designed to pioneer the use of bibliographic data as a core information set for research. It is precisely because bibliographies adhere to specified formats that the data they express can be used for GIS mapping, temporal animations, network visualization, and other digital (and analog) heuristics. Keralis will demonstrate the potential value of reconceiving the bibliography as an information set to drive new work in literary history and history of the book.

Data management has long been a priority for many grant programs offered by the National Endowment for the Humanities (NEH). 2011 became a banner year for formal data management plans. Along with the National Science Foundation (NSF), programs offered by the NEH Office of Digital Humanities began requiring separate data management plans for all grant submissions to programs like the popular Digital Humanities Start-Up Grant program and the new Digital Humanities Implementation Grant program. These plans must describe how the project team will manage and disseminate data generated or collected by the project. Jason Rhody, Senior Program Officer for the Office of Digital Humanities at NEH, will discuss elements now considered fundamental to the data management plan: descriptions of data types, how data will be managed and made available to others, the formal mechanisms and software for data sharing, legal and ethical restrictions relevant to the dataset, and metadata standards to be employed. Rhody will also address the overall decision to require formal data management plans in its humanities grants and the results of the first full year of their inclusion in the submission process.

Presenters are also deeply invested in helping new forms of humanistic data find accessible formats. The use of standard lexical and structural markup, tagging, and other kinds of descriptive indentifiers has the potential to move data from the siloed and idiosyncratic space of the individual research project into the realm of inter-institutional and interdisciplinary collaboration. Michael Ullyot, Assistant Professor of English at the University of Calgary, will describe his endeavors to apply structural metadata to the 865,185 words of Shakespeare’s complete works. In doing so, Ullyot is working to create a training set for Natural Language Processing (NLP) algorithms to automate the markup of all early modern English. In a similar vein, Lisa Rhody, Ph.D. Candidate in English at the University of Maryland, will reflect on the process of building a data collection and management plan from the ground up. Her project, “Review, Revise, Requery: New Methods for Studying Ekphrasis”–supported by a Maryland Institute for Technology in the Humanities (MITH) Winnemore Dissertation Fellowship–required collaborating with the MITH team to engineer an approach to tagging, markup, and version control with the specific aim of building a digital collection of modern poetry suitable for text analysis, including topic modeling and classification.

Finally, through XML, metadata standards, and version control systems like Git, panelists examine how metadata can be deployed to make data accessible to diverse end users. Matt Burton, Ph.D. Candidate in the School of Information at the University of Michigan, and Korey Jackson, Digital Publishing Coordinator and CLIR Postdoctoral Fellow also at Michigan, will examine Git and the popular software development community platform GitHub. Their talk explores Git and GitHub from the perspective of the humanities scholar, asking how Git might best be applied to digital humanities projects and what kinds of training are necessary to make version control a commonplace practice in humanities research.

Overall, in outlining the migration from individual project to scalable dataset, this roundtable explores “big data” not simply as a matter of size or number, but more importantly as a process of granting researchers and educators access to shared information resources.

Hope to see many DH (and good ol’ H) friends there!

Announcing digitalcultureweek (DCW)

(Reposted from the digitalculturebooks blog)

What it is
Fortunately for those of us involved in the digital humanities and new media studies, the information landscape is filled with thought-provoking commentary about research, tools, datasets, new distribution networks, and new ways of legitimating these networks. Of course, riches like this often come at the cost of valuable time. Enter that dreaded word: overload. It’s impossible to read everything. Sometimes it’s a challenge just to read a few things well. This is where digitalcultureweek comes in. DCW will be a weekly space to find syntheses of field-specific material, to read about and discuss trends, hot-button topics, highlights, lowlights, and everything in between. A place, in other words, to make a narrative out of an info stream.

Who it’s for
I’m a tab fiend. I mean my penchant for accumulating browser tabs throughout the day. The thing is, I can’t get myself to close down an article, essay, Storify list, whatever, until I feel like I’ve mined, stored, and curated every piece of information on offer. Sure, I use Pinterest, Delicious, Evernote, Zotero, Hootsuite, a cardboard box full of old receipts with urls scribbled on the backs in barely-legible golf pencil—but these tools, while wonderfully useful (well, barring the box) don’t tend to grant a sense of narrative…rising action, falling action, and, maybe most importantly, resolution. digitalcultureweek is aimed at the similarly narrative-deprived, the tab fiends looking for their week’s dénouement.

How it works
The plan for digitalcultureweek is to put together a stable corral (or a stable stable) of writers who forage in the same DH/new media/scholarly communication turf that so many of us tread. Digital Humanities Now, Scholarly Kitchen, DH Q&A, the new Journal of Digital Humanities, The Chronicle of Higher Ed’s ProfHacker, the many high-quality personal and professional blogs that command attention week after week—all of these are places where DH and new media discourse are being forged and shaped. digitalcultureweek is looking to become a weekly roundtable, an informal Q&A about the week’s doings on sites like those listed above. Think one part McGlaughlin Group, one part Diane Rehm’s Friday News Roundup…and one part Duck Dynasty (okay, not really).

Here’s how it works: digitalculturebooks will tweet out a series of discussion questions throughout the week (using the hashtag #DCWQuestions) that refer to specific contributions from the DH community. On Friday we’ll post responses to these questions from the DCW writing group and from other select Twittersphere contributors. Of course, everyone is welcome to comment on posts or make suggestions for future reading, discussion questions, etc.

An open invitation
We invite anyone interested in becoming a contributor to dcb’s digitalcultureweek to send us a brief introduction/bio (~200 words) along with your particular area of interest or expertise…or a specific ‘beat’ you’d like to cover. Inquiries should be sent to digitalcultureweek@umich.edu. Please remember to include your Twitter handle in the email message (and be sure to follow @Mdigitalculture on Twitter).

digSIG: Visualize This – An Introduction to GIS

The Digital Scholarship Special Interest Group  looks forward to its last meeting of the semester, which will be held on Wednesday, April 25 from 2:30 – 4:00 in the Shapiro Undergraduate Library, Rm 4041. We’ll be hosting Justin Joque, Spatial and Numeric Data Librarian at U-M, for a workshop and discussion about GIS (geographic information system) and data visualization tools and what we can learn (or unlearn) from the use of new techniques for mapping data.

This will be part skill-building exercise and part critical discussion about uses of data and visualization in the humanities & social sciences. As one example, Justin will explore how U.S. Census data can be used as the basis for provocative interrogations into social categorization and critical race studies. Hope to see you there!

UM Digital Scholarship Special Interest Group (digSIG)

Just a quick announcement that the inaugural meeting of the Digital Scholarship Special Interest Group (digSIG) will be held in North Quad Room 2435 on Monday, March 26th from 4-5:30pm. This will be an excellent opportunity for interested members of the U-M library community and other university departments to share ideas and meet with faculty from the recent LSA Digital Enviroments Cluster Hire (Megan Ankerson from Communication Studies, Finn Brunton from the School of Information, John Cheney-Lippold from American Culture, and Tung-Hui Hu from English).

To kick things off, Aaron McCollough (Librarian or English and Comparative Literatures) and I will offer a rendition of our evolving “Introduction to Digital Humanities” talk. We’re hoping to set the stage for discussion of potential programming, projects, readings, training sessions, etc. around the theme of digital scholarship at U-M.

Feel free to contact DIGSig organizers Korey (kbjack@umich.edu), Justin (jjoque@umich.edu) or Aaron (amccollo@umich.edu) with questions.

Hope to see you there!

MLA Call for Papers: Data Management in the Humanities

MLA #MLA13 BostonIn 2006 the American Council for Learned Societies released a report titled Our Cultural Commonwealth summarizing the promises and challenges of “big data” within the humanities and social sciences. The radical growth of computing, networking, and digital storage promised (or at least prefaced) a new era of “cumulative, collaborative, and synergistic” scholarship. And as we’ve seen in the half-dozen years since the report was issued, much of this promise has been borne out. Examples include inter-institutional projects like those sponsored by the Digging into Data program (administered by the NEH’s Office of Digital Humanities); the Mellon-funded Project Bamboo (designed to become a content management and collaboration hub for IT and humanities researchers); and massive data collection undertakings like the Shoah Foundation’s Visual History Archive (a collection of nearly 52,000 testimonies from Holocaust and other genocide survivors).

Of course, most humanities research datasets don’t begin to approach this kind of scale. Single researchers and research teams working with local materials, locally created databases, and local storage are still very much the norm. The question that this roundtable talk focuses on, then, is: How do we define and support good humanities data practices at the individual and local level?

Presenters are encouraged to take a step back from “big” and ask how scholars, librarians, and technologists can help foster better local data collection, storage, and distribution in order to build research practices that promote multi-disciplinary and multi-institutional synergies from the ground up. By sharing local instances of data management, we hope to explore big data as a process of “building toward,” rather than a monumental or sui generis product.

Questions that might be addressed include:

  • What counts as humanities data? The term data is unsettling for many scholars in part because it connotes something definitive and unproblematic. Where humanities scholarship often thrives on complication and constructivism, data seeks repeatability and finality. Datasets are construed as a kind of incontestable bedrock..which, to some, make them not only a little boring, but dangerously and deceptively boring. Is there a way for humanities researchers to have our constructivist cake and eat it, too? Can we, in other words, productively question the constructedness of datasets even as we assemble them? And can we expand the kinds of information that constitute data?
  • Metadata and Occam’s razor. When it comes to metadata, there are any number of fields to fill in, tags to apply, descriptors to append…and not all of them are useful. Or rather, it’s difficult to know what metadata will be useful to current and future researchers and, for this reason, difficult to know when and where to stop. What are best practices for metadata? Is there a standard (Dublin Core, say) that ought to be adhered to? What are the benefits and drawbacks of standardization?
  • Copyright and fair use. Compiling data is one thing. Being able to use it legally is another. Come discuss obstacles, strategies, and successes dealing with copyright and use issues.
  • Grants and funding. Have you successfully (or unsuccessfully) applied for grant funding that requires a data management / preservation strategy? We welcome conversations about how to articulate data management as a component of the grant application process. Funding is also an issue when it comes to supporting the programmers, designers, project managers, and copyright lawyers that may need to be part of a data management team. How do diverse institutions budget these costs? What experiences have you had seeking institutional in-kind support or funding for your own projects?

The roundtable will feature as many as eight presenters and is open to scholars from across the humanities. All presentation formats are welcome, but do let organizers know if you have specific technology needs.

Please send 250 word abstracts and a brief bio to kbjack@umich.edu or spencer.keralis@unt.edu by Monday, March 19th.

Keep in mind that all panelists will need to be registered MLA members (or have their membership waived) on or before April 7th.

Introducing the Digital Humanities

I wanted to share the Prezi portion of the workshop that Aaron McCollough (Subject Librarian for English and Comparative Literature) and I offered recently at the Teaching and Technology Collaborative.

The presentation, “Introducing the Digital Humanities,” was a whirlwind tour of new large-scale databases and tools for conducting and storing research, and a demonstration of some of the interactive platforms for broadcasting and publishing findings. We examined the digital humanities as a dynamic field (or perhaps “set of practices” is a better descriptor) that relies on and benefits from the scale of  the Information Age and which, in equal measure, provides a uniquely humanist take on the  new media making such scale possible.

We hope everyone who attended benefitted in some way from this first step toward increasing the “DH” profile at the University. Stay tuned for follow-up performances in the weeks and months to come!

Below you’ll find a list of resources mentioned in the presentation:

Overview

Tools 

Labs

Crowdsourcing

Publications

Open Access

Organizations

Conferences

Discipline-specific Organizations

Other Resources

Vetting and Funding (a good source for exemplary projects)

Training

Post Navigation