The Archives Unleashed Cloud

This spring, we mark a few special anniversaries for the Archives Unleashed Project! It’s been four years since the start of the project and our third year of running the Archives Unleashed Cloud.

With the support of the Andrew W. Mellon Foundation, our focus has been on developing open-source analysis tools to help make web archives more accessible. One of our primary goals has been “to make petabytes of historical internet content accessible.” Thanks to our community of Cloud users and collaborators, we’ve been able to make that goal a reality.

As we sunset the Cloud at the end of…


Photo by 🇸🇮 Janko Ferlič on Unsplash

Earlier this year, the Archives Unleashed Project announced the Cohort Program, which aims to support web archives research by providing resource support and mentorship. Starting in July, cohorts will engage in collaborative activities and conduct focused research that explores a variety of web archive collections.

We are pleased to introduce the five teams that will make up our inaugural Cohort program.

AWAC2 — Analysing Web Archives of the COVID Crisis through the IIPC Novel Coronavirus dataset

Valérie Schafer, University of Luxembourg (LU)

Karin De Wild, Leiden University (NL)

Frédéric Clavert, University of Luxembourg (LU)

Niels Brügger, Aarhus University (DK)

Susan…


Photo by Susan Q Yin on Unsplash

As North America heads into the 15th month of the COVID pandemic, we are all too aware of the depth and breadth of impacts the pandemic has had on every sector.

In March 2020, our project planned to host our final datathon event at Columbia University, New York, NY. Having monitored ever-growing outbreaks and witnessing the domino effect of cancelled scholarly events, the team proactively shifted our event online.

In reflecting on this experience, our open-access article, Building community at a distance: A datathon during COVID-19, discusses the implications around transitioning events online, provides practice recommendations to event organizers, and…


By: Archives Unleashed and project collaborators

Web archives play a critical role for scholars studying the 1990s onwards. However, gaps in available analytics tools, community infrastructure, and accessible web archival interfaces present high barriers for conducting research with web archives at scale.

Established in 2017 with support from The Andrew W. Mellon Foundation, the Archives Unleashed Project developed open-source analytical tools, community resources, and hosted collaborative learning events to address researcher challenges of accessing, using, and exploring web archives. The marquee product of this first phase was the Archives Unleashed Cloud.

This work continues apace in a second phase (2020–2023)…


Written by Samantha Fritz (Project Manager, Archives Unleashed Project) on behalf of the Archives Unleashed team.

This piece has been cross-posted with the International Internet Preservation Consortium Blog.

The web archiving world blends the work and contributions of many institutions, groups, projects, and individuals. The field is witnessing work and progress in many areas, from policies, to professional development and learning resources, to the development of tools that address replay, acquisition, and analysis.

For over two decades memory institutions and organizations around the world have engaged in web archiving to ensure the preservation of born-digital content that is vital to…


Fostering Community Engagement through Datathon Events: The Archives Unleashed Experience

Archives Unleashed Washington, DC. Datathon | Gelman Library, George Washington University, 2019. Photo by Samantha Fritz

Our recent-open access article “Fostering Community Engagement through Datathon Events: The Archives Unleashed Experience,” published in Digital Humanities Quarterly, takes a reflective look at the impact of Archives Unleashed Datathons on the professional practices of attendees and community engagement within the web archiving field.

Datathons have been a center-piece of the Archives Unleashed Project’s community engagement efforts. Our four events held as part of our Mellon-funded project engaged with over 70 participants from over fifty different institutions, offering hands-on training with analytical tools to explore web archives.

Our article introduces…


Engaging with the Canadian Web Archiving Coalition

Earlier this year, Archives Unleashed had an opportunity to engage with a Canadian-based group focusing on web archiving support for practitioners and researchers. The Canadian Web Archiving Coalition (CWAC) is a community that brings together libraries, archives, and other memory institutions across Canada that engage with web archiving. CWAC is part of the Canadian Association of Research Libraries (CARL), which focuses on “identifying gaps and opportunities that could be addressed by nationally coordinated strategies, actions, and services, including collaborative collection development, training, infrastructure development.” (1)

The launch of the World Wide Web created…


Web Archive Datasets via the Internet Archive

Introduction

We have lots of web archival data — but what to do with it? The web archiving ecosystem has many organizations, institutions, projects, and individuals who have captured terabytes of data. These important preservation efforts allow us to explore the recent past. Web archive collections are a critical source for documenting, and in turn studying, our ever-growing online world. We don’t have to look too far back to see how born-digital collections illustrate important social and cultural movements, global health events, and various national elections, such as #MeToo, COVID, or #BlackLivesMatter.

We can see that web archives inform fields…


From Archive to Analysis: Access Web Archives at Scale Through a Cloud-Based Interface

Photo by Diego PH on Unsplash

In a recently published article, “From Archive to Analysis: Access Web Archives at Scale Through a Cloud-Based Interface,” co-authors Nick Ruest, Samantha Fritz, Ryan Deschamps, Jimmy Lin, and Ian Milligan propose that by moving towards a cloud-based architecture for web archiving analysis, many of the challenges of web archive research can be partially addressed — leaving opportunities for scholars, as well as the broader web archiving community.

We examine how scholars use web archives, emphasizing these usage models do not line up with the existing command-line and…


#WebArchiveWednesday Network via Netlytic

Thanks to the folks at the International Internet Preservation Consortium (IIPC), the community has an opportunity to contribute to regular Wednesday discussions using the #WebArchiveWednesday Twitter hashtag.

Engaging with this hashtag has given individuals, groups, and organizations a chance to share information, news, and projects with other professionals as well as the public. The focused hashtag also provides an opportunity to support colleagues in the field — their stories, successes, and, more broadly, web archiving discussions.

Looking at how the #WebArchiveWednesday community and conversation has evolved over the past year, the Archives Unleashed project would like to offer congratulations to…

Archives Unleashed

News from the Archives Unleashed Project

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store