Love Data Workshop: What You Missed

Today’s post was written by library student assistant Judite do Bem Sampaio. Judite is an international student from Portugal double-majoring in economics and entrepreneurship. 

The William H. Hannon Library’s “Love Data” workshop explored how data is collected, preserved, analyzed, and sometimes lost, emphasizing the responsibility of researchers, institutions, and libraries in protecting access to information. Hosted as part of International Love Data Week, the session was structured around three main segments led by different hosts, each focusing on a key area of data work: the Internet Archive and Wayback Machine, the ICPSR database, and Statista.

The first portion of the workshop focused on the Internet Archive and the Wayback Machine, presented in the context of over 30 years of digital preservation. Participants were shown how archived webpages allow users to trace what has been removed, edited, or newly added over time. Visual comparisons highlighted changes to public-facing information, including government agency pages, reinforcing the idea that digital content is not permanent. This section introduced the concept of data fragility. The reality that information can disappear or be altered without preservation efforts and demonstrated how tools like the Wayback Machine can help document and safeguard historical records of the internet.

The second segment introduced ICPSR, a major social science data repository used widely in academic research. The workshop explained why ICPSR is considered a rigorous and citable source, particularly for students conducting empirical work. Participants learned that datasets from ICPSR can be downloaded and analyzed using software such as Stata, Microsoft Excel, and other data tools. The emphasis here was on responsible data use, not only finding information, but understanding its structure, credibility, and application in research.

The final portion focused on tools and initiatives designed to preserve at risk data, including projects like DataLumos and broader data rescue efforts. A key example showed how preservation initiatives began in 2017 with just over 100 datasets and have grown to more than 400 by 2025 through crowdsourced contributions supported by librarians and data professionals. A climate change case study illustrated how preserved datasets continue to support research and public understanding even when original sources change or disappear.

The session also included a live demonstration of Statista as a research tool, showing how students can access industry, market, and societal data through LMU’s library subscription. Presenters walked through how to search by keyword, browse statistics by industry, and download charts and datasets in multiple formats for academic use. Some engaging examples including charts on topics like “worst Valentine’s Day gifts” helped illustrate how accessible and applicable this data can be for both coursework and real-world analysis.

Across all three segments, a consistent message was made clear: data is fragile, and access cannot be taken for granted. Information must be intentionally preserved, properly cited, and critically evaluated. The workshop highlighted how libraries serve as stewards of this process, helping students navigate reliable sources and maintain long-term access to knowledge.

For those interested in exploring the tools and databases introduced, additional resources are available through the library’s LibGuides. Students can learn more through the Love Data Workshop guide and the Statistics & Demographics guide, which provide direct access to databases, research tools, and support for working with data.

It is important that we, as students, recognize the value of these tools and not take them for granted. They guide our research not only academically, but also in how we navigate information and make decisions. Stay informed!