<<< Back to the guide, "Sample ARCH datasets and how to explore them."
Introduction
Web archives contain myriad forms of expression beyond text. You can aggregate their media to enable access more instinctively and intuitively, outside of the search box. This tutorials creates an image gallery that the end user may browse and interact with in order to access and replay the contents of a web archive collection visually.
Used in this tutorial:
- Dataset: Image graph from the Art Galleries web archive collection
- Tools: Palladio
- Time: ~15-20 minutes to complete
Watch
Follow along with a video demonstration of the instructions below:
Instructions
In this section:
Get to know your data
- Locate the .image-graph_Baltimore.csv. file in the ARCH workshop archive and open it with your preferred spreadsheet program (Excel, Calc, Numbers, Sheets, etc.). The file includes all of the data derived from the eight seeds in the Art Galleries collection contributed by the Maryland Institute College of Art, covering the city of Baltimore.
- Take note of the four attributes included in each Image graph download from ARCH. Each row in the spreadsheet represents the characteristics of an image preserved in the archive:
- .crawl_date.: a timestamp representing when each image file was collected.
- .source.: the location of the web page on which the image file appeared when it was collected from the “live” web.
- .url.: the location of the image file when it was collected from the “live” web.
- .alt_text.: the descriptive text that accompanied the image file’s embed markup at the time that both were collected.
- .crawl_date.: a timestamp representing when each image file was collected.
Create an image gallery
- Open Palladio in your preferred web browser here: https://hdlab.stanford.edu/palladio-app
- Drag the CSV file from your local storage into the editor pane at the center of the page.
- Confirm that the data from your spreadsheet appears in the editor as comma-separated values (through line 935 in the case of this sample data file) and click the “Load” button:
- Palladio enables you to represent your data as a map, graph, table, or gallery. For this exercise, let’s select the “Gallery” option in the top navigation bar.
- Locate the hamburger menu icon (≡) at the top-right corner of the screen and click on it to expand your gallery’s settings. Configure the gallery’s dimensions to match the following attributes from your data:
- Click on the .Title. box and select the .crawl_date. option from the ensuing dialog.
- Set the .Text. field to .alt_text..
- Set .Link. to .source..
- Set .Image URL. to .url.. The images should begin to load in the grid on the page:
- Click on the .Title. box and select the .crawl_date. option from the ensuing dialog.
- Click on a sample image or two in the grid to see them render in their original context. Where do they take you?
Interpret the results
- Now let’s sort and filter our collection in order to find the artists or works that interest us.
- Set the .Sort by. value to .source.. This re-organizes the images into the alphabetical order of the pages on which we can find them, by those pages’ URLs.
- Scroll down until you come to the first example of a video installation. Who is the artist? What is the title of the work, where did it exhibit, and when?
- Click on the “Facet” button at the bottom-left corner of the screen to filter your view by one of the dimensions you’ve set from the dataset. In the .Description. field, name this view “Artists.”
- Set the .Dimensions. value at the right side of the screen to .alt_text. in order to filter your current view by the content of the embedded images’ alt text tags.
- Find the .alt_text. values at the bottom-left corner of the screen, listed in descending order of frequency. What artist’s name appears most frequently? Click on it to filter your view to show only their work.
- Re-organize your values alphabetically by clicking on the “↓AZ” button at the top-right corner of the table. Scroll through the results until you find works by Marisa Takal. When were they exhibited? Which other artists were part of this show?
- Set the .Sort by. value to .source.. This re-organizes the images into the alphabetical order of the pages on which we can find them, by those pages’ URLs.
Comments
0 comments
Please sign in to leave a comment.