Skip to main content

Search

My Visit
Donate
Home Smithsonian Institution

Site Navigation

  • Visit
    • Museums and Zoo
    • Entry and Guidelines
    • Maps and Brochures
    • Dine and Shop
    • Accessibility
    • Visiting with Kids
    • Group Visits
      • Group Sales
  • What's On
    • Exhibitions
      • Current
      • Upcoming
      • Past
    • Online Events
    • All Events
    • IMAX & Planetarium
  • Explore
    • - Art & Design
    • - History & Culture
    • - Science & Nature
    • Collections
      • Open Access
    • Research Resources
      • Libraries
      • Archives
        • Smithsonian Institution Archives
        • Air and Space Museum
        • Anacostia Community Museum
        • American Art Museum
        • Archives of American Art
        • Archives of American Gardens
        • American History Museum
        • American Indian Museum
        • Asian Art Museum Archives
        • Eliot Elisofon Photographic Archives, African Art
        • Hirshhorn Archive
        • National Anthropological Archives
        • National Portrait Gallery
        • Ralph Rinzler Archives, Folklife
        • Libraries' Special Collections
    • Podcasts
    • Stories
  • Learn
    • For Caregivers
    • For Educators
      • Art & Design Resources
      • Science & Nature Resources
      • Social Studies & Civics Resources
      • Professional Development
      • Events for Educators
      • Field Trips
    • For Students
    • For Academics
    • For Lifelong Learners
  • Support Us
    • Become a Member
    • Renew Membership
    • Make a Gift
    • Volunteer
      • Smithsonian Call Center
      • Ambassador Program
      • Museum Information Desk
      • Docent Programs
      • Behind-the-Scenes
      • Digital Volunteers
      • Participatory Science
  • About
    • Our Organization
      • Board of Regents
        • Members
        • Committees
        • Reading Room
        • Bylaws, Policies and Procedures
        • Schedules and Agendas
        • Meeting Minutes
        • Actions
        • Webcasts
        • Contact
      • Museums and Zoo
      • Research Centers
      • Cultural Centers
      • Education Centers
      • General Counsel
        • Legal History
        • Internships
        • Records Requests
          • Reading Room
        • Tort Claim
        • Subpoenas & Testimonies
        • Events
      • Office of Human Resources
        • Employee Benefits
        • How to Apply
        • Job Opportunities
        • Job Seekers with Disabilities
        • Frequently Asked Questions
        • SI Civil Program
        • Contact Us
      • Office of Equal Opportunity
        • EEO Complaint Process
        • Individuals with Disabilities
        • Small Business Program
          • Doing Business with Us
          • Contracting Opportunities
          • Additional Resources
        • Special Emphasis Program
      • Sponsored Projects
        • Policies
          • Combating Trafficking in Persons
          • Animal Care and Use
          • Human Research
        • Reports
        • Internships
    • Our Leadership
    • Reports and Plans
      • Annual Reports
      • Metrics Dashboard
        • Dashboard Home
        • Virtual Smithsonian
        • Public Engagement
        • National Collections
        • Research
        • People & Operations
      • Strategic Plan
    • Newsdesk
      • News Releases
      • Media Contacts
      • Photos and Video
      • Media Kits
      • Fact Sheets
      • Visitor Stats
      • Secretary and Admin Bios
      • Filming Requests

Web and Social Media Preservation: Capturing Today’s Websites for Future Archival Research

Smithsonian Libraries and Archives

Object Details

Creator
Smithsonian Institution Archives
Description
Stefana Breitwieser, Intern, Digital Services Division

Websites are important records of institutional history, but they are also always being updated, redesigned, or taken down. How do we access important information from  outdated versions of websites? The Archives is currently using Archive-It, a tool created by the Internet Archive, to capture Smithsonian websites and social media accounts for future use. Archive-It uses a crawler - a program that browses the Internet like Google - to replicate a website at that specific moment. These “crawls” are later accessible using the Wayback tool. While the research potential for these crawls is enormous, two areas stand out in particular; to document the evolution of website features and to capture public participation during a specific event or program through social media.

A screenshot of the website for the Virtual Echinoderm Newsletter, crawled June 25, 2014, Accession 14-260 - National Museum of Natural History, Website Records, 1996-2014, Smithsonian Institution Archives.

Crawls show the progress of how technology is used and how websites have evolved over time. Above and below, we have two examples from the National Museum of Natural History (NMNH). This is the Virtual Echinoderm Newsletter, which was last updated in 2002. Though it may seem simplistic to us today, this is very representative of a typical website from the early 2000s. 

A screenshot of the website for the Virtual Echinoderm Newsletter, crawled June 25, 2014, Accession 14-260 - National Museum of Natural History, Website Records, 1996-2014, Smithsonian Institution Archives.

Fast-forward to 2014: With the new Human Origins Initiative website. We have a slideshow of features, live updates from Facebook and Twitter, and a text box that allows visitors to participate in the project - all located on the first page. While both of these sites are pretty typical for the respective years they were created in, they also are demonstrative of how much websites have changed in just over a decade. 

A screenshot of the website for the Human Origins Initiative, crawled November 22, 2013, Accession 14-079 - National Museum of Natural History, Website Records, 2013, Smithsonian Institution Archives.

The Archive-It tool is also being used to capture certain programs and events using social media. A great example of this is the crawl of the National Museum of American History’s #HistoryTalkBack Tumblr page. This site documented an ongoing project at the museum where curators invited visitors to respond to a question every day and to post their answers on a wall at the museum. The Tumblr page broadcasts some of the favorite posts and then invites commenters to respond to the question as well. We were pleased with the amount of public participation captured in our crawl - not only do we have the visitors’ comments, but because the site is Tumblr-based, we also captured the number of likes and re-blogs. Now that this site is defunct, this crawl becomes important for documenting the scope and impact of this project.

A screenshot of the website for the NMAH #TalkBackHistory Tumblr, crawled June 6, 2013, Accession 14-039 - National Museum of American History, Website Records, 2011-2013, Smithsonian Institution Archives.

I especially like these social media crawls. Social media - instantaneous, constantly updated, and therefore often thought of as transient - is transformed into something more lasting. By looking at crawls from blogs, Facebook, Twitter, Tumblr, and Flickr, we can examine the public’s response to a project and the strategies museums use to engage with their audiences. The #HistoryTalkBack crawl shows this. Tumblr users spread these images, sharing the posts to express their own love of history to friends and followers, while the National Museum of American History used this platform to engage both their real-life and virtual visitors. Capturing these moments using social media gives us a greater understanding of how the public participates in museum programs, and also how museums reach out to people. 

A screenshot of the website for the NMAH #TalkBackHistory Tumblr, crawled June 6, 2013, Accession 14-039 - National Museum of American History, Website Records, 2011-2013, Smithsonian Institution Archives.

The Archive-It tool promises incredible potential in the coming years, especially as the Archives continue to grow. If you’d like to learn more, you can check out the Archives’ Archive-It crawls. 

Related Resources

  • Smithsonian Now Using Archive-It to Crawl Websites, The Bigger Picture blog, Smithsonian Institution Archives
  • Connecting the Dots: Issues with Preserving Complex Websites, The Bigger Picture blog, Smithsonian Institution Archives
  • Saving the Smithsonian’s Web, The Bigger Picture blog, Smithsonian Instituion Archives

 

Related Collections

  • Accession 14-039 - National Museum of American History, Website Records, 2011-2013, Smithsonian Institution Archives
  • Accession 14-079 - National Museum of Natural History, Website Records, 2013, Smithsonian Institution Archives
Blog Categories: 
Behind the Scenes
Blog Tags: 
Archive
Web/Tech
Published Date
Tue, 12 Aug 2014 11:00:00 +0000
Type
Blog posts
Smithsonian staff publications
Blog posts
Smithsonian Institution Archives
Topic
Archive
Record ID
posts_d6afd3dbb537a1d8299dd7e454391167
Metadata Usage (text)
Usage conditions apply
View Blog post

Footer logo

Link to homepage

Footer navigation

  • Contact Us
  • Job Opportunities
  • Get Involved
  • Inspector General
  • Records Requests
  • Accessibility
  • EEO & Small Business
  • Shop Online
  • Host Your Event
  • Press Room
  • Privacy
  • Terms of Use

Social media links

  • Facebook
  • Instagram
  • YouTube
  • LinkedIn

Get the latest news from the Smithsonian

Sign up for Smithsonian e-news

Get the latest news from the Smithsonian

Email powered by BlackBaud (Privacy Policy, Terms of Use)
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Back to Top