Skip to content

User Guide

Getting Started

  1. Check out Loading Archived Items to point ReplayWeb.page towards the file you wish to view.
  2. Once your file has been loaded, learn how to navigate the archived content!

Supported Formats

While ReplayWeb.page aims to support a variety of formats for archived web content, for best performance we recommend using WACZ files. To archive webpages to WACZ files on a per-page basis, we recommend ArchiveWeb.page. If you need to archive entire websites, check out Browsertrix.

WARC? WACZ? What's the difference?

Due to the nature of the WARC format, the entire file must be read on first use to generate an index — a list of all the pages and content available in the file used by ReplayWebpage to rebuild and display the archived content.

Inversely, WACZ files are always loaded on-demand and no indexing is required. The initial archive view should load very quickly as a result.

For users comfortable with the command line, we recommend converting WARC files to WACZ files by using the wacz python library, see py-wacz on GitHub for more information.

ReplayWeb.page supports the following file formats listed as "Supported". Format type is determined based on the file's extension.

Format Extensions Status
WACZ .wacz Supported
WARC .warc, .warc.gz Supported
HAR .har Supported
WBN .wbn Not Supported
ARC .arc Not Supported
CDX .cdx, .cdxj Supported