Website Archiving Configuration Detail

General Information:

Each website will have specific settings that are available to set.  Note, the default settings are sufficient for basic website archiving and no adjustments are required.

Archive Address – This is the email address that is used when we create messages for archiving websites.  This address can be used when searching for a specific website.

Last Update – This is the last time a change was made to your website settings or page list.

Active – This shows the status of the website, it can be made inactive if you wish to pause archiving of the site and can be re-activated at a later time to resume archiving.

Archive Frequency – This shows the schedule that has been assigned to the site.   Monthly, Weekly or Daily (also indicates which day(s) are used for the scheduled archiving).

SiteMap Url – This is an optional setting if our service can’t locate your website xml sitemap, it should only be used with the direction of our support team.

Archive Type – This will show “default” which indicates that your site will be archived at a rate to avoid your webhosting service provider or webserver from being overloaded.   Websites that are greater than 1500 pages may need to be set to a “Fast” archive by contacting support.

Archive Configuration:

Modified Policy – Enabling this setting and adding the associated policy will flag every webpage that changes.  This setting allows you to flag all websites when a page changes, this will alert your team to review the changes under the flagged review menu.

Individual webpages can be excluded from this policy if needed.  The modified website policy can be updated under the Policies menu – “Modified Website Policy”.   Note, if you do not have this policy, you can add it using the “Add New Policy” button and selecting it from the templates.

Skip Mandatory URL Policy – When disabled this will flag the website if the site doesn’t contain all the mandatory urls that have been added to the mandatory url policy.  The “Mandatory URL Policy” can be updated or added under the Policies menu.

Skip Policies – This allows you to skip all policy scanning for the selected website if you do not wish to flag any of your webpages for review.   This is used when your firm already approves website changes so they do not need to be reviewed using our platform.

Duplicate Check – This is enabled by default and will check your website pages and only archive a page if it has changed.   This can be disabled if you need a full website archive every time regardless of if the pages have changed.  This should only be enabled after contacting support.

External URL Policy – This will use the external url policy and will flag a website if pages have external urls listed that aren’t in the approved list.   The external url policy can be added or edited under the Policies menu.

WebCrawl Information

Our service will automatically “crawl” your website and discover new pages and remove any outdated pages.   We automatically update your website webpage list so that new pages will be automatically archived and pages that have been removed will be removed from archiving.   Please note that pages that are removed are still retained in the archive.

Name – This is a unique identifier that is used to track the webcrawls that have been run.

Last Update – This is the last date/time that the website had detected changes and updated the website.

Status – This should show “Success” indicating that our service is able to crawl your site, if this shows failed, please contact support.

Action – This will show “None Required” unless you run a manual webcrawl and need to apply the changes after reviewing the results.

Run WebCrawl – This is only used if you need to run a webcrawl manually, generally this is only needed if you have made website changes and don’t want to wait for the next weekly automatic webcrawl.

WebCrawl Report – This report will show the status of the automated crawls and any manual crawls showing the number of pages added/removed by the crawl.

Webpage Configuration:

Webpages will be automatically added/removed from your website.  No manual settings are required.  However, we list the definitions below for your reference.

Active – Active page that will be archived, Inactive pages will not be archived

Duplicate Check – Disabling this will force the page to be archived every time even if it hasn’t change.   Only change this after contacting support.

External URL Policy – enable or disable the policy for this page only

URL Encoded – this is enabled if your page has special characters in the URL

Data Type – This will indicate if it is a standard webpage of if the link is to a file attachment (like PDF)

Update TypeDynamic is the default which indicates that our system will update the page automatically if the URL changes or remove it if the page is not found.   Static indicates that you want to archive a page that may normally not be found when performing a webcrawl of the website.   For example, a page that isn’t directly linked from your website but is still accessible so you want it included in the archiving.

Modified Policy – This allows you to enable or disable the modified policy if you have it enabled for your website.

Image Quality – The default of Standard should be used unless support recommends using a higher quality for specific pages.

HTTP URL – this is the address of the page that will be archived.

Context Credential List:

The context credentials are only used when archiving a website that requires a logon/password.   Please only use after contacting support.