Skip to content

Warcio does not support replay of sites hosted on NCSA 1.5 #141

@omgoo

Description

@omgoo

Here is an interesting one for you Ilya.

The original NCSA 1.5 web server responds with "HTTP 200 Document follows" rather than HTTP/1.0.

In recorderloader.py HTTP_TYPES is only looking for 'HTTP/1.0', 'HTTP/1.1'.

Modifying HTTP_TYPES to look for 'HTTP/1.0', 'HTTP/1.1', 'HTTP' does allow the request web page to replay. I'd add this as a PR but I doubt this is the best idea.

Here is the header from the ARC file in question:

http://www.open.gov.uk:80/ofsted/nursery/rp511200.htm 193.32.28.8 19970616061332 text/html 30594
HTTP 200 Document follows
Date: Mon, 16 Jun 1997 07:09:23 GMT
Server: NCSA/1.5.1
Last-modified: Fri, 09 May 1997 20:24:52 GMT
Content-type: text/html
Content-length: 30414

This is the url in question but you'll only see a 500 error:

https://webarchive.nationalarchives.gov.uk//ukgwa/19970616061332/http://www.open.gov.uk:80/ofsted/nursery/rp511200.htm

I'll share the ARC file with you if I can get permission to release it.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions