You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
warcio raises warcio.exceptions.ArchiveLoadFailed: Invalid WARC record, first line: WARC-Type: response
at the second WARC record (after the warc-info record) in a WARC file written with ArchiveSpark.
Both state that they use ISO http://bibnum.bnf.fr/WARC/WARC_ISO_28500_version1_latestdraft.pdf
warcio works fine for WARC-files written with Heritrix
I posted an issue on warcio as well.
warcio also returns a warning before the error:
WARNING: Record not followed by newline, perhaps Content-Length is invalid
Offset: 433
Remainder: b'WARC/1.0\r\n'
It could be that ArchiveSpark should write an additional empty line between the records.
warcio.statusandheaders.StatusAndHeadersParserException: Expected Status Line starting with ['WARC/1.1', 'WARC/1.0', 'WARC/0.17', 'WARC/0.18'] - Found: WARC-Type: response
The text was updated successfully, but these errors were encountered:
warcio raises
warcio.exceptions.ArchiveLoadFailed: Invalid WARC record, first line: WARC-Type: response
at the second WARC record (after the warc-info record) in a WARC file written with ArchiveSpark.
Both state that they use ISO http://bibnum.bnf.fr/WARC/WARC_ISO_28500_version1_latestdraft.pdf
warcio works fine for WARC-files written with Heritrix
I posted an issue on warcio as well.
warcio also returns a warning before the error:
It could be that ArchiveSpark should write an additional empty line between the records.
The text was updated successfully, but these errors were encountered: