Archive of TG.no (TG25 and beyond)

Archive of TG25 site using WARC standard. See ./browsertrix-crawler folder for files and details.

Browsertrix-crawler (Webrecorder tools)

Generated using go-archive repo that uses browsertrix-crawler to generates an interactive (and timetravelable) archive using the WARC (Web ARChive) standard.

Capturing

To capture a new snapshot of gathering.org run the crawler command in go-archive repo with tgnos crawl configuration file. Then update this repo with additional archive files generated.

PS. As we start using WARC as our new archive standard, we expect to transition to a semi-automatic archive setup, where we generate snapshots of the site on a set interval.

Displaying

The recommended setup is just running go-archive repo/service since that is the known working setup that is used on Gathering.org archive.

To run manually install pywb and use the wayback command and a local collection configuration file (see their docs or examples in go-archive).

Due to size of repo/files we use Git LFS for storage of WARC files.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
tgno		tgno
.gitattributes		.gitattributes
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Archive of TG.no (TG25 and beyond)

Browsertrix-crawler (Webrecorder tools)

Capturing

Displaying

About

Releases

Packages

gathering/go-archive-tgno

Folders and files

Latest commit

History

Repository files navigation

Archive of TG.no (TG25 and beyond)

Browsertrix-crawler (Webrecorder tools)

Capturing

Displaying

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages