Name		Name	Last commit message	Last commit date
parent directory ..
data		data
README.md		README.md
add_oids.py		add_oids.py
bulk_import.sql		bulk_import.sql
deep_copy.py		deep_copy.py
events_meta_schema.yml		events_meta_schema.yml
identify_tags_supporters.py		identify_tags_supporters.py
increment_oids.py		increment_oids.py
list_tags_supporters.py		list_tags_supporters.py
parse.py		parse.py
scrape_events.py		scrape_events.py
scrape_news.py		scrape_news.py
src		src

README.md

Scrape data from old website

The old Jekyll website has a bunch of content (events, people, news items...) embedded in markdown files. This content needs to be scraped somehow and parsed into a set of TSV files that can be bulk-imported into the new site database.

This folder contains a set of scripts that have been used for this purpose, though currently they are not properly packaged for re-use. If there seems to be demand for another Galaxy to use this site then I could package this code to allow others to parse their Jekyll data and load it into the new app database.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scrape

scrape

README.md

Scrape data from old website

Files

scrape

Directory actions

More options

Directory actions

More options

Latest commit

History

scrape

Folders and files

parent directory

README.md

Scrape data from old website