How to get started

People to contact...

Matthew Woolhouse - Professor
Jotthi Bansal - Lab instructor
Kurt - Developer
Michael - Developer
Michael Mallon - IT Specialist

Server info

Server ip address: 130.113.103.46
Server host: user_name@woolhouse-g.humanities.mcmaster.ca

How to use the Humanities Servers

Get access to the Humanities Server, ask Dr. Woolhouse or Jotthi.
Install VPN (link) to access the server without being connected to McMaster's network.
ssh into the server. ssh user_name@woolhouse-g.humanities.mcmaster.ca.

Redis Setup

MySQL Setup

Download MySQL Community Edition (GPL) link.
Change the temporary password:
Open the MySQL terminal.

mysql -u root -h 127.0.0.1 -p

For macs, there is no default MySQL config. Add a file to usr/local/mysql/etc/my.cnf. This will skip the default password.

skip-grant-tables

Change the temporary password:

ALTER USER 'root'@'localhost' IDENTIFIED BY 'new-password';

Workflow (locally)

clone the repo. get the credentials. install mysql, redis.

1. MySQL Tables

Rows cannot be added/deleted by the crawler if the tables in the database are not created.

run rsync -avz -e 'ssh -p 22' user_name@130.113.103.46:~/grail-dump.sql.gz ~/local_directory.
- This will download a copy of the sql data in the database currently onto your local machine.
run mysql -u username -p database_name < file.sql.
- In the directory that you downloaded the data into your local machine, this will populate your local database with the data and populate the database with the tables and columns.

2. Getting the Grails Crawler to crawl

To populate the job queue and start the crawler

run npm run development:master
- This initializes the dashboard and job queue.
go to dashboard --> link
- NTS: port listed in confs
run npm run seed:musicbrainz:track
- This loads the queue with jobs taken from the seed files (tsv).
run npm run development:worker
- This plucks the jobs from the job queue and processes the jobs.
- This also inserts into the database.

3. Reseting the Job Queue.

To test previous crawls you will or to reset the job queue.

run redis-cli
- The jobs are stored in a redis database so you will need to start the redis command prompt.
run flushall
- This will clear the redis database and delete all the queued jobs.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SETUP.md

SETUP.md

How to get started

People to contact...

Server info

How to use the Humanities Servers

Redis Setup

MySQL Setup

Workflow (locally)

1. MySQL Tables

2. Getting the Grails Crawler to crawl

3. Reseting the Job Queue.

Files

SETUP.md

Latest commit

History

SETUP.md

File metadata and controls

How to get started

People to contact...

Server info

How to use the Humanities Servers

Redis Setup

MySQL Setup

Workflow (locally)

1. MySQL Tables

2. Getting the Grails Crawler to crawl

3. Reseting the Job Queue.