GitHub - tud-zih-tools/JobLog: Queries job information from a HPC job scheduler like SLURM.

JobLog

This python script queries job information from an HPC job scheduler. The information are stored in a JSON file. Currently, JobLog supports only SLURM. JobLog queries the following fields of the running job.

This software was developed as part of the EC H2020 funded project NEXTGenIO (Project ID: 671951) http://www.nextgenio.eu.

Job Information

JobId
JobName
StartTime
EndTime
SubmitTime
NumNodes
NumCPUs
NumTasks
Dependency
ExitCode

Job Step Information

JobID
NNodes
NTasks
NCPUS
Start
End
Elapsed
JobName
NodeList
ExitCode
State

Job information is once only included in the JSON file and job step information may be contained multiple times.

Requirements

Python3
SLURM and permissions to execute scontrol and sacct

Usage

> python joblog.py $JOBID /path/to/output

The $JOBID tells JobLog which job is running and will be used. The last parameter is used to specify the output directory. After a successful execution, JobLog created the file job_log.json in the output directory.

Interactive Jobs

SLURM provides the environment variable SLURM_EPILOG to specify an epilog script. JobLog can be intergrated using an additional epilog script:

#!/bin/sh

module load Python

joblog=/path/to/joblog.py

python3 $joblog $SLURM_JOBID $HOME

This script can be set before executing the srun command.

> SLURM_EPILOG="/path/to/epilog_script.sh" srun -n 16 ./app

Job Script

Just add the execution of JobLog into your job script like that

#!/bin/bash
#SBATCH -J HELLOWORLD
#SBATCH --account=zihforschung
#SBATCH --ntasks=4
#SBATCH --time=00:02:00
#SBATCH --partition=haswell

module load OpenMPI Python

srun -n 4 ./mpi_helloworld

python3 /path/to/joblog.py $SLURM_JOBID $HOME

You can also use the SLURM epilog variable but than the script will be called multiple times. Basically, this is not an issue because JobLog will override an existing JSON file.

example_log.json contains an example output of a job run with one step.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
LICENSE		LICENSE
README.md		README.md
example_log.json		example_log.json
joblog.py		joblog.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JobLog

Job Information

Job Step Information

Requirements

Usage

Interactive Jobs

Job Script

About

Releases

Packages

Languages

License

tud-zih-tools/JobLog

Folders and files

Latest commit

History

Repository files navigation

JobLog

Job Information

Job Step Information

Requirements

Usage

Interactive Jobs

Job Script

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages