Skip to content

Latest commit

 

History

History
130 lines (104 loc) · 6.54 KB

README.md

File metadata and controls

130 lines (104 loc) · 6.54 KB

covid-19-classification

Code Status

JavaScript Style Guide

Overview

This work showcases a binary classifier for chest X-ray images which distinguishes between COVID-19 and no finding (healthy) using a convolutional neural network. Additionally, both LIME- and GradCAM-explainer are integrated into a web interface.

Web Interface Screenshot

Prerequisites

Install

pip install -r requirements.txt
npm install
cd src/frontend
npm install

Training

The training of the classification model is performed from a jupyter notebook. The notebook contains further documentation for the training steps and necessary datasets and models.

See training.ipynb.

Usage

The application is divided into a backend and a frontend.

The backend consists of python workers which perform classifications and explanations of chest x-ray images. The API itself is written in javascript (node.js) and merely forwards requests to a python worker (server.py) which starts a thread for each classification/explanation task. The communication between the API and the python worker uses the standard input/output streams and is structured as follows:

  • A request for a method, e.g. classification containing a x-ray image payload is send to the corresponding endpoint, e.g. POST /v1/classifier.
  • The node.js API (index.js) validates the request, stores the image on disk and assigns a unique identifier to it.
  • The node.js API send a single line to the stdin of the server.py-process consisting of: METHOD ID e.g. classify f00091ff-cb7a.
  • The server.py-process starts a thread for the specific task and prints METHOD ID RESULT on stdout once the task finishes.
  • Finally, the API can answer the HTTP-request.

The server.py-process is initialized on startup and kept running for the entire lifecylce of the API-process.

The frontend is based on react.js.

  1. Start the backend/API:

    $ node src/index
    usage: index [-h] -c MODEL_PATH -s SEGMENTATION_MODEL_PATH --cache-dir-path
                CACHE_DIR_PATH [--disable-api-cache]
                [--api-cache-lifetime API_CACHE_LIFETIME] [-p PORT] [-ip HOST]
    
    Covid-19 Classification API
    
    optional arguments:
      -h, --help            show this help message and exit
      -c MODEL_PATH, --model-path MODEL_PATH
                            path to classification model
      -s SEGMENTATION_MODEL_PATH, --segmentation-model-path SEGMENTATION_MODEL_PATH
                            path to segmentation model (U-Net)
      --cache-dir-path CACHE_DIR_PATH
                            path to cache dir
      --training-dir-path TRAINING_DIR_PATH
                            path to training queue dir
      --disable-api-cache   path to cache dir
      --api-cache-lifetime API_CACHE_LIFETIME
                            api cache lifetime in minutes
      -p PORT, --port PORT  api port
      -ip HOST, --host HOST
                            api host
    

    --model-path: required, contains the path to the CNN / classification model
    --segmentation-model-path: required, contains the path to the U-Net, which is used to perform segmentations of the lungs prior to classification.
    --cache-dir-path, required, contains the path to a cache directory. Some tasks such as segmentation need a directory where artifacts such as masks can be stored. The path needs to point to a (arbitrary) writeable directory.
    --training-dir-path, required, contains the path to a training directory. In order to store new training images, which can be uploaded via the web-interface, a directory is needed. The path needs to point to a (arbitrary) writeable directory.
    --disable-api-cache, optional, use this flag to disable the api cache. Usually, identical requests (e.g. classification of the same image) are resolved using a cache.
    --api-cache-lifetime, optional, default: 5 minutes, use this parameter to chance the lifetime of the cache entries.
    --port, optional, default: 3005, change the port of the api.
    --host, optional, default: localhost, change the host of the api.

  2. Start the frontend:

    cd src/frontend
    npm start
    

    The frontend will try to start on port 3000 and forward API requests to http://localhost:3005/ if possible. You can change your proxy path to the actual API host/port in frontend/package.json:

    "proxy": "http://localhost:3005",
    

Python worker interface

Internally, the node.js-API starts the python worker and organizes the communication. Hence, there is no need to directly access it. However, it can be used to perform a set of classifications and explanations on many images. The interface is structured as follows and the parameters are a subset of the API-parameters. The parameters provide the model files and the actual tasks are provided via stdin.

$ ./src/server.py
usage: server.py [-h] -c MODEL_PATH -s SEGMENTATION_MODEL_PATH
                 --cache-dir-path CACHE_DIR_PATH

Covid-19-Classification Server

The server accepts messages in the form of
"command image_id" e.g. "explain_lime f00091ff-cb7a"
on stdin. Once a command finished, the server
replies with the same message on stdout, 
followed by optional response parameters.
Allowed message types are: "classify", 
"explain_lime" and "explain_gradcam". 

The images have to be located in
"CACHE_DIR_PATH/IMAGE_ID.png".

optional arguments:
  -h, --help            show this help message and exit
  -c MODEL_PATH, --model-path MODEL_PATH
                        path to classification model
  -s SEGMENTATION_MODEL_PATH, --segmentation-model-path SEGMENTATION_MODEL_PATH
                        path to segmentation model (U-Net)
  --cache-dir-path CACHE_DIR_PATH
                        path to cache dir

A single image can be classified using:

echo "classify f00091ff-cb7a" | ./src/server.py -c data/model20200905-193900.h5 -s data/trained_model.hdf5 --cache-dir-path cache

Attributions

Icon made by Freepik from www.flaticon.com