Skip to content

Codes for our paper -- Micro-expression recognition with layered relations and more input frames.

Notifications You must be signed in to change notification settings

ini17/Micro-expression-recognition-with-layered-relations-and-more-input-frames

Repository files navigation

Micro-expression Recognition with Layered Relations and More Input Frames

Introduction

This repo contains all the code needed to reproduce the results of this paper - "Micro-expression-recognition-with-layered-relations-and-more-input-frames" - [paper]. It consists of pure Python codes.

Installation for Python

Requirements

# Install requirement
$ pip install -r requirements.txt

# Download landmarks weight for DLIB
$ mkdir -p PythonCodes/dataloader/weight
$ wget https://github.com/davisking/dlib-models/raw/master/mmod_human_face_detector.dat.bz2 -P dataloader/weight
$ bzip2 -d PythonCodes/dataloader/weight/mmod_human_face_detector.dat.bz2
$ wget https://github.com/davisking/dlib-models/raw/master/shape_predictor_68_face_landmarks.dat.bz2 -P dataloader/weight
$ bzip2 -d PythonCodes/dataloader/weight/shape_predictor_68_face_landmarks.dat.bz2

Instructions for use

In general, this paper conducts end-to-end MER model training on the public datasets, which is divided into the following steps.

  1. Video interpolation is performed for each ME sample (with RIFE).
  2. Motion magnification and extraction on interpolated frames (with MagNet).
  3. Action Units correlation calculation to produce npz files.
  4. Preprocessing : Extract a specified number of ROIs for the optical flow map; and process the original ME sample, convert it to the corresponding data format.
  5. End-to-end training is based on the above ROIs data and AU data.

Since there are many data files involved, the data path needs to be carefully modified according to the comments.

DLIB with GPU (not necessary)

# Remove the cpu version first
$ pip uninstall dlib
# Install cudnn and its toolkit
$ conda install cudnn cudatoolkit
# Build from source
$ git clone https://github.com/davisking/dlib.git
$ cd dlib
$ mkdir build & cd build
$ cmake .. \
    -DDLIB_USE_CUDA=1 \
    -DUSE_AVX_INSTRUCTIONS=1 \
    -DCMAKE_PREFIX_PATH=<path to  conda env>\
    -DCMAKE_C_COMPILER=gcc-6 -DCMAKE_CXX_COMPILER=g++-6
$ cmake --build .
$ cd ..
$ python setup.py install \
    --set USE_AVX_INSTRUCTIONS=1 \
    --set DLIB_USE_CUDA=1 \
    --set CMAKE_PREFIX_PATH=<path to  conda env>  \
    --set CMAKE_C_COMPILER=gcc-6 \
    --set CMAKE_CXX_COMPILER=g++-

Pretrained models

RIFE

The RIFE (Real-time Intermediate Flow Estimation) model is used in the video interpolation step, and the required model is included in this repo due to its small size.

MagNet

The structure of MagNet was adapted from here. Download the pra-trained models, place it in the appropriate directory, and modify the path in generate_MagNet_images.py to use it for data preprocessing.

Datasets

Training

usage: train.py [-h] --parallel parallel --csv_path CSV_PATH 
				--mat_dir mat_dir --npz_file npz_dir
                --catego CATEGO [--num_classes NUM_CLASSES]
                [--batch_size BATCH_SIZE]
                [--weight_save_path WEIGHT_SAVE_PATH] [--epochs EPOCHS]
                [--learning_rate LEARNING_RATE]

optional arguments:
  -h, --help            show this help message and exit
  --parallel parallel	Number of frames
  --csv_path CSV_PATH   Path for the csv file for training data
  --mat_dir mat_dir		Root for the training images
  --npz_file npz_dir  	Files root for npz
  --catego CATEGO       SAMM or CASME dataset
  --num_classes NUM_CLASSES
                        Classes to be trained
  --batch_size BATCH_SIZE
                        Training batch size
  --weight_save_path WEIGHT_SAVE_PATH
                        Path for the saving weight
  --epochs EPOCHS       Epochs for training the model
  --learning_rate LEARNING_RATE
                        Learning rate for training the model

Citation

@inproceedings{huang2023micro,
  title={Micro-Expression Recognition with Layered Relations and More Input Frames},
  author={Huang, Pinyi and Wang, Lei and Cai, Tianfu and Guo, Kehua},
  booktitle={2023 IEEE International Conference on Image Processing (ICIP)},
  pages={2210--2214},
  year={2023},
  organization={IEEE}
}

About

Codes for our paper -- Micro-expression recognition with layered relations and more input frames.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages