Skip to content

IJCAI 2024: Dialogue Cross-Enhanced Central Engagement Attention Model

Notifications You must be signed in to change notification settings

wujiekd/Dialogue-Cross-Enhanced-CEAM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Dialogue Cross-Enhanced Central Engagement Attention Model for Real-Time Engagement Estimation

An illustration of our proposed model architecture.

image

1. Environment setting

1.0. Package

  • Several important packages
    • torch == 1.8.0+cu111
    • trochvision == 0.9.0+cu111

1.1. Dataset

Dataset | MultiMediate:Multi-modal Group Behaviour Analysis for Artificial Mediation

Download the dataset from the above link to the 'noxi' folder.

├── Engagemet Estiamtion/
│   ├── code/
│   │   ├── data
│   │   ├── src
└── └── └── output_model
│   ├── noxi/
│   │   ├── train
│   │   ├── val
└── └── └── test

1.2. OS

  • Windows10
  • Ubuntu20.04
  • macOS (CPU only)

2. Train

  • Single GPU Training
  • DataParallel (single machine multi-gpus)
  • DistributedDataParallel

(more information: https://pytorch.org/tutorials/intermediate/ddp_tutorial.html)

2.1. Dataset preprocessing

Perform preprocessing operations, normalization, and merging of multi-modal features.

python process.py

Use the center-based sliding window to partition multi-modal feature sequences for training, validation, and testing.

python CSW.py
├── code/
│   ├── data/
└── └── └── all_data

2.3. Train CEAM

python train.py \
    --N 3 \
    --M 0 \
    --K 0 \
    --save_dir CEAM

2.4. Train Dialogue Cross-Enhanced CEAM

python train.py \
    --N 1 \
    --M 1 \
    --K 2 \
    --save_dir DCECEAM

3. Evaluation

3.1. Eval CEAM

python eval.py \
    --N 3 \
    --M 0 \
    --K 0 \
    --save_dir CEAM

3.2. Eval Dialogue Cross-Enhanced CEAM

python eval.py \
    --N 1 \
    --M 1 \
    --K 2 \
    --save_dir DCECEAM

4. Result

Model Method Val CCC Test CCC Inference speed (FPS) Params  (M)
SA-based model [Yu et al.] Sliding window 0.796 - 4537 22.67
BiLSTM-based model [Yu et al.] Sliding window 0.818 0.689 1310 36.17
CEAM (Ours) Center-based sliding window 0.821 0.691 6455 23.98
Dialogue Cross-Enhanced CEAM (Ours) Center-based sliding window 0.835 0.704 6185 31.07

5. Model checkpoints

It can be downloaded from Google Cloud Disk.

It can be directly used for inference and to get the final result.

References

  • Müller P, Balazia M, Baur T, et al. MultiMediate'23: Engagement Estimation and Bodily Behaviour Recognition in Social Interactions[C]//Proceedings of the 31st ACM International Conference on Multimedia. 2023: 9640-9645.
  • Yu J, Lu K, Jing M, et al. Sliding Window Seq2seq Modeling for Engagement Estimation[C]//Proceedings of the 31st ACM International Conference on Multimedia. 2023: 9496-9500.

About

IJCAI 2024: Dialogue Cross-Enhanced Central Engagement Attention Model

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages