[ACCV 2024] ELLAR: An Action Recognition Dataset for Extremely Low-Light Conditions with Dual Gamma Adaptive Modulation

By Minse Ha^★, Wan-Gi Bae^★ , Geunyoung Bae, and Jong Taek Lee^†.

This repository is the official implementation of "ELLAR: An Action Recognition Dataset for Extremely Low-Light Conditions with Dual Gamma Adaptive Modulation". It is based on mmaction2 and Video Swin Transformer.

Updates

12/04/2024 Our paper is now published at CVF, here.

10/03/2024 Initial commits | Project page is now available at here.

About

In this research, we address the challenging problem of action recognition in extremely low-light environments. We present a new dataset with more than 12K video samples, named Extremely Low-Light condition Action Recognition (ELLAR). This dataset is constructed to reflect the characteristics of extremely low-light conditions. Furthermore, we propose a simple yet strong baseline method, DGAM(Dual Gamma Adaptive Modulation), which enables models to be flexible and adaptive to a range of low illuminance levels. Our approach significantly surpasses state-of-the-art results by 3.39% top-1 accuracy on ELLAR dataset.

ELLAR Dataset

This dataset is divided into two parts based on the illumination of the locations: low-light (LL) and extremely low-light (ELL). The LL part is captured at three outdoor locations under low-light conditions and the ELL part is recorded at two extremely low-light indoor settings.

Model and experimental results

The core idea of DGAM(Dual Gamma Adaptive Modulation) is its dual Mixture of Experts structure. This structure first identifies the characteristics of each sample and performs adaptive image enhancement that is optimal for action recognition. This dual mixture of expert systems allows the action recognition model to dynamically respond to inputs from diverse dark settings.

Comparison Result on ELLAR Dataset

Model	Pretrained	Input Size	Top-1	Top-5
ResNet101	K700	3×16×112²	10.46	45.69
ResNeXt101	K400	3×16×112²	9.63	39.37
DarkLight	IG-65M	3×64×112²	28.58	64.31
TimeSformer	K400	3×96×224²	15.51	55.96
Video-Swin-B	K400	3×32×224²	35.03	68.87
DGAM (Ours)	K400	3×32×224²	38.42	74.44

Our method is pretrained by Kinetics400, and finetuned by ELLAR dataset. You can download the checkpoint pth file (DGAM_ELLAR.pth) in here.

The config file format is following mmaction2. The config file for DGAM is already located in ./configs/recognition/swin/hydra_config.py.

Usage

Installation

Please refer to mmaction2 and Video Swin Transformer setup for installation.

Detailed installation instructions will be updated soon.

Data Preparation

You can download the ELLAR dataset from here, both videos and annotation files.

Expected Data Directory Structure:

.data/
|-- Other dataset/
|-- Another dataset/
|-- ELLAR/
|------- ELLAR_label_train.txt
|------- ELLAR_label_val.txt
|------- ELLAR_label_test.txt
|------- videos/
|------------- Walking/
|------------- Running/
|------------- Stertching/
|------------- ...

Inference

python tools/test.py configs/recognition/swin/hydra_config.py work_dirs/hydra_den/DGAM_ELLAR.pth --eval top_k_accuracy

Training

python tools/train.py configs/recognition/swin/hydra_config.py --cfg-options load_from=workdirs/hydra_den/DGAM_ELLAR.pth model.backbone.use_checkpoint=True --validate

Citation

If you find our work useful in your research, please cite:

@InProceedings{Ha_2024_ACCV,
    author    = {Ha, Minse and Bae, Wan-Gi and Bae, Geunyoung and Lee, Jong Taek},
    title     = {ELLAR: An Action Recognition Dataset for Extremely Low-Light Conditions with Dual Gamma Adaptive Modulation},
    booktitle = {Proceedings of the Asian Conference on Computer Vision (ACCV)},
    month     = {December},
    year      = {2024},
    pages     = {800-817}
}

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
configs		configs
demo		demo
docker		docker
docs		docs
docs_zh_CN		docs_zh_CN
figures		figures
mmaction		mmaction
mmaction2.egg-info		mmaction2.egg-info
mmcv_custom		mmcv_custom
requirements		requirements
tests		tests
tools		tools
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
model_zoo.yml		model_zoo.yml
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[ACCV 2024] ELLAR: An Action Recognition Dataset for Extremely Low-Light Conditions with Dual Gamma Adaptive Modulation

Updates

About

ELLAR Dataset

Model and experimental results

Comparison Result on ELLAR Dataset

Usage

Installation

Data Preparation

Inference

Training

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

knu-vis/ELLAR

Folders and files

Latest commit

History

Repository files navigation

[ACCV 2024] ELLAR: An Action Recognition Dataset for Extremely Low-Light Conditions with Dual Gamma Adaptive Modulation

Updates

About

ELLAR Dataset

Model and experimental results

Comparison Result on ELLAR Dataset

Usage

Installation

Data Preparation

Inference

Training

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages