← Back to feed
2025-12-08vision

DFIR-DETR: Frequency-Domain Iterative Refinement and Dynamic Feature Aggregation for Small Object Detection

Bo Gao, Jingcheng Tong, Xingsheng Chen, Han Yu, Zichen Li

Key claim

DFIR-DETR significantly enhances small object detection performance.

DFIR-DETR improves small object detection by addressing issues in attention mechanisms and feature upsampling. It achieves a mean Average Precision (mAP50) of 92.9% on NEU-DET and 51.6% on VisDrone with a compact model size of 11.7M parameters. This demonstrates effective performance across different detection scenarios.

Novelty
8.0/10

The paper introduces a new model addressing specific deficiencies in existing object detection frameworks.

Reliability
7.0/10

The evaluation on two distinct datasets supports the claims, though more extensive testing could enhance reliability.

Deep reliability assessment

The methodology supports improvements in small object detection by addressing specific architectural deficiencies in existing models. However, the claims of general applicability across diverse domains may be overclaimed without further validation on additional datasets.

Reproducibility

No open source code or dataset link is provided in the paper.

Discussion questions

  1. How does the frequency-domain approach specifically enhance small object detection compared to purely spatial methods?
  2. What are the practical implications of using DFIR-DETR for real-time applications in industrial and UAV settings?
  3. What experimental results or scenarios would challenge the claim that DFIR-DETR generalizes well across different domains?

Key figure

Figure 1 illustrates the overall architecture of DFIR-DETR, highlighting the integration of DCFA, DFPN, and FIRC3 modules.

Benchmark results

NEU-DETmAP50: 92.9vs RT-DETR+4.2%SOTA
VisDronemAP50: 51.6vs RT-DETR+3.4%SOTA
Read on arXiv →