Super-resolution reconstruction from spike stream via multi-scale aggregation representation and attention guided alignment*
DOI:
CSTR:
Author:
Affiliation:

Tianjin University

Clc Number:

Fund Project:

The National Natural Science Foundation of China (General Program, Key Program, Major Research Plan)

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    Spike camera is a kind of neuromorphic vision sensor that records high-speed scenes as binary spike streams. While it enables an ultra-high temporal sampling rate, its spatial resolution is limited. In this paper, an end-to-end network is designed to reconstruct high-resolution images directly from low-resolution spike streams. Specifically, a Multi-Scale Spatio-Temporal Aggregation Representation Module (MSSTARM) is proposed to aggregate discrete spike signals through multi-scale spatio-temporal branches, effectively capturing local correlation. Additionally, an Attention Guided Deformation Alignment Module (AGDAM) is introduced, which combines deformable convolution with pixel-wise guided attention to model long-range correlation. Extensive experiments on both synthetic and real-captured data demonstrate the effectiveness of the proposed method.

    Reference
    Related
    Cited by
Get Citation
Related Videos

Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:July 01,2025
  • Revised:July 20,2025
  • Adopted:August 20,2025
  • Online:
  • Published:
Article QR Code