MoNET: No-reference image quality assessment based on a multi-depth output network

Qingbing Sang*, Chenfei Su, Lingying Zhu, Lixiong Liu, Xiaojun Wu, Alan C. Bovik

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)

Abstract

When deep convolutional neural networks perform feature extraction, the features computed at each layer express different abstractions of visual information. The earlier layers extract highly compact low-level features such as bandpass and directional primitives, whereas deeper layers extract structural features of increasing abstraction, similar to contours, shapes, and edges, becoming less effable as the depth increases. We propose a different kind of end-to-end no-reference (NR) image quality assessment (IQA) model, which is defined as a multi-depth output convolutional neural network (MoNET). It accomplishes this by mapping both shallow and deep features to perceived quality. MoNET delivers three outputs that express shallow (lower-level) and deep (high-level) features, and maps them to subjective quality scores. The multiple outputs are combined into a single, final quality score. MoNET does this by combining the responses of three learning machines, so it may be viewed as a form of ensemble learning. The experimental results on three public image quality databases show that our proposed model achieves better performance than other state-of-the-art NR IQA algorithms.

Original languageEnglish
Article number043007
JournalJournal of Electronic Imaging
Volume30
Issue number4
DOIs
Publication statusPublished - 1 Jul 2021

Keywords

  • ensemble learning
  • image quality assessment
  • multi-depth output convolutional neural network
  • no-reference

Fingerprint

Dive into the research topics of 'MoNET: No-reference image quality assessment based on a multi-depth output network'. Together they form a unique fingerprint.

Cite this