Translation of Aerial Image into Digital Map via Discriminative Segmentation and Creative Generation

Ying Fu; Shuaizhe Liang; Dongdong Chen; Zhanlong Chen

doi:10.1109/TGRS.2021.3110894

Translation of Aerial Image into Digital Map via Discriminative Segmentation and Creative Generation

Ying Fu^*, Shuaizhe Liang, Dongdong Chen, Zhanlong Chen

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Contribution to journal › Article › peer-review

12 Citations (Scopus)

Abstract

Automatic translation of aerial images into digital maps is an important and challenging task which is widely used in practical applications. Most of the existing works view it either as a creative image-to-image translation problem or a discriminative semantic segmentation problem. However, we notice that human annotators need to extract and understand the information in aerial images first and then translate them to online maps in a creative way, which helps them draw accurate and visually appealing online maps. In this article, we propose an end-to-end online map generation method that combines a discriminative module with a creative module based on this observation to mimic human behavior. Specifically, we first utilize a semantic segmentation module to obtain a rough aerial map, in which each region is labeled with its category, and then further improve its quality with a creative module. To train a robust network that generalizes well to unfamiliar regions, we also collect a large aerial image dataset for online map generation (AIDOMG). AIDOMG consists of 40 087 pairs of aerial images and corresponding online maps collected from nine regions of six continents. We conduct extensive experiments to verify the superiority of the new design that combines discrimination and creativity and experimental results show that the performance of the proposed method significantly outperforms baseline methods.

Original language	English
Journal	IEEE Transactions on Geoscience and Remote Sensing
Volume	60
DOIs	https://doi.org/10.1109/TGRS.2021.3110894
Publication status	Published - 2022

Keywords

Aerial image
Generative adversarial networks (GAN)
Map generation
Semantic segmentation

Access to Document

10.1109/TGRS.2021.3110894

Cite this

Fu, Y., Liang, S., Chen, D., & Chen, Z. (2022). Translation of Aerial Image into Digital Map via Discriminative Segmentation and Creative Generation. IEEE Transactions on Geoscience and Remote Sensing, 60. https://doi.org/10.1109/TGRS.2021.3110894

@article{0c2274760ef04fea9902d31dca415ec4,

title = "Translation of Aerial Image into Digital Map via Discriminative Segmentation and Creative Generation",

abstract = "Automatic translation of aerial images into digital maps is an important and challenging task which is widely used in practical applications. Most of the existing works view it either as a creative image-to-image translation problem or a discriminative semantic segmentation problem. However, we notice that human annotators need to extract and understand the information in aerial images first and then translate them to online maps in a creative way, which helps them draw accurate and visually appealing online maps. In this article, we propose an end-to-end online map generation method that combines a discriminative module with a creative module based on this observation to mimic human behavior. Specifically, we first utilize a semantic segmentation module to obtain a rough aerial map, in which each region is labeled with its category, and then further improve its quality with a creative module. To train a robust network that generalizes well to unfamiliar regions, we also collect a large aerial image dataset for online map generation (AIDOMG). AIDOMG consists of 40 087 pairs of aerial images and corresponding online maps collected from nine regions of six continents. We conduct extensive experiments to verify the superiority of the new design that combines discrimination and creativity and experimental results show that the performance of the proposed method significantly outperforms baseline methods.",

keywords = "Aerial image, Generative adversarial networks (GAN), Map generation, Semantic segmentation",

author = "Ying Fu and Shuaizhe Liang and Dongdong Chen and Zhanlong Chen",

note = "Publisher Copyright: {\textcopyright} 1980-2012 IEEE.",

year = "2022",

doi = "10.1109/TGRS.2021.3110894",

language = "English",

volume = "60",

journal = "IEEE Transactions on Geoscience and Remote Sensing",

issn = "0196-2892",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Translation of Aerial Image into Digital Map via Discriminative Segmentation and Creative Generation

AU - Fu, Ying

AU - Liang, Shuaizhe

AU - Chen, Dongdong

AU - Chen, Zhanlong

PY - 2022

Y1 - 2022

N2 - Automatic translation of aerial images into digital maps is an important and challenging task which is widely used in practical applications. Most of the existing works view it either as a creative image-to-image translation problem or a discriminative semantic segmentation problem. However, we notice that human annotators need to extract and understand the information in aerial images first and then translate them to online maps in a creative way, which helps them draw accurate and visually appealing online maps. In this article, we propose an end-to-end online map generation method that combines a discriminative module with a creative module based on this observation to mimic human behavior. Specifically, we first utilize a semantic segmentation module to obtain a rough aerial map, in which each region is labeled with its category, and then further improve its quality with a creative module. To train a robust network that generalizes well to unfamiliar regions, we also collect a large aerial image dataset for online map generation (AIDOMG). AIDOMG consists of 40 087 pairs of aerial images and corresponding online maps collected from nine regions of six continents. We conduct extensive experiments to verify the superiority of the new design that combines discrimination and creativity and experimental results show that the performance of the proposed method significantly outperforms baseline methods.

AB - Automatic translation of aerial images into digital maps is an important and challenging task which is widely used in practical applications. Most of the existing works view it either as a creative image-to-image translation problem or a discriminative semantic segmentation problem. However, we notice that human annotators need to extract and understand the information in aerial images first and then translate them to online maps in a creative way, which helps them draw accurate and visually appealing online maps. In this article, we propose an end-to-end online map generation method that combines a discriminative module with a creative module based on this observation to mimic human behavior. Specifically, we first utilize a semantic segmentation module to obtain a rough aerial map, in which each region is labeled with its category, and then further improve its quality with a creative module. To train a robust network that generalizes well to unfamiliar regions, we also collect a large aerial image dataset for online map generation (AIDOMG). AIDOMG consists of 40 087 pairs of aerial images and corresponding online maps collected from nine regions of six continents. We conduct extensive experiments to verify the superiority of the new design that combines discrimination and creativity and experimental results show that the performance of the proposed method significantly outperforms baseline methods.

KW - Aerial image

KW - Generative adversarial networks (GAN)

KW - Map generation

KW - Semantic segmentation

UR - http://www.scopus.com/inward/record.url?scp=85115196418&partnerID=8YFLogxK

U2 - 10.1109/TGRS.2021.3110894

DO - 10.1109/TGRS.2021.3110894

M3 - Article

AN - SCOPUS:85115196418

SN - 0196-2892

VL - 60

JO - IEEE Transactions on Geoscience and Remote Sensing

JF - IEEE Transactions on Geoscience and Remote Sensing

ER -

Translation of Aerial Image into Digital Map via Discriminative Segmentation and Creative Generation

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this