Navigation-Oriented Scene Understanding for Robotic Autonomy: Learning to Segment Driveability in Egocentric Images

Galadrielle Humblot-Renaux; Letizia Marchegiani; Thomas B. Moeslund; Rikke Gade

doi:10.1109/LRA.2022.3144491

Navigation-Oriented Scene Understanding for Robotic Autonomy: Learning to Segment Driveability in Egocentric Images

Galadrielle Humblot-Renaux^*, Letizia Marchegiani, Thomas B. Moeslund, Rikke Gade

^*Corresponding author for this work

Research output: Contribution to journal › Journal article › Research › peer-review

9 Citations (Scopus)

165 Downloads (Pure)

Abstract

This work tackles scene understanding for outdoor robotic navigation, solely relying on images captured by an on-board camera. Conventional visual scene understanding interprets the environment based on specific descriptive categories. However, such a representation is not directly interpretable for decision-making and constrains robot operation to a specific domain. Thus, we propose to segment egocentric images directly in terms of how a robot can navigate in them, and tailor the learning problem to an autonomous navigation task. Building around an image segmentation network, we present a generic affordance consisting of 3 driveability levels which can broadly apply to both urban and off-road scenes. By encoding these levels with soft ordinal labels, we incorporate inter-class distances during learning which improves segmentation compared to standard "hard" one-hot labelling. In addition, we propose a navigation-oriented pixel-wise loss weighting method which assigns higher importance to safety-critical areas. We evaluate our approach on large-scale public image segmentation datasets ranging from sunny city streets to snowy forest trails. In a cross-dataset generalization experiment, we show that our affordance learning scheme can be applied across a diverse mix of datasets and improves driveability estimation in unseen environments compared to general-purpose, single-dataset segmentation.

Original language	English
Article number	9689949
Journal	IEEE Robotics and Automation Letters
Volume	7
Issue number	2
Pages (from-to)	2913-2920
Number of pages	8
ISSN	2377-3766
DOIs	https://doi.org/10.1109/LRA.2022.3144491
Publication status	Published - Apr 2022

Keywords

Affordances
Deep learning for visual perception
Image segmentation
Labeling
Navigation
Roads
Robots
Semantics
computer vision for transportation
semantic scene understanding

Access to Document

10.1109/LRA.2022.3144491

(RA-L 2022) Navigation-Oriented Scene Understanding for Robotic AutonomyAccepted author manuscript, 9.73 MB

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

@article{277094502bfa4e8288db3c848d60c3cd,

title = "Navigation-Oriented Scene Understanding for Robotic Autonomy: Learning to Segment Driveability in Egocentric Images",

abstract = "This work tackles scene understanding for outdoor robotic navigation, solely relying on images captured by an on-board camera. Conventional visual scene understanding interprets the environment based on specific descriptive categories. However, such a representation is not directly interpretable for decision-making and constrains robot operation to a specific domain. Thus, we propose to segment egocentric images directly in terms of how a robot can navigate in them, and tailor the learning problem to an autonomous navigation task. Building around an image segmentation network, we present a generic affordance consisting of 3 driveability levels which can broadly apply to both urban and off-road scenes. By encoding these levels with soft ordinal labels, we incorporate inter-class distances during learning which improves segmentation compared to standard {"}hard{"} one-hot labelling. In addition, we propose a navigation-oriented pixel-wise loss weighting method which assigns higher importance to safety-critical areas. We evaluate our approach on large-scale public image segmentation datasets ranging from sunny city streets to snowy forest trails. In a cross-dataset generalization experiment, we show that our affordance learning scheme can be applied across a diverse mix of datasets and improves driveability estimation in unseen environments compared to general-purpose, single-dataset segmentation.",

keywords = "Affordances, Deep learning for visual perception, Image segmentation, Labeling, Navigation, Roads, Robots, Semantics, computer vision for transportation, semantic scene understanding",

author = "Galadrielle Humblot-Renaux and Letizia Marchegiani and Moeslund, {Thomas B.} and Rikke Gade",

year = "2022",

month = apr,

doi = "10.1109/LRA.2022.3144491",

language = "English",

volume = "7",

pages = "2913--2920",

journal = "IEEE Robotics and Automation Letters",

issn = "2377-3766",

publisher = "IEEE Communications Society",

number = "2",

}

Navigation-Oriented Scene Understanding for Robotic Autonomy: Learning to Segment Driveability in Egocentric Images. / Humblot-Renaux, Galadrielle; Marchegiani, Letizia; Moeslund, Thomas B. et al.
In: IEEE Robotics and Automation Letters, Vol. 7, No. 2, 9689949, 04.2022, p. 2913-2920.

Research output: Contribution to journal › Journal article › Research › peer-review

TY - JOUR

T1 - Navigation-Oriented Scene Understanding for Robotic Autonomy: Learning to Segment Driveability in Egocentric Images

AU - Humblot-Renaux, Galadrielle

AU - Marchegiani, Letizia

AU - Moeslund, Thomas B.

AU - Gade, Rikke

PY - 2022/4

Y1 - 2022/4

N2 - This work tackles scene understanding for outdoor robotic navigation, solely relying on images captured by an on-board camera. Conventional visual scene understanding interprets the environment based on specific descriptive categories. However, such a representation is not directly interpretable for decision-making and constrains robot operation to a specific domain. Thus, we propose to segment egocentric images directly in terms of how a robot can navigate in them, and tailor the learning problem to an autonomous navigation task. Building around an image segmentation network, we present a generic affordance consisting of 3 driveability levels which can broadly apply to both urban and off-road scenes. By encoding these levels with soft ordinal labels, we incorporate inter-class distances during learning which improves segmentation compared to standard "hard" one-hot labelling. In addition, we propose a navigation-oriented pixel-wise loss weighting method which assigns higher importance to safety-critical areas. We evaluate our approach on large-scale public image segmentation datasets ranging from sunny city streets to snowy forest trails. In a cross-dataset generalization experiment, we show that our affordance learning scheme can be applied across a diverse mix of datasets and improves driveability estimation in unseen environments compared to general-purpose, single-dataset segmentation.

AB - This work tackles scene understanding for outdoor robotic navigation, solely relying on images captured by an on-board camera. Conventional visual scene understanding interprets the environment based on specific descriptive categories. However, such a representation is not directly interpretable for decision-making and constrains robot operation to a specific domain. Thus, we propose to segment egocentric images directly in terms of how a robot can navigate in them, and tailor the learning problem to an autonomous navigation task. Building around an image segmentation network, we present a generic affordance consisting of 3 driveability levels which can broadly apply to both urban and off-road scenes. By encoding these levels with soft ordinal labels, we incorporate inter-class distances during learning which improves segmentation compared to standard "hard" one-hot labelling. In addition, we propose a navigation-oriented pixel-wise loss weighting method which assigns higher importance to safety-critical areas. We evaluate our approach on large-scale public image segmentation datasets ranging from sunny city streets to snowy forest trails. In a cross-dataset generalization experiment, we show that our affordance learning scheme can be applied across a diverse mix of datasets and improves driveability estimation in unseen environments compared to general-purpose, single-dataset segmentation.

KW - Affordances

KW - Deep learning for visual perception

KW - Image segmentation

KW - Labeling

KW - Navigation

KW - Roads

KW - Robots

KW - Semantics

KW - computer vision for transportation

KW - semantic scene understanding

UR - http://www.scopus.com/inward/record.url?scp=85123682503&partnerID=8YFLogxK

U2 - 10.1109/LRA.2022.3144491

DO - 10.1109/LRA.2022.3144491

M3 - Journal article

SN - 2377-3766

VL - 7

SP - 2913

EP - 2920

JO - IEEE Robotics and Automation Letters

JF - IEEE Robotics and Automation Letters

IS - 2

M1 - 9689949

ER -

Navigation-Oriented Scene Understanding for Robotic Autonomy: Learning to Segment Driveability in Egocentric Images

Abstract

Keywords

Access to Document

AUB Link

Other files and links

Fingerprint

AI for the People 2022 poster session - "Soft labelling for semantic segmentation"

ICRA 2022 session - Computer Vision for Transportation

Visual scene understanding for outdoor robot navigation

Cite this

Navigation-Oriented Scene Understanding for Robotic Autonomy: Learning to Segment Driveability in Egocentric Images

Abstract

Keywords

Access to Document

AUB Link

Other files and links

Fingerprint

Activities

AI for the People 2022 poster session - "Soft labelling for semantic segmentation"

ICRA 2022 session - Computer Vision for Transportation

Visual scene understanding for outdoor robot navigation

Cite this