Abstract
Testing was recently proposed as a method to gain trust in learned action policies in classical planning. Test cases in this setting are states generated by a fuzzing process that performs random walks from the initial state. A fuzzing bias attempts to bias these random walks towards policy bugs, that is, states where the policy performs sub-optimally. Prior work explored a simple fuzzing bias based on policy-trace cost. Here, we investigate this topic more deeply. We introduce three new fuzzing biases based on analyses of policy-trace shape, estimating whether a trace is close to looping back on itself, whether it contains detours, and whether its goal-distance surface does not smoothly decline. Our experiments with two kinds of neural action policies show that these new biases improve bug-finding capabilities in many cases.
Original language | English |
---|---|
Title of host publication | Proceedings of the 34th International Conference on Automated Planning and Scheduling, ICAPS 2024 |
Editors | Sara Bernardini, Christian Muise |
Number of pages | 6 |
Publisher | Association for the Advancement of Artificial Intelligence |
Publication date | 30 May 2024 |
Pages | 162-167 |
ISBN (Electronic) | 9781577358893 |
DOIs | |
Publication status | Published - 30 May 2024 |
Externally published | Yes |
Event | 34th International Conference on Automated Planning and Scheduling, ICAPS 2024 - Banaff, Canada Duration: 1 Jun 2024 → 6 Jun 2024 |
Conference
Conference | 34th International Conference on Automated Planning and Scheduling, ICAPS 2024 |
---|---|
Country/Territory | Canada |
City | Banaff |
Period | 01/06/2024 → 06/06/2024 |
Series | Proceedings International Conference on Automated Planning and Scheduling, ICAPS |
---|---|
Volume | 34 |
ISSN | 2334-0835 |
Bibliographical note
Publisher Copyright:Copyright © 2024, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.