Interpreting Deep Q-Networks: A Rule-Based Comparison with First-Order Logic in Wumpus World

Filip Pawlicki; Kamil Dobies; Marcin Pucek; Karol Draszawka

doi:10.34808/tq2024/28.4/c

Interpreting Deep Q-Networks: A Rule-Based Comparison with First-Order Logic in Wumpus World

Abstract

Deep reinforcement learning models such as Deep Q-Networks (DQNs) have achieved great performance in both simple and complex environments, but their decision-making process remains largely opaque. This work addresses the interpretability challenge by proposing a~method of extracting and comparing symbolic rules from a trained DQN and logic-based agents. The method is showcased in the popular Wumpus World domain. Rules extraction from the agents is done via training decision trees to mimic the agent's behavior. Comparison is done using Jaccard similarity and simple structural metrics. Results show that despite similar performance, DQNs and logic agents rely on partially overlapping but structurally largely distinct decision rules. This highlights the feasibility of translating subsymbolic policies into interpretable rules and reveals meaningful structural differences between learned and symbolic strategies.

Keywords:

Explainable AI, Deep Q-Network, First-Order Logic

Details

Issue

Vol. 28 No. 4 (2024)

Section

Research article

Published

2025-12-09

DOI:

https://doi.org/10.34808/tq2024/28.4/c

Licencja:

This work is licensed under a Creative Commons Attribution 4.0 International License.

Authors

Filip Pawlicki
Department of Computer Architecture Faculty of Electronics Telecommunications and Informatics Gdańsk University of Technology 11/12 Gabriela Narutowicza Street, 80-233 Gdańsk, Poland
Kamil Dobies
Department of Computer Architecture Faculty of Electronics Telecommunications and Informatics Gdańsk University of Technology 11/12 Gabriela Narutowicza Street, 80-233 Gdańsk, Poland
Marcin Pucek
Department of Computer Architecture Faculty of Electronics Telecommunications and Informatics Gdańsk University of Technology 11/12 Gabriela Narutowicza Street, 80-233 Gdańsk, Poland
Karol Draszawka
Department of Computer Architecture Faculty of Electronics Telecommunications and Informatics Gdańsk University of Technology 11/12 Gabriela Narutowicza Street, 80-233 Gdańsk, Poland https://orcid.org/0000-0002-7047-0070 ##linkOpensInNewTab##

Download paper

pdf

Main menu

Interpreting Deep Q-Networks: A Rule-Based Comparison with First-Order Logic in Wumpus World

Abstract

Keywords:

Details

Authors

Filip Pawlicki

Kamil Dobies

Marcin Pucek

Karol Draszawka

Download paper