Observed Adversaries in Deep Reinforcement Learning

Observed Adversaries in Deep Reinforcement Learning, Eugene Lim★ and Harold Soh★, AAAI Fall Symposium Series, Artificial Intelligence for Human-Robot Interaction, 2022
Links: Paper | Github

In this work, we point out the problem of observed adversaries for deep policies. Specifically, recent work has shown that deep reinforcement learning is susceptible to adversarial attacks where an observed adversary acts under environmental constraints to invoke natural but adversarial observations. This setting is particularly relevant for HRI since HRI-related robots are expected to perform their tasks around and with other agents. In this work, we demonstrate that this effect persists even with low-dimensional observations. We further show that these adversarial attacks transfer across victims, which potentially allows malicious attackers to train an adversary without access to the target victim.

Resources

You can find our paper here.

Citation

Please consider citing our paper if you build upon our results and ideas.

Eugene Lim★ and Harold Soh★, “Observed Adversaries in Deep Reinforcement Learning”, AAAI Fall Symposium Series, Artificial Intelligence for Human-Robot Interaction, 2022

@inproceedings{lim2022observed,
title={Observed Adversaries in Deep Reinforcement Learning},
author={Lim, Eugene and Soh, Harold},
journal={AAAI Fall Symposium Series, Artificial Intelligence for Human-Robot Interaction},
year={2022} }

Contact

If you have questions or comments, please contact Eugene.

Know When to Abstain: Optimal Selective Classification with Likelihood Ratios

We propose optimal likelihood ratio-based selective classification methods based on the Neyman-Pearson lemma and evaluate them under vision and language covariate shifts tasks.

Alvin Heng 21 May 2025

generative NeurIPS learn

Out-of-Distribution Detection with a Single Unconditional Diffusion Model

We show that a single unconditional diffusion model performs competitively in out-of-distribution detection tasks by measuring the rate-of-change and curvature of diffusion paths connecting data samples to the standard normal distribution.

Alvin Heng 18 Oct 2024

learn generative RA-L

LTLDoG: Satisfying Temporally-Extended Symbolic Constraints for Safe Diffusion-based Planning

We develop a safe planning method for trajectory generation by sampling from diffusion model under different LTLf constraints.

Zeyu Feng 21 Aug 2024

Observed Adversaries in Deep Reinforcement Learning

Resources

Citation

Contact

Eugene Lim

Know When to Abstain: Optimal Selective Classification with Likelihood Ratios

Out-of-Distribution Detection with a Single Unconditional Diffusion Model

LTLDoG: Satisfying Temporally-Extended Symbolic Constraints for Safe Diffusion-based Planning

CLeAR

Recent posts

Know When to Abstain: Optimal Selective Classification with Likelihood Ratios

SocRATES: Towards Automated Scenario-based Testing of Social Navigation Algorithms

Menu

Observed Adversaries in Deep Reinforcement Learning

Resources

Citation

Contact

Eugene Lim

You may also like...

Know When to Abstain: Optimal Selective Classification with Likelihood Ratios

Out-of-Distribution Detection with a Single Unconditional Diffusion Model

LTLDoG: Satisfying Temporally-Extended Symbolic Constraints for Safe Diffusion-based Planning

Know When to Abstain: Optimal Selective Classification with Likelihood Ratios

SocRATES: Towards Automated Scenario-based Testing of Social Navigation Algorithms