Research

Ashish Gaurav

September 17, 2025

Here is a collection of my research publications, sorted by date (most recent first).

Techniques to learn constraints from demonstrations
Ph.D. thesis
[link] [code]

Understanding Constraint Inference in Safety-Critical Inverse Reinforcement Learning
ICLR 2025
B. Yue, S. Wang, A. Gaurav, J. Li, P. Poupart, G. Liu
[openreview] [code]

A Comprehensive Survey on Inverse Constrained RL: Definitions, Progress and Challenges
TMLR 2024
G. Liu, S. Xu, S. Liu, A. Gaurav, S.G. Subramanian, P. Poupart
[openreview] [code]

Learning Soft Constraints From Constrained Expert Demonstrations
ICLR 2023 (spotlight)
A. Gaurav, K. Rezaee, G. Liu, P. Poupart
[openreview] [code] [patent in submission]

Benchmarking Constraint Inference in Inverse Reinforcement Learning
ICLR 2023
G. Liu, Y. Luo, A. Gaurav, K. Rezaee, P. Poupart
[openreview] [code]

Transfer RL for Autonomous Driving: From WiseMove to WiseSim
ACM Transactions on Modeling and Computer Simulation (TOMACS), Vol. 31, Issue 3
A. Balakrishnan, J. Lee, A. Gaurav, K. Czarnecki, S. Sedwards
[link]

Safety-Oriented Stability Biases for Continual Learning
M. Math. Thesis
A Gaurav
[link] [code]

Simple Continual Learning Strategies for Safer Classifers
Workshop on AI Safety, AAAI 2020
A Gaurav, S Vernekar, J Lee, V Abdelzad, K Czarnecki, S Sedwards
[paper] [code]

Out-of-distribution Detection in Classifiers via Generation
Safety & Robustness in Decision Making Workshop, NeurIPS 2019
S Vernekar, A Gaurav, V Abdelzad, T Denouden, R Salay, K Czarnecki
[paper] [arXiv] [code]

WiseMove: A Framework to Investigate Safe Deep Reinforcement Learning for Autonomous Driving
Quantitative Evaluation of SysTems (QEST) 2019
J Lee*, A Balakrishnan*, A Gaurav*, K Czarnecki, S Sedwards*
[paper] [arXiv] [code]

Analysis of Confident Classifiers for Out-of-distribution Detection
SafeML Workshop, ICLR 2019
S Vernekar*, A Gaurav*, T Denouden, B Phan, V Abdelzad, R Salay, K Czarnecki
[paper] [arXiv] [code]

Design Space of Behaviour Planning for Autonomous Driving
M Ilievski, S Sedwards, A Gaurav, A Balakrishnan, A Sarkar, J Lee, F Bouchard, R De Iaco, K Czarnecki
[arXiv]