default search action

combined dblp search
author search
venue search
publication search

ask others

Josiah Hanna

Josiah P. Hanna

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/aim/Hanna24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/aim/Hanna24
Josiah P. Hanna:
Toward the confident deployment of real-world reinforcement learning agents. AI Mag. 45(3): 396-403 (2024)
[c38]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/Hanna24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/Hanna24
Josiah P. Hanna:
Scaling Offline Evaluation of Reinforcement Learning Agents through Abstraction. AAAI 2024: 22667
[c37]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/MukherjeeXHN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/MukherjeeXHN24
Subhojyoti Mukherjee, Qiaomin Xie, Josiah P. Hanna, Robert D. Nowak:
SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits. AISTATS 2024: 2962-2970
[c36]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/CorradoH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/CorradoH24
Nicholas Corrado, Josiah P. Hanna:
Understanding when Dynamics-Invariant Data Augmentations Benefit Model-free Reinforcement Learning Updates. ICLR 2024
[c35]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/MukherjeeHN24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MukherjeeHN24
Subhojyoti Mukherjee, Josiah P. Hanna, Robert D. Nowak:
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP. ICML 2024
[c34]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/PavseZ0XH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/PavseZ0XH24
Brahma S. Pavse, Matthew Zurek, Yudong Chen, Qiaomin Xie, Josiah P. Hanna:
Learning to Stabilize Online Reinforcement Learning in Unbounded State Spaces. ICML 2024
[i34]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-07102
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-07102
Jeongyeol Kwon, Liu Yang, Robert D. Nowak, Josiah Hanna:
Future Prediction Can be a Strong Evidence of Good History Representation in Partially Observable Environments. CoRR abs/2402.07102 (2024)
[i33]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-07838
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-07838
Arushi Jain, Josiah P. Hanna, Doina Precup:
Adaptive Exploration for Data-Efficient General Value Function Evaluations. CoRR abs/2405.07838 (2024)
[i32]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02165
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02165
Subhojyoti Mukherjee, Josiah P. Hanna, Robert D. Nowak:
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP. CoRR abs/2406.02165 (2024)
[i31]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-05064
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-05064
Subhojyoti Mukherjee, Josiah P. Hanna, Qiaomin Xie, Robert D. Nowak:
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning. CoRR abs/2406.05064 (2024)
[i30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-17168
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-17168
Abhinav Narayan Harish, Larry Heck, Josiah P. Hanna, Zsolt Kira, Andrew Szot:
Reinforcement Learning via Auxiliary Task Distillation. CoRR abs/2406.17168 (2024)
[i29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-01643
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-01643
Brahma S. Pavse, Yudong Chen, Qiaomin Xie, Josiah P. Hanna:
Stable Offline Value Function Learning with Bisimulation-based Representations. CoRR abs/2410.01643 (2024)
2023
[c33]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/PavseH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/PavseH23
Brahma S. Pavse, Josiah P. Hanna:
Scaling Marginalized Importance Sampling to High-Dimensional State-Spaces via State Abstraction. AAAI 2023: 9417-9425
[c32]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/DunionMLHA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/DunionMLHA23
Mhairi Dunion, Trevor McInroe, Kevin Sebastian Luck, Josiah P. Hanna, Stefano V. Albrecht:
Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning. ICLR 2023
[c31]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/DunionMLHA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DunionMLHA23
Mhairi Dunion, Trevor McInroe, Kevin Sebastian Luck, Josiah Hanna, Stefano V. Albrecht:
Conditional Mutual Information for Disentangled Representations in Reinforcement Learning. NeurIPS 2023
[c30]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/MukherjeeXHN23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MukherjeeXHN23
Subhojyoti Mukherjee, Qiaomin Xie, Josiah Hanna, Robert D. Nowak:
Multi-task Representation Learning for Pure Exploration in Bilinear Bandits. NeurIPS 2023
[c29]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/PavseH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/PavseH23
Brahma S. Pavse, Josiah Hanna:
State-Action Similarity-Based Representations for Off-Policy Evaluation. NeurIPS 2023
[i28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-12357
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-12357
Subhojyoti Mukherjee, Qiaomin Xie, Josiah Hanna, Robert D. Nowak:
SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits. CoRR abs/2301.12357 (2023)
[i27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-14133
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-14133
Mhairi Dunion, Trevor McInroe, Kevin Sebastian Luck, Josiah P. Hanna, Stefano V. Albrecht:
Conditional Mutual Information for Disentangled Representations in Reinforcement Learning. CoRR abs/2305.14133 (2023)
[i26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-01896
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-01896
Brahma S. Pavse, Yudong Chen, Qiaomin Xie, Josiah P. Hanna:
Tackling Unbounded State Spaces in Continuing Task Reinforcement Learning. CoRR abs/2306.01896 (2023)
[i25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-17786
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-17786
Nicholas E. Corrado, Josiah P. Hanna:
Understanding when Dynamics-Invariant Data Augmentations Benefit Model-Free Reinforcement Learning Updates. CoRR abs/2310.17786 (2023)
[i24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-18247
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-18247
Nicholas E. Corrado, Yuxiao Qu, John U. Balis, Adam Labiosa, Josiah P. Hanna:
Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning. CoRR abs/2310.18247 (2023)
[i23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-18409
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-18409
Brahma S. Pavse, Josiah P. Hanna:
State-Action Similarity-Based Representations for Off-Policy Evaluation. CoRR abs/2310.18409 (2023)
[i22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-00327
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-00327
Subhojyoti Mukherjee, Qiaomin Xie, Josiah P. Hanna, Robert D. Nowak:
Multi-task Representation Learning for Pure Exploration in Bilinear Bandits. CoRR abs/2311.00327 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-08290
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-08290
Nicholas E. Corrado, Josiah P. Hanna:
On-Policy Policy Gradient Reinforcement Learning Without On-Policy Sampling. CoRR abs/2311.08290 (2023)
2022
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/0001CHA22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/0001CHA22
Lukas Schäfer, Filippos Christianos, Josiah P. Hanna, Stefano V. Albrecht:
Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration. AAMAS 2022: 1146-1154
[c27]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/collas/CorradoQH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/collas/CorradoQH22
Nicholas Corrado, Yuxiao Qu, Josiah P. Hanna:
Simulation-Acquired Latent Action Spaces for Dynamics Generalization. CoLLAs 2022: 661-682
[c26]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/ZhongZ0AH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhongZ0AH22
Rujie Zhong, Duohan Zhang, Lukas Schäfer, Stefano V. Albrecht, Josiah Hanna:
Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement Learning. NeurIPS 2022
[c25]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/uai/MukherjeeHN22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/MukherjeeHN22
Subhojyoti Mukherjee, Josiah P. Hanna, Robert D. Nowak:
ReVar: Strengthening policy evaluation via reduced variance sampling. UAI 2022: 1413-1422
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-04510
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-04510
Subhojyoti Mukherjee, Josiah P. Hanna, Robert D. Nowak:
ReVar: Strengthening Policy Evaluation via Reduced Variance Sampling. CoRR abs/2203.04510 (2022)
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-14323
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-14323
Chi Zhang, Olga Papaemmanouil, Josiah Hanna:
Multi-agent Databases via Independent Learning. CoRR abs/2205.14323 (2022)
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-05480
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-05480
Mhairi Dunion, Trevor McInroe, Kevin Sebastian Luck, Josiah Hanna, Stefano V. Albrecht:
Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning. CoRR abs/2207.05480 (2022)
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-09446
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-09446
Sheelabhadra Dey, Sumedh Pendurkar, Guni Sharon, Josiah P. Hanna:
A Joint Imitation-Reinforcement Learning Framework for Reduced Baseline Regret. CoRR abs/2209.09446 (2022)
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-07486
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-07486
Brahma S. Pavse, Josiah P. Hanna:
Scaling Marginalized Importance Sampling to High-Dimensional State-Spaces via State Abstraction. CoRR abs/2212.07486 (2022)
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-08302
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-08302
Hager Radi, Josiah P. Hanna, Peter Stone, Matthew E. Taylor:
Safe Evaluation For Offline Learning: Are We Ready To Deploy? CoRR abs/2212.08302 (2022)
2021
[j4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ml/HannaNS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ml/HannaNS21
Josiah P. Hanna, Scott Niekum, Peter Stone:
Importance sampling in reinforcement learning with an estimated behavior policy. Mach. Learn. 110(6): 1267-1317 (2021)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/ml/HannaDKWS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ml/HannaDKWS21
Josiah P. Hanna, Siddharth Desai, Haresh Karnan, Garrett Warnell, Peter Stone:
Grounded action transformation for sim-to-real reinforcement learning. Mach. Learn. 110(9): 2469-2499 (2021)
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/DeyPSH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/DeyPSH21
Sheelabhadra Dey, Sumedh Pendurkar, Guni Sharon, Josiah P. Hanna:
A Joint Imitation-Reinforcement Learning Framework for Reduced Baseline Regret. IROS 2021: 3485-3491
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/HannaRFEDRRA21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/HannaRFEDRRA21
Josiah P. Hanna, Arrasy Rahman, Elliot Fosong, Francisco Eiras, Mihai Dobre, John Redford, Subramanian Ramamoorthy, Stefano V. Albrecht:
Interpretable Goal Recognition in the Presence of Occluded Factors for Autonomous Vehicles. IROS 2021: 7044-7051
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/paams/AhmedHFA21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/paams/AhmedHFA21
Ibrahim Ahmed, Josiah P. Hanna, Elliot Fosong, Stefano V. Albrecht:
Towards Quantum-Secure Authentication and Key Agreement via Abstract Multi-Agent Interaction. PAAMS 2021: 14-26
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-08966
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-08966
Lukas Schäfer, Filippos Christianos, Josiah Hanna, Stefano V. Albrecht:
Decoupling Exploration and Exploitation in Reinforcement Learning. CoRR abs/2107.08966 (2021)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2108-02530
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-02530
Josiah P. Hanna, Arrasy Rahman, Elliot Fosong, Francisco Eiras, Mihai Dobre, John Redford, Subramanian Ramamoorthy, Stefano V. Albrecht:
Interpretable Goal Recognition in the Presence of Occluded Factors for Autonomous Vehicles. CoRR abs/2108.02530 (2021)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-14552
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-14552
Rujie Zhong, Josiah P. Hanna, Lukas Schäfer, Stefano V. Albrecht:
Robust On-Policy Data Collection for Data-Efficient Policy Evaluation. CoRR abs/2111.14552 (2021)
2020
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/ral/PavseTHWS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ral/PavseTHWS20
Brahma S. Pavse, Faraz Torabi, Josiah Hanna, Garrett Warnell, Peter Stone:
RIDM: Reinforced Inverse Dynamics Modeling for Learning from a Single Observed Demonstration. IEEE Robotics Autom. Lett. 5(4): 6262-6269 (2020)
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/AultHS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/AultHS20
James Ault, Josiah P. Hanna, Guni Sharon:
Learning an Interpretable Traffic Signal Control Policy. AAMAS 2020: 88-96
[c20]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/PavseDHS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/PavseDHS20
Brahma S. Pavse, Ishan Durugkar, Josiah Hanna, Peter Stone:
Reducing Sampling Error in Batch Temporal Difference Learning. ICML 2020: 7543-7552
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/KarnanDHWS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/KarnanDHWS20
Haresh Karnan, Siddharth Desai, Josiah P. Hanna, Garrett Warnell, Peter Stone:
Reinforced Grounded Action Transformation for Sim-to-Real Transfer. IROS 2020: 4397-4402
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/DesaiKHWS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/DesaiKHWS20
Siddharth Desai, Haresh Karnan, Josiah P. Hanna, Garrett Warnell, Peter Stone:
Stochastic Grounded Action Transformation for Robot Learning in Simulation. IROS 2020: 6106-6111
[c17]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/DesaiDKWHS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DesaiDKWHS20
Siddharth Desai, Ishan Durugkar, Haresh Karnan, Garrett Warnell, Josiah Hanna, Peter Stone:
An Imitation from Observation Approach to Transfer Learning with Dynamics Mismatch. NeurIPS 2020
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2007-09327
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-09327
Ibrahim Ahmed, Josiah P. Hanna, Stefano V. Albrecht:
Quantum-Secure Authentication via Abstract Multi-Agent Interaction. CoRR abs/2007.09327 (2020)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2008-01279
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-01279
Haresh Karnan, Siddharth Desai, Josiah P. Hanna, Garrett Warnell, Peter Stone:
Reinforced Grounded Action Transformation for Sim-to-Real Transfer. CoRR abs/2008.01279 (2020)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2008-01281
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-01281
Siddharth Desai, Haresh Karnan, Josiah P. Hanna, Garrett Warnell, Peter Stone:
Stochastic Grounded Action Transformation for Robot Learning in Simulation. CoRR abs/2008.01281 (2020)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2008-01594
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-01594
Siddharth Desai, Ishan Durugkar, Haresh Karnan, Garrett Warnell, Josiah Hanna, Peter Stone:
An Imitation from Observation Approach to Sim-to-Real Transfer. CoRR abs/2008.01594 (2020)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2008-06738
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-06738
Brahma S. Pavse, Ishan Durugkar, Josiah Hanna, Peter Stone:
Reducing Sampling Error in Batch Temporal Difference Learning. CoRR abs/2008.06738 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/HannaSBS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/HannaSBS19
Josiah P. Hanna, Guni Sharon, Stephen D. Boyles, Peter Stone:
Selecting Compliant Agents for Opt-in Micro-Tolling. AAAI 2019: 565-572
[c15]
- view
  - electronic edition @ acm.org
  - no references & citations available
- export record
  dblp key:
  - conf/atal/HannaS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/HannaS19
Josiah P. Hanna, Peter Stone:
Reducing Sampling Error in Policy Gradient Learning. AAMAS 2019: 1016-1024
[c14]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/HannaNS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HannaNS19
Josiah Hanna, Scott Niekum, Peter Stone:
Importance Sampling Policy Evaluation with an Estimated Behavior Policy. ICML 2019: 2605-2613
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1906-07372
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-07372
Brahma S. Pavse, Faraz Torabi, Josiah P. Hanna, Garrett Warnell, Peter Stone:
RIDM: Reinforced Inverse Dynamics Modeling for Learning from a Single Observed Demonstration. CoRR abs/1906.07372 (2019)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1912-11023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-11023
James Ault, Josiah Hanna, Guni Sharon:
Learning an Interpretable Traffic Signal Control Policy. CoRR abs/1912.11023 (2019)
2018
[c13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ChenASHSMS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ChenASHSMS18
Haipeng Chen, Bo An, Guni Sharon, Josiah P. Hanna, Peter Stone, Chunyan Miao, Yeng Chai Soh:
DyETC: Dynamic Electronic Toll Collection for Traffic Congestion Alleviation. AAAI 2018: 757-765
[c12]
- view
  - electronic edition @ aaai.org
  - no references & citations available
- export record
  dblp key:
  - conf/aaaiss/HannaS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaaiss/HannaS18
Josiah P. Hanna, Peter Stone:
Towards a Data Efficient Off-Policy Policy Gradient. AAAI Spring Symposia 2018
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1806-01347
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-01347
Josiah Hanna, Scott Niekum, Peter Stone:
Importance Sampling Policy Evaluation with an Estimated Behavior Policy. CoRR abs/1806.01347 (2018)
2017
[c11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/HannaS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/HannaS17
Josiah P. Hanna, Peter Stone:
Grounded Action Transformation for Robot Learning in Simulation. AAAI 2017: 3834-3840
[c10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/HannaS17a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/HannaS17a
Josiah P. Hanna, Peter Stone:
Grounded Action Transformation for Robot Learning in Simulation. AAAI 2017: 4931-4932
[c9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/HannaSN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/HannaSN17
Josiah P. Hanna, Peter Stone, Scott Niekum:
Bootstrapping with Models: Confidence Intervals for Off-Policy Evaluation. AAAI 2017: 4933-4934
[c8]
- view
  - electronic edition @ acm.org
  - no references & citations available
- export record
  dblp key:
  - conf/atal/HannaSN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/HannaSN17
Josiah P. Hanna, Peter Stone, Scott Niekum:
Bootstrapping with Models: Confidence Intervals for Off-Policy Evaluation. AAMAS 2017: 538-546
[c7]
- view
  - electronic edition @ acm.org
  - no references & citations available
- export record
  dblp key:
  - conf/atal/Hanna17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/Hanna17
Josiah P. Hanna:
Bridging the Gap Between Simulation and Reality. AAMAS 2017: 1834-1835
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/HannaTSN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HannaTSN17
Josiah P. Hanna, Philip S. Thomas, Peter Stone, Scott Niekum:
Data-Efficient Policy Evaluation Through Behavior Policy Search. ICML 2017: 1394-1403
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/robocup/MenasheKGHLNZS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/robocup/MenasheKGHLNZS17
Jacob Menashe, Josh Kelle, Katie Genter, Josiah Hanna, Elad Liebman, Sanmit Narvekar, Ruohan Zhang, Peter Stone:
Fast and Precise Black and White Ball Detection for RoboCup Soccer. RoboCup 2017: 45-58
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/HannaTSN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HannaTSN17
Josiah P. Hanna, Philip S. Thomas, Peter Stone, Scott Niekum:
Data-Efficient Policy Evaluation Through Behavior Policy Search. CoRR abs/1706.03469 (2017)
2016
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/expert/GenterMMHLNZS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/expert/GenterMMHLNZS16
Katie Genter, Patrick MacAlpine, Jacob Menashe, Josiah Hanna, Elad Liebman, Sanmit Narvekar, Ruohan Zhang, Peter Stone:
UT Austin Villa: Project-Driven Research in AI and Robotics. IEEE Intell. Syst. 31(2): 94-101 (2016)
[c4]
- view
  - electronic edition @ ceur-ws.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/ijcai/SharonHRASB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/SharonHRASB16
Guni Sharon, Josiah Hanna, Tarun Rambha, Michael Albert, Peter Stone, Stephen D. Boyles:
Delta-Tolling: Adaptive Tolling for Optimizing Traffic Throughput. ATT@IJCAI 2016
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/HannaSN16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HannaSN16
Josiah P. Hanna, Peter Stone, Scott Niekum:
High Confidence Off-Policy Evaluation with Models. CoRR abs/1606.06126 (2016)
2015
[c3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/robocup/MacAlpineHLS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/robocup/MacAlpineHLS15
Patrick MacAlpine, Josiah Hanna, Jason Liang, Peter Stone:
UT Austin Villa: RoboCup 2015 3D Simulation League Competition and Technical Challenges Champions. RoboCup 2015: 118-131
2013
[c2]
- view
  - electronic edition @ aaai.org (archived)
  - no references & citations available
- export record
  dblp key:
  - conf/aaai/PernyWGH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/PernyWGH13
Patrice Perny, Paul Weng, Judy Goldsmith, Josiah Hanna:
Approximation of Lorenz-Optimal Solutions in Multiobjective Markov Decision Processes. AAAI (Late-Breaking Developments) 2013
[c1]
- view
  - electronic edition @ dslpitt.org (archived)
  - no references & citations available
- export record
  dblp key:
  - conf/uai/PernyWGH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/PernyWGH13
Patrice Perny, Paul Weng, Judy Goldsmith, Josiah Hanna:
Approximation of Lorenz-Optimal Solutions in Multiobjective Markov Decision Processes. UAI 2013
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/PernyWGH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/PernyWGH13
Patrice Perny, Paul Weng, Judy Goldsmith, Josiah Hanna:
Approximation of Lorenz-Optimal Solutions in Multiobjective Markov Decision Processes. CoRR abs/1309.6856 (2013)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.