Hosein Hasanbeig
Short Bio:
I am at Microsoft Research, New York. I completed my Ph.D. in 2020 in the Computer Science Department of the University of Oxford, under the supervision of Alessandro Abate and Daniel Kroening. I was also a lecturer at St Catherine's College, University of Oxford, teaching Computer-aided Formal Verification from 2018 to 2022. Prior to Oxford, I was a research assistant in the Systems Control Lab at the University of Toronto, where I received my M.Sc. in Electrical and Computer Engineering in 2016.
My research focus is on the design and analysis of safe, interpretable, and explainable machine learning algorithms in decision-making problems. Most of my work is at the intersection of reinforcement learning, automatic control, formal methods, and game theory.
Next
- • 24 Oct 2022 ¦ Two papers have been accepted to NeurIPS ML Safety Workshop .
- • 20 Jun 2022 ¦ "LCRL: Logically-Constrained Reinforcement Learning" has been accepted to QEST'22 as a tool paper! The codebase is available here and the reprint will be online soon.
- • 15 Jul 2021 ¦ Our Work on deep RL for continuous motion planning with temporal logic has been accepted to IROS'21 and IEEE Robotics and Automation.
- • 17 Dec 2020 ¦ "Shielding Atari Games with Bounded Prescience" has been accepted to AAMAS'21! Preprint will be online soon.
- • 01 Dec 2020 ¦ "DeepSynth : Automata Synthesis for Automatic Task Segmentation in Deep RL" has been accepted to AAAI'21.
- • 01 Sep 2020 ¦ Our Work on modular deep RL with temporal logic has been accepted to FORMATS'20.
- • 03 Mar 2020 ¦ Our invited submission to OVERLAY is now available here .
- • 15 Jan 2020 ¦ Our Work on safe RL has been accepted to AAMAS'20, Auckland, New Zealand.
- • 05 Dec 2019 ¦ Alessandro will present our work at CDC'19, December 12, 2019, 18:10-18:30.
- • 19 Jul 2019 ¦ Our paper has been accepted to CDC'19, Nice Acropolis, France.
- • 15 May 2019 ¦ I'll be presenting our work on Logical Neural FQ at AAMAS'19, Montreal, Canada.
Publications in Reverse Chronological Order
2022
- • Mitta, R., Hasanbeig, H., Kroening, D., Abate, A., "Risk-aware Bayesian Reinforcement Learning for Cautious Exploration", NeurIPS, MLSW, 2022. [Bib ] [PDF ]
- • Barez, F., Hasanbeig, H., Abate, A., "System III: Learning with Domain Knowledge for Safety Constraints", NeurIPS, MLSW, 2022. [Bib ] [PDF ]
- • Hasanbeig, H., Kroening, D., Abate, A., "LCRL: Certified Policy Synthesis via Logically-Constrained Reinforcement Learning", CONFEST, 2022. [Bib ] [PDF ] [Code ]
2021
- • Hasanbeig, M., Jeppu, N. Y., Abate, A., Melham, T., and Kroening, D., "DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning", AAAI, 2021. [Bib ] [PDF ] [Code ]
- • Cai, M., Hasanbeig, M., Xiao, S., Abate, A., and Kan, Z., "Modular Deep Reinforcement Learning for Continuous Motion Planning with Temporal Logic", IROS, 2021. [Bib ] [PDF ]
- • Giacobbe, M., Hasanbeig, M., Kroening, D., and Wijk, H., "Shielding Atari Games with Bounded Prescience", AAMAS, 2021. [Bib ] [PDF ]
2020
- • Hasanbeig, M., Kroening, D., and Abate, A., "Deep Reinforcement Learning with Temporal Logics", International Conference on Formal Modeling and Analysis of Timed Systems, 2020. [Bib ] [PDF ]
- • Hasanbeig, M., Abate, A., and Kroening, D., "Cautious Reinforcement Learning with Logical Constraints", International Conference on Autonomous Agents and Multi-agent Systems, 2020. [Bib ] [PDF ]
- • Hasanbeig, M., Kroening, D., and Abate, A., "Towards Verifiable and Safe Model-Free Reinforcement Learning", Workshop on Artificial Intelligence and Formal Verification, Logics, Automata and Synthesis (OVERLAY), 2020 [invited]. [Bib ] [PDF ]
- • Ringstrom, T.J., Hasanbeig, M., and Abate, A., "Jump Operator Planning: Goal-Conditioned Policy Ensembles and Zero-Shot Transfer", CoRR abs/2007.02527, 2020. [Bib ] [PDF ]
2019
- • Hasanbeig, M., Kantaros, Y., Abate, A., Kroening, D., Pappas, G. J., and Lee, I., "Reinforcement Learning for Temporal Logic Control Synthesis with Probabilistic Satisfaction Guarantees", IEEE Conference on Decision and Control, 2019. [Bib ] [PDF ]
- • Lim Zun Yuan, Hasanbeig, M., Abate, A., and Kroening, D., "Modular Deep Reinforcement Learning with Temporal Logic Specifications", CoRR abs/1909.11591, 2019. [Bib ] [PDF ]
- • Hasanbeig, M. , Abate, A., and Kroening, D., "Certified Reinforcement Learning with Logic Guidance", CoRR abs/1902.00778, 2019. [Bib ] [PDF ]
- • Hasanbeig, M. , Abate, A., and Kroening, D., "Logically-Constrained Neural Fitted Q-Iteration", International Conference on Autonomous Agents and Multi-agent Systems, 2019. [Bib ] [PDF ]
2018
- • Hasanbeig, M., and Pavel, L., "From Game-theoretic Multi-agent Log Linear Learning to Reinforcement Learning ", Journal of Autonomous Agents and Multi-Agent Systems, 2018. [under review] [Bib ] [PDF ]
- • Hasanbeig, M. , Abate, A., and Kroening, D., "Logically-Constrained Reinforcement Learning", CoRR abs/1801.08099, 2018. [Bib ] [PDF ]
2017
- • Hasanbeig, M., and Pavel, L., "On Synchronous Binary Log-linear Learning and Second Order Q-learning", International Federation of Automatic Control, 2017. [Bib ] [PDF ]
- • Hasanbeig, M., and Pavel, L., "Distributed Coverage Control by Robot Networks in Unknown Environments Using a Modified EM Algorithm", International Journal of Computer, Electrical, Automation, Control and Information Engineering, 2017. [Bib ] [PDF ]
2016
- • Hasanbeig, M. "Multi-agent Learning in Coverage Control Games", MSc Thesis, 2016. [Bib ] [PDF ]