Publications

2022

  1. StaTeS-SQL: Soft Q Learning with State-Dependent Temperature Scheduling
    Dailin Hu
    2022
  2. Reducing variance in temporal-difference value estimation via ensemble of deep networks
    Litian Liang , Yaosheng Xu , Stephen McAleer , and 4 more authors
    In International Conference on Machine Learning , 2022

2021

  1. Temporal-difference value estimation via uncertainty-guided soft updates
    Litian Liang , Yaosheng Xu , Stephen McAleer , and 4 more authors
    NeurIPS Deep Reinforcement Learning workshop, 2021
  2. Target entropy annealing for discrete soft actor-critic
    Yaosheng Xu , Dailin Hu , Litian Liang , and 3 more authors
    NeurIPS Deep Reinforcement Learning workshop, 2021
  3. Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning
    Dailin Hu , Pieter Abbeel , and Roy Fox
    NeurIPS Deep Reinforcement Learning workshop, 2021