Publications
2022
- StaTeS-SQL: Soft Q Learning with State-Dependent Temperature Scheduling2022
- Reducing variance in temporal-difference value estimation via ensemble of deep networksIn International Conference on Machine Learning , 2022
2021
- Temporal-difference value estimation via uncertainty-guided soft updatesNeurIPS Deep Reinforcement Learning workshop, 2021
- Target entropy annealing for discrete soft actor-criticNeurIPS Deep Reinforcement Learning workshop, 2021
- Count-Based Temperature Scheduling for Maximum Entropy Reinforcement LearningNeurIPS Deep Reinforcement Learning workshop, 2021