Skip to content

Commit

Permalink
Fix PBVI bug for deterministic environments
Browse files Browse the repository at this point in the history
  • Loading branch information
markkho committed Jan 10, 2025
1 parent 2d2cf37 commit 41572c6
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions msdm/algorithms/pointbasedvalueiteration.py
Original file line number Diff line number Diff line change
Expand Up @@ -40,6 +40,8 @@ def expand_beliefs(pomdp : PartiallyObservableMDP, belief_set):
continue
max_L1_nbs = nbs[L1_nb_dist == max_L1_nb_dist]
new_bs.append(max_L1_nbs)
if len(new_bs) == 0:
return belief_set
new_bs = np.concatenate(new_bs)
belief_set = np.concatenate((belief_set, new_bs))
belief_set = np.unique(belief_set, axis=0)
Expand Down

0 comments on commit 41572c6

Please sign in to comment.