Tags: markkho/msdm
Tags
Merge pull request #24 from markkho/tabular_mdp_opt Major overhaul of core and tabular methods: - States/actions are assumed to be hashable (e.g., Gridworld now uses frozendict; no built-in hashing functions; dictionaries are the main way to create maps) - The distribution classes have been streamlined (Multinomial has been removed and DictDistribution is the main way to represent categorical distributions; .sample() takes a random number generator) - Policy classes have been simplified - More thorough type hints
PreviousNext