Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Spelling #1214

Open
wants to merge 38 commits into
base: master
Choose a base branch
from
Open
Changes from 1 commit
Commits
Show all changes
38 commits
Select commit Hold shift + click to select a range
135c679
spelling: algorithm
jsoref Jan 24, 2023
6b1149a
spelling: auxiliary
jsoref Jan 24, 2023
7135c55
spelling: coefficient
jsoref Jan 24, 2023
9b18680
spelling: contrasting
jsoref Jan 24, 2023
7458d59
spelling: demonstrations
jsoref Jan 24, 2023
f57a63c
spelling: deterministic
jsoref Jan 24, 2023
8033327
spelling: directory
jsoref Jan 24, 2023
6818376
spelling: evaluate
jsoref Jan 24, 2023
22b8c4f
spelling: evaluation
jsoref Jan 24, 2023
a7a7511
spelling: gradient
jsoref Jan 24, 2023
5498b5e
spelling: graph
jsoref Jan 24, 2023
e9bf097
spelling: implementation
jsoref Jan 24, 2023
a8d2ff1
spelling: importance
jsoref Jan 24, 2023
6895b22
spelling: independent
jsoref Jan 24, 2023
1a933f0
spelling: indexes
jsoref Jan 24, 2023
b8be389
spelling: intended
jsoref Jan 24, 2023
782bbdb
spelling: macos
jsoref Jan 24, 2023
01f44c9
spelling: meaning
jsoref Jan 24, 2023
9de11b2
spelling: nothing
jsoref Jan 24, 2023
80c5812
spelling: observed
jsoref Jan 24, 2023
7f0bd23
spelling: optimize
jsoref Jan 24, 2023
cd7aea9
spelling: parameters
jsoref Jan 24, 2023
cb1b27b
spelling: policy
jsoref Jan 24, 2023
b39692b
spelling: published
jsoref Jan 24, 2023
4cc118a
spelling: redundant
jsoref Jan 24, 2023
811937a
spelling: rescale
jsoref Jan 24, 2023
eff93cb
spelling: sequence
jsoref Jan 24, 2023
cf226ed
spelling: specified
jsoref Jan 24, 2023
6f037c7
spelling: stabilize
jsoref Jan 24, 2023
810b4d1
spelling: stateful
jsoref Jan 24, 2023
c3e81e3
spelling: stochastic
jsoref Jan 24, 2023
df25a9d
spelling: strategy
jsoref Jan 24, 2023
36f9c0a
spelling: subprocesses
jsoref Jan 24, 2023
294f2c0
spelling: to
jsoref Jan 24, 2023
663e81f
spelling: training
jsoref Jan 24, 2023
dce56c8
spelling: update
jsoref Jan 24, 2023
78d1551
spelling: variable
jsoref Jan 24, 2023
d24c669
spelling: visualizing
jsoref Jan 24, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
spelling: gradient
Signed-off-by: Josh Soref <[email protected]>
  • Loading branch information
jsoref committed Jan 24, 2023
commit a7a7511756a595434c796cb2dd6585ef515fc595
2 changes: 1 addition & 1 deletion baselines/acer/acer.py
Original file line number Diff line number Diff line change
Expand Up @@ -123,7 +123,7 @@ def custom_getter(getter, *args, **kwargs):
# entropy = tf.reduce_mean(strip(train_model.pd.entropy(), nenvs, nsteps))
entropy = tf.reduce_mean(cat_entropy_softmax(f))

# Policy Graident loss, with truncated importance sampling & bias correction
# Policy Gradient loss, with truncated importance sampling & bias correction
v = strip(v, nenvs, nsteps, True)
check_shape([qret, v, rho_i, f_i], [[nenvs * nsteps]] * 4)
check_shape([rho, f, q], [[nenvs * nsteps, nact]] * 2)
Expand Down