-
Notifications
You must be signed in to change notification settings - Fork 258
Fix Metta's Env Training #346
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
harshbhatt7585
wants to merge
68
commits into
PufferAI:3.0
Choose a base branch
from
harshbhatt7585:fix-metta-train
base: 3.0
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Changes from all commits
Commits
Show all changes
68 commits
Select commit
Hold shift + click to select a range
aef91f5
Fix env creation
harshbhatt7585 1d51d81
remove metta.yaml
harshbhatt7585 3bb8380
remove yaml args
harshbhatt7585 8d8f6ae
revert num_workers
harshbhatt7585 84ab0f3
fix import
harshbhatt7585 3141dba
make like arena_basic_easy_shaped
harshbhatt7585 1040fcc
use MettaGridPufferBase
harshbhatt7585 1dba00c
soft code values
harshbhatt7585 f1ede99
fix env
harshbhatt7585 95a5134
change agent
harshbhatt7585 e836e8b
lint
harshbhatt7585 a4db8f6
remove env_id
harshbhatt7585 7e8191f
pass state
harshbhatt7585 7e0c8bc
remove recurrent
harshbhatt7585 a19ec18
clean code
harshbhatt7585 029a33c
Refactor & Clean
harshbhatt7585 a127966
basic env
harshbhatt7585 427f6a8
fix action space issue
harshbhatt7585 8110f56
Hyperparamter change
harshbhatt7585 1378605
Clamp log ratio
harshbhatt7585 1f08189
reduce workers
harshbhatt7585 17fa6a3
remove rnn
harshbhatt7585 172f08f
restore minibatch size
harshbhatt7585 a1df6c5
use easy_shaped_arena_basic configuration
harshbhatt7585 76dbf60
restore pufferl
harshbhatt7585 bc16929
fix log prob
harshbhatt7585 72ed0cd
fix sample logits
harshbhatt7585 cdd9557
log prob change
harshbhatt7585 3157d26
clean
harshbhatt7585 e9998a6
restore rnn
harshbhatt7585 a6631d6
restore recurrent architecture
harshbhatt7585 372a402
max vec
harshbhatt7585 b44d270
clamp max_vec
harshbhatt7585 f4d9d8e
normalize
harshbhatt7585 ba5b207
update env config
harshbhatt7585 4c9a094
remove config
harshbhatt7585 fb316d9
fix num of layers
harshbhatt7585 aa6ac2f
add lstm
harshbhatt7585 997c27c
remove forward
harshbhatt7585 efd84eb
Merge branch 'PufferAI:3.0' into fix-metta-train
harshbhatt7585 71b499e
refactor encode_observation
harshbhatt7585 388fe5c
increase cnn
harshbhatt7585 fb43c1a
add standard deviation
harshbhatt7585 6a2f1f0
do not clamp reward for mettagrid
harshbhatt7585 5549e95
remove override and trust the parent
harshbhatt7585 99fb972
revert
harshbhatt7585 9eb3d8a
fix max_vec
harshbhatt7585 b5777ef
adjust recent metta refactor
harshbhatt7585 fe2e2ba
set buffer how metta does
harshbhatt7585 4bc0f6c
remove comments
harshbhatt7585 65aff9c
remove print comments
harshbhatt7585 9e7f456
env cleanup
harshbhatt7585 141f425
revert num_of_workers
harshbhatt7585 0c919b9
remove clamping rewards
harshbhatt7585 26b37c5
clean code
harshbhatt7585 51d17b2
clean code
harshbhatt7585 bbefce1
clean torch.py
harshbhatt7585 1538779
change action space init
harshbhatt7585 04e67f5
revert action space
harshbhatt7585 c121d1d
correct git urls and path
harshbhatt7585 284d477
concise
harshbhatt7585 bc8f720
revert timestep
harshbhatt7585 92b2132
change version
harshbhatt7585 124ee1a
Merge remote-tracking branch 'origin' into fix-metta-train
harshbhatt7585 280d843
change gymnasium version
harshbhatt7585 d992a4d
change pettingzoo version
harshbhatt7585 10e7bde
change version
harshbhatt7585 ea7646a
remove space
harshbhatt7585 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.