[BugFix] patch rand_action in TransformedEnv to read the base_env method #2699

vmoens · 2025-01-17T13:29:44Z

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]

pytorch-bot · 2025-01-17T13:29:48Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2699

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 4 New Failures, 7 Pending, 3 Unrelated Failures

As of commit 2794f70 with merge base 256a700 ():

NEW FAILURES - The following jobs have failed:

Build Windows Wheels / pytorch/rl / upload / wheel-py3_9-cuda12_6 (gh)
Unable to download artifact(s): Artifact not found for name: pytorch_rl__3.9_cu126_
Continuous Benchmark (PR) / CPU Pytest benchmark (gh)
Process completed with exit code 1.
Continuous Benchmark (PR) / GPU Pytest benchmark (gh)
Process completed with exit code 1.
Unit-tests on Windows / unittests-cpu / windows-job (gh)
Process completed with exit code 1.

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Build Windows Wheels / pytorch/rl / upload / wheel-py3_9-cpu (gh) (trunk failure)
Unable to download artifact(s): Artifact not found for name: pytorch_rl__3.9_cpu_
Build Windows Wheels / pytorch/rl / upload / wheel-py3_9-cuda11_8 (gh) (trunk failure)
Build Windows Wheels / pytorch/rl / upload / wheel-py3_9-cuda12_4 (gh) (trunk failure)
Unable to download artifact(s): Artifact not found for name: pytorch_rl__3.9_cu124_

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vmoens · 2025-01-17T15:34:29Z

@kurtamohler I'm not super happy with this, as the comment says it's far from accounting for transforms that do some kind of inverse mapping but the point is that without that, a transformed chess env cannot generate random actions (because the rand_action is not the same as the one in EnvBase that would otherwise be used by TransformedEnv).

[ghstack-poisoned]

kurtamohler · 2025-01-17T23:25:23Z

torchrl/envs/transforms/transforms.py

+            #  env = PendulumEnv().append_transform(ActionDiscretizer(num_intervals=4))
+            #  env.rand_action will NOT have a discrete action!
+            #  Getting a discrete action would require coding the inverse transform of an action within
+            #  ActionDiscretizer (ie, float->int, not int->float).


Is there a reason we couldn't use self.action_spec.rand()?

>>> import torchrl >>> env = torchrl.envs.PendulumEnv().append_transform(torchrl.envs.ActionDiscretizer(num_intervals=4)) >>> env.action_spec.rand() tensor([3]) >>> env.action_spec.rand().dtype torch.int64

Update

764e2e0

[ghstack-poisoned]

This was referenced Jan 17, 2025

[Feature] example_data for NonTensor spec #2698

Open

[Feature] UnaryTransform for input entries #2700

Open

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 17, 2025

This was referenced Jan 17, 2025

[Feature] Tokenizer transform #2701

Open

[Feature,Refactor] Chess improvements: fen, pgn, pixels, san #2702

Open

vmoens added the bug Something isn't working label Jan 17, 2025

vmoens requested a review from kurtamohler January 17, 2025 15:20

Update

2794f70

[ghstack-poisoned]

kurtamohler reviewed Jan 17, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix] patch rand_action in TransformedEnv to read the base_env method #2699

[BugFix] patch rand_action in TransformedEnv to read the base_env method #2699

vmoens commented Jan 17, 2025 •

edited

Loading

pytorch-bot bot commented Jan 17, 2025 •

edited

Loading

vmoens commented Jan 17, 2025

kurtamohler Jan 17, 2025 •

edited

Loading

[BugFix] patch rand_action in TransformedEnv to read the base_env method #2699

Are you sure you want to change the base?

[BugFix] patch rand_action in TransformedEnv to read the base_env method #2699

Conversation

vmoens commented Jan 17, 2025 • edited Loading

pytorch-bot bot commented Jan 17, 2025 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2699

❌ 4 New Failures, 7 Pending, 3 Unrelated Failures

vmoens commented Jan 17, 2025

kurtamohler Jan 17, 2025 • edited Loading

Choose a reason for hiding this comment

vmoens commented Jan 17, 2025 •

edited

Loading

pytorch-bot bot commented Jan 17, 2025 •

edited

Loading

kurtamohler Jan 17, 2025 •

edited

Loading