Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] FileNotFoundError: [Errno 2] No such file or directory: '/tmp/nlew7mo9hzl' #88

Open
skezle opened this issue Jun 4, 2023 · 2 comments
Labels
bug Something isn't working

Comments

@skezle
Copy link

skezle commented Jun 4, 2023

🐛 Bug

We get a file not found error when looking for the NetHack temporary directory after running our RL algorithm for > 1M steps. This error doesn't occur on all runs, only some runs get this error. We use the following environment wrapper over MiniHack:
https://github.com/AGI-Labs/continual_rl/blob/develop/continual_rl/experiments/tasks/make_minihack_task.py.

To Reproduce

Steps to reproduce the behavior:

We run our RL algorithm on any Minihack environments (RoomRandom-15x15 or RoomTrap-15x15). Occasionally after around 1M steps we obtain the following FileNotFoundError when looking for the NetHack tmp directory:

Screen Shot 2023-06-04 at 11 14 29 PM

Expected behavior

No FileNotFoundError when running our RL algorithm on MiniHack.

Environment

I'm running this on a SLURM scheduler so didn't collect the GPU info.

Collecting environment information...
MiniHack version: 0.1.3
NLE version: 0.8.1
Gym version: 0.23.1
PyTorch version: N/A
Is debug build: N/A
CUDA used to build PyTorch: N/A

OS: CentOS Linux release 8.1.1911 (Core)
GCC version: (conda-forge gcc 10.3.0-16) 10.3.0
CMake version: version 3.23.2

Python version: 3.8
Is CUDA available: N/A
CUDA runtime version: Could not collect
GPU models and configuration: Could not collect
Nvidia driver version: Could not collect
cuDNN version: Could not collect

Versions of relevant libraries:
[pip3] numpy==1.19.5
[conda] Could not collect

Additional context

@skezle skezle added the bug Something isn't working label Jun 4, 2023
@JupiLogy
Copy link

This looks like an NLE problem as the directory creation goes back to there. That, and/or a problem with your computer deleting temp files automatically after a certain amount of time?

@Bpoole908
Copy link

Was there any solution to this?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants