IGOR – Instruction Following with Goal-Conditioned Reinforcement Learning in Virtual Environments

In this study, we address the challenge of enabling an artificial intelligence agent to execute complex language instructions within virtual environments. Our framework assumes that these instructions involve intricate linguistic structures and multiple interdependent tasks that must be navigated successfully to achieve the desired outcomes. To manage these complexities effectively, we propose a hierarchical framework that combines the deep language comprehension of large language models (LLMs) with the adaptive action-execution capabilities of reinforcement learning (RL) agents. The language module (based on LLM) translates the language instruction into a high-level action plan, which is then executed by a pre-trained RL agent. We have demonstrated the effectiveness of our approach in two different environments: IGLU, where agents are instructed to build structures, and Crafter, where agents perform tasks and interact with objects in the surrounding environment according to language commands.

Paper: Instruction Following with Goal-Conditioned Reinforcement Learning in Virtual Environments

Citation:

@article{volovikova2024instruction,
      title={Instruction Following with Goal-Conditioned Reinforcement Learning in Virtual Environments}, 
      author={Zoya Volovikova and Alexey Skrynnik and Petr Kuderov and Aleksandr I. Panov},
      year={2024},
      eprint={2407.09287},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2407.09287}, 
}

Installation

With Conda Virtual Environment (available only for LLM experiments)

Create the environment:
```
conda create --name Igor python=3.9
```
Activate the environment:
```
conda activate Igor
```
Install the required packages:
```
pip install -r docker/requirements.txt
```

With Docker

Create the container:

cd ./docker
sh build.sh
cd ../

Run container:

docker run --shm-size 20G --env WANDB_API_KEY=$WANDB_API_KEY --rm -it -v $(pwd):/code -w /code --gpus all igor bash

🤗 Links to Haggingface to LLM

Zoya/igor_iglu_prim_llm

Example

Input: <Architect> Make 3 red blocks in the middle of the grid
Output: [0, 5, 5], [1, 1, 3], skyeast, red

Zoya/igor_crafter_llm

Example

Input: Vanquish the undead foe, gather a single unit of metallic mineral, and forge an iron weapon
Output: ['Defeat Zombie', 'Collect Iron with count 1']

Datasets

Crafter Dataset Generation

To generate the Crafter dataset, run:

python3 scripts/crafter_dataset_generator.py

The datasets for the Crafter environment can be found at ./datasets/crafter.
The datasets for the IGLU environment, including its augmented and primitive versions, can be found at ./datasets/iglu.

Run Experiments with LLM

Run LLM Tuning in Crafter Dataset

To run LLM tuning in the Crafter dataset, execute:

sh scripts/crafter/train_llm.sh

Run LLM Tuning in IGLU Original Dataset

To run LLM tuning in the IGLU original dataset, execute:

sh scripts/iglu/train_llm.sh

Run LLM Tuning in IGLU with Subtasks as Primitives

To run LLM tuning in IGLU with subtasks as primitives, execute:

sh scripts/iglu/train_llm_prim.sh

Run Experiments with Rl

Run RL traning

To run RL tuning in the Crafter dataset, execute:

sh scripts/crafter/train_rl.sh

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
datasets		datasets
docker		docker
igor		igor
scripts		scripts
train_dir/scaled_reward		train_dir/scaled_reward
utils		utils
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IGOR – Instruction Following with Goal-Conditioned Reinforcement Learning in Virtual Environments

Installation

With Conda Virtual Environment (available only for LLM experiments)

With Docker

🤗 Links to Haggingface to LLM

Zoya/igor_iglu_prim_llm

Example

Zoya/igor_crafter_llm

Example

Datasets

Crafter Dataset Generation

Run Experiments with LLM

Run LLM Tuning in Crafter Dataset

Run LLM Tuning in IGLU Original Dataset

Run LLM Tuning in IGLU with Subtasks as Primitives

Run Experiments with Rl

Run RL traning

About

Releases

Packages

Contributors 3

Languages

License

AIRI-Institute/IGOR

Folders and files

Latest commit

History

Repository files navigation

IGOR – Instruction Following with Goal-Conditioned Reinforcement Learning in Virtual Environments

Installation

With Conda Virtual Environment (available only for LLM experiments)

With Docker

🤗 Links to Haggingface to LLM

Zoya/igor_iglu_prim_llm

Example

Zoya/igor_crafter_llm

Example

Datasets

Crafter Dataset Generation

Run Experiments with LLM

Run LLM Tuning in Crafter Dataset

Run LLM Tuning in IGLU Original Dataset

Run LLM Tuning in IGLU with Subtasks as Primitives

Run Experiments with Rl

Run RL traning

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages