[KDD 2022] AutoShard: Automated Embedding Table Sharding for Recommender Systems

This is the implementation for the paper AutoShard: Automated Embedding Table Sharding for Recommender Systems. We proposed a reinforcement learning approach for embedding table sharding in distributed recommender systems, which aims to put emebedding tables to multiple GPU devices to achieve a load balance. Please refer the paper for more deteails.

Miscellaneous Resources: Have you heard of data-centric AI? Please check out our data-centric AI survey and awesome data-centric AI resources!

Cite this Work

If you find this project helpful, please cite

@inproceedings{zha2022autoshard,
  title={AutoShard: Automated Embedding Table Sharding for Recommender Systems},
  author={Zha, Daochen and Feng, Louis and Bhushanam, Bhargav and Choudhary, Dhruv and Nie, Jade and Tian, Yuandong and Chae, Jay and Ma, Yinbin and Kejariwal, Arun and Hu, Xia},
  booktitle={KDD},
  year={2022}
}

Installation

Step 1: install PyTorch

pip3 install torch

Step 2: install FBGEMM

Follow the instructions in https://github.com/pytorch/FBGEMM to install the embedding operators

Step 3: install AutoShard

pip3 install -r requirements.txt
pip3 install -e .

Run AutoShard on Synthetic Data

Step 1: download DLRM dataset

Download the data with git lfs at https://github.com/facebookresearch/dlrm_datasets

Step 2: process the dataset

python3 gen_dlrm_data.py

Note that you need to change --data argument to the path of the downloaded DLRM dataset.

Step 3: train AutoShard

python3 run_autoshard.py

Note that you need to specify --gpu-devices and --max-memory based on your GPU.

Step 4: evaluate AutoShard

python3 eval.py --alg autoshard

Not that you need to specify --gpu-devices and --max-memory based on your GPU.

Run Baselines

python3 eval.py --alg random
python3 eval.py --alg dim_greedy
python3 eval.py --alg lookup_greedy
python3 eval.py --alg size_greedy

Note that you need to specify --gpu-devices and --max-memory based on your GPU.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
autoshard		autoshard
imgs		imgs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
eval.py		eval.py
gen_dlrm_data.py		gen_dlrm_data.py
requirements.txt		requirements.txt
run_autoshard.py		run_autoshard.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[KDD 2022] AutoShard: Automated Embedding Table Sharding for Recommender Systems

Cite this Work

Installation

Run AutoShard on Synthetic Data

Run Baselines

About

Releases

Packages

Languages

License

daochenzha/autoshard

Folders and files

Latest commit

History

Repository files navigation

[KDD 2022] AutoShard: Automated Embedding Table Sharding for Recommender Systems

Cite this Work

Installation

Run AutoShard on Synthetic Data

Run Baselines

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages