Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] Properly preparing NVTabular preprocessed data for HugeCTR inference #707

Closed
vinhngx opened this issue Apr 8, 2021 · 0 comments · Fixed by #712
Closed

[BUG] Properly preparing NVTabular preprocessed data for HugeCTR inference #707

vinhngx opened this issue Apr 8, 2021 · 0 comments · Fixed by #712
Assignees
Labels
bug Something isn't working

Comments

@vinhngx
Copy link
Contributor

vinhngx commented Apr 8, 2021

Describe the bug
NVTab encodes categorical data is not directly compatible with HugeCTR requirement and would require the value to be shifted - so that they occupy different integer ranges

Concrete conceptual example: provided there are 10 users and 10 items, NVtab encodes users in 0-9 range, and items in 0-9 range. HugeCTR expect users in 0-9 range and items in 10-19 range

Steps/Code to reproduce bug
A concrete example of shifting:

https://gitlab-master.nvidia.com/dl/hugectr/hugectr/-/blob/master/notebooks/ecommerce-example.ipynb

Expected behavior
A clear and concise description of what you expected to happen.

Environment details (please complete the following information):

  • Environment location: [Bare-metal, Docker, Cloud(specify cloud provider)]
  • Method of NVTabular install: [conda, Docker, or from source]
    • If method of install is [Docker], provide docker pull & docker run commands used

Additional context
Add any other context about the problem here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants