-
Notifications
You must be signed in to change notification settings - Fork 143
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Implement the Categorify() start_index feature request of #1074
This commit implements the feature in issue #1074. This issue asks to add an argument start_index to Categorify to give an offset for translating vocabulary items to categorical values. We update nvtabular/ops/categorify.py to add a start_index arg in the implementation of Categorify(). This update touches the categorify.py module in various places. We also add docstrings to the _encode() and _write_uniques() methods for improved readability in categorify.py. We also update the test_categorify_lists_with_start_index() test method in tests/unit/test_ops.py to test various start_index values.
- Loading branch information
Adam Lesnikowski
committed
Sep 3, 2021
1 parent
642e5f5
commit 8cfdd83
Showing
2 changed files
with
50 additions
and
6 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters