Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Updates SG PropertyGraph and cugraph-service to apply DataFrame.fillna() based on latest cuDF changes #4408

Merged

Conversation

rlratzel
Copy link
Contributor

@rlratzel rlratzel commented May 9, 2024

This handles a recent cuDF change by applying non-dict and non-Series values for a fillna() call on PropertyGraph instances only to the user-defined columns, with the assumption that savvy users that intend to update the "internal" columns, or users that are aware of their own categorical dtype columns, will use a dict or Series value to properly apply dtypes as needed.

This also updates code in cugraph-service that serializes dataframes to numpy bytes to properly convert NA values when categoricals are present.

Notes:

  • This is only applied to the SG PropertyGraph class. The MG class needs further review as to how to best apply the same policy (and because there are other MG failing tests that need addressed). Since this is blocking CI for the SG case only, this PR is being submitted now and MG will be addressed later, which should be okay since PropertyGraph is experimental.
  • This could be considered a breaking change if PropertyGraph was not experimental.

…perations on the entire vertex or edge DataFrame, to accommodate an update to cuDF which does not accept all fillna values as-is if applied to a categorical dtype.
@rlratzel rlratzel added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels May 9, 2024
@rlratzel rlratzel self-assigned this May 9, 2024
@rlratzel rlratzel requested a review from a team as a code owner May 9, 2024 14:24
@github-actions github-actions bot added the python label May 9, 2024
…ategoricals, updates tests to use latest datasets API.
@rlratzel rlratzel changed the title Updates SG PropertyGraph to apply fillna() only to user-defined columns when non-dict/non-Series values passed in Updates SG PropertyGraph and cugraph-service to apply DataFrame.fillna() based on latest cuDF changes May 9, 2024
Copy link
Member

@alexbarghi-nv alexbarghi-nv left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍

@alexbarghi-nv
Copy link
Member

/merge

@rapids-bot rapids-bot bot merged commit 0639797 into rapidsai:branch-24.06 May 10, 2024
136 checks passed
@alexbarghi-nv
Copy link
Member

/merge

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
improvement Improvement / enhancement to an existing function non-breaking Non-breaking change python
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants