Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] list.sort_values returning incorrect results in some cases. #16455

Closed
ayushdg opened this issue Jul 31, 2024 · 1 comment · Fixed by #16463
Closed

[BUG] list.sort_values returning incorrect results in some cases. #16455

ayushdg opened this issue Jul 31, 2024 · 1 comment · Fixed by #16463
Assignees
Labels
bug Something isn't working

Comments

@ayushdg
Copy link
Member

ayushdg commented Jul 31, 2024

Describe the bug
Ran into a bug where sorting list values in a series led to a few rows producing incorrectly sorted results.

Steps/Code to reproduce bug

import cudf

df = cudf.read_parquet("listtype.parquet")
sorted_df = df.a.list.sort_values()
sorted_df.to_pandas().apply(lambda x: sorted(x) != x).sum()
3

Effectively 3 rows returned incorrect results.

More specifically these 3 indices: [79888, 122495, 163667] have incorrectly sorted results.

Repro Data: data.tgz

Expected behavior
Correctly sorted values

Environment overview (please complete the following information)

  • Environment location: Bare-metal
  • Method of cuDF install: pip release 24.06
    • If method of install is [Docker], provide docker pull & docker run commands used

Environment details
Please run and paste the output of the cudf/print_env.sh script here, to gather any other relevant environment details

Additional context
Add any other context about the problem here.

@ayushdg ayushdg added the bug Something isn't working label Jul 31, 2024
@Matt711
Copy link
Contributor

Matt711 commented Aug 1, 2024

I was able to reproduce in 24.10.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
Status: Done
Development

Successfully merging a pull request may close this issue.

3 participants