Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix UCObjectStore.list_objects #3284

Merged
merged 13 commits into from
May 13, 2024
Merged

Conversation

dakinggg
Copy link
Contributor

What does this PR do?

Previously, the databricks SDK did not have ls built in, and we discovered a bug where we were not properly handling the truncation of the ls results (truncated to 999 elements). This PR fixes that bug by switching over to the built in ls in the databricks SDK.

Manual test, ran the convert to MDS script before and after, before it returns 999 elements, after it returns the full correct number (1350 from the directory I was testing)

@dakinggg dakinggg requested a review from irenedea May 13, 2024 22:33
@dakinggg dakinggg marked this pull request as ready for review May 13, 2024 22:33
Copy link
Contributor

@irenedea irenedea left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm! thanks for the fix!

composer/utils/object_store/uc_object_store.py Outdated Show resolved Hide resolved
dakinggg and others added 2 commits May 13, 2024 18:43
Co-authored-by: Irene Dea <deaairene@gmail.com>
@dakinggg dakinggg enabled auto-merge (squash) May 13, 2024 22:57
@dakinggg dakinggg merged commit 01eec3a into mosaicml:dev May 13, 2024
15 checks passed
j316chuck pushed a commit that referenced this pull request May 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants