Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow Importing Multiple Documents from a single GCS Path #214

Open
holtskinner opened this issue Dec 5, 2023 · 0 comments
Open

Allow Importing Multiple Documents from a single GCS Path #214

holtskinner opened this issue Dec 5, 2023 · 0 comments

Comments

@holtskinner
Copy link
Member

Inspired by https://stackoverflow.com/a/77609221/6216983

Thefrom_gcs() method can only create a single Wrapped Document from a single document output in GCS.

It could be simpler for users if this method could output a list of Wrapped Documents from a GCS path similar to from_batch_process_operation() and from_batch_process_metadata().

However, this would be backwards incompatible, so an alternative is to create a new method for this functionality. (Not sure what the best name would be)

holtskinner added a commit that referenced this issue Dec 12, 2023
… documents from a GCS directory.

Fixes #214

- Note: `from_gcs()` takes in a GCS directory, but it only works for a single sharded document from a single input document source.
- In a GA release, it would be a better practice to have `from_gcs()` take in any GCS directory and output a list of Wrapped Documents. But this would be a backwards-incompatible change now.
  - Not sure if it's possible/advisable to have two possible return types for `from_gcs()` and just have it return a list when there are multiple Wrapped Documents?
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
1 participant