-
Notifications
You must be signed in to change notification settings - Fork 589
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Export does not work #515
Comments
Hi,
cheers, Maarten |
Ahh I see. Is there a way to save/export a filtered dataset back to a CSV or any text file? |
We could do this, but we'll rely on pandas or arrow to do this. But I feel users should feel comfortable doing pandas_df = df.to_pandas_df()
pandas_df.to_csv('test.csv') We may end up wrapping everything in pandas, which does not scale, but this might be a case where it's convenient/common enough that it makes sense. What do you think? @JovanVeljanoski what do you think should we support an export to csv? |
Pandas Output to CSV/ascii should be avoided however, especially for bigger datasets, since in that case one is losing of the main strengths of Vaex. If the use-case is interoperability with other tools (java, etc..) arrow promises to bridge this gap. CSVs are still useful for reports (e.g. corporate environments), in which case small aggregations will be saved, for which the example by @maartenbreddels is quite suitable I would think. For completeness.. i guess we could support output to CSV (wrapper around pandas), maybe in the same efforts planned for improvement of i/o @maartenbreddels ? I could also improve the docstring of the |
I think with #516 we could do a very efficient csv exporter. Provided we also do a parallel/chunked csv reader (based on pandas or arrow). |
Since #708 the documentation page features an example about reading and exporting data from vaex. It also described the improvements we have made in data I/O recently |
Hi,
I was trying to export a filtered dataset in a csv document but the
export
function does not save the file. It runs without any error message but basically it does not do anything.export_hdf5
works well.Here is an example with the
titanic
dataset:Neither
test.txt
ortest_filtered.txt
was created and saved. Is there any way to enable some debug information to see what is going on and why the data does not get exported ?The text was updated successfully, but these errors were encountered: