-
Notifications
You must be signed in to change notification settings - Fork 651
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FIX-#4099: Use mangled column names but keep the original when building frames from arrow #4767
Conversation
c30e156
to
05c109e
Compare
Codecov Report
@@ Coverage Diff @@
## master #4767 +/- ##
==========================================
- Coverage 85.37% 84.84% -0.54%
==========================================
Files 265 266 +1
Lines 19622 19908 +286
==========================================
+ Hits 16752 16890 +138
- Misses 2870 3018 +148
📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more |
@AndreyPavlenko thanks for contribution! Please also add a test and a release note into |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@AndreyPavlenko thanks for the changes, it should indeed fix the problem. Although, to be sure about this it's better to add a test that verifies it.
The fix looks general and may even be fixing the more general related issue (#3370), so I would suggest trying to add general testing for integer column names to see if we can close the #3370 with the fix, what do you think?
Sorry, but looking deeper, now I'm not sure that the fix is quite correct, because it changes the type of the column names. |
bb029ac
to
b080a4b
Compare
b080a4b
to
bdda101
Compare
bd0056d
to
b6e3617
Compare
…l when building from Arrow - Add F_ suffix to column names since integer names are not supported. - Keep the original column names in the from_arrow() method. Signed-off-by: Andrey Pavlenko <andrey.a.pavlenko@gmail.com>
b6e3617
to
3e45f2d
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
I think, the issue #3370 is also resolved by this fix. The test test_set_index() covers this case. |
@AndreyPavlenko, cool, closing that issue as well. |
What do these changes do?
The method Table.rename_columns() accepts a list of strings, but not numbers, thus, the columns must be converted to strings.
flake8 modin/ asv_bench/benchmarks scripts/doc_checker.py
black --check modin/ asv_bench/benchmarks scripts/doc_checker.py
git commit -s
dataframe.sum()
doesn`t work correctly at MODIN_STORAGE_FORMAT=omnisci #4099docs/development/architecture.rst
is up-to-date