Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] Not all granular features are getting generated #78

Open
neomatrix369 opened this issue Mar 13, 2023 · 0 comments
Open

[BUG] Not all granular features are getting generated #78

neomatrix369 opened this issue Mar 13, 2023 · 0 comments
Assignees
Labels
1. high-priority Important to fix the issue as soon as possible bug Something isn't working granular feature(s) Low-level/granular feature(s)

Comments

@neomatrix369
Copy link
Owner

neomatrix369 commented Mar 13, 2023

Describe the bug

After running the notebook(s) on Kaggle/local machine we can see that not all granular features are getting generated for e.g. these fields 'repeated_letters_count', 'repeated_digits_count', 'repeated_spaces_count', 'repeated_whitespaces_count',
'repeated_punctuations_count', 'english_characters_count', 'non_english_characters_count' in addition to the others are not part of the dataframe, either it's not detected or something else is amiss.

To Reproduce

Run the notebook on Kaggle i.e. https://www.kaggle.com/code/neomatrix369/nlp-profiler-simple-dataset and it fails at the cell that looks for repeat characters, etc...

Version information:

NLP Profiler Version 0.0.3 - issue is not relevant to environment or any other technical parameter.
The version on the master branch also behaves in the same manner.

Additional context

From the logs on https://www.kaggle.com/code/neomatrix369/nlp-profiler-simple-dataset#Installation-and-import-libraries/packages - the 0.0.3 version on PyPi worked in the past and for some time has not been working.

@neomatrix369 neomatrix369 added the bug Something isn't working label Mar 13, 2023
@neomatrix369 neomatrix369 added this to To do in NLP Profiler via automation Mar 13, 2023
@neomatrix369 neomatrix369 self-assigned this Mar 13, 2023
@neomatrix369 neomatrix369 added granular feature(s) Low-level/granular feature(s) 1. high-priority Important to fix the issue as soon as possible labels Mar 13, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
1. high-priority Important to fix the issue as soon as possible bug Something isn't working granular feature(s) Low-level/granular feature(s)
Projects
NLP Profiler
  
To do
Development

No branches or pull requests

1 participant