Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Set smart defaults with language detection #583

Open
3 tasks
ajdapretnar opened this issue Oct 2, 2020 · 0 comments
Open
3 tasks

Set smart defaults with language detection #583

ajdapretnar opened this issue Oct 2, 2020 · 0 comments
Assignees

Comments

@ajdapretnar
Copy link
Collaborator

Is your feature request related to a problem? Please describe.
One has to set the language manually in several widgets, if the corpus is not in English. Having a smart default, that would propagate downstream and set the language automatically would be neat (in Preprocess Text, Sentiment Analysis, Document Embedding).

Describe the solution you'd like
Once #582 is implemented, we could save this as a Corpus property and if present, set defaults in certain widgets to that language. If several languages are present, we could find smart ways of handling this (Document Embedding could embed with a different model for each document, stopwords could be removed for each document separately, etc.).

Describe alternatives you've considered
Setting it manually. 😆

Additional context
Widgets that should support this:

  • Preprocess Text
  • Document Embedding
  • Sentiment Analysis
  • ?
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants