Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can we train Magpie models with just one word? #158

Open
kaundinya5 opened this issue Sep 26, 2018 · 3 comments
Open

Can we train Magpie models with just one word? #158

kaundinya5 opened this issue Sep 26, 2018 · 3 comments

Comments

@kaundinya5
Copy link

I'm trying to use Magpie to classify different alphanumeric numbers into either policy number, account number. I changed the minimum number of words in word2vec to 1 to support this. After I trained the model, it always produces the same probability. Am I doing something wrong or do Magpie models always be trained with sentences and paragraphs?

@dorg-ekrolewicz
Copy link

dorg-ekrolewicz commented Sep 26, 2018 via email

@kaundinya5
Copy link
Author

Thank you for your response, I'm not trying to implement binary classification, I just started out with two labels, will be adding more labels once I get this working, I just wanted to try and see if Magpie could work on just two labels first.

Here are some example inputs:
Policy numbers: 594034, 02499357-3, 04187428-7, 04202703-0, 5572732, 06080677-0, 6498924502, 100014713, 100023672
Account numbers: AAAMOTO-01, ALDEINC-03 ,ALLAFOR-01, AMSMARK-01, AMERMOD-03

I have created the appropriate txt and lab files as well.

@jstypka
Copy link
Collaborator

jstypka commented Sep 26, 2018

@kaudinya5 Magpie is designed to classify free text documents that consist of many words. It's power relies on the fact that it can learn implicit relationships between those words and use them for classification. If your input is a unique alphanumeric character, it's unlikely the model will learn much.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants