-
Notifications
You must be signed in to change notification settings - Fork 446
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Can't get word embedding #37
Comments
Hi @happypanda5, |
Hi, @jhyuklee , |
I think you can try out, for example |
Hi, I am trying to get a word embedding vector for BioBERT, and compare it with the word embedding vector I get from BERT.
However, I haven't been successful in running BioBERT.
I have downloaded the weights from release v1.1-pubmed and after unzipping the weights into a folder, I run the following code
`out = open('prepoutput.json', 'w')
import os
os.system('python3 "/content/biobert/extract_features.py"
--input_file= "/content/biobert/sample_text.txt"
--vocab_file= "/content/biobert_v1.1_pubmed/vocab.txt"
--bert_config_file= "/content/biobert_v1.1_pubmed/bert_config.json"
--init_checkpoint= "/content/biobert_v1.1_pubmed/model.ckpt.index"
--output_file= "/content/prepoutput.json" ')`
The output is "256" and the file "preoutput.json" is empty.
Please guide me.
Unfortunately, my attempts at converting the weights from Pytorch wasn't successful either.
The text was updated successfully, but these errors were encountered: