Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md

README.md

BiDAF

Description

This model is a neural network for answering a query about a given context paragraph.

Model

Model	Download	Checksum	Download (with sample test data)	ONNX version	Opset version	Accuracy
BiDAF	41.5 MB	MD5	37.3 MB	1.4	ONNX 9, ONNX.ML 1	EM of 68.1 in SQuAD v1.1

Inference

Input to model

Tokenized strings of context paragraph and query.

Preprocessing steps

Tokenize words and chars in string for context and query. The tokenized words are in lower case, while chars are not. Chars of each word needs to be clamped or padded to list of length 16. Note NLTK is used in preprocess for word tokenize.

context_word: [seq, 1,] of string
context_char: [seq, 1, 1, 16] of string
query_word: [seq, 1,] of string
query_char: [seq, 1, 1, 16] of string

The following code shows how to preprocess input strings:

import numpy as np
import string
from nltk import word_tokenize

def preprocess(text):
   tokens = word_tokenize(text)
   # split into lower-case word tokens, in numpy array with shape of (seq, 1)
   words = np.asarray([w.lower() for w in tokens]).reshape(-1, 1)
   # split words into chars, in numpy array with shape of (seq, 1, 1, 16)
   chars = [[c for c in t][:16] for t in tokens]
   chars = [cs+['']*(16-len(cs)) for cs in chars]
   chars = np.asarray(chars).reshape(-1, 1, 1, 16)
   return words, chars

# input
context = 'A quick brown fox jumps over the lazy dog.'
query = 'What color is the fox?'
cw, cc = preprocess(context)
qw, qc = preprocess(query)

Output of model

The model has 2 outputs.

start_pos: the answer's start position (0-indexed) in context,
end_pos: the answer's inclusive end position (0-indexed) in context.

Postprocessing steps

Post processing and meaning of output

# assuming answer contains the np arrays for start_pos/end_pos
start = np.asscalar(answer[0])
end = np.asscalar(answer[1])
print([w.encode() for w in cw[start:end+1].reshape(-1)])

For this testcase, it would output

[b'brown'].

Dataset (Train and validation)

The model is trained with SQuAD v1.1.

Validation accuracy

Metric is Exact Matching (EM) of 68.1, computed over SQuAD v1.1 dev data.

Publication/Attribution

Minjoon Seo, Aniruddha Kembhavi, Ali Farhadi, Hannaneh Hajishirzi. Bidirectional Attention Flow for Machine Comprehension, paper

References

This model is converted from a CNTK model trained from this implementation.

License

MIT License

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bidirectional_attention_flow

bidirectional_attention_flow

README.md

BiDAF

Description

Model

Inference

Input to model

Preprocessing steps

Output of model

Postprocessing steps

Dataset (Train and validation)

Validation accuracy

Publication/Attribution

References

License

Files

bidirectional_attention_flow

Directory actions

More options

Directory actions

More options

Latest commit

History

bidirectional_attention_flow

Folders and files

parent directory

README.md

BiDAF

Description

Model

Inference

Input to model

Preprocessing steps

Output of model

Postprocessing steps

Dataset (Train and validation)

Validation accuracy

Publication/Attribution

References

License