Deep NLP Chatbot

A chatbot based on the recurrent seq2seq model, built in TensorFlow 2 and trained on movie dialogue from the Cornell Movie-Dialogs Corpus.

Details

My model uses a standard recurrent encoder-decoder architecture. I implemented both the encoder and the decoder as bidirectional RNNs with GRU cells, and I included a Bahdanau-style attention mechanism to improves memory of long sequences during decoding. At the evaluation stage, beam search is used to decode input sequences. A possible next step is to reimplement the training stage with beam search as well.

CMDC is organized such that collections of movie character lines form conversations. After parsing the data, I iterated through each conversation to produce question-answer pairs, which became the input and target data for training.

Read my blog post for more insights on this project.

References

I initially referenced tutorials from SuperDataScience and TensorFlow. As I branched out, I came across many blogs and papers to helped me learn about RNNs, attention, and beam search. I also got a glimpse of hierarchical encoders, transformers, and other state-of-the-art techniques.

Some extra helpful ones:

colah's blog on LSTMs

Lil'Log on attention

Beam search (Wiseman, Rush)

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
preprocessed		preprocessed
.gitignore		.gitignore
README.md		README.md
chatbot.py		chatbot.py
cmdc		cmdc
environment.yml		environment.yml
model.py		model.py
test.py		test.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep NLP Chatbot

Details

References

About

Releases

Packages

Languages

dlzou/dnlp-chatbot

Folders and files

Latest commit

History

Repository files navigation

Deep NLP Chatbot

Details

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages