Fix recurrent block memory leak and output shape calculation #556

mpskowron · 2021-01-24T17:46:57Z

Description

Recurrent block implementation has a memory leak in opInputs(...) function, line 240:
parameterList.add(array.flatten());
array.flatten() is creating new array in model's NDManager, thus this array lives as long as the model. The analogical problem is also present in LSTM block.
Recurrent block's getOutputShapes(...) function returns incorrect shapes. Block accepts NTC not TNC (which is wrongly assumed as input shape in getOutputShapes(...) function).

api/src/main/java/ai/djl/nn/recurrent/LSTM.java

api/src/main/java/ai/djl/nn/recurrent/RecurrentBlock.java

stu1130

Hi @mpskowron Thanks for your contribution. I am also doing refactoring on the model to make it more generic for latter PyTorch & TensorFlow integration. So I am going to merge your PR and rebase onto it.

mpskowron · 2021-01-25T18:19:53Z

Sounds great! Thank you guys for a quick review.

mpskowron added 2 commits January 24, 2021 18:34

Fix recurrent block memory leak and output shape calculation

e9cddc6

Code formatting

4125ca4

frankfliu requested review from lanking520 and stu1130 January 25, 2021 16:55

lanking520 reviewed Jan 25, 2021

View reviewed changes

api/src/main/java/ai/djl/nn/recurrent/LSTM.java Outdated Show resolved Hide resolved

frankfliu reviewed Jan 25, 2021

View reviewed changes

api/src/main/java/ai/djl/nn/recurrent/RecurrentBlock.java Outdated Show resolved Hide resolved

api/src/main/java/ai/djl/nn/recurrent/RecurrentBlock.java Outdated Show resolved Hide resolved

api/src/main/java/ai/djl/nn/recurrent/RecurrentBlock.java Outdated Show resolved Hide resolved

Applied code review remarks

f536fdc

stu1130 approved these changes Jan 25, 2021

View reviewed changes

frankfliu approved these changes Jan 25, 2021

View reviewed changes

stu1130 merged commit 0b4f3a5 into deepjavalibrary:master Jan 25, 2021

Lokiiiiii pushed a commit to Lokiiiiii/djl that referenced this pull request Oct 10, 2023

Load external dependencies for workflows (deepjavalibrary#556)

0a8d55b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix recurrent block memory leak and output shape calculation #556

Fix recurrent block memory leak and output shape calculation #556

mpskowron commented Jan 24, 2021 •

edited

Loading

stu1130 left a comment

mpskowron commented Jan 25, 2021

Fix recurrent block memory leak and output shape calculation #556

Fix recurrent block memory leak and output shape calculation #556

Conversation

mpskowron commented Jan 24, 2021 • edited Loading

Description

stu1130 left a comment

Choose a reason for hiding this comment

mpskowron commented Jan 25, 2021

mpskowron commented Jan 24, 2021 •

edited

Loading