want zoneout lstm supported #1867

breadbread1984 · 2020-05-21T07:09:54Z

Describe the feature and the current behavior/state.

no zoneout lstm found in addons

Relevant information

Are you willing to contribute it (yes/no): no
Are you willing to maintain it going forward? (yes/no): no
Is there a relevant academic paper? (if so, where): https://arxiv.org/abs/1606.01305
Is there already an implementation in another framework? (if so, where): https://github.com/LaurentMazare/deep-models/blob/c54a0f5227d1b03018bbbf19075454250266bf90/rhn/lstm_zoneout.py
Was it part of tf.contrib? (if so, where):

Which API type would this fall under (layer, metric, optimizer, etc.)

tfa.layers

Who will benefit with this feature?

people implementing Tacotron2 with tf.keras API

Any other info.

failure-to-thrive · 2020-05-21T10:19:21Z

I could handle it...

breadbread1984 · 2020-05-21T10:43:39Z

really appreciated to it

failure-to-thrive · 2020-05-22T17:13:11Z

So far so good! Any ideas on how to test it? Input values, random seed, expected output values?

breadbread1984 · 2020-05-22T22:44:10Z

speedy implement! given the same input and zero initial state, it outputs a tensor having element value either the same as lstm's or zero.

failure-to-thrive · 2020-05-23T17:36:07Z

Here it is! You can copy & paste the class into your code. Example of usage and tests are further down below. If everything work as intended we could incorporate it into TFA.

import tensorflow as tf
from tensorflow.keras.layers import LSTMCell, RNN, LSTM

class ZoneoutLSTMCell(LSTMCell):
    def __init__(
        self,
        units,
        zoneout_h = 0,
        zoneout_c = 0,
        **kwargs
    ):
        super().__init__(
            units,
            **kwargs
        )
        self.zoneout_h = zoneout_h
        self.zoneout_c = zoneout_c

    def _zoneout(self, t, tm1, rate, training):
        dt = tf.cast(tf.random.uniform(t.shape) >= rate * training, t.dtype)
        return dt*t + (1 - dt)*tm1

    def call(self, inputs, states, training=None):
        output, new_states = super().call(inputs, states, training)
        h = self._zoneout(new_states[0], states[0], self.zoneout_h, training)
        c = self._zoneout(new_states[1], states[1], self.zoneout_c, training)
        return h, [h, c]


x = tf.constant([[[1., 2, 3, 4, 5]]])
initial_state = [tf.constant([[11., 12, 13]]), tf.constant([[14., 15, 16]])]

tf.random.set_seed(0)
l0 = LSTM(3, return_state=True)
y0 = l0(x, initial_state=initial_state, training=True)
tf.print(y0)

tf.random.set_seed(0)
l = RNN(ZoneoutLSTMCell(3, zoneout_h=.3, zoneout_c=.5), return_state=True)
y = l(x, initial_state=initial_state, training=True)
tf.print(y)

breadbread1984 · 2020-05-24T01:57:07Z

you need to save zoneout_h and zoneout_c in operator's config dictionary.

breadbread1984 · 2020-05-24T02:13:02Z

I get unsupported operand type(s) for *: 'float' and 'NoteType' at line "dt = tf.cast(tf.random.uniform(t.shape) >= rate * training, t.dtype)"

failure-to-thrive · 2020-05-24T05:26:58Z

you need to save zoneout_h and zoneout_c in operator's config dictionary.

Sure. The code above is just an algorithm implementation. If it is OK we can move forward.

I get unsupported operand type(s) for *: 'float' and 'NoteType' at line "dt = tf.cast(tf.random.uniform(t.shape) >= rate * training, t.dtype)"

Could you describe environment where it happened?

breadbread1984 · 2020-05-24T08:38:11Z

sorry, I used the operator the wrong way. I can successfully run the code.

breadbread1984 · 2020-06-06T11:46:20Z

@failure-to-thrive I found there is a flaw in your implement. the hidden in your implement is calculated from cell of lstm, but the hidden of zoneout lstm should be calculated from zoneouted cell.

failure-to-thrive · 2020-06-06T12:03:59Z

I've simply implemented the generalized form from the original academic paper. Although there are lots of variations which are mentioned in the paper too, I'm not sure whether all of them should be implemented in the scope of TFA. Any ideas?

seanpmorgan · 2023-03-01T03:42:20Z

TensorFlow Addons is transitioning to a minimal maintenance and release mode. New features will not be added to this repository. For more information, please see our public messaging on this decision:
TensorFlow Addons Wind Down

Please consider sending feature requests / contributions to other repositories in the TF community with a similar charters to TFA:
Keras
Keras-CV
Keras-NLP

breadbread1984 closed this as completed May 24, 2020

breadbread1984 reopened this May 24, 2020

failure-to-thrive added a commit to failure-to-thrive/addons that referenced this issue May 25, 2020

Zoneout LSTM Cell (tensorflow#1867)

4ae6daa

failure-to-thrive mentioned this issue May 25, 2020

Zoneout LSTM Cell #1877

Closed

seanpmorgan added Feature Request rnn labels May 25, 2020

seanpmorgan closed this as completed Mar 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

want zoneout lstm supported #1867

want zoneout lstm supported #1867

breadbread1984 commented May 21, 2020

failure-to-thrive commented May 21, 2020

breadbread1984 commented May 21, 2020

failure-to-thrive commented May 22, 2020

breadbread1984 commented May 22, 2020

failure-to-thrive commented May 23, 2020

breadbread1984 commented May 24, 2020

breadbread1984 commented May 24, 2020

failure-to-thrive commented May 24, 2020

breadbread1984 commented May 24, 2020

breadbread1984 commented Jun 6, 2020

failure-to-thrive commented Jun 6, 2020

seanpmorgan commented Mar 1, 2023

want zoneout lstm supported #1867

want zoneout lstm supported #1867

Comments

breadbread1984 commented May 21, 2020

failure-to-thrive commented May 21, 2020

breadbread1984 commented May 21, 2020

failure-to-thrive commented May 22, 2020

breadbread1984 commented May 22, 2020

failure-to-thrive commented May 23, 2020

breadbread1984 commented May 24, 2020

breadbread1984 commented May 24, 2020

failure-to-thrive commented May 24, 2020

breadbread1984 commented May 24, 2020

breadbread1984 commented Jun 6, 2020

failure-to-thrive commented Jun 6, 2020

seanpmorgan commented Mar 1, 2023