Multivariate Time Series Transformer #130066

sergiofdez02 · 2024-06-27T19:51:51Z

sergiofdez02
Jun 27, 2024

Body

The transformer class I've implemented for multivariate time series forecasting is giving me weird results when training, leading to volatile valid loss and increasing training loss. Could it be due to the structure designed? I am showing you the structure of the transformer, without including the Decoder and Encoder layers:

class Transformer(tf.keras.Model):
def init(self, num_blocks_enc, num_blocks_dec, d_model, d_ff, num_heads, output_size, rate=0.1, **kwargs):
super(Transformer, self).init(**kwargs)
self.time2vec = Time2Vec(d_model)
self.encoder = [EncoderLayer(d_model, d_ff, num_heads, rate) for _ in range(num_blocks_enc)]
self.decoder = [DecoderLayer(d_model, d_ff, num_heads, rate) for _ in range(num_blocks_dec)]
self.final_layer = tf.keras.layers.Dense(output_size, activation='relu')
self.num_blocks_enc = num_blocks_enc
self.num_blocks_dec = num_blocks_dec
self.d_model = d_model
self.d_ff = d_ff
self.num_heads = num_heads
self.output_size = output_size
self.rate = rate

def call(self, inputs, labels=None, label_width=None, training=True):
    seq_len = tf.shape(inputs)[1]
    look_ahead_mask = create_look_ahead_mask(seq_len)
    dec_target_padding_mask = create_padding_mask(inputs)
    combined_mask = tf.maximum(dec_target_padding_mask, look_ahead_mask)
    x = self.time2vec(inputs)
    
    for i in range(len(self.encoder)):
        x = self.encoder[i](x, training=training, mask=combined_mask)
    enc_output = x

    if training and labels is not None:
        # Pad the labels sequence with zeros to match the input length
        _, label_length, _ = labels.shape # Get the label length

        padding = [[0, 0], [0, seq_len - label_length], [0, 0]]
        labels_padded = tf.pad(labels, padding)
        x = self.time2vec(labels_padded)
        for i in range(len(self.decoder)):
            x = self.decoder[i](x, enc_output, training=training, look_ahead_mask=look_ahead_mask, padding_mask=combined_mask)

    else:
        for i in range(len(self.decoder)):   
            x = self.decoder[i](x, enc_output, training=training, look_ahead_mask=look_ahead_mask, padding_mask=combined_mask)
        
    dec_out = x    
    final_output = self.final_layer(dec_out)
    if label_width is not None:
        final_output = final_output[:, -label_width:, :]
    return final_output

Guidelines

I have read and understood this category's guidelines before making this post.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GitHub Community

Multivariate Time Series Transformer #130066

{{title}}

Replies: 0 comments

Select a reply

GitHub Community

Multivariate Time Series Transformer #130066

sergiofdez02 Jun 27, 2024

Body

Guidelines

Replies: 0 comments

sergiofdez02
Jun 27, 2024