CRF layer v3.0 continued #1999

jaspersjsun · 2020-07-14T15:15:34Z

This is a continued PR of #1733 by @gabrieldemarmiesse .

Several suggestions from code review were applied.

Following is the original comment.

With a subclassing approch, we have a nicer API and it's very flexible.

Works only with TF 2.2+

@howl-anderson for the review and the CLA

The plan is to show users how to do the subclassing for the CRF. We shouldn't provide and API to save them some code there because it's going to become very complex to design a good API and to maintain it later on.

So the CRF layer is a public API and for the CRF loss, we give a good tutorial about subclassing.

Quick tutorial right now:

import tensorflow as tf
from tensorflow_addons.layers.crf import CRF
from tensorflow_addons.text.crf import crf_log_likelihood

def unpack_data(data):
    if len(data) == 2:
        return data[0], data[1], None
    elif len(data) == 3:
        return data
    else:
        raise TypeError("Expected data to be a tuple of size 2 or 3.")


class ModelWithCRFLoss(tf.keras.Model):
    """Wrapper around the base model for custom training logic."""

    def __init__(self, base_model):
        super().__init__()
        self.base_model = base_model

    def call(self, inputs):
        return self.base_model(inputs)

    def compute_loss(self, x, y, sample_weight, training=False):
        y_pred = self(x, training=training)
        _, potentials, sequence_length, chain_kernel = y_pred

        crf_loss = -crf_log_likelihood(potentials, y, sequence_length, chain_kernel)[0]

        if sample_weight is not None:
            crf_loss = crf_loss * sample_weight

        return tf.reduce_mean(crf_loss), sum(self.losses)

    def train_step(self, data):
        x, y, sample_weight = unpack_data(data)

        with tf.GradientTape() as tape:
            crf_loss, internal_losses = self.compute_loss(
                x, y, sample_weight, training=True
            )
            total_loss = crf_loss + internal_losses

        gradients = tape.gradient(total_loss, self.trainable_variables)
        self.optimizer.apply_gradients(zip(gradients, self.trainable_variables))

        return {"crf_loss": crf_loss, "internal_losses": internal_losses}

    def test_step(self, data):
        x, y, sample_weight = unpack_data(data)
        crf_loss, internal_losses = self.compute_loss(x, y, sample_weight)
        return {"crf_loss": crf_loss, "internal_losses": internal_losses}


x_np, y_np = get_test_data()

x_input = tf.keras.layers.Input(shape=x_np.shape[1:])
crf_outputs = CRF(5)(x_input)
base_model = tf.keras.Model(x_input, crf_outputs)
model = ModelWithCRFLoss(base_model)

model.compile("adam")
model.fit(x=x_np, y=y_np)
model.evaluate(x_np, y_np)
model.predict(x_np)
model.save("my_model.tf")

googlebot · 2020-07-14T15:16:02Z

Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please visit https://cla.developers.google.com/ to sign.

Once you've signed (or fixed any issues), please reply here with @googlebot I signed it! and we'll verify it.

What to do if you already signed the CLA

Individual signers

It's possible we don't have your GitHub username or you're using a different email address on your commit. Check your existing CLA data and verify that your email is set on your git commits.

Corporate signers

Your company has a Point of Contact who decides which employees are authorized to participate. Ask your POC to be added to the group of authorized contributors. If you don't know who your Point of Contact is, direct the Google project maintainer to go/cla#troubleshoot (Public version).
The email used to register you as an authorized contributor must be the email used for the Git commit. Check your existing CLA data and verify that your email is set on your git commits.
The email used to register you as an authorized contributor must also be attached to your GitHub account.

ℹ️ Googlers: Go here for more info.

jaspersjsun · 2020-07-14T15:18:09Z

@googlebot I signed it!

googlebot · 2020-07-14T15:18:23Z

All (the pull request submitter and all commit authors) CLAs are signed, but one or more commits were authored or co-authored by someone other than the pull request submitter.

We need to confirm that all authors are ok with their commits being contributed to this project. Please have them confirm that by leaving a comment that contains only @googlebot I consent. in this pull request.

Note to project maintainer: There may be cases where the author cannot leave a comment, or the comment is not properly detected as consent. In those cases, you can manually confirm consent of the commit author(s), and set the cla label to yes (if enabled on your project).

ℹ️ Googlers: Go here for more info.

gabrieldemarmiesse · 2020-07-14T15:20:15Z

@googlebot I consent.

howl-anderson · 2020-07-15T01:41:01Z

@googlebot I signed it!

jaspersjsun · 2020-07-15T01:59:44Z

Wheels build broke on Ubuntu-18.04 with Python3.5. Seems there are some problems with the docker env. Anything I need to do to pass those two checks?

Looks like #2002 fixed it.

jaspersjsun · 2020-07-15T13:20:05Z

@howl-anderson Thank you so much for your contribution. Would you mind replying with only @googlebot I consent. to grand consent to this PR?

howl-anderson · 2020-07-15T13:26:35Z

@googlebot I consent.

googlebot · 2020-07-15T13:26:52Z

CLAs look good, thanks!

ℹ️ Googlers: Go here for more info.

jaspersjsun · 2020-07-16T14:03:04Z

@facaiy @seanpmorgan Hi guys, would you mind having a look at this PR whenever you are available? The history of adding CRF layer can be dated back one year ago ( #22 #314 #377 and many other issues and PRs). Having this published will help a lot . Very appreciated!

WindQAQ

Thank you for the contribution to this long journey 👍

luozhouyang · 2020-07-26T06:54:53Z

Thank you a lot for this contribution!

Can we use tf.keras.layers.Layer.add_loss API inside CRF layer to calculate the crf_loss instead of using a wrapper model like ModelWithCRFLoss?

howl-anderson · 2020-07-27T02:28:01Z

@luozhouyang The pattern that you said is called endpoint pattern I think. In that pattern, users need to pass true labels as one of the inputs, and I think it is not a user-friendly API. The tensorflow/tensorflow#37818 is a promising solution to this, but still under review.

eloukas · 2020-11-05T11:20:41Z

This approach can work but it's incredibly slow, especially when we have a vast amount of classes/labels.
Is there any way you can incorporate Viterbi decoding in it?

* Squash all. * Cleanup for easier review. * Calming the angry bazel. * Fix the strange bug. * Replaced one bug by another bug. * Minor simplification. * Fix unused parameter. * Simplified the signature. * Removing boilerplate * Unused import. * CRF layer v3.0 * Finish the conversion. * Some renaming here and there. * Added a test where some training is done after reloading the model. * Apply suggesstions from CR * update ops in _compute_mask_[left|right]_boundary Co-authored-by: howl-anderson <u1mail2me@gmail.com> Co-authored-by: gabrieldemarmiesse <gabrieldemarmiesse@gmail.com>

howl-anderson and others added 19 commits March 21, 2020 18:41

Squash all.

8d754a1

Merge branch 'master' into trying_to_squash

a080aaa

Cleanup for easier review.

037549c

Calming the angry bazel.

e4cdfcb

Fix the strange bug.

76a4375

Replaced one bug by another bug.

bf691c8

Minor simplification.

413f242

Fix unused parameter.

a6afeb9

Simplified the signature.

3c0f306

Merge branch 'master' into trying_to_squash

4517e98

Removing boilerplate

fa347ae

Unused import.

4f820b4

CRF layer v3.0

89111ff

Finish the conversion.

35021a0

Some renaming here and there.

bb68d01

Merge branch 'master' into crf_layer_again

152eb34

Added a test where some training is done after reloading the model.

cc74721

Merge branch 'master' into crf_layer_again

f08e996

Apply suggesstions from CR

fab39fd

jaspersjsun requested review from facaiy and seanpmorgan as code owners July 14, 2020 15:15

boring-cyborg bot added the layers label Jul 14, 2020

googlebot added the cla: no label Jul 14, 2020

gabrieldemarmiesse mentioned this pull request Jul 14, 2020

CRF layer v3.0 #1733

Closed

Merge remote-tracking branch 'upstream/master' into crf_layer_again

fc77943

googlebot added cla: yes and removed cla: no labels Jul 15, 2020

update ops in _compute_mask_[left|right]_boundary

9d0630e

WindQAQ self-requested a review July 19, 2020 00:52

WindQAQ approved these changes Jul 19, 2020

View reviewed changes

WindQAQ merged commit 4f65776 into tensorflow:master Jul 19, 2020

This was referenced Aug 19, 2020

Pinning keras and tensorflow versions in setup wellcometrust/deep_reference_parser#39

Merged

Future of CRF layer wellcometrust/deep_reference_parser#28

Open

Harsh188 mentioned this pull request Sep 11, 2020

Request for example: CRF #337

Closed

jonpsy mentioned this pull request Jan 27, 2021

crf_losses.py as in keras_contrib? #2363

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CRF layer v3.0 continued #1999

CRF layer v3.0 continued #1999

jaspersjsun commented Jul 14, 2020

googlebot commented Jul 14, 2020

jaspersjsun commented Jul 14, 2020

googlebot commented Jul 14, 2020

gabrieldemarmiesse commented Jul 14, 2020 •

edited

Loading

howl-anderson commented Jul 15, 2020

jaspersjsun commented Jul 15, 2020 •

edited

Loading

jaspersjsun commented Jul 15, 2020

howl-anderson commented Jul 15, 2020

googlebot commented Jul 15, 2020

jaspersjsun commented Jul 16, 2020

WindQAQ left a comment

luozhouyang commented Jul 26, 2020

howl-anderson commented Jul 27, 2020

eloukas commented Nov 5, 2020

CRF layer v3.0 continued #1999

CRF layer v3.0 continued #1999

Conversation

jaspersjsun commented Jul 14, 2020

googlebot commented Jul 14, 2020

What to do if you already signed the CLA

Individual signers

Corporate signers

jaspersjsun commented Jul 14, 2020

googlebot commented Jul 14, 2020

gabrieldemarmiesse commented Jul 14, 2020 • edited Loading

howl-anderson commented Jul 15, 2020

jaspersjsun commented Jul 15, 2020 • edited Loading

jaspersjsun commented Jul 15, 2020

howl-anderson commented Jul 15, 2020

googlebot commented Jul 15, 2020

jaspersjsun commented Jul 16, 2020

WindQAQ left a comment

Choose a reason for hiding this comment

luozhouyang commented Jul 26, 2020

howl-anderson commented Jul 27, 2020

eloukas commented Nov 5, 2020

gabrieldemarmiesse commented Jul 14, 2020 •

edited

Loading

jaspersjsun commented Jul 15, 2020 •

edited

Loading