Categorical kernel #345

ingmarschuster · 2023-08-05T08:38:17Z

Type of changes

Checklist

I've formatted the new code by running poetry run pre-commit run --all-files --show-diff-on-failure before committing.
I've added tests for new code.
I've added docstrings for the new code.

Description

This implements a kernel with explicit gram values for categorical input (such as a string of characters). The parametrization is working very well for gradient descent.

Not sure about formatting with the poetry command - the command didn't work for me. I'm using the black formatter.

First implementation and demo notebook

Merge in main

This reverts commit a4e877f.

gpjax/kernels/non_euclidean/categorical.py

thomaspinder · 2023-08-07T05:03:14Z

gpjax/kernels/non_euclidean/categorical.py

+        jnp.eye(2), bijector=tfb.CorrelationCholesky()
+    )
+    inspace_vals: list = static_field(None)
+    name: str = "Dictionary Kernel"


Here the name is a Dictionary Kernel, yet the object is called CatKernel. Could you shed some light on this incongruity please?

Dictionary Kernel or DictKernel was the old name. References to this old name will be fixed.

thomaspinder · 2023-08-07T05:05:03Z

gpjax/kernels/non_euclidean/categorical.py

+        L = self.sdev.reshape(-1, 1) * self.cholesky_lower
+        return L @ L.T
+
+    def __call__(  # TODO not consistent with general kernel interface


This is True of the GraphKernel too. Not helpful, I know, but maybe there's an alternative abstraction that is more appropriate for non-Euclidean kernels.

The this is because AbstractKernel.__call__(x, y) requires float array for x and y. Insteadm maybe the right signature for this baseclass would be
def __call__(self, x: Num[Array, " D"], y: Num[Array, " D"]) -> ScalarFloat:
because then the categorical kernel could specialize to ScalarInt if I'm not mistaken.

Yes. I agree this would be the more general signature.

thomaspinder · 2023-08-07T05:05:33Z

gpjax/kernels/non_euclidean/categorical.py

+    @property
+    def explicit_gram(self):
+        L = self.sdev.reshape(-1, 1) * self.cholesky_lower
+        return L @ L.T


How does this differ from the regular gram method?

It's actually a property. Implemented this because it can be handy to use this rather than the internal parametrization. Adding a doc string.

thomaspinder · 2023-08-07T05:06:04Z

gpjax/kernels/non_euclidean/categorical.py

+        ValueError: If the number of diagonal variance parameters does not match the number of input space values.
+    """
+
+    sdev: Float[Array, " N"] = param_field(jnp.ones((2,)), bijector=tfb.Softplus())


nit: elsewhere in the package we use stddev for standard deviation.

changed name.

frazane · 2023-08-07T07:11:06Z

gpjax/kernels/non_euclidean/categorical.py

+        return num_inspace_vals * (num_inspace_vals - 1) // 2
+
+    @classmethod
+    def gram_to_sdev_cholesky_lower(cls, gram: Float[Array, "N N"]) -> CatKernelParams:


Doesn't need to be a class method since cls is not used. Can be a static method.

yep, changed.

ingmarschuster · 2023-08-07T15:02:20Z

@thomaspinder The input_1hot field is now documented. Thanks for the catch. All other problems should be fixed.

ingmarschuster · 2023-08-07T15:02:52Z

If I get your thumbs up I'll merge.

henrymoss · 2023-08-07T16:08:03Z

gpjax/kernels/non_euclidean/categorical.py

+    cholesky_lower: Float[Array, "N N"] = param_field(
+        jnp.eye(2), bijector=tfb.CorrelationCholesky()
+    )


I did not know this bijector exists. This makes the code neater. Its worth thinking about a slightly messier formulation though

it would be nice to be able to control the flexibility, kinda like when you specify the rank of W in the decomp K = W * W^T + kappa

How much work would this be @ingmarschuster? If it's simple, then maybe let's add it to this PR. Otherwise, if you feel it's a good idea, then let's open an issue for it.

thomaspinder · 2023-08-08T05:08:45Z

If I get your thumbs up I'll merge.

Left two comments @ingmarschuster. They don't need resolving or actioning, and the PR can now be merged.

ingmarschuster added 2 commits June 1, 2023 14:04

Enabling kernels to use PyTree inputs

a4e877f

First implementation and demo notebook

Introduce categorical kernel

f85d6b7

ingmarschuster requested review from thomaspinder and frazane August 5, 2023 08:38

ingmarschuster added 4 commits August 5, 2023 14:12

Merge pull request #346 from JaxGaussianProcesses/main

2e893d3

Merge in main

Revert "Enabling kernels to use PyTree inputs"

3ded3b4

This reverts commit a4e877f.

add tests

61190fe

Undo changes to conftest

e6911a4

ingmarschuster added enhancement New feature or request testing Testing labels Aug 5, 2023

Make ruff happy

6cdaea0

thomaspinder requested changes Aug 7, 2023

View reviewed changes

frazane reviewed Aug 7, 2023

View reviewed changes

ingmarschuster added 2 commits August 7, 2023 16:56

Some minor improvements

f612e3f

Change classmethods to static methods

0dfb739

henrymoss reviewed Aug 7, 2023

View reviewed changes

thomaspinder approved these changes Aug 8, 2023

View reviewed changes

thomaspinder assigned ingmarschuster Aug 8, 2023

thomaspinder added this to the v1.0.0 milestone Aug 8, 2023

ingmarschuster merged commit 77aa81c into main Aug 8, 2023
21 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Categorical kernel #345

Categorical kernel #345

ingmarschuster commented Aug 5, 2023 •

edited

Loading

thomaspinder Aug 7, 2023

ingmarschuster Aug 7, 2023

thomaspinder Aug 7, 2023

ingmarschuster Aug 7, 2023

thomaspinder Aug 8, 2023

thomaspinder Aug 7, 2023

ingmarschuster Aug 7, 2023

thomaspinder Aug 7, 2023

ingmarschuster Aug 7, 2023

frazane Aug 7, 2023

ingmarschuster Aug 7, 2023

ingmarschuster commented Aug 7, 2023

ingmarschuster commented Aug 7, 2023

henrymoss Aug 7, 2023

henrymoss Aug 7, 2023

thomaspinder Aug 8, 2023

thomaspinder commented Aug 8, 2023

Categorical kernel #345

Categorical kernel #345

Conversation

ingmarschuster commented Aug 5, 2023 • edited Loading

Type of changes

Checklist

Description

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ingmarschuster commented Aug 7, 2023

ingmarschuster commented Aug 7, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

thomaspinder commented Aug 8, 2023

ingmarschuster commented Aug 5, 2023 •

edited

Loading