Output nodes should be per subschema not per keyword #1249

gregsdennis · 2022-06-27T23:57:43Z

This is quite a change, so I'd be happy to walk anyone through it.

Summary

Following the idea from this thread and this thread, this PR changes the concept of the output unit from being based on individual output from keywords to being based on output from schemas. The benefits of this are in the thread, so I won't rehash them here. The primary difference is that errors and annotations are included in the output unit as properties (objects with keyword names for keys) rather than as nested units.

It also does a few other things:

removes the detailed format
renames the verbose format hierarchical (since there's only one now)
adds clarification in some of the language
adds passing examples to show how annotations are included

Doing this made the formats considerably more information-dense, which meant that the verbose examples now reasonably fit in the spec rather than needing to be defined in an external file.

All of this is pretty related, otherwise I would have broken it into multiple PRs.

gregsdennis · 2022-06-30T20:35:10Z

I have implemented this in the schema/experiment-new-output-format branch of json-everything

jdesrosiers · 2022-07-08T15:28:23Z

The draft-next branch has been merged and is now closed. The merge target for this PR has been changed to main. Here are the recommended steps to get your branch reabsed properly.

Make sure your remote for the json-schema-org/json-schema-spec repo is up-to-date. (Example: git fetch upstream).
Rebase your commits onto main. (Example: git rebase --onto upstream/main abcd123~1 (replace abcd123 with the commit hash of the first commit in your PR)).
Force push the rebased branch to your fork. (Example: git push --force origin my-branch).

…nd explanation of results

… and clarifications

karenetheridge · 2022-07-11T16:22:00Z

I am strongly against parts of this PR but I am unable to spend much time going through it again in detail until the end of the week.

handrews

I've got various nitpicks and wording ideas (some of which may belong somewhere other than this PR), but overall this looks good to me. I like how this gets rid of the duplication of the three location fields.

For the purposes of this PR I'm not worrying about whether "basic" becomes mandatory, or whether there are reasons for a more substantial difference between error and annotation output.

jsonschema-core.xml

gregsdennis · 2022-07-27T22:53:22Z

@karenetheridge I welcome your feedback, but it's been over two weeks since you posted.

handrews

Thanks for the update! While i made quite a few comments they're mostly tiny things or just an effort to standardize the terminology. Overall this is looking really good!

jsonschema-core.xml

handrews · 2022-08-03T22:38:13Z

jsonschema-core.xml

+                        All output units are included in this format.
+                    </t>
+                    <t>
+                        The location properties of the root output unit MAY be omitted.


What does this mean exactly? I would really rather not have to special-case any output unit, they should all always provide the same set of location information.

For the basic output, results from the root schema are also an item in the list, which means that there is no need to have location properties (evaluationPath, schemaLocation, & instanceLocation) in the root output unit (and only the root). Thus for basic it's just valid and nested at the root.

This allowance permits these properties to also be omitted from the root of the hierarchical format. The thinking behind this is that it aligns with basic, if that matters to implementors, but also that these location properties will always be

evaluationPath: "" (empty pointer) schemaLocation: "https://example.com/schema#" instanceLocation: "" (empty pointer)

Having them at the root isn't useful or really required.

This feels like a design smell from trying to make two things that aren't really the same pretend like they are the same.

jdesrosiers · 2022-08-04T21:11:08Z

I'm still not entirely sure how I feel about the approach. I think it will work fine. I think my only concern is that we would be making a drastic change without really knowing how it's going to work out. Not that what we have now is well proven either, I just don't want to be implementing a completely new output format every release. I don't know that there's really a solution to that without a crystal ball, but I wanted to mention it anyway.

gregsdennis · 2022-08-04T21:24:48Z

@jdesrosiers I understand and share that concern. However, as I mentioned in my blog post, the current design was not met with much acceptance and there were a lot of questions.

Secondly, I think this aligns better with the discussion we had around how annotations are passed on / blocked.

It really needs to change. I really think that we're a lot closer to a final form.

Also, having implemented both formats, I can tell you that this is so much simpler!

jdesrosiers · 2022-08-04T23:13:50Z

I completely agree that it needs an update. Doing nothing is not an option. My main concern is maintaining a bunch of different versions of the output format as we iterate and experiment to figure out what's going to work best. I'd feel a lot better about this if it were it's own spec and we actually treated it like a draft where each draft replaces the previous and has no backwards/forwards compatibility guarantees until things stabilize. That way I always only have to maintain the latest version.

gregsdennis · 2022-08-07T21:47:48Z

I agree with what you're saying. For now, I think we can still make improvements in-place and then extract the output separately later.

handrews · 2022-08-07T23:18:33Z

@jdesrosiers given that we quite likely won't even put out the next revision under the IETF process, I don't think we should worry too much about what is in which document at the moment.

gregsdennis · 2022-08-08T07:17:29Z

Let's continue the output-in-a-new-spec discussion in this thread. For now, it's all in Core. We'll open a new PR for further developments. I think this one is complete.

jdesrosiers · 2022-08-08T18:14:41Z

given that we quite likely won't even put out the next revision under the IETF process, I don't think we should worry too much about what is in which document at the moment.

I don't think what process we are using makes a difference, unless your point is that it doesn't make sense to split things out until we know what process we're going to use moving forward. That makes sense, but I wasn't suggesting it be split out now, just that it's an important consideration to keep in mind. If we're going to make such a big change we need to consider the effect on implementors and that very much includes discussion of what the life-cycle of this feature will be.

I think it's a relevant an very important to thing to be thinking about now, but it's not a blocker for this PR, just an important thing to call out and follow up on later.

jsonschema-core.xml

handrews

I'm in favor of merging this. The open points of discussion are either not things that I'd block the PR on anyway, or I'm confident that we'll continue talking about them. What's here right now is a great improvement.

jdesrosiers changed the base branch from draft-next to main July 8, 2022 15:27

jdesrosiers and others added 7 commits July 9, 2022 12:26

Move contains to "other" applicator section

91fea47

updated output structure description, example schema and instances, a…

a469d9b

…nd explanation of results

update wording for structures and associated examples

8a73d7b

removed 'detailed' format; added annotations examples; some rewording…

b4ff06e

… and clarifications

updated description of nested results

997bed9

use actual generated output for the examples

e1344cc

fix URIs and remove location props from root node on basic

742221a

gregsdennis force-pushed the output-nodes-should-be-per-subschema-not-per-keyword branch from 78b4cde to 742221a Compare July 9, 2022 01:31

Relequestual mentioned this pull request Jul 18, 2022

Open Community Working Meeting 2022-07-18 json-schema-org/community#200

Closed

2 tasks

This was referenced Jul 24, 2022

Experiment: new output format json-everything/json-everything#308

Merged

Fixing JSON Schema Output json-schema-org/blog#17

Merged

Add output formatting tests json-schema-org/JSON-Schema-Test-Suite#247

Closed

gregsdennis marked this pull request as draft July 25, 2022 23:51

gregsdennis mentioned this pull request Jul 26, 2022

Add keyword id to output unit #1065

Open

handrews reviewed Jul 26, 2022

View reviewed changes

edits suggested by @handrews

0d7b836

handrews reviewed Aug 3, 2022

View reviewed changes

gregsdennis added 2 commits August 4, 2022 15:47

using 'dereferenced evaluation structure

fe67355

more verbiage tweaks to output

509ed28

gregsdennis marked this pull request as ready for review August 4, 2022 19:53

correcting the number of output formats

b6799cd

update example locations for evaluationPath and schemaLocation

843a683

handrews reviewed Aug 14, 2022

View reviewed changes

jsonschema-core.xml Outdated Show resolved Hide resolved

add clarification around empty-string error messages

b6ea8a3

gregsdennis added the output label Aug 22, 2022

gregsdennis mentioned this pull request Aug 23, 2022

Support Error Code #1282

Closed

handrews approved these changes Aug 26, 2022

View reviewed changes

gregsdennis merged commit cd06bd5 into main Aug 28, 2022

gregsdennis deleted the output-nodes-should-be-per-subschema-not-per-keyword branch August 28, 2022 20:46

gregsdennis mentioned this pull request Sep 8, 2022

Update output schema to describe new structure #1285

Merged

Julian mentioned this pull request Oct 24, 2022

Support standard output formats python-jsonschema/jsonschema#1008

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Output nodes should be per subschema not per keyword #1249

Output nodes should be per subschema not per keyword #1249

gregsdennis commented Jun 27, 2022 •

edited

Loading

gregsdennis commented Jun 30, 2022

jdesrosiers commented Jul 8, 2022

karenetheridge commented Jul 11, 2022

handrews left a comment

gregsdennis commented Jul 27, 2022

handrews left a comment

handrews Aug 3, 2022

gregsdennis Aug 4, 2022 •

edited

Loading

handrews Aug 14, 2022

jdesrosiers commented Aug 4, 2022

gregsdennis commented Aug 4, 2022

jdesrosiers commented Aug 4, 2022

gregsdennis commented Aug 7, 2022

handrews commented Aug 7, 2022 •

edited

Loading

gregsdennis commented Aug 8, 2022

jdesrosiers commented Aug 8, 2022

handrews left a comment

Output nodes should be per subschema not per keyword #1249

Output nodes should be per subschema not per keyword #1249

Conversation

gregsdennis commented Jun 27, 2022 • edited Loading

Summary

gregsdennis commented Jun 30, 2022

jdesrosiers commented Jul 8, 2022

karenetheridge commented Jul 11, 2022

handrews left a comment

Choose a reason for hiding this comment

gregsdennis commented Jul 27, 2022

handrews left a comment

Choose a reason for hiding this comment

handrews Aug 3, 2022

Choose a reason for hiding this comment

gregsdennis Aug 4, 2022 • edited Loading

Choose a reason for hiding this comment

handrews Aug 14, 2022

Choose a reason for hiding this comment

jdesrosiers commented Aug 4, 2022

gregsdennis commented Aug 4, 2022

jdesrosiers commented Aug 4, 2022

gregsdennis commented Aug 7, 2022

handrews commented Aug 7, 2022 • edited Loading

gregsdennis commented Aug 8, 2022

jdesrosiers commented Aug 8, 2022

handrews left a comment

Choose a reason for hiding this comment

gregsdennis commented Jun 27, 2022 •

edited

Loading

gregsdennis Aug 4, 2022 •

edited

Loading

handrews commented Aug 7, 2022 •

edited

Loading