update HttpRequestKernel to use new parser; support variable expressions in request body #3122

jonsequitur · 2023-08-08T03:01:21Z

No description provided.

src/Microsoft.DotNet.Interactive.HttpRequestParser/HttpSyntaxNode.cs

shyamnamboodiripad · 2023-08-09T20:46:23Z

src/Microsoft.DotNet.Interactive.HttpRequest.Tests/Utility/AssertionExtensions.cs

+
+    public static AndWhichConstraint<ObjectAssertions, T> ContainSingle<T>(
+        this GenericCollectionAssertions<HttpSyntaxNodeOrToken> should)
+        where T : HttpSyntaxNode


Can we remove the overload on line 13 since HttpSyntaxNodeOrToken is a base type of HttpSyntaxNode (and since IReadOnlyList supports covariance)?

src/Microsoft.DotNet.Interactive.HttpRequestParser/HttpSyntaxNodeOrToken.cs

shyamnamboodiripad · 2023-08-09T21:03:23Z

src/Microsoft.DotNet.Interactive.HttpRequestParser/HttpSyntaxNode.cs

+    /// <summary>
+    /// Gets the text of the current node, including trivia.
+    /// </summary>
+    public string FullText => SourceText.ToString(FullSpan);


It is a bit strange that FullSpan is available on the base HttpSyntaxNodeOrToken but FullText is not.

If we think that FullSpan and FullText are only meaningful for HttpSyntaxNode, perhaps we should also move FullSpan down to HttpSyntaxNode.

Alternatively, we can move FullText up to the base type so that a caller does not have to think about whether they are dealing a node or a token when they want to get its full text.

This makes sense, but the meaning of the two is also different. On a token, Span really means the full span. FullSpan is used for things like FindNode and GrowSpan. Removing HttpSyntaxNodeOrToken.FullSpan results in needing to switch on the two types in a number of places.

I see... Should we also move FullText to the base type then so that the API is consistent?

src/Microsoft.DotNet.Interactive.HttpRequest.Tests/ParserTests.Comments.cs

src/Microsoft.DotNet.Interactive.HttpRequest.Tests/ParserTests.Method.cs

src/Microsoft.DotNet.Interactive.HttpRequest.Tests/ParserTests.Request.cs

src/Microsoft.DotNet.Interactive.HttpRequest.Tests/ParserTests.Trivia.cs

shyamnamboodiripad · 2023-08-10T17:14:58Z

src/Microsoft.DotNet.Interactive.HttpRequestParser/HttpRequestNode.cs

@@ -70,10 +74,9 @@ internal class HttpRequestNode : HttpSyntaxNode
    public HttpBindingResult<HttpRequestMessage> TryGetHttpRequestMessage(HttpBindingDelegate bind)
    {
        var request = new HttpRequestMessage();
-        var diagnostics = new List<Diagnostic>();
-        var success = true;
+        var diagnostics = new List<Diagnostic>(base.GetDiagnostics());


The binding methods for the child nodes (such as TryGetUri and TryGetBody) should probably also call base.GetDiagnostics() to include syntax errors (and return false if there are any diagnostics with severity Error).

I guess that could lead to some duplication at higher levels (unless we change the list to a hashset)...

Yeah, I deliberately kept these concerns separate.

Thinking more perhaps it may be better for the binding API calls to only return binding errors. The syntax errors are already available on the node which the caller is trying to bind.

This may be more flexible from an API standpoint and would also keep the behavior consistent regardless of which construct is being bound... Otherwise, it forces callers to reason about which construct they are binding and whether or not they should fetch the syntax diagnostics separately.

src/Microsoft.DotNet.Interactive.HttpRequestParser/HttpRequestNode.cs

shyamnamboodiripad · 2023-08-10T17:22:37Z

src/Microsoft.DotNet.Interactive.HttpRequestParser/HttpRequestNode.cs

            diagnostics.AddRange(uriBindingResult.Diagnostics);
        }

-        if (success)
+        var headers =
+            HeadersNode?.HeaderNodes.Select(h => new KeyValuePair<string, string>(h.NameNode.Text, h.ValueNode.Text)).ToArray()


The headers could also contain expressions. But I guess we will flesh that out in future PRs cc @bleaphar

src/Microsoft.DotNet.Interactive.HttpRequestParser/HttpRequestNode.cs

shyamnamboodiripad · 2023-08-10T17:37:04Z

src/Microsoft.DotNet.Interactive.HttpRequestParser/HttpRequestNode.cs

+                case "content-type":
+                    if (request.Content is null)
+                    {
+                        request.Content = new StringContent("");


Can this be done globally in the else clause for the if block on line 136 below instead (i.e. always set to empty string if a parsed body is empty)?

The null check for request.Content is also confusing since we have not touched request.Content above this line. Did you mean to check something else perhaps?

No, I meant to check this. Coverage indicates that request.Content is null 100% of the time here. But I realized there's a bug in that this header can be get overridden when we set the Content again for the body below, so some refactoring is needed.

Ah does the System.Net.Http API try to deduce and set the content-type header implicitly when Content is set below? That is certainly not very intuitive 😄

I agree this needs some refactoring... I think we can delete this if block for now as it is dead code.

The code isn't dead code. 100% of tests enter the if block.

Ah does the System.Net.Http API try to deduce and set the content-type header implicitly when Content is set below?

It doesn't and there's no way it reliably could.

shyamnamboodiripad · 2023-08-10T17:52:30Z

src/Microsoft.DotNet.Interactive.HttpRequestParser/HttpSyntaxNode.cs

+                .FirstOrDefault(n => n.IsSignificant);
+
+            var lastSignificantNodeOrToken = ChildNodesAndTokens
+                .LastOrDefault(n => n.IsSignificant);


I think we can improve this part by remembering the first and last significant children in fields as children are added inside Add below. That way we won't need to traverse all the child nodes here each time a child is added.

src/Microsoft.DotNet.Interactive.HttpRequestParser/HttpRequestParser.cs

src/Microsoft.DotNet.Interactive.HttpRequest/HttpRequestKernel.cs

shyamnamboodiripad · 2023-08-10T18:26:19Z

src/Microsoft.DotNet.Interactive.HttpRequest/HttpRequestKernel.cs

-        var parsedRequests = new List<ParsedHttpRequest>();
-
-        foreach (var (request, diagnostics) in InterpolateAndGetDiagnostics(requests))
+        foreach (var expressionNode in parseResult.SyntaxTree.RootNode.DescendantNodesAndTokensAndSelf().OfType<HttpExpressionNode>())


Ah why not just call parseResult.TryGetRequest() and get the diagnostics that way (and throw away the request in the success case)? It seems a bit strange to have to repeat the binding logic here + keeping the set of diagnostics the same for both code paths may prove cumbersome.

For example, we may add more specific diagnostics for bad URIs / bad header declations inside the .TryGetUri / TryGetHeaders code path. Some of those diagnostics may be reported in URIs / headers with no embedded expressions (but would still be diagnostics that are reported at bind-time). It would be difficult to replicate the same diagnostics over here since this code only tries to bind embedded expressions (but does not bind individual parts of the syntax).

Some of those diagnostics may be reported in URIs / headers with no embedded expressions (but would still be diagnostics that are reported at bind-time).

The block before this (parseResult.GetDiagnostics()) is getting the syntactic diagnostics and this code is getting the symbolic diagnostics. My thinking was that keeping them as separate steps is useful in this code path (i.e. in the context of RequestDiagnostics) because the latter will sometimes include side effects such as UI prompts which we shouldn't trigger here.

What I meant is that there could be more to the symbolic diagnostics beyond variable resolution and this loop only seems to be handling variable resolution.

The TryGetRequest API (and other APIs it calls such as TryGetUri) would already be considering all such diagnostics - so my thinking was that it may be better to rely exclusively on that API and centralize all validation / binding logic over there.

What I meant is that there could be more to the symbolic diagnostics beyond variable resolution and this loop only seems to be handling variable resolution.

Do we have an example of this?

My concern about binding those types to a specific API call is that it's also not necessarily correct. TryGetHttpRequest isn't the only possible use of the diagnostics request, so using it here feels odd from a layering perspective.

One example may be if we add validation for supported version numbers for HttpVersionNode. Another may be if we need to add validation for specific values that are supported / unsupported for HeaderValueNode depending on the HeaderNameNode of the corresponding header...

Wonder if we should try and create a more general TryBind() API which performs the same binding steps that TryGetHttpRequest() does without creating / returning the System.Net.Http.* objects. However, I feel some of these validations may require calling into System.Net.Http regardless so we may not be saving much...

I would expect these to be directly handled by the parser, since they don't change often and they don't vary based on the host environment. These are closer to syntactic than to symbolic validations.

shyamnamboodiripad

@jonsequitur The changes mostly look great 👍🏾

I've added a few comments that may be worth a look. Some of those (such as a potential perf improvement for GrowSpan()) may be worth fixing up in follow up PRs.

update kernel to use new parser; support expressions in req body

104329e

jonsequitur requested review from shyamnamboodiripad and bleaphar August 8, 2023 03:01

FullText and FullSpan; exclude comments from Span

c8d8326

shyamnamboodiripad reviewed Aug 8, 2023

View reviewed changes

src/Microsoft.DotNet.Interactive.HttpRequestParser/HttpSyntaxNode.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed Aug 8, 2023

View reviewed changes

src/Microsoft.DotNet.Interactive.HttpRequestParser/HttpSyntaxNode.cs Outdated Show resolved Hide resolved

jonsequitur added 4 commits August 8, 2023 16:20

pre-calculate IsSignificant and Span

e92fd65

prevent kernel from sending requests when errors are present

24f4431

cleanup

c767844

remove ParsedHttpRequest type

193a1bf

jonsequitur marked this pull request as ready for review August 9, 2023 01:24

jonsequitur mentioned this pull request Aug 9, 2023

Add HTTP as cell type #3126

Closed

jonsequitur added 4 commits August 8, 2023 19:55

make SyntaxTree.RootNode non-nullable

8656e37

fix nullability warning

970deea

clean up tests to take kernel extension out of the loop

d919c76

improve variable sharing, add support for RequestValueInfos

ab3900f

jonsequitur enabled auto-merge (squash) August 9, 2023 18:29

shyamnamboodiripad reviewed Aug 9, 2023

View reviewed changes

src/Microsoft.DotNet.Interactive.HttpRequestParser/HttpSyntaxNodeOrToken.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed Aug 9, 2023

View reviewed changes

src/Microsoft.DotNet.Interactive.HttpRequest.Tests/ParserTests.Comments.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed Aug 9, 2023

View reviewed changes

src/Microsoft.DotNet.Interactive.HttpRequest.Tests/ParserTests.Comments.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed Aug 9, 2023

View reviewed changes

src/Microsoft.DotNet.Interactive.HttpRequest.Tests/ParserTests.Method.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed Aug 9, 2023

View reviewed changes

src/Microsoft.DotNet.Interactive.HttpRequest.Tests/ParserTests.Request.cs Outdated Show resolved Hide resolved

shyamnamboodiripad reviewed Aug 9, 2023

View reviewed changes

src/Microsoft.DotNet.Interactive.HttpRequest.Tests/ParserTests.Trivia.cs Show resolved Hide resolved

jonsequitur added 2 commits August 9, 2023 15:04

PR comments and compiler warning fix

e410a89

update API baseline

2ba80e1

jonsequitur requested review from shyamnamboodiripad and colombod August 9, 2023 23:50

colombod approved these changes Aug 10, 2023

View reviewed changes

jonsequitur merged commit a554838 into dotnet:main Aug 10, 2023
4 checks passed