Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Efficient telemetry for requests that are rejected at Kestrel layer.

See original GitHub issue

When running on Kestrel, requests typically fall into two buckets:

Requests that pass through Kestrel and reach the application middleware.
Requests that are rejected within Kestrel for various reasons and never reach the application layer.

This is a feature request to enable efficient telemetry/logging for the 2nd category to increase observability into such failures.

Let’s take a look at IIS/Http.Sys’s approach to such failures:

In the case of Http.Sys, from https://learning.oreilly.com/library/view/internet-information-services/9780735624412/ch15s06.html, “HTTPERR log records all errors that are not handed off to a valid worker process, typically responses to clients, connection time-outs, and orphaned requests.” In other words, requests that are not serviced by the application/rejected at the Http.Sys layer are logged.

Schema of a typical Http.Sys error log looks like this : https://docs.microsoft.com/en-us/troubleshoot/aspnet/error-logging-http-apis#format-of-the-http-api-error-logs

Similarly, in IIS, it is possible to configure FailedRequestTracing to log requests that fail in IIS layer and never reach the application middleware : https://www.jeremymorgan.com/blog/iis-devops/failed-request-tracing-iis/

A concrete example of where such logging is useful:

When Kestrel receives an authority form request where the request start line and host header are mismatched, Kestrel throws the following exception and returns a 400.0 (Stacktrace shows "google.com because that’s the Host header I sent, point is valid for any mismatch). This request does not enter the application middleware and there is no observability for this error (unless Trace logging is turned on).

Microsoft.AspNetCore.Server.Kestrel.Core.BadHttpRequestException: Invalid Host header: 'google.com'
   at Microsoft.AspNetCore.Server.Kestrel.Core.KestrelBadHttpRequestException.Throw(RequestRejectionReason reason, String detail)
   at Microsoft.AspNetCore.Server.Kestrel.Core.Internal.Http.Http1Connection.ValidateNonOriginHostHeader(String hostText)
   at Microsoft.AspNetCore.Server.Kestrel.Core.Internal.Http.Http1Connection.EnsureHostHeaderExists()
   at Microsoft.AspNetCore.Server.Kestrel.Core.Internal.Http.Http1Connection.TryParseRequest(ReadResult result, Boolean& endConnection)
   at Microsoft.AspNetCore.Server.Kestrel.Core.Internal.Http.HttpProtocol.ProcessRequests[TContext](IHttpApplication`1 application)
   at Microsoft.AspNetCore.Server.Kestrel.Core.Internal.Http.HttpProtocol.ProcessRequestsAsync[TContext](IHttpApplication`1 application)

Specifically, this particular error is coming from this line : https://github.com/dotnet/aspnetcore/blob/be5ba495b4872fa055385463da655ae4c4318898/src/Servers/Kestrel/Core/src/Internal/Http/Http1Connection.cs#L606-607

Repro:

Download ASP.NET Core sample app and fire up Kestrel in Visual Studio.
Use any client (I used wrk2), send a authority form request to Kestrel with mismatched HostHeader and request start line as shown below.
Your client will observe a 400.0 Status Code, but on the Server side, since the request does not enter the application middleware, there is no logging of this error.

The above is just one example, there are other examples (please let me know if you’d want me to provide more examples) where requests are terminated/rejected within Kestrel code and not handed off to application middleware.

The ask is to have functionality, where requests that are not serviced by the application middleware are logged efficiently. This telemetry will be hugely beneficial to critical large-scale distributed services.

Issue Analytics

State:
Created 2 years ago
Reactions:15
Comments:9 (8 by maintainers)

Top GitHub Comments

1reaction

shirhatticommented, Jul 1, 2021

Based on our offline discussion:

Let’s look at emitting new events from KestrelEventSource for failure conditions. We’ll introduce a new DTO for logging that’ll have the fields based on @avparuch’s suggestion

TimeStamp, ClientIP, Client Port, ServerIP, Server Port, Server Name (from SNI), Host Header, Verb, Protocol, Uri, StatusCode, Reason.

Reason phrase is probably just the stringified version of the RequestRejectionReason enum
Fields in the DTO may be null (or have a default/sentinel value).
We’ll re-emit all the fields we emit in the ConnectionStart event.
We may have duplication between Events raised here and those observed by middleware. We’re not making any efforts to de-dupe.

Open questions:

Is adding an extra parameter to the WriteEvent call considered breaking? We’d rely on this behavior to add more info to logs in the future
How would the W3CLogger consume this? Via a custom EventListener?

Given that this doesn’t require new API surface, we can consider doing this in 6.0 after the preview7 release.

1reaction

halter73commented, Jun 30, 2021

Do you want an EventSource event(s), specific log event(s) (or category) or something else? Would we want events for anything that looks like requests but don’t get to middleware for whatever reason (e.g. invalid host header, unexpected character when parsing, etc…)? Do you want something specific for just invalid host headers?

We do currently have the ConnectionBadRequest log for all invalid requests on an otherwise healthy connection (Note: Middleware might still have run for a something logged as ConnectionBadRequest if there was an invalid request body. For anything earlier, middleware is not run.), and RequestProcessingError for I/O errors during request processing. Starting in .NET 6, both will be under the new Microsoft.AspNetCore.Server.Kestrel.BadRequests logger category.

We could consider trying to formalize Kestrel’s internal RequestRejectionReasons as part of this but I’m hesitant given we’ve gotten by without doing so this far. A lot of the reasons are specific to a certain versions of HTTP and not something you’d ever expect to see from non-buggy HTTP clients. We’ve found that logging a message describing the problem is sufficient for most use cases. I don’t know why any app-level logic would care about things like whether the client sent a BadChunkSuffix or BadChunkSizeData. This might be interesting for diagnostic purposes, but not much else.

In any case, what details would you want for these events? The less specific the event, the harder it will be to always provide details, but maybe some of the details could be optional.

Top Results From Across the Web

Request to make Kestrel BadHttpRequestException. ...

I'm implementing metrics telemetry for my web application and would like to have counters for different reasons we reject requests--whether ...

Need to pass state about rejected requests for W3C ...

It is currently impossible to implement W3C access logs as a middleware since certain requests are rejected at the server layer.

Networking Telemetry in .NET

Time to process a request in ASP.NET Core running Kestrel. This is currently the most accurate way to measure the duration of a...

NET Observability with OpenTelemetry

When you run an application, you want to know how well the app is performing and to detect potential problems before they become...

The Case for C# and .NET

Fair question, devs who do not want to share telemetry for the dotnet core SDK simply need to set an environment variable.

Troubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.

Start Free

Top Related Reddit Thread

No results found

Top Related Tweet

No results found

Top Related Dev.to Post

No results found

Efficient telemetry for requests that are rejected at Kestrel layer.

Issue Analytics

Top GitHub Comments

Top Results From Across the Web

Top Related Medium Post

Top Related StackOverflow Question

Troubleshoot Live Code

Top Related Reddit Thread

Top Related Hackernoon Post

Top Related Tweet

Top Related Dev.to Post

Top Related Hashnode Post

Authorization failure with resource - lack of documentation/warning/implementation

Update ASP.NET Core C# project templates to use new minimal hosting APIs in .NET 6