Document process for new custom metrics
See original GitHub issueCustom metrics are a powerful way to extract insights from pages directly from the JS runtime context. This is an especially useful time to analyze things that depend on the DOM, for example, like counting HTML element and attribute usage. These are things that can be approximated using regular expressions in the response bodies, but it becomes extremely costly considering that the latest desktop and mobile response_bodies tables are over 11 TB combined.
I’d like to use this issue to discuss what the criteria should be for the custom metrics. What are the limitations of doing more work in custom metrics? Should we be actively cleaning up unused custom metrics? Should we have a policy for allowing one-time custom metrics by request?
cc @bkardell
Issue Analytics
- State:
- Created 4 years ago
- Comments:7 (5 by maintainers)

Top Related StackOverflow Question
Can you combine a bunch of the checks into a single collection script so we’re not running dozens of scripts? It can return JSON and it should be passed through. I’d be surprised if even dozens of metrics added more than a few seconds to the crawl and should be a non-issue but be mindful of any that look particularly expensive (exponential based on size of DOM for example).
On Sat, May 25, 2019 at 4:56 PM Rick Viscomi notifications@github.com wrote:
Fixed by https://github.com/HTTPArchive/legacy.httparchive.org/blob/master/custom_metrics/README.md