chore: add April 2026 data#263
Conversation
|
thanks so much @akalyuzhnyi ! wow everything got so much worse though, like @rviscomi or @paulirish any chance you know of a major change to HTTPArchive hardware or test run in April? Or is this a bug on our end we should investigate further? |
|
Nothing changed with the HTTP Archive that I'm aware of on the collection side. The instance types, configuration and images haven't been touched in close to a year and the only agent updates have been around updating feature names and wappalyzer detections. |
Can you explain this more? Not sure what I’m looking at to see that level of change. |
|
Thanks very much all! I'll look more deeply into this on our end then. Initial findings...
So pretty much all scripts duplicated 4x in this PR, but looking at specific Lighthouse reports from the April sample on HTTPArchive nothing obvious jumps out (no duplicated URLs, no URLs that aren't part of the page, etc). Either something in the report structure changed or we've gone awry in our querying in this one. |
patrickhulce
left a comment
There was a problem hiding this comment.
@akalyuzhnyi I think something went wrong with the querying process here. Do your intermediate third_party_web tables have duplicate script entries?
I can't verify these results against the actual HTTPArchive data in BigQuery.
Hello @patrickhulce
I'm updating the data for April 2026. Could you do a review, please?