report: use entity classification to filter third-parties #14697

alexnj · 2023-01-19T20:44:45Z

Report changes split from #14622. This PR rewires third-parties filter checkbox on reports to use LHR.entities entity classification. For legacy reports, it will continue falling back to the origin string match based filter.

Change PR base to main after core: add entity classification of origins to the LHR #14622 lands.

…tonly-extn

connorjclark

FYI, you can find audits that will display the filter by searching for the selector .lh-3p-filter:not([hidden]) in the elements panel. There are fewer than before, so I had to do this to find some relevant audits.

Looking at bootup-time in the sample reports, something unexpected was that "Unattributable" is filtered out when the checkbox is unchecked. We shouldn't consider that to be third-party, I think... Maybe just hardcode to ignore "Unattributable" in _getThirdPartyRows?

report/renderer/report-ui-features.js

Co-authored-by: Connor Clark <cjamcl@google.com>

alexnj · 2023-01-27T20:22:12Z

We shouldn't consider that to be third-party, I think.

This sounds like a bug. Unattributable shouldn't get classified as 3p.

adamraine

LGTM

Co-authored-by: Connor Clark <cjamcl@google.com>

brendankenny

late comments, sorry, maybe good for incorporation into #14655

report/test/renderer/report-utils-test.js

brendankenny · 2023-01-27T22:04:53Z

report/test/renderer/report-utils-test.js

+        // Avoid injecting entity names into audits that would would
+        // make the diff at the end of this test difficult.
+        delete clonedSampleResult.entities;
+


this is a good reason for the split of augmentation vs back compat in #14701

brendankenny · 2023-01-27T22:11:29Z

report/renderer/report-utils.js

+    if (!entityClassification) return;
+    if (audit.details?.type !== 'opportunity' && audit.details?.type !== 'table') {


Intent might be clearer if these checks are moved out into prepareReportResult, so as you're reading that method you know you only need to step into classifyEntities if it's a table or opportunity (right now you have to trust the comment isn't out of date).

In here, the params could then be narrowed, dropping the undefined on entityClassification, and audit could become details of LH.Audit.Details.Opportunity|LH.Audit.Details.Table

These are to keep TS happy in the rest of the function, as we're passing the full audit here. Is there a better way to do it?

I mean something like

diff --git a/report/renderer/report-utils.js b/report/renderer/report-utils.js index a38b52d2b..739cc6210 100644 --- a/report/renderer/report-utils.js +++ b/report/renderer/report-utils.js @@ -91,7 +91,11 @@ class ReportUtils { } // Attach table/opportunity items with entity information. - ReportUtils.classifyEntities(result.entities, audit); + if (result.entities && audit.details) { + if (audit.details.type === 'opportunity' || audit.details.type === 'table') { + ReportUtils.classifyEntities(result.entities, audit.details); + } + } // TODO: convert printf-style displayValue. // Added: #5099, v3 @@ -224,17 +228,12 @@ class ReportUtils { /** * Mark TableItems/OpportunityItems with entity names. - * @param {LH.Result.Entities|undefined} entityClassification - * @param {import('../../types/lhr/audit-result').Result} audit + * @param {LH.Result.Entities} entityClassification + * @param {LH.FormattedIcu<LH.Audit.Details.Opportunity|LH.Audit.Details.Table>} details */ - static classifyEntities(entityClassification, audit) { - if (!entityClassification) return; - if (audit.details?.type !== 'opportunity' && audit.details?.type !== 'table') { - return; - } - + static classifyEntities(entityClassification, details) { // If details.items are already marked with entity attribute during an audit, nothing to do here. - const {items, headings} = audit.details; + const {items, headings} = details; if (!items.length || items.some(item => item.entity)) return; // Identify a URL-locator function that we could call against each item to get its URL.

So I was passing audit.details or more specific params earlier, I think. But I now think it would be beneficial to pass the full audit to get the id as well, in case if any audit needs special handling in future.

report/renderer/report-utils.js

brendankenny · 2023-01-27T22:21:35Z

report/renderer/report-utils.js

+      // Return a function that extracts item.url.
+      return (item) => {
+        const url = item[urlKey];
+        if (typeof url === 'string') return url;


technically table items can override their valueType themselves, so this would just opt them out even if they could still be classified from item[urlKey].value. Maybe a corner case worth dealing with after #14655, though (not sure if any audit actually does this with url)

Not that I've seen in any of our audits. Why would we allow that, vs. introducing a new valueType?

brendankenny · 2023-01-27T22:25:38Z

report/renderer/report-ui-features.js

+        // We rely on entity-classification for new LHRs that support it.
+        if (!rowEl.dataset.entity || rowEl.dataset.entity === firstPartyEntityName) continue;
+      } else {
+        // Without 10.0's entity classification, fallback to the older root domain-based filtering.


Not sure if this was already discussed, but maybe after a certain amount of time (circa 11.0?) we should drop this code? We want to render old LHRs, but I'm not sure 3p filtering of old LHRs is that important after a while

We haven't discussed that. Should we have a tracker issue labeled 11.0?

alexnj · 2023-01-27T22:39:44Z

Added a couple of points there that could use clarity, @brendankenny. I'll add these changes into #14655.

alexnj added 30 commits October 20, 2022 15:14

First cut of entity classification computed artifact.

7833354

some cleanup on the types

0a154eb

Refactor resource-summary audit with third party classification

3212aa6

Refactor unused-javascript to output entity and is-3p flag

c71a8ab

Refactor 3p audits to depend on computed entity classification.

14163bd

Expose entity classification to LH report via a hidden audit.

46d706c

Refine groupBy feature to be more concise.

b5caa4f

Replace domains with homepage

8689903

Revert the LHR grouping changes, after discussing the design with team.

5b67f2e

Add name based lookup to entity classification audit result.

0abc053

Refactor third-party filter to base on entity-classification

170adcb

Attach entity classification to ByteEfficiencyAudit.

5e18f75

Classify bootup-time (Reduce JavaScript execution time) audit

f0674c7

Classify long-tasks (Avoid long main-thread tasks) audit

69725b7

Classify uses-long-cache-ttl audit.

ab5d847

Mark sub-items with entity as well

04bfff8

Fix the regression caused to third-party-summary audit

e8c1c7e

Classify uses-rel-preconnect audit.

0a87d78

Classify all ViolationAudit derived audits.

d953829

Classify no-unload-listeners audit

aea71bf

Classify total-byte-weight audit

cb16f02

Merge remote-tracking branch 'origin/main' into entity-based-3p

1e73c13

Updated components/CSS

e79f44b

Merge remote-tracking branch 'origin/main' into entity-based-3p

6f80513

Some cleanup

d299ae8

Classify valid-source-maps audit

f462dc8

Classify legacy-javascript audit

9189494

Classify all audits that depend on makeOpportunityDetails call

f1d9893

Merge remote-tracking branch 'origin/main' into entity-based-3p

1ad67ac

Explicitly name the lookup tables.

0230896

Update/cleanup comments

2cc6b44

vercel bot deployed to Preview January 25, 2023 22:15 View deployment

alexnj added 2 commits January 25, 2023 16:48

Review changes + Refactor of third-party filter + tests

6f392b8

delete core/util.cjs

2466ec6

vercel bot deployed to Preview January 26, 2023 00:51 View deployment

alexnj added 2 commits January 25, 2023 16:54

Missed entity classification

8a6c2cc

Merge remote-tracking branch 'origin/main' into entity-based-3p-repor…

c2625a4

…tonly-extn

vercel bot deployed to Preview January 26, 2023 00:55 View deployment

vercel bot deployed to Preview January 26, 2023 01:03 View deployment

alexnj requested a review from adamraine January 26, 2023 01:16

Merge remote-tracking branch 'origin/main' into entity-based-3p-repor…

f3a16d3

…tonly-extn

vercel bot deployed to Preview January 27, 2023 00:09 View deployment

connorjclark reviewed Jan 27, 2023

View reviewed changes

report/renderer/report-ui-features.js Outdated Show resolved Hide resolved

report/renderer/report-ui-features.js Outdated Show resolved Hide resolved

Update report/renderer/report-ui-features.js

41aa773

Co-authored-by: Connor Clark <cjamcl@google.com>

vercel bot deployed to Preview January 27, 2023 20:18 View deployment

attempt bugfix: unclassified row items aren't 3ps

e850f7e

vercel bot deployed to Preview January 27, 2023 20:40 View deployment

add testcase: non-classifable urls belong in 1st party.

1265edb

vercel bot deployed to Preview January 27, 2023 21:21 View deployment

connorjclark changed the title ~~report: use entity-classification to filter third-parties~~ report: use entity classification to filter third-parties Jan 27, 2023

adamraine approved these changes Jan 27, 2023

View reviewed changes

Update report/renderer/report-ui-features.js

c69cb3d

Co-authored-by: Connor Clark <cjamcl@google.com>

vercel bot deployed to Preview January 27, 2023 21:50 View deployment

connorjclark approved these changes Jan 27, 2023

View reviewed changes

alexnj merged commit abf268b into main Jan 27, 2023

alexnj deleted the entity-based-3p-reportonly-extn branch January 27, 2023 22:23

brendankenny reviewed Jan 27, 2023

View reviewed changes

alexnj added a commit that referenced this pull request Jan 31, 2023

Review comments from #14697

f4287fc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

report: use entity classification to filter third-parties #14697

report: use entity classification to filter third-parties #14697

alexnj commented Jan 19, 2023 •

edited

Loading

connorjclark left a comment •

edited

Loading

alexnj commented Jan 27, 2023

adamraine left a comment

brendankenny left a comment

brendankenny Jan 27, 2023

brendankenny Jan 27, 2023 •

edited

Loading

alexnj Jan 27, 2023

brendankenny Jan 27, 2023

alexnj Jan 27, 2023

brendankenny Jan 27, 2023

alexnj Jan 27, 2023

brendankenny Jan 27, 2023

alexnj Jan 27, 2023

alexnj commented Jan 27, 2023

		if (!entityClassification) return;
		if (audit.details?.type !== 'opportunity' && audit.details?.type !== 'table') {

report: use entity classification to filter third-parties #14697

report: use entity classification to filter third-parties #14697

Conversation

alexnj commented Jan 19, 2023 • edited Loading

connorjclark left a comment • edited Loading

Choose a reason for hiding this comment

alexnj commented Jan 27, 2023

adamraine left a comment

Choose a reason for hiding this comment

brendankenny left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

brendankenny Jan 27, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alexnj commented Jan 27, 2023

alexnj commented Jan 19, 2023 •

edited

Loading

connorjclark left a comment •

edited

Loading

brendankenny Jan 27, 2023 •

edited

Loading