fix(DENG-10248): update shredder to support usage reporting derived #8589

kik-kik · 2025-12-11T16:18:15Z

fix(DENG-10248): update shredder to support usage reporting derived

This change is to ensure usage_reporting derived datasets get shredded.

…ts get shredded

kik-kik · 2025-12-11T16:18:46Z

tests/shredder/test_config.py

I'm not actually sure this test_config does anything or if these changes are even needed / useful.

The derived tables need to be added to the FakeClient in this file. This should be tested since it's easy to get wrong (as it is now)

I just took a closer look at the tests and the setup isn't trivial so I can update the tests if you don't want to do it

BenWu · 2025-12-11T17:29:04Z

bigquery_etl/shredder/config.py

+                field=(USAGE_PROFILE_ID,) * len(sources[table.dataset_id]),
+            ): sources[table.dataset_id]
+            for table in glean_derived_tables
+            if any(field.name == USAGE_PROFILE_ID for field in table.schema)


Event though it's unlikely, I suggest adding a check for client_id as well to ensure it doesn't duplicate any table added above.

Suggested change

if any(field.name == USAGE_PROFILE_ID for field in table.schema)

if any(field.name == USAGE_PROFILE_ID for field in table.schema)

and all(field.name != CLIENT_ID for field in table.schema)

BenWu · 2025-12-11T17:40:41Z

bigquery_etl/shredder/config.py

+                field=(USAGE_PROFILE_ID,) * len(sources[table.dataset_id]),
+            ): sources[table.dataset_id]


This would use the deletion request ping. The derived datasets needs to be added to usage_reporting_sources above

Suggested change

field=(USAGE_PROFILE_ID,) * len(sources[table.dataset_id]),

): sources[table.dataset_id]

field=(USAGE_PROFILE_ID,) * len(usage_reporting_sources[table.dataset_id]),

): usage_reporting_sources[table.dataset_id]

BenWu · 2025-12-11T17:48:16Z

tests/shredder/test_config.py

The derived tables need to be added to the FakeClient in this file. This should be tested since it's easy to get wrong (as it is now)

kik-kik added 2 commits December 10, 2025 17:17

feat: update shredder config to ensure usage_reporting derived datase…

7fc25ad

…ts get shredded

feat: add usage_reporting to shredder test_config.py

d12ec8a

kik-kik self-assigned this Dec 11, 2025

kik-kik requested a review from a team as a code owner December 11, 2025 16:18

kik-kik added the bug Something isn't working label Dec 11, 2025

kik-kik commented Dec 11, 2025

View reviewed changes

BenWu reviewed Dec 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(DENG-10248): update shredder to support usage reporting derived #8589

fix(DENG-10248): update shredder to support usage reporting derived #8589

Uh oh!

kik-kik commented Dec 11, 2025

Uh oh!

kik-kik Dec 11, 2025

Uh oh!

BenWu Dec 11, 2025

Uh oh!

BenWu Dec 11, 2025

Uh oh!

BenWu Dec 11, 2025

Uh oh!

BenWu Dec 11, 2025

Uh oh!

BenWu Dec 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	if any(field.name == USAGE_PROFILE_ID for field in table.schema)
	if any(field.name == USAGE_PROFILE_ID for field in table.schema)
	and all(field.name != CLIENT_ID for field in table.schema)

		field=(USAGE_PROFILE_ID,) * len(sources[table.dataset_id]),
		): sources[table.dataset_id]

fix(DENG-10248): update shredder to support usage reporting derived #8589

Are you sure you want to change the base?

fix(DENG-10248): update shredder to support usage reporting derived #8589

Uh oh!

Conversation

kik-kik commented Dec 11, 2025

fix(DENG-10248): update shredder to support usage reporting derived

Uh oh!

kik-kik Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

BenWu Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

BenWu Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

BenWu Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

BenWu Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

BenWu Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants