Standardize notebook table-relationship documentation cells
This commit is contained in:
@@ -1486,29 +1486,23 @@
|
||||
"source": [
|
||||
"## Tables Created by This Notebook and Their Relationships\n",
|
||||
"\n",
|
||||
"This notebook creates and/or maintains five PostgreSQL tables in the `public` schema:\n",
|
||||
"\n",
|
||||
"### Tables Created / Maintained\n",
|
||||
"1. `public.fcc_bdc_as_of`\n",
|
||||
"- One row per FCC BDC release date and data type.\n",
|
||||
"- Primary metadata table used to track versioning (`as_of_date`) for downstream loads.\n",
|
||||
"- Release/version metadata by `as_of_date`.\n",
|
||||
"\n",
|
||||
"2. `public.fcc_bdc_files`\n",
|
||||
"- One row per file discovered/downloaded for a release.\n",
|
||||
"- Linked to releases via `as_of_date` and used as file-level lineage/provenance.\n",
|
||||
"- File-level lineage records for each FCC BDC release.\n",
|
||||
"\n",
|
||||
"3. `public.fcc_bdc_broadband_by_datacenter`\n",
|
||||
"- Fact table keyed by `(master_id, as_of_date)` for per-data-center broadband availability metrics.\n",
|
||||
"- Includes scalar broadband fields and summary JSON payloads.\n",
|
||||
"- `master_id` aligns with `public.master_data_centers.master_id`.\n",
|
||||
"- Per-data-center broadband fact table keyed by `(master_id, as_of_date)`.\n",
|
||||
"\n",
|
||||
"4. `public.fcc_bdc_broadband_summary`\n",
|
||||
"- Aggregated summary metrics by release (`as_of_date`) used for QA and reporting.\n",
|
||||
"- Release-level aggregate summary metrics.\n",
|
||||
"\n",
|
||||
"5. `public.fcc_bdc_provider_summary`\n",
|
||||
"- Provider catalog/aggregation table by release (`as_of_date`) with provider class rollups.\n",
|
||||
"\n",
|
||||
"### Relationship Summary\n",
|
||||
"- Release-level provider catalog and provider-class summary metrics.\n",
|
||||
"\n",
|
||||
"### Key Relationships\n",
|
||||
"- `public.fcc_bdc_as_of (as_of_date)`\n",
|
||||
" - 1-to-many -> `public.fcc_bdc_files (as_of_date)`\n",
|
||||
" - 1-to-many -> `public.fcc_bdc_broadband_by_datacenter (as_of_date)`\n",
|
||||
@@ -1518,7 +1512,9 @@
|
||||
"- `public.master_data_centers (master_id)`\n",
|
||||
" - 1-to-many over time -> `public.fcc_bdc_broadband_by_datacenter (master_id, as_of_date)`\n",
|
||||
"\n",
|
||||
"In short: **release metadata (`as_of` + `files`) supports reproducible loads, while per-DC broadband facts and release-level/provider-level summaries support analysis.**"
|
||||
"### Rerun Notes\n",
|
||||
"- The notebook is designed for repeat refreshes as new FCC releases arrive.\n",
|
||||
"- Use `as_of_date` as the version key when comparing snapshots over time."
|
||||
]
|
||||
}
|
||||
],
|
||||
|
||||
Reference in New Issue
Block a user