Standardize notebook table-relationship documentation cells

This commit is contained in:
2026-05-22 14:21:51 -07:00
parent c95f22fcdb
commit 03239ad007
9 changed files with 147 additions and 191 deletions

View File

@@ -1486,29 +1486,23 @@
"source": [
"## Tables Created by This Notebook and Their Relationships\n",
"\n",
"This notebook creates and/or maintains five PostgreSQL tables in the `public` schema:\n",
"\n",
"### Tables Created / Maintained\n",
"1. `public.fcc_bdc_as_of`\n",
"- One row per FCC BDC release date and data type.\n",
"- Primary metadata table used to track versioning (`as_of_date`) for downstream loads.\n",
"- Release/version metadata by `as_of_date`.\n",
"\n",
"2. `public.fcc_bdc_files`\n",
"- One row per file discovered/downloaded for a release.\n",
"- Linked to releases via `as_of_date` and used as file-level lineage/provenance.\n",
"- File-level lineage records for each FCC BDC release.\n",
"\n",
"3. `public.fcc_bdc_broadband_by_datacenter`\n",
"- Fact table keyed by `(master_id, as_of_date)` for per-data-center broadband availability metrics.\n",
"- Includes scalar broadband fields and summary JSON payloads.\n",
"- `master_id` aligns with `public.master_data_centers.master_id`.\n",
"- Per-data-center broadband fact table keyed by `(master_id, as_of_date)`.\n",
"\n",
"4. `public.fcc_bdc_broadband_summary`\n",
"- Aggregated summary metrics by release (`as_of_date`) used for QA and reporting.\n",
"- Release-level aggregate summary metrics.\n",
"\n",
"5. `public.fcc_bdc_provider_summary`\n",
"- Provider catalog/aggregation table by release (`as_of_date`) with provider class rollups.\n",
"\n",
"### Relationship Summary\n",
"- Release-level provider catalog and provider-class summary metrics.\n",
"\n",
"### Key Relationships\n",
"- `public.fcc_bdc_as_of (as_of_date)`\n",
" - 1-to-many -> `public.fcc_bdc_files (as_of_date)`\n",
" - 1-to-many -> `public.fcc_bdc_broadband_by_datacenter (as_of_date)`\n",
@@ -1518,7 +1512,9 @@
"- `public.master_data_centers (master_id)`\n",
" - 1-to-many over time -> `public.fcc_bdc_broadband_by_datacenter (master_id, as_of_date)`\n",
"\n",
"In short: **release metadata (`as_of` + `files`) supports reproducible loads, while per-DC broadband facts and release-level/provider-level summaries support analysis.**"
"### Rerun Notes\n",
"- The notebook is designed for repeat refreshes as new FCC releases arrive.\n",
"- Use `as_of_date` as the version key when comparing snapshots over time."
]
}
],