About This Project
Overview
This analytics platform is built on real CMS Medicare Part D public use files, not synthetic data. It demonstrates the full data architecture, engineering, governance, and BI delivery patterns used in pharmacy and healthcare analytics.
The platform processes 26.8 million prescription drug event records, filters to pharmacy-relevant therapeutic areas, and delivers governed analytics across four business unit domains.
Architecture
Data Sources
| Source | Rows | Description |
|---|---|---|
| Part D Spending by Drug | 60,478 | Drug-level spending, 2019-2023 (5 years unpivoted) |
| Prescribers by Provider | 676,186 | Per-prescriber summary, 11 target specialties |
| Prescribers by Drug | 5,451,672 | Prescriber-drug detail, filtered to target therapeutic areas |
dbt Project
23 models across 4 layers, 48 tests (47 pass, 1 warn), 1 SCD Type 2 snapshot.
Mart domains map to business units:
| Domain | Models | Key KPIs |
|---|---|---|
| Commercial | 3 | Spending trends, therapeutic area mix, Pareto analysis |
| Customer Success | 3 | Engagement tiers, specialty adoption, retention cohorts |
| Pharmacy Operations | 4 | Cost/claim, fill volumes, day supply, brand/generic ratio |
| Enterprise Customers | 3 | Manufacturer performance, TA KPIs, geographic distribution |
| Executive | 2 | KPI scorecard, cross-BU summary |
| Data Quality | 2 | Completeness scorecard, anomaly detection |
Governance
- KPI Dictionary: 16 metrics with formula, grain, exclusions, and owner
- RBAC Model: 6 roles with Snowflake, Power BI, BigQuery, and Databricks RLS
- PII Handling Policy: HIPAA adaptation guide, retention schedule
BI Tool Integration
- LookML model with explores, views, measures, and access filters
- Tableau connection spec with published data sources and extract schedule
- Evidence dashboard (this site) querying dbt marts directly
Source Code
github.com/nicholasjh-work/pharma-ops-analytics
Built by Nicholas Hidalgo | LinkedIn | analytics@nicholashidalgo.com
