System Overview
Architecture Overview
Section titled “Architecture Overview”The Blue Compass RV data pipeline is designed to centralize data ownership, ensure comprehensive data collection, and enable rapid analysis and personalization.
High-Level Data Flow
Section titled “High-Level Data Flow”The pipeline follows a linear flow from data generation to activation:
- Event Tracking: User interactions are captured via Cloudflare, Tag Manager, and Rudderstack.
- Extract & Load: Data is ingested from various sources (Ads, CRM, etc.) using Airbyte and stored in the data lake.
- Transform & Store: Raw data is processed and modeled in Databricks, creating a clean and reliable source of truth.
- Visualize & Activate: Modeled data is consumed by Power BI for reporting and Braze for customer engagement.
Detailed System Map
Section titled “Detailed System Map”This detailed view illustrates the specific components and their interactions:
- Sources: DigitalOcean (API Proxy), Airbyte (Ad Networks), and Cloudflare/Rudderstack (Web Events).
- Storage: Data is staged in ADLS/S3 buckets before being processed.
- Processing: Databricks handles the heavy lifting, moving data through Raw, Cleaned, and Production catalogs.
- Identity: Rudderstack resolves user identities across devices and sessions.
- Destinations: The final output powers business intelligence dashboards and marketing automation platforms.