P2 Data Room & Analytics Engine
Enterprise-grade loan portfolio analytics infrastructure implemented in Excel, serving as a complete blueprint for future database and data warehouse migration.
Core Functions
- β Monthly DR Tape generation for investor reporting
- β Real-time loan status classification (Performing/Collections/DQ60+)
- β Automated UPB reconciliation with source data
- β 30/360 interest accrual calculations
- β Days Past Due (DPD) tracking & delinquency analysis
- β Historical metrics & trend analysis (12 months)
- β Multi-servicer data aggregation (Rogue + PorchPass)
Technical Achievements
- β Single-input period changes (Config β All Sheets)
- β Automated validation flag generation
- β Cross-platform data matching (INDEX/MATCH)
- β Rule-based status determination engine
- β Exception handling & error detection
- β Complete audit trail via formula transparency
- β Scalable architecture (400 β 10,000+ loans)
π₯ Data Sources
- β’ Madison Portal (Servicer)
- β’ Rogue Transaction History
- β’ PorchPass Transaction History
- β’ P2 Origination Data
- β’ 17 New Fields Supplemental
β‘ Processing Engine
- β’ Trial Balance Processor
- β’ Loan Sale Data Calculator
- β’ Historical Metrics Aggregator
- β’ Validation Rules Engine
- β’ Calendar Analysis Module
π€ Output Layers
- β’ MAIC Analytics Dashboard
- β’ Servicing Dashboard
- β’ QA Dashboard
- β’ Visual Dashboard (Charts)
- β’ Loan Detail Viewer
System Metrics & Statistics
Comprehensive breakdown of the data infrastructure's scale, complexity, and processing capabilities.
| Module (Sheet) | Dimensions | Formulas | Values | Layer Type |
|---|---|---|---|---|
| Trial Balance_Nov2025 | 400 Γ 68 | 22,742 | 2,454 | PROCESSING |
| Loan Sale Data | 432 Γ 74 | 14,334 | 8,839 | PROCESSING |
| Rogue | 3,274 Γ 17 | 13,068 | 36,359 | SOURCE |
| PorchPass | 2,768 Γ 17 | 6,811 | 6,402 | SOURCE |
| P2 Originated Serviced | 399 Γ 76 | 797 | 25,571 | SOURCE |
| Historical_Metrics | 132 Γ 23 | 349 | 351 | PROCESSING |
| MAIC Analytics | 709 Γ 34 | 532 | 434 | OUTPUT |
| Config | 166 Γ 32 | 29 | 336 | CONFIG |
Systems Architecture
High-level architectural overview of the Data Room & Analytics Engine, mapping Excel components to enterprise data platform patterns.
Enterprise Data Platform Architecture (Excel Proxy)
π Key Architectural Patterns
- Separation of Concerns: Config, Source, Processing, Output layers
- Single Source of Truth: Config sheet controls all calculations
- Referential Integrity: INDEX/MATCH cross-sheet linking
- Immutable Inputs: Source sheets preserve raw data
- Computed Outputs: All derived values via formulas
β‘ Enterprise Platform Equivalents
| Config Sheet | β | Parameter Store / Config DB |
| Source Sheets | β | Data Lake / Raw Zone |
| Trial Balance | β | ETL / dbt Transformations |
| Historical_Metrics | β | Data Warehouse / OLAP |
| Dashboards | β | BI Tools (Tableau/Looker) |
Data Flow Pipeline Architecture
End-to-end data lifecycle from external sources through validation to final outputs.
- Madison Portal CSV export
- Rogue servicer transaction data
- PorchPass internal records
- Supplemental loan attributes
- Raw data paste to source sheets
- Schema preservation
- Timestamp tracking
- Source identification
- INDEX/MATCH data linking
- Transaction aggregation
- Interest calculations (30/360)
- DPD computation
- UPB reconciliation
- Date validation rules
- Status matching
- Flag generation
- DR Tape generation
- MAIC reporting
- Executive dashboards
- QA exception reports
Engine Components
Detailed breakdown of all 14 interconnected modules organized by functional layer.
π₯ Source Data Layer
π Processing Layer
π€ Output & Presentation Layer
Data Lineage & Dependencies
Complete mapping of how data flows between modules, enabling impact analysis and change management.
Data Dependency Graph
6 consumers
5 consumers
5 consumers
2 consumers
1 consumer
14,334 formulas
22,742 formulas
| Source Module | Target Module | Reference Count | Relationship |
|---|---|---|---|
| Loan Sale Data | Trial Balance | 9,974 | Primary data linkage |
| Rogue | Loan Sale Data | 3,980 | Transaction aggregation |
| PorchPass | Loan Sale Data | 3,980 | Transaction aggregation |
| Config | Trial Balance | 1,596 | Parameter injection |
| Historical_Metrics | Visual_Dashboard | 99 | Chart data source |
Entity Relationship Diagram
Logical data model representing the core entities, their attributes, and relationships within the system.
π Key Relationships
LOAN 1:N TRANSACTIONEach loan has many transactions
LOAN 1:N MONTHLY_SNAPSHOTMonthly DR Tape records per loan
MONTHLY_SNAPSHOT 1:N VALIDATION_FLAGFlags generated per snapshot
LOAN 1:1 PROPERTYCollateral property details
Validation Pipeline Architecture
Comprehensive data quality, control, and assurance framework with automated flag generation.
- Prior month DR Tape
- Madison export import
- Servicer transaction load
- Schema validation
- Referential integrity
- Range validations
- UPB reconciliation
- Interest calculations
- Status determination
- UPB_MISMATCH
- BAD_DUE_DATE
- STATUS_MISMATCH
- Clean DR Tape data
- Flag summary reports
- QA dashboards
| Validation Rule | Logic | Flag If | Action Required |
|---|---|---|---|
| UPB Reconciliation | End UPB = Beg UPB - Principal | |Diff| > $0.01 | Check Madison for curtailments |
| Due Date Validation | Payment Due must be 1st or 15th | Day β 1 or 15 | Check Madison, correct dates |
| Interest Accrual | AUTO vs MANUAL interest | |Diff| > $1.00 | Recalculate 30/360 |
| Loan Status Match | AUTO vs MANUAL status | Statuses β | Review DPD calculation |
| Payment Completeness | Actual vs Expected payment | Diff exists | Check escrow/partial |
| BOM vs Prior EOM | Beg UPB = Prior End UPB | |Diff| > $0.01 | Investigate changes |
Business Rules Engine
Complete documentation of servicing business rules, calculations, and status determination logic.
| Performing | DPD 0-29 days |
| Collections | DPD 30-59 days |
| DQ60+ | DPD 60+ days |
| Paid Off | UPB = $0 |
Method: 30/360 day count convention
Formula: Interest = Principal Γ (Rate/100) / 360 Γ Days
Accrual Days: Based on last payment due date
| Parameter | Value | Type | Description |
|---|---|---|---|
| Report_Month | November | Text | Current reporting period |
| Report_Year | 2025 | Number | Reporting year |
| Report_EOM | 2025-11-30 | Date | End of month for calculations |
| Grace_Period_Days | 14 | Number | Days before late fee |
| Collections_Threshold | 30 | Number | DPD for Collections status |
| DQ60_Threshold | 60 | Number | DPD for DQ60+ status |
| Interest_Calc_Method | 30/360 | Text | Interest convention |
| Valid_Due_Days | 1, 15 | Text | Valid payment due days |
SQL Schema Proxy
Database-ready schema definitions derived from the Excel structure for enterprise migration.
π Excel β SQL Mapping
| INDEX/MATCH | β | JOIN + WHERE |
| SUMIFS | β | SUM() + GROUP BY |
| COUNTIFS | β | COUNT() + FILTER |
| IFERROR | β | COALESCE / NULLIF |
| IFS | β | CASE WHEN |
ποΈ Table Counts
| loans | ~398 rows | Master data |
| transactions | ~6,000 rows | Payment history |
| monthly_snapshots | ~4,800 rows | 12 months Γ 400 |
| validation_flags | ~200 rows | Exceptions |
Enterprise Blueprint
Comprehensive migration roadmap from Excel-based system to enterprise data platform.
π Current State (Excel)
- β 14 interconnected worksheets
- β 60,481 formula calculations
- β Manual data ingestion (copy/paste)
- β Single-file architecture
- β Local execution only
- β Limited concurrent access
- β Manual backup/versioning
π Target State (Enterprise)
- β PostgreSQL + Data Warehouse
- β dbt transformations
- β Automated ETL pipelines
- β Microservices architecture
- β Cloud-native deployment
- β Multi-user concurrent access
- β REST/GraphQL API layer
- Schema migration to PostgreSQL
- Initial data load scripts
- Excel β DB validation
- dbt transformation layer
- Automated ETL pipelines
- Data warehouse setup
- BI dashboard migration
- API development
- User acceptance testing
- Production cutover
- Excel deprecation
- Documentation complete
Value Stream Map
End-to-end process flow comparing current and future state operations.
π Current State Value Stream
π Future State Value Stream
π Key Value Delivered
Technical Achievements
- β 154,307 total data points managed
- β 60,481 automated calculations
- β 12 months historical coverage
- β <72 hours system build time
- β <5 minutes monthly execution
Business Impact
- β $210K+ annual cost avoidance
- β Eliminated manual calculation errors
- β Real-time delinquency visibility
- β Board-ready investor reports
- β Scalable to 10,000+ loans
System Summary
This Data Room & Analytics Engine represents an enterprise-grade loan portfolio analytics infrastructure built in Excel. It serves as both a fully functional production system and a complete architectural blueprint for future database and data warehouse migration.