Study notes Β· Data Modeling
Banking Data Models β 12 Approaches, 4 Eras
I've been reading my way through how banking data has been modeled β from Inmon's 1992 enterprise data warehouse to 2024's source-centric lakehouse with semantic Gold. This post is my notes: 12 approaches, with diagrams, trade-offs, and a banking-domain example for each.
May 2026 Β· ~35 min read Β· learning in public
Heads up β this is a learning exercise, not a war story. I haven't shipped all 12 of these. I worked through papers, vendor docs, and case studies, and these are the notes I wish I had when I started reading.
The map: how data has been modeled in banking from 1992 to 2026. Twelve approaches, four eras. Each one gets origin, core idea, a diagram, pros and cons, and a note on what it's still good for today.
The four eras: Classical (top-down EDW + dimensional + canonical CDM), Reference Standards (BIAN + ISO 20022 + FIBO), Modern Lakehouse (Vault + Medallion + Mesh), and AI-Era (Activity Schema + Source-centric + Hybrid). Each era solved a different set of constraints; matching your constraints to one is the only way to pick well.
At the end there's a comparison matrix scoring all 12 on dev velocity, AI readiness, maintainability, extensibility, regulatory fit, and adoption β plus an open-ended closing on what I'm still trying to figure out.
The Era of "Build a CDM"
Storage was expensive, compute happened on a single machine, integration ran on ESBs with rigid contracts, and schema evolution was a project not a PR. The dominant question: "where do we put the canonical?". Three answers emerged.
Inmon's CIF β Top-Down 3NF Enterprise Data Warehouse
Philosophy
Rationale
Inmon's Corporate Information Factory was the foundational pattern of enterprise data warehousing. The proposition: build one corporate-wide third-normal-form (3NF) data warehouse first β every entity, every attribute, every relationship β and then create dependent data marts (often dimensional) on top for specific reporting needs.
β Strengths
- Single source of truth, integrity guaranteed by 3NF
- Marts derive consistently β no semantic drift
- Audit + lineage natural (one canonical place)
- Regulatory reporting (BCBS 239) lineage clean
β Weaknesses
- Heavy upfront β 12-24 months before first business value
- 3NF is consumer-unfriendly β every query needs joins
- Schema changes ripple through marts
- Doesn't fit "ship a use-case in a sprint" expectations
Banking exampleMost large banks 1995β2010 built CIF-style EDWs on Teradata or Oracle (Citi, HSBC, Wells Fargo are commonly cited). Local-market banks that started warehousing 2008β2015 typically followed the same template with localization for regulatory reporting.
When to use todayGreenfield: rarely. The upfront cost no longer matches expectations. Migration: still relevant when the existing EDW is 3NF and you're modernizing layer-by-layer. Conceptual: the discipline (single-source-of-truth) is alive in MDM hubs and contracts.
Kimball's Dimensional β Bottom-Up Star Schemas with Conformed Dimensions
Philosophy
Rationale
Kimball's response to Inmon: instead of modeling everything upfront, build one fact table per business process (sales, transactions, applications) surrounded by conformed dimensions (customer, product, date) that are reused across processes. Star schemas optimized for OLAP queries.
β Strengths
- Fast delivery β first mart in 4-8 weeks
- Query-friendly β joins are simple, BI tools love it
- Conformed dimensions enforce reuse
- Business users can self-serve once dimensions are clean
β Weaknesses
- Cross-process queries can be awkward (need conformed dims discipline)
- Granularity decisions are hard to reverse
- Doesn't preserve full transactional history without bridge tables
- Star schemas don't naturally support graph/event queries
Banking examplePower BI / Tableau / SAS / SAP BO report layers in virtually every bank. Even when the silver layer is 3NF or Vault, the consumption layer is usually dimensional β the discipline survives every storage shift.
When to use todayFor BI reporting layer on top of any silver. The dimensional discipline (conformed dim, granularity, slowly-changing) is timeless even though the underlying storage is now lakehouse parquet, not Teradata cubes.
IBM BDW / FSLDM β Banking-Specific 3NF Canonical
Philosophy
Rationale
If Inmon's CIF says "build a 3NF EDW", IBM BDW (and its siblings Teradata FSLDM and Oracle FSDM) say "here is the 3NF β banking edition, off-the-shelf". A pre-defined logical data model with ~10 super-types (Party, Arrangement, Account, Event, Position, Product, Asset, Channel, Resource, Location, Classification) covering every banking concept. Decades of industry consensus baked in.
β Strengths
- Off-the-shelf banking semantics β no need to invent
- Vendor-supported (IBM, Teradata, Oracle ship reference doc)
- Regulator-friendly β well-known structure for audit
- Cross-product semantics already harmonized
β Weaknesses
- Heavy β 80-200 entities even at minimum
- Consumer-unfriendly β every query joins 5-7 tables
- Slow to onboard new sources (must map into canonical)
- Schema evolution painful β change ripples across all consumers
Banking exampleHSBC, Citi, Standard Chartered, ING β commonly cited as having built FSLDM-style canonical warehouses 2000β2015. Mid-tier banks in many emerging markets adopted the same templates as their warehouse foundations because vendor support + regulator familiarity removed two big risks at once.
When to use todayGreenfield: rarely. The 18-month canonical-modeling phase no longer matches dev velocity expectations. Migration: still relevant when modernizing an existing FSLDM warehouse β you can keep canonical at silver and add Gold on top (Modern A pattern). Conceptual: the entity vocabulary (Party, Arrangement, Event) is still useful as glossary even if not as physical schema.
The Era of "Standardize the Wire, Not the Warehouse"
Once SOA matured and cross-bank integration grew, the question shifted from "what does our warehouse look like" to "what does the wire between systems look like". Three industry standards emerged β none are warehouse schemas, all are interoperability layers. Banks need to understand them because data lands in the warehouse shaped by these standards.
BIAN β Banking Industry Architecture Network Β· Service Domain Reference
Philosophy
Rationale
BIAN is not a data model β it's a service-oriented reference architecture. It decomposes a bank into ~300 service domains (Customer Reference, Customer Position, Loan, Payment Order, Card Capture, Fraud Resolution, β¦), each with defined responsibilities, control records, and operations. When two systems integrate, BIAN gives them a shared vocabulary for what services should exist and what data they exchange.
β Strengths
- Vendor-neutral β Temenos, FIS, Finacle, Oracle all map to BIAN
- Service-oriented β perfect for microservice boundaries
- Industry consensus β pre-built taxonomy of bank capabilities
- Pairs well with API-first design (REST / GraphQL contracts)
β Weaknesses
- Not a data model β doesn't tell you how to store data
- Heavy taxonomy β 300 domains is too many for small banks
- Implementation varies β domain naming is consistent but data format is not
- Doesn't replace the warehouse design question
Banking exampleING used BIAN to redesign its IT estate around microservices. Deutsche Bank's API marketplace exposes services aligned to BIAN domains. Many vendors (Temenos, Finacle) advertise "BIAN-aligned" products, meaning their APIs map to BIAN service domains.
When to use todayFor API design and microservice boundaries β BIAN is the modern banking equivalent of "REST best practice". For warehouse modeling β pair it with another approach (FSLDM canonical, source-centric silver, etc.). BIAN tells you what services exist; you still choose how data is stored.
ISO 20022 β Universal Message Standard for Payments
Philosophy
Rationale
ISO 20022 is an XML/JSON-based message standard for financial messaging β payments, securities, FX, cards, trade. It defines a common business modeling methodology and a standard library of message types (e.g. pacs.008 = customer credit transfer, camt.053 = bank-to-customer statement, pain.001 = customer-initiated payment). By Nov 2025, all SWIFT cross-border payments must use ISO 20022 β every bank in the world is affected.
β Strengths
- Global standard β replaces SWIFT MT, used in SEPA, CHAPS, Fedwire, ISO 20022 RTP
- Rich structured data β replaces free-text MT103 fields
- Machine-readable XML/JSON
- Pre-built domain vocabulary β Party, Account, Amount, Settlement
β Weaknesses
- Verbose XML β payloads 5-10x SWIFT MT
- Not a warehouse schema β must flatten / shred for analytics
- Migration cost is real β every payment system, every reconciliation tool
- Different message types for different flows β need sub-domain expertise
Banking exampleSWIFT's CBPR+ initiative mandates ISO 20022 for cross-border payments by Nov 2025. Several national real-time payment systems (SEPA, CHAPS, Fedwire RTP, and a number of country-level RTGS) are aligned to ISO 20022. Every bank running cross-border treasury must onboard ISO 20022 by 2025 β no opt-out.
When to use todayMandatory for payment systems by 2025. Use the ISO 20022 message vocabulary as input to your data warehouse (don't store raw XML β shred into structured silver tables). Don't use ISO 20022 schemas as warehouse schemas β too verbose, too message-shaped.
FIBO β Financial Industry Business Ontology
Philosophy
Rationale
FIBO is an OWL/RDF ontology β formal, machine-readable definitions of financial concepts (Party, Account, Loan, Derivative, Security, Bond, Counterpartyβ¦) and their relationships, expressed in description logic. Where FSLDM gives you a relational schema, FIBO gives you a concept graph with formal semantics.
β Strengths
- Formal semantics β disambiguates "what is a Loan"
- Machine-readable β reasoners can infer relationships
- Regulatory adoption β ECB, FRB use FIBO for definitions
- Aligns with knowledge graph + LLM agent retrieval
β Weaknesses
- Heavy tooling β needs OWL reasoners, RDF stores
- SPARQL queries unfamiliar to most data engineers
- Performance β graph queries don't scale like SQL on lakehouse
- Mostly definition layer β not where you store actual data
Banking exampleEuropean Central Bank uses FIBO for regulatory data dictionaries. Some banks (Wells Fargo, JPMorgan) maintain FIBO-aligned glossaries linked to physical schemas. BCBS 239 implementations frequently reference FIBO for traceable definitions.
When to use todayAs a glossary backbone β define your business terms by reference to FIBO concepts. As a knowledge graph layer β when you want LLM agents to reason over definitions. Don't use FIBO as your physical warehouse schema; use it as the semantic anchor on top of whatever physical model you choose.
The Era of "Cheap Storage Changes Everything"
When S3 cost $0.023/GB/month and Spark/Delta turned the data lake into a queryable warehouse, three new patterns emerged. None of them assume canonical 3NF β they assume cheap storage, parallel compute, and that schema can change with a PR.
Data Vault 2.0 β Hub / Link / Satellite
Philosophy
Rationale
Data Vault 2.0 splits warehouse modeling into three table types: Hub (one row per business key β e.g., customer_id), Link (relationships between hubs β e.g., customer β account), and Satellite (attributes and history, append-only). Every change is a new satellite row, never an update. Sources stay separate: sat_customer_t24, sat_customer_crm sit side by side under one hub_customer.
β Strengths
- Append-only β perfect audit trail for regulators
- Source-aware β multiple satellites per hub, sources never collide
- Parallel loadable β every hub/link/sat can load independently
- Late-binding semantics β conform at PIT view, not at silver
β Weaknesses
- Consumer queries are very complex β every read needs PIT joins
- 3-5x more tables than 3NF for the same domain
- Tooling immature outside specialist vendors (Wherescape, Vaultspeed)
- Steep learning curve β most engineers don't think in HLS
Banking exampleMany European banks (Rabobank, Nordea, ABN AMRO) use Data Vault 2.0 for regulatory warehouses where audit trail is paramount. Several large insurance groups also adopted it. In Vietnam, less common β the learning curve and tooling cost are high.
When to use todayWhen audit / lineage / regulatory traceability is the dominant requirement (BCBS 239, IFRS 9 with full history). When you need parallel ingestion from many sources without schema collisions. Pair with a Gold layer for consumer queries β Vault is not a consumer interface.
Lakehouse Medallion β Bronze / Silver / Gold
Philosophy
Rationale
The Medallion is a layering convention, not a specific schema. Bronze = raw / 1:1 with sources. Silver = cleaned, conformed, sometimes joined. Gold = business-aggregated, query-ready. The genius of Medallion is that it doesn't prescribe what goes in silver or gold β it just gives you a vocabulary to organize your transformations. You can run FSLDM inside Medallion, or Vault, or source-centric, or all three.
β Strengths
- Universal vocabulary β every team understands bronze/silver/gold
- Storage-cheap β parquet/Delta on S3 fits any scale
- Tool-friendly β dbt, Spark, Databricks all native
- Combines with any specific approach (FSLDM, Vault, source-centric)
β Weaknesses
- Doesn't answer "what goes in silver" β still need a model
- Without discipline, gold balloons (every team builds their own)
- Layer boundaries fuzzy in practice
- Marketing-heavy β sometimes treated as a complete answer when it's just a container
Banking exampleMost modern banking lakehouses adopt Medallion: Capital One on Snowflake, JPMorgan on Databricks, and Goldman Sachs' internal lakehouse have all been publicly discussed. Banks setting up new Enterprise Data Platforms from 2023 onward almost uniformly use Medallion as the layering convention.
When to use todayAlways β as a layering convention. But pair it with a specific schema choice (FSLDM canonical for migration, source-centric for greenfield, Vault for audit-heavy). Medallion alone is not a complete data model.
Data Mesh β Domain-Owned Data Products
Philosophy
Rationale
Data Mesh is an organizational pattern: instead of a central data team owning all data, each domain (Customer, Loan, Card, Payment, Risk) owns its own data products. Each domain publishes data with contracts, SLAs, ownership. A central "platform" team provides self-serve infrastructure (lakehouse, catalog, contracts framework). Federated computational governance enforces interop without central modeling.
β Strengths
- Scales to large orgs (50+ domain teams) without central bottleneck
- Domain expertise embedded in data products
- Clear ownership β every product has a team accountable
- No central canonical model needed β contracts handle interop
β Weaknesses
- Heavy governance overhead β federated β free
- Requires mature platform team (often 10+ engineers)
- Tooling immature β contracts framework, federated governance still evolving
- Overkill for small/medium banks with 5-10 teams
Banking exampleJPMorgan Chase, ING, Roche (pharma but instructive) implemented mesh principles. The most successful banking implementations combine mesh organizationally with lakehouse infrastructure underneath. Few small/mid-tier banks adopt full mesh β the org structure isn't there.
When to use todayFor tier-1 banks with 50+ data teams and dedicated platform engineering. For mid-tier banks: borrow concepts (data-as-product, contracts) but skip the full federated governance. For small banks: not yet β too heavy.
The Era of "AI Is The Fourth Consumer"
After BI, ML, and APIs, LLM agents become the fourth class of consumer. Schemas have to be self-documenting, columns have to make sense without joins, and the semantic layer has to be machine-readable. Three patterns that bake these constraints in.
Activity Schema β Behavioral Event Stream
Philosophy
Rationale
Activity Schema collapses analytics into one wide event-stream table. Every business event (logged in, viewed account, deposited, applied for loan, paid, called support) becomes a row with customer_id, activity_name, ts, β¦. Metrics are computed via temporal SQL β first/last/count/before/after/between. Radically simple model; works because cheap storage and columnar engines handle the high cardinality.
β Strengths
- Schema evolution = add new activity_name (no DDL)
- Behavioral analytics natively β funnels, retention, cohorts
- AI/ML-friendly β features = temporal aggregations
- One mental model for all consumers
β Weaknesses
- Bad for snapshot/balance reporting (no native "as-of" view)
- JSON feature column requires shred/cast for typed access
- High cardinality activity table can grow huge
- Doesn't replace need for dimensional/relational data for some use cases
Banking exampleRevolut, Chime, Monzo, and other fintech-style neobanks reportedly favor activity-schema-style stores for behavioral analytics, growth marketing, and ML feature pipelines. Less common in traditional banks because regulatory reporting wants positions / balances, not events.
When to use todayFor behavioral analytics, growth metrics, ML feature engineering. As a complement to a relational/dimensional layer, not a replacement. Excellent for AI agents that need to reason about customer journeys.
Source-Centric Silver + Semantic Gold
Philosophy
Rationale
The synthesis I kept seeing recommended for greenfield 2026 builds. Silver mirrors each source 1:1 (silver_t24_*, silver_cards_*, silver_los_*) β append-only, SCD2, audit-ready. Gold is where semantic conformance happens: customer_360 joins all silver sources via the MDM hub at write time. The "common language" lives in glossary + semantic layer + MDM hub β three metadata artifacts, not a physical canonical schema.
β Strengths
- New source onboards in days β just add silver_X_* mirror
- Schema evolution isolated to one silver bucket
- AI-agent friendly β Gold is one semantic layer, columns self-document
- Glossary + semantic layer = "common language" without rigid schema
β Weaknesses
- Cross-source query at silver = impossible (must go to Gold)
- Gold builds compute-heavy β every snapshot rebuilds joins
- Pattern is new β fewer reference implementations to copy
- Semantic conformance bugs are real β late binding is harder to test
Banking exampleModern fintech-influenced banks (BBVA, Capital One have been publicly discussed; a growing number of digital banks fit the same profile) gravitate toward this pattern β they treat T24 / Cards / LOS as separate silver islands and build Gold per use case. The pattern fits naturally with dbt + Spark + Databricks.
When to use todayFor greenfield banking lakehouses in 2024+ where dev velocity + AI-readiness + extensibility are the priorities, and the org is OK paying compute at Gold for the simplicity at silver. Not the right fit if you need cross-source queries with sub-second latency at silver.
Hybrid AI-Native β Lakehouse + Feature Store + Vector Index + Knowledge Graph
Philosophy
Rationale
The most ambitious modern pattern: combine specialized stores for each workload, unified by a semantic layer. Lakehouse for batch analytics + Gold tables. Feature Store (Feast / Tecton / Databricks FS) for ML features with online + offline parity. Vector Index (FAISS / Pinecone / pgvector) for RAG over banking documents. Knowledge Graph (Neo4j / RDF) for entity resolution and reasoning over relationships. A central semantic layer + glossary unifies them.
β Strengths
- Best store for each workload β no compromises
- AI agent capabilities at the ceiling β RAG + reasoning + numbers
- Future-proof β supports new AI workload classes
- Each store has best-in-class tooling
β Weaknesses
- Operational complexity β 4 systems to keep in sync
- Data sync challenges β embedding refresh, graph rebuild, FS backfill
- Skills required across multiple specialized domains
- High cost β both engineering + infrastructure
Banking exampleJPMorgan, Goldman Sachs, BNP Paribas building this stack for fraud + anti-money-laundering + customer 360 chatbot. Cap One / Wells Fargo with mature ML platforms gravitating toward Feature Store + Lakehouse. Knowledge Graph for AML at HSBC.
When to use todayFor tier-1 banks with mature data + ML platform teams and AI as a strategic priority. Not for mid/small banks β the operational burden won't pay back. Start with Lakehouse + Feature Store; add Vector Index when you have RAG use cases; add Knowledge Graph last (most complex).
All 12 Approaches Side-by-Side
Six dimensions that matter for choosing a banking data model in 2026. Scores are subjective + based on observed adoption β calibrate to your context.
| Approach | Era | Dev velocity | AI readiness | Maintainability | Extensibility | Audit / regulatory | Banking adoption (VN) | Best-fit context |
|---|---|---|---|---|---|---|---|---|
| 01 Β· Inmon CIF | 1992 | Low | Low | Mid | Low | High | Legacy | Existing 3NF EDW migration |
| 02 Β· Kimball Star | 1996 | High | Mid | High | Mid | Mid | Universal | BI / reporting layer (always) |
| 03 Β· IBM BDW / FSLDM | 2000s | Low | Low | Low | Low | High | Tier-1 legacy | Migration; vocabulary reference |
| 04 Β· BIAN | 2008+ | Mid | Mid | High | High | High | Growing | API design, microservice boundaries |
| 05 Β· ISO 20022 | 2004+ | Mid | Mid | Mid | High | High | Mandatory | Payment messaging (must-have by 2025) |
| 06 Β· FIBO | 2010+ | Low | High | Mid | High | High | Rare | Glossary backbone, knowledge graph |
| 07 Β· Data Vault 2.0 | 2013+ | Mid | Mid | Mid | High | High | Rare in VN | Audit-heavy regulatory warehouse |
| 08 Β· Lakehouse Medallion | 2020+ | High | High | High | High | Mid | Default | Layering convention (always) |
| 09 Β· Data Mesh | 2019+ | Mid | Mid | High | High | Mid | Tier-1 only | Tier-1 with 50+ data teams |
| 10 Β· Activity Schema | 2020+ | High | High | High | High | Low | Fintechs | Behavioral analytics, ML features |
| 11 Β· Source-centric + Semantic Gold | 2024+ | High | High | High | High | High | Emerging | β Greenfield 2024+ recommended |
| 12 Β· Hybrid AI-Native | 2024+ | Mid | Highest | Low | High | High | Tier-1 frontier | Tier-1 with strategic AI mandate |
An open ending, not a verdict
Honestly? I'm still trying to figure out which of these is actually right for the next ten years. Twelve approaches sound like a lot of choice, but each one was a child of its era β its constraints, its hardware, its idea of who the consumer would be. The next era is being shaped right now by AI agents that ask different questions, expect different shapes, and care about different things than BI dashboards ever did.
The patterns that look strongest to me on paper β source-centric silver, semantic gold, lakehouse as the substrate, FIBO/BIAN/ISO as the shared language β feel like a reasonable bet for today. But "reasonable bet" is a long way from "this is the answer". RAG, agentic retrieval, feature stores wired into reasoning, knowledge graphs that aren't just nice diagrams β every one of these is moving fast enough that the model I sketch in 2026 may look quaint in 2028.
So this post ends where I am right now: still reading, still asking around, still unsure. If you've built one of these for real and have opinions on what survives the AI-native shift β or what I've completely missed β I'd love to hear it. I'll keep updating this as I learn.
Citations
These are the books, papers, and reference docs I worked through. Treat this as a reading list β most of the framing in this post traces back to one of these.
Era 1 Β· Classical
- Inmon, W. H. (2005). Building the Data Warehouse (4th ed.). Wiley. β The original CIF formulation.
- Kimball, R. & Ross, M. (2013). The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling (3rd ed.). Wiley.
- IBM (n.d.). IBM Banking and Financial Markets Data Warehouse β General Information Manual. IBM Redbooks (various editions).
- Teradata. Financial Services Logical Data Model (FSLDM) β vendor reference documentation.
- Oracle. Financial Services Data Model (FSDM) β Oracle Financial Services reference docs.
Era 2 Β· Reference Standards
- BIAN. BIAN Service Landscape (latest version). bian.org/servicelandscape
- ISO. ISO 20022 Universal Financial Industry Message Scheme. iso20022.org
- SWIFT. CBPR+ (Cross-Border Payments and Reporting Plus) migration guidance. swift.com/standards/iso-20022
- EDM Council. Financial Industry Business Ontology (FIBO) Specification. spec.edmcouncil.org/fibo
- Basel Committee on Banking Supervision (2013). BCBS 239: Principles for effective risk data aggregation and risk reporting. Bank for International Settlements.
Era 3 Β· Modern Lakehouse
- Linstedt, D. & Olschimke, M. (2015). Building a Scalable Data Warehouse with Data Vault 2.0. Morgan Kaufmann.
- Databricks. The Medallion Lakehouse Architecture. databricks.com/glossary/medallion-architecture
- Armbrust, M., Ghodsi, A., Xin, R. & Zaharia, M. (2021). Lakehouse: A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics. CIDR 2021.
- Dehghani, Z. (2022). Data Mesh: Delivering Data-Driven Value at Scale. O'Reilly Media.
- Dehghani, Z. (2019). How to Move Beyond a Monolithic Data Lake to a Distributed Data Mesh. martinfowler.com. martinfowler.com/articles/data-monolith-to-mesh.html
Era 4 Β· AI-Era
- Elsamadisi, A. (2020). The Activity Schema: A New Way to Model Data. Narrator blog & activityschema.com
- Feast contributors. Feast: Open Source Feature Store for Machine Learning. feast.dev
- Tecton. What is a Feature Store? tecton.ai/blog/what-is-a-feature-store
- Lewis, P., et al. (2020). Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. NeurIPS 2020. (RAG paper.)
- Hogan, A., et al. (2021). Knowledge Graphs. ACM Computing Surveys, Vol. 54, No. 4. β Comprehensive survey.
- dbt Labs. dbt Semantic Layer documentation. docs.getdbt.com/docs/use-dbt-semantic-layer/dbt-sl
Cross-cutting & Practitioner Blogs
- Fowler, M. Patterns of Enterprise Application Architecture (2002) β for ESB / integration context.
- Hohpe, G. & Woolf, B. (2003). Enterprise Integration Patterns. Addison-Wesley.
- Capital One Tech. Data Lakehouse on Snowflake β public engineering blog posts (various authors, capitalone.com/tech).
- JPMorgan Chase. Fusion data platform β public talks and blog posts on their Databricks lakehouse.
- Goldman Sachs Engineering. Data Lake / Legend platform β engineering.goldmansachs.com.
- Microsoft Industry Cloud for Financial Services β reference architectures.
If you spot a missing or misattributed source, let me know β I'd like the citation list to be accurate.