DataHub Documentation¶
Welcome to the customer documentation for DataHub — the data governance & AI platform.
These pages are written for administrators, data stewards, and power users. If you are an engineer working on DataHub itself, the architectural decision records live in docs/adr/ of the source repository and are surfaced in-app at /internal-docs (development builds only).
Where to start¶
| If you want to… | Start here |
|---|---|
| Connect Databricks with per-user identity, Unity Catalog RLS, and per-user audit trails | Databricks: per-user OAuth (oauth_u2m) |
| Define a metric whose semantic model spans more than one physical table | Metrics: defining and joining tables |
How this site is organised¶
- Integrations — how to wire DataHub into your data stack (Databricks, …) and what you can expect from each connector.
- More sections will appear here as the product grows. The left-hand navigation is the source of truth.
Conventions¶
- When to choose this sits at the top of every feature page — read it first to decide if the feature even applies to you.
- Setup sections separate one-time admin work from per-user steps so you can split the work across teams.
- Limitations are explicit. If a feature does not cover a case (e.g. embedded dashboards do not honour per-user RLS), it is called out before troubleshooting, not buried in it.
- Audit & compliance tables answer the security questions a CISO will ask before approving a rollout.
Feedback¶
If a page is missing, wrong, or unclear, ping your DataHub contact — the source markdown is in the product repo and customer-facing docs are continuously updated alongside the product.