Why ChartBook?#
Data science teams face a common challenge: analytics work is scattered across notebooks, scripts, and shared drives. Documentation is an afterthought. When someone needs to find existing analysis or understand how a dataset was created, they spend hours searching or rebuild from scratch.
ChartBook solves this by providing a centralized platform for cataloging and documenting data science work.
The Backstage Analogy#
If you’re familiar with Backstage, Spotify’s open-source developer platform, ChartBook serves a similar purpose for data science teams.
Backstage (Software Teams) |
ChartBook (Data Science Teams) |
|---|---|
Software catalog |
Pipeline catalog |
Service documentation |
Chart and dataframe documentation |
API references |
Dataset references |
Tech docs |
Analytics documentation |
Templates and scaffolding |
Pipeline templates |
Where Backstage helps software engineers discover microservices and APIs, ChartBook helps data scientists discover pipelines, datasets, and analytics.
Key Benefits#
Discovery#
Find existing work before starting new analysis. chartbook builds searchable documentation sites where team members can browse pipelines, charts, and datasets.
Documentation#
Documentation is generated from your pipeline configuration, not maintained separately. Define your charts and dataframes in a TOML file, and ChartBook builds a complete documentation website.
Governance#
Track data sources, licenses, and access permissions for every dataset. Know where your data comes from, who can access it, and what restrictions apply.
Reproducibility#
Each pipeline is self-contained with its code, configuration, and outputs. Version control your pipelines and regenerate documentation as work evolves.
Programmatic Access#
Load any cataloged dataset directly into your analysis:
from chartbook import data
df = data.load(pipeline="sales", dataframe="quarterly_summary")
Who It’s For#
Data science teams building recurring analytics and reports.
Analytics engineers maintaining data pipelines and transformations.
Research teams producing charts and datasets that need to be discoverable.
Organizations that need governance and traceability for their analytics work.
Project Types#
ChartBook supports two project types:
Pipeline — A single analytics pipeline with its own charts, dataframes, and documentation (
type = "pipeline")Catalog — A collection of multiple pipelines aggregated into a unified documentation site (
type = "catalog")
Start with a pipeline to document a single project, then create a catalog when you want to aggregate multiple pipelines into a searchable portal.
How It Works#
Organize your work into pipelines — Each pipeline contains code, data, and outputs
Configure with TOML — Define charts, dataframes, and metadata in
chartbook.tomlBuild documentation — Run
chartbook buildto generate an HTML siteShare and discover — Host the site internally for your team to browse
See the Getting Started guide to create your first pipeline.