Data Source: Representation of a given dataset, includes a connection to the source of the data, a metadata analysis and a profiling of its data.

Connector: Connection to a given data source. Does not copy data.

Data Catalog: Collection of Data Sources for a given project.

Data profile report: Dashboard containing visualizations of a given dataset, including but not limited to a univariate and multivariate analysis, correlation analysis, alerts, and more.

Lab: Development environment (IDE). The ones currently available are JupyterLab, VSCode and H2O Flow.

Synthesizers: Component that generates new data.

Anonymizer: Component that masks a given field.

Pipeline: Scalable flow that can be versioned and can be customized to run recurrently. Consists of one or many components, connected between themselves forming a parallel or sequential flow.

GUI: Graphical User Interface.

SDK: Software Development Kit.

API: Application Programming Interface.

RDBMS: Relational Database Management System.