Data Source: Representation of a given dataset, includes a connection to the source of the data, a metadata analysis and a profiling of its data.
Connector: Connection to a given data source. Does not copy data.
Data Catalog: Collection of Data Sources for a given project.
Data profile report: Dashboard containing visualizations of a given dataset, including but not limited to a univariate and multivariate analysis, correlation analysis, alerts, and more.
Lab: Development environment (IDE). The ones currently available are JupyterLab, VSCode and H2O Flow.
Synthesizers: Component that generates new data.
Anonymizer: Component that masks a given field.
Pipeline: Scalable flow that can be versioned and can be customized to run recurrently. Consists of one or many components, connected between themselves forming a parallel or sequential flow.
GUI: Graphical User Interface.
SDK: Software Development Kit.
API: Application Programming Interface.
RDBMS: Relational Database Management System.