7. Create Observation Tables
What is an Observation Table?
An Observation Table is a structured collection of historical data points that acts as the foundation for training datasets. By adding features, you can create Feature Tables that can be used to train and validate Machine Learning models.
Each data point represents a specific historical moment for a particular entity and may also include target values. Observation Tables are often utilized across experiments within the same use case, even if selected features and models vary.
How to create an Observation Table?
You can either upload an Observation Table from a parquet or csv file or create one from a Source Table or an existing observation table.
This guide explains how to configure Observation Tables from a Source Table, derive Observation Table from existing one and link them to our Credit Default context and use case.
We will first create two Observation Tables:
- Applications with Credit Default target: Credit Default Observations with information whether the loan default the next 6 months.
- 50K applications: Credit Default Observations for EDA .
Then we will derive three new Observation Tables:
- Applications up to Dec 2024: Credit Default Observations for training up to Dec 2024.
- Applications Q1 2025: Credit Default Observations for validation (Q1 2025).
- Applications Preview: 50 Credit Default Observations for Feature PREVIEW.
Step 1: Navigate to Observation Table Catalog¶
From the menu, navigate to the 'Formulate' section:
-
Select the Observation Table catalog.
-
Select the Use Case or Context for which you want to create observation tables.
Step 2: Create Observation Tables from a Source Table¶
- Click
.
-
Select 'Derive from Source Table' tab and click
-
In the Source Table listing, select OBSERVATIONS_WITH_TARGET under the DEMO_DATASETS database and the CREDIT_DEFAULT schema and click
.
-
Set the table as follows and click
to save the table:
- Name: "Applications with Credit Default target"
- Description: "Credit Default Observations with information whether the loan default the next 6 months."
- Purpose: Training
- Sample Rows: 0 (to disable sampling)
- Sampling Date Range: leave it blank as we will do time partitioning later
-
Columns to Include:
- Original Column Name: POINT_IN_TIME --> New Column Name: POINT_IN_TIME
- Original Column Name: SK_ID_CURR --> New Column Name: SK_ID_CURR
- Original Column Name: Loan_Default --> New Column Name: Loan_Default (as Target)
-
Select OBSERVATION_EDA_TABLE and click
.
-
Set the table as follows and click
to save the table:
- Name: "50K applications"
- Description: "Credit Default Observations for EDA."
- Purpose: EDA
- Sample Rows: 0 (to disable sampling)
- Sampling Date Range: leave it blank as we will do time partitioning later
-
Columns to Include:
- Original Column Name: POINT_IN_TIME --> New Column Name: POINT_IN_TIME
- Original Column Name: SK_ID_CURR --> New Column Name: SK_ID_CURR
- Original Column Name: Loan_Default --> New Column Name: Loan_Default (as Target)
Step 3: Derive Training and Validation tables¶
-
Navigate to the Observation Table catalog under the Formulate section:
-
Select the
Applications with Credit Default target
table and go theAbout
tab. Scroll down and click
-
Create Applications up to Dec 2024 and click
.
- Name: "Applications up to Dec 2024"
- Description: "Credit Default Observations for training up to Dec 2024."
- Purpose: Training
- Sample Rows: 0
- Sampling Date Range: April 1, 2019 - Jan 1, 2025
-
Select again the
Applications with Credit Default target
table and create Applications Q1 2025:- Name: "Applications Q1 2025"
- Description: "Credit Default Observations for validation (Q1 2025)"
- Purpose: Validation-Test
- Sample Rows: 0
- Sampling Date Range: January 1, 2025 - April 1, 2025
Step 4: Derive Preview table (Optional)¶
-
From the Observation Table catalog, select the
50K Applications
table and go theAbout
tab. Scroll down and click -
Create Applications Preview and click
.
- Name: "Applications Preview"
- Description: "Credit Default Observations for preview."
- Purpose: Preview
- Sample Rows: 50
Step 5: Check Observation Tables¶
Check successful registration by reviewing the Observation Table Catalog.
Step 6: Check the Use Case¶
-
Navigate to the Use Case Catalog (you may to refresh the browser) and select the "Loan Default by client" use case.
-
In the 'About' tab, review the linked Observation Tables.