7. Create Observation Tables
An Observation Table is a structured collection of historical data points that acts as the foundation for training datasets. By adding features, you can create Feature Tables that can be used to train and validate Machine Learning models.
Each data point represents a specific historical moment for a particular entity and may also include target values. Observation Tables are often utilized across experiments within the same use case, even if selected features and models vary.
This guide explains how to upload and configure Observation Tables in FeatureByte and link them to your context and use cases.
Step 1: Navigate to Observation Table Catalog¶
From the menu, navigate to the 'Formulate' section and select the Observation Table catalog.
Step 2: Upload Observation Table¶
- Click
- Select 'Upload file' tab and configure the table:
- Name: "In_Store_Customer_2023_10K"
- Purpose: "EDA" as we will use the table to run EDA analysis
- Primary Entity: "customer"
- Target: Leave blank
- Context: Leave blank
- CSV/Parquet: "In-Store Customer_2023_10K_sample.parquet" that you can download here.
Step 3: Link Observation Table to a Context¶
- Select the uploaded table from the catalog.
- Go to the 'About' tab and choose "In-Store Customer" from the Context dropdown menu.
Step 4: Create Observation Table with Target Values¶
Option A: Using the SDK Target¶
- Ensure you have created the target "CUSTOMER_More_than_1_purchase_next_7d" using the FeatureByte SDK.
- Open the 'About' tab, scroll down and click
- Select the target and new the new table, e.g., "Pre_Purchase_Customer_Activity_next_week_2023_10K".
Option B: Upload Observation Table with Target Values¶
If you don't have access to the FeatureByte SDK, follow these steps.
- Click
- Upload an Observation Table with pre-populated target values:
- Name: "Pre_Purchase_Customer_Activity_next_week_2023_10K_M"
- Purpose: "EDA" as we will use the table to run EDA analysis
- Primary Entity: "customer"
- Target: "CUSTOMER_More_than_1_purchase_next_7d_M"
- CSV/Parquet: "Pre_Purchase_Customer_Activity_next_week_2023_M.parquet" that you can download here.
Step 5: Check Observation Tables¶
Confirm successful registration by reviewing the Observation Table Catalog.
Step 6: Preview Observation Table¶
- Locate your observation table.
- Go to the 'Preview' tab and ensure the target column appears.
Step 7: Set Observation Table as the Default EDA Table for the Use Case¶
The Default EDA Table is used for feature analysis in the use case.
If you made the target using the SDK, follow these steps to set the Default EDA Table.
-
Navigate to the Use Case Catalog and select "Customer Activity Next Week before a purchase".
-
Set "Pre_Purchase_Customer_Activity_next_week_2023_10K" as the Default EDA Table.
Ensure, once completed, your setup resembles the image below.
If you uploaded an Observation Table with pre-populated target values, follow these steps to set the Default EDA Table.
- Select the uploaded table "Pre_Purchase_Customer_Activity_next_week_2023_10K_M" and link it to the "In-Store Customer" Context.
- Navigate to the Use Case Catalog and select "Customer Activity Next Week before a purchase (using the descriptive target)".
- Set "Pre_Purchase_Customer_Activity_next_week_2023_10K_M" as the Default EDA Table.