Skip to content

7. Create Observation Tables

An Observation Table is essentially a collection of historical data points that serve as a foundation for learning. Think of it as the backbone of a training dataset. Its primary role is to process and compute features, which then form the training data for Machine Learning models. For a given use case, the same Observation Table is often employed in multiple experiments. However, the specific features chosen and the Machine Learning models applied may vary between these experiments.

Step 1: Navigate to Observation Table Catalog

From the menu, go to the 'Formulate' section and access the Observation Table catalog.

Empty Observation Table Catalog

Step 2: Upload Observation Table

Click 'Create New Table'.

Image

Select 'Upload file'.

Our Observation Table can be set as follows:

  • Name: "In_Store_Customer_2023_10K"
  • Purpose: "EDA" as we will use the table to run EDA analysis
  • Primary Entity: "customer"
  • CSV/Parquet: "In-Store Customer_2023_10K_sample.parquet" that you can download here.

Name

  1. Select the table from the catalog.
  2. Go to the 'About' tab.
  3. Choose "In-Store Customer" in the Context dropdown menu.

Name

Step 4: Create Observation Table with Target Values

If you have successfully created the "CUSTOMER_More_than_1_purchase_next_7d" target using the FeatureByte SDK, follow these steps:

  1. In the same 'About' tab, scroll down and click "Compute New Observation Table With Target"
  2. Choose the specific target you intend to use
  3. Assign a name to your newly created table. For example, you can name it "Pre_Purchase_Customer_Activity_next_week_2023_10K".

Name

Name

If you haven't created the target using the SDK, no worries. We will upload an observation table with the pre-populated target.

Step 4 (manual): Upload Observation Table with Target Values

If you don't have access to the FeatureByte SDK, follow these steps.

Click 'Create New Table'.

Image

The Observation Table with Target can be set as follows:

  • Name: "Pre_Purchase_Customer_Activity_next_week_2023_10K_M"
  • Purpose: "EDA" as we will use the table to run EDA analysis
  • Primary Entity: "customer"
  • Target: "CUSTOMER_More_than_1_purchase_next_7d_M"
  • CSV/Parquet: "Pre_Purchase_Customer_Activity_next_week_2023_M.parquet" that you can download here.

Name

Step 5: Check Observation Tables

Verify the registration by checking the Observation Table Catalog.

Name

Step 6: Preview Observation Table

If you made the target using the SDK, follow these steps to preview the table. If not, follow them for "Pre_Purchase_Customer_Activity_next_week_2023_10K_manual_version":

  1. Find and select 'Pre_Purchase_Customer_Activity_next_week_2023_10K' in the catalog.
  2. Click on the 'Preview' tab.
  3. Check the target appears in the table.

Name

Step 7: Set Observation Table as the Default EDA Table for the Use Case

The Default EDA Table will be used to run EDA Analysis for the features generated for the use case.

If you made the target using the SDK

Follow these steps to set the Default EDA Table.

Navigate back to the 'About' tab.

Ensure table is associated with the use case of the target.

Name

Scroll down and click the Use Case hyperlink to navigate to the relevant Use Case.

Name

This should open a new tab in your browser. Scroll down and edit the Default EDA Table for the use case. If none is selected yet, it should be blank.

Name

Select "Pre_Purchase_Customer_Activity_next_week_2023_10K".

Name

Once completed, your setup should resemble the image below.

Name

If you created the target using the UI

Follow these steps to set the Default EDA Table.

Select "Pre_Purchase_Customer_Activity_next_week_2023_10K_M" and link the Observation Table to the "In-Store Customer" Context.

Name

Navigate to the Use Case Catalog and select "Customer Activity Next Week before a purchase (using the descriptive target)".

Set "Pre_Purchase_Customer_Activity_next_week_2023_10K_M" as the Default EDA Table.

Name

Verify the setting by checking the Observation Table Catalog. This may need a browser refresh.

Name