9. Create New Feature Lists and Models
In the previous tutorials, Ideation created two feature lists that are now available in the Catalog.
In this tutorial, you will learn several ways to create new Feature Lists:
- using the Feature List Builder
- using the Regularized SHAP-based Feature List Simplification
- deriving them from a model’s Feature Importance
The latter two methods support Key-Based Feature extraction from dictionary-style features, which can improve interpretability when working with nested or high-dimensional structures.
You’ll also learn how to train new models using these feature lists, or materialize features to train models outside FeatureByte.
Step 1: Review Existing Feature Lists¶
-
Navigate to the
Feature ListsCatalog under theExperimentsection of the menu. Confirm that the two feature lists Ideation previously created are listed.
-
Click on the feature list suggested during ideation.

-
Go to the
Featurestab to review the features in the list.
-
Open the
Themestab to identify signal types missing from the feature list. Once reviewed, close the window.
Step 2: Create a New Feature List Using the Feature List Builder¶
-
Add the feature list suggested by ideation to the Feature List Builder by clicking
.

-
Review the Builder’s suggestions by clicking
in the
section at the bottom of the Feature List Builder.

-
Click on
for the CLIENT/BUREAU/MOST FREQUENT theme to explore associated features. Review EDA results and add the feature using the
.

-
Review the feature list by clicking
and save it using
. Name it “1 + Ideated Features.”

-
Verify the new feature list appears in the Catalog.

Step 3: Use the Regularized SHAP-based Feature List Simplification¶
This technique produces a simple and interpretable Feature List through a two-step process:
- Template Model Training Train an XGBoost or LightGBM template model on nested training data to generate SHAP values.
- Regularized Linear Model Training Train a regularized linear model on nested validation data using the SHAP values from Step 1 as inputs. The regularization encourages sparsity, naturally reducing the feature set.
-
Click
for the feature list suggested by ideation.
-
Select
Applications up to Sept 2024as training table and theNCTsDE_LGB_classificationas the model template. Then, click
.
-
Once the task finishes (this may take some time), verify the new Feature List appears in the Catalog.

-
Click on the Feature List to access its details. Search for "when" to list Key-Based Features.

Step 4: Create a New Feature List from a Model¶
-
Navigate to
Leaderboardunder theExperimentmenu and configure the following:- Observation Table: Applications Q4 2024
- Type: Validation
- Metric: AUC

-
Click on the best-performing model and open its Feature Importance tab.

-
Select the Per Feature Key Panel. It may list close to 900 Feature Keys! Click
.
Set the Importance Threshold Percentage to 0.90.
This will generate new features derived from dictionary features and create a feature list composed of the top features and keys for the model.

-
Return to the Feature Lists Catalog to verify the new list appears. This should have led to more features than with the Regularized SHAP-based Feature List Simplification.

Step 5: Train New Models with the new Feature Lists¶
-
Click
to train a new model with the simplified feature List.

Configure your model as follows:
- Metric: area_under_curve
- Training Observation Table: Applications up to Sept 2024
- Validation Observation Table: Applications Q4 2024
- Model Template: NCTsDE_LGB_classification
You can review and edit parameters by clicking on them.

-
Click
. Do the same to train a new model with the feature List derived from the feature key importance.
-
Navigate to Tasks under the Manage menu to track model training progress.

-
Once training completes, verify the new model appears on the leaderboard.

Step 6: (Optional) Compute a Feature Table¶
If you want to train a model outside FeatureByte, you can compute a Feature Table using the same feature list.
-
Return to the Feature List Catalog.
-
Click
.

-
Select the Observation Table:
Applications with Credit Default targetand confirm by clicking
.

-
Once materialization completes, navigate to the
Feature TablesCatalog underExperimentto confirm creation.