featurebyte.FeatureList.preview¶
Description¶
Materializes a FeatureList using a small observation set of up to 50 rows. Unlike compute_historical_features, this method does not store partial aggregations (tiles) to speed up future computation. Instead, it computes the features on the fly, and should be used only for small observation sets for debugging or prototyping unsaved features.
The small observation set should combine historical points-in-time and key values of the primary entity from the feature list. Associated serving entities can also be utilized.
Parameters¶
- observation_set: DataFrame
Observation set DataFrame which combines historical points-in-time and values of the feature primary entity or its descendant (serving entities). The column containing the point-in-time values should be namedPOINT_IN_TIME
, while the columns representing entity values should be named using accepted serving names for the entity.
Returns¶
- Union[DataFrame, NoneType]
Materialized feature values. The returned DataFrame will have the same number of rows, and include all columns from the observation set.
Note: POINT_IN_TIME
values will be converted to UTC time.
Examples¶
Create a feature list with two features.
>>> features = fb.FeatureList([
... catalog.get_feature("InvoiceCount_60days"),
... catalog.get_feature("InvoiceAmountAvg_60days"),
... ], name="My new feature list")
Prepare observation set with POINT_IN_TIME and serving names columns.
>>> observation_set = pd.DataFrame({
... "POINT_IN_TIME": ["2022-06-01 00:00:00", "2022-06-02 00:00:00"],
... "GROCERYCUSTOMERGUID": [
... "a2828c3b-036c-4e2e-9bd6-30c9ee9a20e3",
... "ac479f28-e0ff-41a4-8e60-8678e670e80b",
... ],
... })
Preview the feature list with a small observation set.