featurebyte.SparkDetails¶
class SparkDetails(
*,
host: str="127.0.0.1",
port: int=10000,
http_path: str="cliservice",
use_http_transport: bool=False,
use_ssl: bool=False,
storage_type: StorageType,
storage_url: str,
storage_path: str,
catalog_name: str,
schema_name: str
)Description¶
Model for details used to connect to a Spark data source.
Parameters¶
- host: str
default: "127.0.0.1"
The server where your spark cluster is hosted. - port: int
default: 10000
The port your spark cluster is hosted on. - http_path: str
default: "cliservice"
Spark compute resource URL. - use_http_transport: bool
default: False
Configuration on whether to use HTTP as our transport layer. Defaults to Thrift - use_ssl: bool
default: False
Configuration on whether to use SSL. Only applicable if use_http_transport is set to True. - storage_type: StorageType
Storage type of where we will be persisting the feature store to. - storage_url: str
URL of where we will be uploading our custom UDFs to. - storage_path: str
Path where we will be reading our data from. Note that this technically points to the same location as the storage_url. However, the format that the warehouse accepts differs between the read and write path, and as such, we require two fields. - catalog_name: str
The name of the catalog to use for creation of output tables. - schema_name: str
The name of the schema to use for creation of output tables.