We end up with the following attributes for the parent and child components:
| | | | | --- | --- | --- |Parent attributes | Name | Type | Description | | Employee_id | Categorical | 78 levels | | order_date | Datetime | 1996-1998 | | required_date | Datetime | 1996-1998 | | shipped_date | Datetime | 1996-1998 | | ship_via | Categorical | 3 levels | | freight | Numerical | [0.02 - 1007] | | ship_region | Categorical | 19 levels | | ship_country | Categorical | 21 levels | | customer_city | Categorical | 70 levels | | customer_region | Categorical | 19 levels | | customer_country | Categorical | 21 levels | | order_lenght | Numerical | [1 - 22] | | | | | | --- | --- | --- |Table 3: Child attributes Name Type Description product_id Categorical 77 levels supplier_id Categorical 29 levels category_id Categorical 8 levels unit_price Numerical [2 - 263.5] quantity_per_unit Numerical [0 - 70]
All data is converted into a one-hot encoding, including datetime and numerical types (previously binned into 20 levels).