Exploratory data analysis to build customer profiles for each treadmill model — helping Aerofit's sales team recommend the right product to the right buyer.
Aerofit sells three treadmill models — KP281 (entry-level), KP481 (mid-range), and KP781 (premium). The fitness retail company wants to understand who buys what — so their sales team can make data-driven recommendations during purchase consultations.
| Feature | Type | Description |
|---|---|---|
| Product | Categorical | KP281 / KP481 / KP781 |
| Age | Numerical | Customer age in years |
| Gender | Categorical | Male / Female |
| Education | Numerical | Years of education |
| MaritalStatus | Categorical | Single / Partnered |
| Usage | Numerical | Planned weekly treadmill uses |
| Fitness | Numerical (1-5) | Self-rated fitness level |
| Income | Numerical | Annual income in USD |
| Miles | Numerical | Expected weekly miles |
# Conditional probability: P(Gender | Product) pd.crosstab(df['Product'], df['Gender'], normalize='index') # Mean stats per product df.groupby('Product')[['Age','Income','Fitness','Usage','Miles']].mean() # Output: # Age Income Fitness Usage Miles # KP281 28.5 46,400 3.0 3.3 82 # KP481 28.9 48,900 3.1 3.5 87 # KP781 29.1 58,500 4.2 4.8 166