Skip to content

Clustering/preprocessing/ritwik

This notebook includes exploratory data analysis and preprocessing steps to prepare a health and lifestyle dataset for future clustering. Key updates in this MR:

  • Visualized distributions of numeric and categorical features
  • Removed target variable (NObeyesdad) for unsupervised learning
  • Encoded categorical variables using Label and One-Hot Encoding
  • Applied StandardScaler to normalize all features
  • Dataset is now cleaned and scaled, ready for clustering

No clustering or modeling has been applied yet — this MR only covers the EDA and data preparation phase.

Merge request reports

Loading