Skip to content

Customer segmentation using unsupervised and supervised machine learning.

Notifications You must be signed in to change notification settings

Rachneet/Customer-Segmentation

Repository files navigation

Customer Segmentation for Arvato Financial Services

Introduction

Targeting potential customers is an essential task for companies. It helps boost their revenue and tailor their needs to cater to the right customers. Moreover, it helps them understand why particular segments of people do not use their services. In this project, we will study ways of preprocessing a high dimensional dataset nd prepare it for analysis with machine learning algorithms. We will use the power of machine learning to segment customers from a mail-order campaign, understand their demographics, and predict potential future customers.

Libraries Utilized

  1. Numpy
  2. Pandas
  3. Scikit-learn
  4. Matplotlib
  5. Plotly
  6. XGBoost
  7. Optuna
  8. Tqdm

File Descriptions

  1. Arvato Project Workbook.ipynb: The main jupyter notebook with the ETL pipeline, modelling and evaluation of data.
  2. helper_function.py: This file contains all the data cleaning functions used in the main notebook.
  3. preprocess.py: This file comprises the complete preprocessing function along with the functions that extract important features for our work.
  4. hyperparameter_tuning.py: This file contains the functions to tune the parameters of the machine learning model.
  5. visualizations.py: This file contains all the functions to plot the visualizations using plotly.

Medium Blogpost

For detailed explanation of the project, feel free to read the blog on Medium here.

Kaggle Competition

This project is a part of the kaggle competition. The competition can be accessed here. Although, the data for this project is not available publicly and can be accessed only through Udacity's Data Scientist Nanodegree.

Current leaderboard position: leaderboard

License, Acknowledgements and Copyright

We would like to thank Arvato Financial Services for providing us this data. Special thanks to Udacity to provide us with an interesting problem. All this work can be used without any restrictions.

About

Customer segmentation using unsupervised and supervised machine learning.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published