Name		Name	Last commit message	Last commit date
parent directory ..
1 Attribute Selection.pdf		1 Attribute Selection.pdf
2 DataSet Creation.pdf		2 DataSet Creation.pdf
3 Clustering.pdf		3 Clustering.pdf
4 Recommended Actions.pdf		4 Recommended Actions.pdf
Clustering.ipynb		Clustering.ipynb
README.md		README.md

README.md

Clustering

Our goal is to segment our user base so that we can get a high-level/abstract understanding of the different types of users we have.
This provides us managable bites/goals for us to market to our users.

Steps:

Attribute Selection:
Select which attributes to use for segmenting our user base.
Dataset Creating:
Join and Wrangle flamingo-data tables to get the attributes we decided on in the previous step.
Clustering:
Segmenting our created Dataset into 3 Clusters.
Note: The Values were all Normalised/Scaled before clustering for better understanding of results.
Conclusion:
Provides Recommendations to increase Revenue.

Working

This Jupyter Notebook contains all the working done above, step by step.
From reading files, to wrangling and joining, to normalising dataset and finally Clustering.

Note: This Notebook requries a Scala Kernel to run AND also needs the Apache Spark Libraries to be available to said Kernel.
Apache Toree recommended.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clustering

Clustering

README.md

Clustering

Steps:

Working

Files

Clustering

Directory actions

More options

Directory actions

More options

Latest commit

History

Clustering

Folders and files

parent directory

README.md

Clustering

Steps:

Working