Final Project by Akhil Nair for STA6704 - Data Mining Methodologies II at UCF for Spring 2024. Breakdown:
- Objective: Predict customer churn using various features related to customer demographics, services, and account information.
- Dataset: Contains customer information such as gender, tenure, contract type, payment method, monthly charges, and churn status.
- Preprocessing: Categorical variables are encoded, and irrelevant columns like customerID are removed.
- Feature Selection: Techniques like Mutual Information and Recursive Feature Elimination (RFE) are used to identify the most important features for model training.
- Modeling: Logistic Regression is employed to rank features based on their importance for predicting customer churn, aiming to improve prediction accuracy.