The purpose of this project is to create a model using car characteristics to predict the The Manufacturer Suggested Retail Price (MSRP) in the United States.
In the case of car prices prediction, companies could use this model to determine the prices of new cars that they produce which will help them to set the most accurate prices for their cars based on the market value. As a result, optimal prices for cars could be set leading to better growth and outcomes for car manufacturers respectively. Based on existing data, the aim is to use machine learning algorithms to develop models for predicting car prices.
The price of a car depends on a lot of factors like the goodwill of the brand of the car, features of the car, horsepower and the mileage it gives and many more. Here are the list of features that can be used to predict The Manufacturer Suggested Retail Price (MSRP).
- MSRP: Target Variable
Feature | Descriptions |
---|---|
Make: | Name of the car Manufacturers |
Model: | Model of the car |
Year: | Year of Manufactur |
Engine Fuel Type: | Fuel type used in the car |
Engine HP: | The horsepower an engine produces in pounds |
Engine Cylinder: | Number of Cylinders in the Engine |
Transmission Type: | The Type of Transmission |
Driven_Wheels: | Types of drivetrain |
Number of Doors: | Number of Doors |
Market Category: | Market Category of the car |
Vehicle Size: | Vehicle size (Compact, Midsize or Large) |
Vehicle Style: | Vehicle Style of the car |
highway MPG: | Highway Mileage |
city mpg: | City Mileage |
Popularity: | Popularity of the car |
MSRP: | The Manufacturer Suggested Retail Price |
- Python: 3.9.12
- Pandas: 1.4.3
- Seaborn: 0.11.2
- sklearn: 1.1.2.
- Numpy: 1.21.5
This data contains 1000s of automobiles (each record represents an actual car) and their attributes. The target variable of interest is MSRP (manufacturer suggested price). The data was provided as a part of our CIS: 508 Assignment.