Machine Learning course 2021-2022.
Masters' Degree in Applied Mathematics,
Sapienza University of Rome.
Exam date: June 13th 2022
The provided dataset
consists of daily MSFT financial data from October 2013 to
February 2022 1.
These data are initially pre-processed, then used to define some prediction models.
Given day
Our target is to predict the highest price
reached by MSFT stock in each of these 10 days,
given information on day
See this notebook
for detailed analysis and results.
Some fetures of the dataset are highly correlated
to each other, and so redundant, since they don't
provide the model with any additional useful
information.
However, new input features (technical indicators)
can be created from these correlated features.
Here
the following indicators have been implemented:
-
Simple moving average.
-
MACD (Moving Average Convergence/Divergence).
-
Average true range
-
Average typical price.
-
CCI (Commodity Channel Index)
After feature selection and data transformation are performed, the following models are trained and evaluated by spitting the dataset for crossvalidation:
- Linear regressor
- Multilayer perceptron
- Regression trees
- Random forest
Footnotes
-
Data have been downloaded from https://www.marketwatch.com/investing/stock/msft/download-data. ↩