This repository contains the implementation of a comprehensive information integration system for high-frequency trading (HFT) in stock markets. Our approach integrates nested decision execution framework for HFT operations and LLM for extracting and analyzing financial sentiments from news and social media.
This project leverages the Qlib framework for high-frequency trading operations and integrates advanced sentiment analysis using FinLLAMA. By incorporating real-time sentiment data from news and social media, we aim to enhance the decision-making process and profitability in volatile stock markets.
To set up the environment, you need to install the required dependencies for both Qlib and FinLLAMA. Ensure you have Python 3.8.x.
- Clone the repository:
git clone https://github.com/Flemington7/SimonsAgent.git
- Install the required Python packages: (TBD)
(Will be updated with the required packages for both FinLLAMA and Qlib, now you can install the packages from the specific directories)
pip install -r requirements.txt
Tip
If you are using a Windows machine, you can install the version of the required packages using the following command:
pip install -r requirements-win.txt
(Will be updated with the required data preparation steps for both FinLLAMA and Qlib)
-
Sentiment Data: Prepare the sentiment analysis data by processing news and social media texts using FinLLAMA. Ensure that the data is in a format compatible with the finetuning process.
-
Financial Data: Download and prepare the financial data as required by Qlib. Follow the Qlib documentation for setting up data. As mentioned above, we will integrate the sentiment data with the financial data, and convert it into a format that suitable for backtesting.
Evaluate the performance of the trading strategies and the impact of sentiment analysis on trading decisions. Compare the results with and without sentiment data integration and the benchmark performance.
Set up and run backtests using Qlib with the integrated sentiment data:
python SimonsHFT/workflow.py backtest
W. Ye designed the workflow, deployed and fine-tuned the language models, developed the trading strategies, performed data analysis and wrote the manuscript.
H. Li conducted the related work review, forecast model selection, backtesting and wrote the poster.
J. Li conducted interference of LLM and integrated the sentiment analysis with traditional price-volume features.
All authors contributed to the design of the study.
This work was supported by Jiachen Wang, Wentao Ye, Rui Chen and Hanyu Wei, who provided valuable computational resources.
We also thank the Qlib project in Microsoft Research Asia for providing the open-source quantitative investment platform and the Llama project in Meta AI for providing the pre-trained language models.
This project is licensed under the terms of the Apache License 2.0. See the LICENSE file for details.