Chrome Extension
WeChat Mini Program
Use on ChatGLM

EarnHFT: Efficient Hierarchical Reinforcement Learning for High Frequency Trading

THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 13(2024)

Nanyang Technol Univ

Cited 23|Views78
Abstract
High-frequency trading (HFT) is using computer algorithms to make trading decisions in short time scales (e.g., second-level), which is widely used in the Cryptocurrency (Crypto) market, (e.g., Bitcoin). Reinforcement learning (RL) in financial research has shown stellar performance on many quantitative trading tasks. However, most methods focus on low-frequency trading, e.g., day-level, which cannot be directly applied to HFT because of two challenges. First, RL for HFT involves dealing with extremely long trajectories (e.g., 2.4 million steps per month), which is hard to optimize and evaluate. Second, the dramatic price fluctuations and market trend changes of Crypto make existing algorithms fail to maintain satisfactory performances. To tackle these challenges, we propose an Efficient hieArchical Reinforcement learNing method for High Frequency Trading (EarnHFT), a novel three-stage hierarchical RL framework for HFT. In stage I, we compute a Q-teacher, i.e., the optimal action value based on dynamic programming, for enhancing the performance and training efficiency of second level RL agents. In stage II, we construct a pool of diverse RL agents for different market trends, distinguished by return rates, where hundreds of RL agents are trained with different preferences of return rates and only a tiny fraction of them will be selected into the pool based on their profitability. In stage III, we train a minute-level router which dynamically picks a second-level agent from the pool to achieve stable performance across different markets. Through extensive experiments in various market trends on Crypto markets in a high-fidelity simulation trading environment, we demonstrate that EarnHFT significantly outperforms 6 state-of-art baselines in 6 popular financial criteria, exceeding the runner-up by 30% in profitability.
More
Translated text
Key words
Bid Rotation
PDF
Bibtex
AI Read Science
Video&Figures
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn
Chat Paper

要点】:本文提出了一种高效分层强化学习方法EarnHFT,用于解决高频交易中长轨迹优化和评估困难以及数字货币市场价格波动带来的挑战,创新点在于采用了三级分层框架,提高了训练效率和交易性能。

方法】:EarnHFT方法包括三个阶段:首先,计算Q-教师,即基于动态规划的最优动作价值,以提高第二层RL代理的性能和训练效率;其次,构建具有不同市场趋势的多样化RL代理池,通过收益率区分市场趋势,根据盈利能力筛选RL代理;最后,训练分钟级路由器,动态选择代理池中的第二层代理,以实现不同市场间的稳定性能。

实验】:在多种市场趋势的高保真模拟交易环境中,对数字货币市场进行实验验证,EarnHFT在六项流行的金融指标上显著优于六个最先进基线,盈利能力方面比第二名高出30%。