metaorder
When AI Trading Agents Compete: Adverse Selection of Meta-Orders by Reinforcement Learning-Based Market Making
Jafree, Ali Raza, Jain, Konark, Firoozye, Nick
We investigate the mechanisms by which medium-frequency trading agents are adversely selected by opportunistic high-frequency traders. We use reinforcement learning (RL) within a Hawkes Limit Order Book (LOB) model in order to replicate the behaviours of high-frequency market makers. In contrast to the classical models with exogenous price impact assumptions, the Hawkes model accounts for endogenous price impact and other key properties of the market (Jain et al. 2024a). Given the real-world impracticalities of the market maker updating strategies for every event in the LOB, we formulate the high-frequency market making agent via an impulse control reinforcement learning framework (Jain et al. 2025). The RL used in the simulation utilises Proximal Policy Optimisation (PPO) and self-imitation learning. To replicate the adverse selection phenomenon, we test the RL agent trading against a medium frequency trader (MFT) executing a meta-order and demonstrate that, with training against the MFT meta-order execution agent, the RL market making agent learns to capitalise on the price drift induced by the meta-order. Recent empirical studies have shown that medium-frequency traders are increasingly subject to adverse selection by high-frequency trading agents. As high-frequency trading continues to proliferate across financial markets, the slippage costs incurred by medium-frequency traders are likely to increase over time. However, we do not observe that increased profits for the market making RL agent necessarily cause significantly increased slippages for the MFT agent.
- Europe > United Kingdom > England > Greater London > London (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)
Why is the estimation of metaorder impact with public market data so challenging?
Naviglio, Manuel, Bormetti, Giacomo, Campigli, Francesco, Rodikov, German, Lillo, Fabrizio
Transaction cost analysis is a fundamental aspect of financial trading and market impact is the main source of costs for medium and large sized investors [1]. Thus, estimating the potential impact and cost of a trading decision is important to assess its profitability. This is particularly true and challenging for metaorders, i.e. sequences of orders and trades executed gradually over a long time period and following a single investment decision. In fact, while there is a vast literature on estimating and modeling impact of individual trades (or orders) from public data, it is less clear if and how such models can be used to estimate the expected price trajectory of a metaorder and the associated impact cost. To this end, the industrial practice is to estimate market impact and the associated cost of a metaorder by using data on actual metaorder execution (for academic researches using this approach, see, for example, [2-5]). However this approach presents some pitfalls.
- Europe > Italy > Tuscany > Pisa Province > Pisa (0.04)
- Europe > Italy > Emilia-Romagna > Metropolitan City of Bologna > Bologna (0.04)
- Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)