'Future state in Deep Q learning in stock trading
In Q-learning, Q(S,a)is updated by the Bellman equation. Q(S,a) = r + max(Q(S',a')) where S' is the future state. Let's say S denotes the price of the stock, does it mean we are using future information (S') in the Q-learning process? After the training process and bringing into implementation, how is it possible to apply the strategy without knowing the future price (the future state)?
Sources
This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.
Source: Stack Overflow
| Solution | Source |
|---|
