'is the basic actor-critic approach from the Sutton book only for continuous action spaces?

I went through some literature and I tried to implement the Sutton & Barto basic actor critic net with entropy and TD Error on a big discrete actionspace. But it is not performing so well.

So I checked some forums, and it sad, that for discrete action spaces, I need to implement an adaption, e.g. the Soft-Actor Critic (SAC). Is that true? I couldn't find a scientific paper where it said something like that?

Thank you for your help



Sources

This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.

Source: Stack Overflow

Solution Source