Category "stable-baselines"

model.learn(total_timesteps=500000) not causing model improvement in a custom open ai gym environment

I am trying to follow along a tutorial made by a popular youtuber about custom openai gym environments, but unable to replicate his results. I initially setup m

I'd like to get the episodic rewards in csv format in stable baselines 3

I want to retrieve the data after every episode, I've read the documentation that you can use, stable_baselines3.common.monitor.ResultsWriter but I don't know h

Error running model trained on cloud on local machine

I trained a reinforcement learning model using a GPU instance from Lambda Labs cloud service. The library I used for training is stable_baselines3. When I try

RL + optimization: how to do it better?

I am learning about how to optimize using reinforcement learning. I have chosen the problem of maximum matching in a bipartite graph as I can easily compute the

LSTM based policy in stable baselines3 model

I am trying to make a PPO model using the stable-baselines3 library. I want to use a policy network with an LSTM layer in it. However, I can't find such a possi

Why does ep_rew_mean decrease over time?

In order to learn about reinforcement learning for optimization I have written some code to try to find the maximum cardinality matching in a graph. Not only d