All, I am trying to train a distributed model using Horovod on Azure Machine Learning Service as shown below. estimator = TensorFlow(source_directory=script_fo
So I've got an Azure Machine Learning pipeline here that consists of a number of PythonScriptStep tasks - pretty basic really. Some of these script steps fail i