Category "azure-machine-learning-studio"

Horovod Timeline and MPI Tracing in Azure Machine Learning Workspace(MPI Configuration)

All, I am trying to train a distributed model using Horovod on Azure Machine Learning Service as shown below. estimator = TensorFlow(source_directory=script_fo

Azure Machine Learning pipeline: How to retry upon failure?

So I've got an Azure Machine Learning pipeline here that consists of a number of PythonScriptStep tasks - pretty basic really. Some of these script steps fail i