Category "distributed-computing"

Horovod Timeline and MPI Tracing in Azure Machine Learning Workspace(MPI Configuration)

All, I am trying to train a distributed model using Horovod on Azure Machine Learning Service as shown below. estimator = TensorFlow(source_directory=script_fo

Error using MultiWorkerMirroredStrategy to train object detection research model ssd_mobilenet_v1_fpn_640x640_coco17_tpu-8

I'm trying to train research model ssd_mobilenet_v1_fpn_640x640_coco17_tpu-8 using the MultiWorkerMirroredStrategy (by setting --num_workers=2 in the invocation

Jmeter doesn't show results when executed on slave, both GUI and CLI

Introduction: I'm new to Jmeter, so I'm working with basic tasks but I have a mentor. We are working with distributed architecture, one master and one slave in