'Running cupy.histogram on given axis

I want to run cupy.histogram() parallelly on a 3D tensor of size (1000,10) where the histogram is performed features ( ,10) for each instance. I want to avoid doing the for-loop. Is there any way to do it, help will be appreciated?



Sources

This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.

Source: Stack Overflow

Solution Source