'How to detect when person holds rope at the bottom part in a video, by applying computer vision methods?

There is a video, where a person holds a rope at a specific moment. What I would like to do is to understand when this person actually holding bottom rope automatically. Can we do it with computer vision models?

enter image description here



Solution 1:[1]

Yes, it is possible

You should check out YOLO algorithm

Sources

This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.

Source: Stack Overflow

Solution Source
Solution 1 Funny Kup