'Is there a way to track multiple moving objects(spermatozoa) in a microscopic video without annotated dataset to train the object detetion model?

I am doing a research on "Motility Analysis of Human Spermatozoa". I have gathered 100 spermatozoa video samples of 100 persons. But this dataset is not annotated(labeled). Is there a way to build a unsupervised deep learning model with dataset?

Sample Image Frame of a video



Sources

This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.

Source: Stack Overflow

Solution Source