This is my dataset: from pyspark.sql import SparkSession, functions as F spark = SparkSession.builder.getOrCreate() df = spark.createDataFrame([('2021-02-07',)
custom-object
dbfit
odbc
clamav
capl
rhythmbox
tastypie
semantic-ui
system.drawing.graphics
exonum
htmlspecialchars
android-coroutine
jpm
textmate2
fabricjs2
writable
android-bitmap
opsgenie
dereference
github-organizations
strcmp
3gp
simpleaudioengine
devspace
kombu
non-modal
picker
powershell-4.0
touchesbegan
discrete-optimization