This is my dataset: from pyspark.sql import SparkSession, functions as F spark = SparkSession.builder.getOrCreate() df = spark.createDataFrame([('2021-02-07',)
iphone-8
xmpphp
rtools
message-body
nexus-7
awesomium
media-library
tslint
nsvisualeffectview
windows-1251
format-patch
project-properties
laravel-envoy
application-restart
nexus6
rndis
behaviorsubject
uglifyjs2
uti
systrace
web-development-server
preserve
google-slides
pipfile
appscan
egress
distutils
discrete
tfs-proxy
google-forms