This is my dataset: from pyspark.sql import SparkSession, functions as F spark = SparkSession.builder.getOrCreate() df = spark.createDataFrame([('2021-02-07',)
google-compute-disk
cypress-each
spacy-3
intelephense
delphi-xe
jackson-modules
xpdf
sqlcommandbuilder
dependent-name
anonymous-methods
jira-plugin
dataservice
a2dp
viewport3d
fancybox
android-build-flavors
magic-mirror
sfspeechrecognizer
memory-layout
datafeed
mipmaps
buildsrc
approximate
openlink-virtuoso
ioredis
graph-layout
msmq
maml
cordic
audiowaveform