This is my dataset: from pyspark.sql import SparkSession, functions as F spark = SparkSession.builder.getOrCreate() df = spark.createDataFrame([('2021-02-07',)
program-files
report-viewer2016
windows-server-container
rayshader
genesis
zxing-js
uberspace
smartcard-reader
author
superpixels
clflush
stdmove
dfu
httpwebrequest
kml
emacs
iformfile
pyscripter
commerce
company-mode
fastjsonapi
sanity-testing
tf.keras
ntfs-mft
xhtml2pdf
diamond-problem
mysql-error-1111
angular-akita
samsung-knox
git-fsck