This is my dataset: from pyspark.sql import SparkSession, functions as F spark = SparkSession.builder.getOrCreate() df = spark.createDataFrame([('2021-02-07',)
divider
openrowset
twint
.net-1.1
phpmd
aws-msk-connect
pixmap
cro
selenide
resource-monitor
mantissa
clsx
domcontentloaded
selector-bem-pattern
casablanca
knyle-style-sheet
bioservices
assembly-references
fxcop
angularjs-scope
xwpf
accountmanager
keycloak-rest-api
iwork
underscore.js
match
hidden-markov-models
bluez
firebase-app-check
automatic-semicolon-insertion