This is my dataset: from pyspark.sql import SparkSession, functions as F spark = SparkSession.builder.getOrCreate() df = spark.createDataFrame([('2021-02-07',)
where-object
google-analytics-4
dronekit
cloudcustodian
connect-src
jenkins-git-plugin
paxos
netty4
qftp
babel-core
falco
php-internals
google-optimize
react-scrollmagic
dynamic-properties
blaze
wrapper
font-style
transit
named-captures
azure-cosmosdb
ios11
itemspaneltemplate
visual-and-installer
loginview
angular-activatedroute
ilmerge
tinyxml2
cmusphinx
wiremock-standalone