This is my dataset: from pyspark.sql import SparkSession, functions as F spark = SparkSession.builder.getOrCreate() df = spark.createDataFrame([('2021-02-07',)
microsoft-graph-groups
cilium
autosummary
ipfs
materialcardview
test-fixture
excel-web-query
variable-binding
ipmi
wrk
sax
qt5.5
will-change
varint
performselector
fluid-layout
set-difference
mediatr
google-api-nodejs-client
crossmint
spring-ldap
eventfilter
flutter-modular
terminate
checkin-policy
custom-dimensions
androidx-test
p6spy
redmi-device
dll