This is my dataset: from pyspark.sql import SparkSession, functions as F spark = SparkSession.builder.getOrCreate() df = spark.createDataFrame([('2021-02-07',)
bgi
mongodb-geospatial
mechanicalsoup
nested-properties
h3
converter
rollback
cloud-sql-proxy
gdcm
tortoisegit
mongodb-replica
expo-build
sas-visual-analytics
http-headers
el
buttoncolor
cinder
librato
ckeditor-wordcount
rocket-chip
pbs
async-pipe
varchar2
modbus-rtu-over-tcp
fluent-interface
material-ui-x
mobile-application
gae-search
use-effect
android-binder