This is my dataset: from pyspark.sql import SparkSession, functions as F spark = SparkSession.builder.getOrCreate() df = spark.createDataFrame([('2021-02-07',)
vfork
crosh
jsoncpp
directory
geolite2
multistage-pipeline
laravel-medialibrary
steam-condenser
jsonconvert
pandas-ta
netadvantage
subsonic
ibm-content-navigator
asp.net-core-3.0
rtmp
boost-interprocess
managed
glassfish-4.1
ext2
in-clause
tizen-sdb
mysql-5.0
gwt2
alexa-presentation-language
immer.js
tcpdf
entity-model
high-level-architecture
web-of-things
filehelpers