This is my dataset: from pyspark.sql import SparkSession, functions as F spark = SparkSession.builder.getOrCreate() df = spark.createDataFrame([('2021-02-07',)
web-animations-api
zugferd
sca
zip4j
high-order-component
qnx-neutrino
2-way-object-databinding
login-page
johnny-five
execl
burndowncharts
video-gallery
venmo
webspeech-api
worklight-geolocation
netty4
uncompyle6
google-one-tap
microsoft.extensions.configuration
licenses.licx
numerical-integration
glassfish-6
gulp-autoprefixer
buffering
flutter-freezed
smote
primeng-dialog
python-ast
cheerp
stress-testing