This is my dataset: from pyspark.sql import SparkSession, functions as F spark = SparkSession.builder.getOrCreate() df = spark.createDataFrame([('2021-02-07',)
system-design
real-time-data
invariantculture
pnotify
flicker
react-scroll
rank-n-types
mypy
scopeguard
ansible-facts
c++-amp
env-file
listbox-control
esmodules
saprfc
m2eclipse
el-get
caplog
xcode4.3
office-pia
jekyll-paginator
drm
computus
e-commerce
ng2-pdfjs-viewer
google-speech-to-text-api
hover
vaadin7
eclipse-2020-06
devel-cover