I have a Spark question, so for the input for each entity k I have a sequence of probability p_i with a value associated v_i, for example the data can look like
ssl-certificate
perlbrew
make-scorer
home-button
dfinity
jsdoc
tidy
econnrefused
mechanicalsoup
netrw
take
cdo-emf
hdr
firemonkey-fm3
mat-stepper
stroke-dasharray
sqlmigrations
rust-macros
tlb
openfiledialog
manpage
marc4j
selecteditem
cloudamqp
git-subtree
rust-ndarray
controller
gdrive
openbr
mdi