I have a Spark question, so for the input for each entity k I have a sequence of probability p_i with a value associated v_i, for example the data can look like
insert-select
routes
notificationmanager
ruby-2.3
easy-webpack
dotspatial
autodesk
cloud">cloud
onreadystatechange
fosrestbundle
privileges
shadowjar
safe-navigation-operator
mod
windows-server-2012
spatial-data-frame
urlacl
itemrenderer
appcelerator
google-bi-engine
reinterpret-cast
private-constructor
namedtuple
skyfield
uic
hiveserver2
jake
pxssh
aasm
launch-daemon