I have a Spark question, so for the input for each entity k I have a sequence of probability p_i with a value associated v_i, for example the data can look like
account-linking
dry-validation
amazon-glue
collabnet
kafka-streams-binder
visual-composer
cdn
remote-administration
ember-octane
circe
bigrquery
distributed-objects
youtrack-api
asciimatics
tripwire
rainbow-js
hyperledger-explorer
armv7
django-static
flyway
microprofile
h2-console
wear-os-tiles
xtermjs
duration
svn-hooks
clutter
servicepacks
microsoft-agent
blockingcollection