I have a Spark question, so for the input for each entity k I have a sequence of probability p_i with a value associated v_i, for example the data can look like
fusion360
powershell-v5.1
bluetooth-mesh
trumbowyg
isomorphic-style-loader
wide-column-store
telerik-editor
webkit-transition
touchmove
dockerhub
visnetwork
best-in-place
multifile-uploader
rack-test
xlconnect
usb-debugging
mimekit
cots
selenium
webpack-serve
ejbca
svg.js
react-css-modules
boolean-expression
bad-gateway
hypothesis-test
mysqldatareader
gulp-data
github-flavored-markdown
ws-eventing