I have a Spark question, so for the input for each entity k I have a sequence of probability p_i with a value associated v_i, for example the data can look like
i18n-ally
gridpanel
alphabetic
banner
nstextstorage
hibernate-generic-dao
facebook-android-sdk
google-compute-engine
mysql-backup
android-gallery
key-bindings
mailhog
service-locator
state-pattern
go-testing
cursorindexoutofboundsexception
ubsan
spring-cloud-contract
annoy
session-reuse
componentsseparatedbystring
sql-server-2008
text-analytics-api
bufferedreader
go-generate
dynamic-data
scully
alamofire5
openalpr
tiingo