When using the scikit-learn library in Python, I can use the CountVectorizer to create ngrams of a desired length (e.g. 2 words) like so: from sklearn.metrics.
localytics
unjar
postgresql-8.0
url-rewriting
phone-number
concurrency
spring-java-config
signalr.net-client
reinterpret-cast
gatling-plugin
azure-devops-rest-api
tuner
gitlab-ee
npm-build
drupal-exposed-filter
nps
visual-studio-2015
scala-logging
strsplit
3d-rendering
react-virtualized
gmagick
stow
msmq
dicttoxml
cmmotionmanager
powerapps-collection
pysmt
over-the-air
autoquery-servicestack