When using the scikit-learn library in Python, I can use the CountVectorizer to create ngrams of a desired length (e.g. 2 words) like so: from sklearn.metrics.
delphi-2007
moss
webvtt
woodstox
hunit
bytecode
printscreen
magento-layout-xml
ecmascript-2016
scnnode
mmix
openrefine
vaadin-lumo
rolap
xml-binding
data-link-layer
google-cloud-tpu
rspamd
k8s-serviceaccount
ramda.js
ts-node
visual-studio-project
mysql5
aws-nlb
rendercontrol
treemap
zinggrid
grommet
aad-pod-identity
viewmodelproviders