When using the scikit-learn library in Python, I can use the CountVectorizer to create ngrams of a desired length (e.g. 2 words) like so: from sklearn.metrics.
adventure
elastix-itk
libdl
multiple-makefiles
non-modal
membership-provider
nexus-5
neovis
cordova-android
struts1
strongly-typed-dataset
resolveurl
animatedimagedrawable
asqueryable
python-2.7
partial-matches
runnable-jar
log4j
controlvalueaccessor
react-native-router-flux
react-native-picker
spinnaker-halyard
swagger-editor
beanstalkd
rootviewcontroller
boyer-moore
switchcompat
managementeventwatcher
crop
angularjs-ng-include