To identify the step that is using most of the computation time, I ran cProfile and got the following result: ncalls tottime percall cumtime percall fil
pycurl
vtd-xml
pipes-filters
testing-libraryreact-native
gdi
rawurl
lua-scripting-library
pygraphviz
line-drawing
fbjs
nedit
eula
porter-stemmer
archilogic
wizard
jenkins-scriptler
formula.js
shinyproxy
authenticator
targets
openoffice-calc
non-breaking-characters
swtchart
chapel
petite-vue
hadoop-lzo
thread-exceptions
directsound
kafka-transactions-api
double-byte