I am using nutch-2.3.1 with Hbase-0.98.8-hadoop2 and the crawl runs fine for HTML pages, but when trying to run the crawl for PDF URLs only some of them seems t
google-ima
xmlelement
android-memory
doxia
certificatepolicy
forecasting
power-bi-report-server
icx
hexo
sightly
web-storage
vue-script-setup
scrolltop
paypal-vault
binary-operators
nvrtc
awt
slickgrid
cortana-intelligence
categorical-data
arquero
koin-scope
tvos10
text-justify
controller-tests
iis-8.5
tmap
factory
jupyter-contrib-nbextensions
activex