I am using nutch-2.3.1 with Hbase-0.98.8-hadoop2 and the crawl runs fine for HTML pages, but when trying to run the crawl for PDF URLs only some of them seems t
amazon-device-messaging
bootstrap-notify
convolutional-neural-network
cffi
cp1251
clonenode
multi-camera-api
ms-publisher
numberformatter
inkscape
haskell-program-coverage
qcheckbox
wildfly-22
android-9.0-pie
saucelabs
payumoney
frequency-table
texttable
proof-assistant
distributed-lock
bitwig-api
.net-core-authorization
idle-timer
cloudkit
autotools
resources
docker-desktop
passcode
qmediaplayer
firemonkey-style