I am using nutch-2.3.1 with Hbase-0.98.8-hadoop2 and the crawl runs fine for HTML pages, but when trying to run the crawl for PDF URLs only some of them seems t
case-tools
netbeans-8
imagelibrary
sql-data-warehouse
moving-average
dcmtk
msdeploy
arima
azure-servicebus-subscriptions
oorexx
linux-containers
tls1.0
r-faq
codesniffer
locks
plutus
coverage.py
mousekeyhook
angular10
two-connection-limit
apache-commons-cli
aspnet-compiler
react-native-camera
semgrep
reportbuilder3.0
google-pagespeed
microk8s
number-manipulation
django-mongodb-engine
adsapi-php.ini