I have to do copy of an S3 to HDFS of an cluster EMR. I'm trying to smaller the execution time of my job. Looking in the logs the map input of the job is 1_000_
bioformat
parallelstream
ex-unit
react-lifecycle
tomlkit
pyreverse
bootstrap-popover
plink
immer.js
scrapyd-deploy
affix
libnotify
runpath
django-autocomplete-light
errordocument
mongodate
criteriaquery
jsignature
bios
nacelle
dotnet-new
didfailwitherror
fcmp
control-structure
react-router-redux
ansi-escape
gwmodel
currentlocation
data-objects
message-bus