I have to do copy of an S3 to HDFS of an cluster EMR. I'm trying to smaller the execution time of my job. Looking in the logs the map input of the job is 1_000_
excel4node
contentful
symfony1
pyvenv
std-pair
yandex-api
kotlin-serialization
horovod
microsoft-translator
symmetry
bitcoinjs-lib
preg-match
package.json
ngx-codemirror
android-12
django-subquery
novnc
header-only
array-intersect
racket-student-languages
iphone
subfloats
invoice-ninja
visualstatemanager
anonymous
arangodb-php
jdb
decoding
outlook-filter
openjms