I have to do copy of an S3 to HDFS of an cluster EMR. I'm trying to smaller the execution time of my job. Looking in the logs the map input of the job is 1_000_
libtool
slam
windows-terminal
controller-tests
access-log
instance-methods
amazon-ami
git-sparse-checkout
onscroll
blue-green-deployment
cinder
pydot
r-promises
factors
azure-free-services
umbraco6
vue-composition-api
linear-types
od
periodic
checker-framework
case-when
license-key
withcontext
report-studio
idl-programming-language
angular-material-8
network-flow
action-send
vegeta