'How do i rewrite this pyspark code more differently?
clinicaltrial_2021 = sc.textFile(clinical_location).map(lambda line: line.split('|')).filter(lambda line: len(line)>1) header = clinicaltrial_2021.first() clinicaltrial_2021 = clinicaltrial_2021.filter(lambda x: x!=header)
Sources
This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.
Source: Stack Overflow
| Solution | Source |
|---|
