- Datasets in Spark are inmutable. That’s why you shouldn’t call
withColumn
many times. - https://twitter.com/vboykis/status/1655899657841541120?s=46 Spark Calc from Vicky Boykis
- You can broadcast a dataframe, not just native types (although you are suffering a collect under the hood): https://stackoverflow.com/a/38667928.