Question: Explain the role of the 'broadcast' variable in PySpark.Answer: A 'broadcast' variable is used to cache a read-only variable in each node of a cluster to enhance the performance of joins.Example:
|
Is it helpful?
Yes
No
Most helpful rated by users: