You don't have permission for that!

Adventures in Machine Learning

Apache Spark Integration and Platform Execution for ML - ML 073

Apache Spark is a lightning-fast unified analytics engine for large-scale data processing and machine learning. In this episode, Ben and Michael unpack Spark by ping-ponging questions and answers, supplemented by various examples applicable to machine learning workflows.

May 26, 2022
45:41
Episode 073
Episode Artwork

Apache Spark Integration and Platform Execution for ML - ML 073

Adventures in Machine Learning

45:41

0:00 45:41
Speed:

Share This Episode

Show Notes

Apache Spark is a lightning-fast unified analytics engine for large-scale data processing and machine learning. In this episode, Ben and Michael unpack Spark by ping-ponging questions and answers, supplemented by various examples applicable to machine learning workflows.
In this Episode…
  1. How does Spark work?
  2. What makes Apache Spark effective?
  3. Dot repartition in Spark
  4. Parallel processing systems
  5. What is an aggregation in Spark sequel?
  6. Analytics with Spark 
  7. What is MPP?
  8. Testing for production
  9. Spark algorithms
Sponsors
Sponsored By: