Apache Spark Integration and Platform Execution for ML - ML 073

Apache Spark is a lightning-fast unified analytics engine for large-scale data processing and machine learning. In this episode, Ben and Michael unpack Spark by ping-ponging questions and answers, supplemented by various examples applicable to machine learning workflows.

Show Notes

Apache Spark is a lightning-fast unified analytics engine for large-scale data processing and machine learning. In this episode, Ben and Michael unpack Spark by ping-ponging questions and answers, supplemented by various examples applicable to machine learning workflows.
In this Episode…
  1. How does Spark work?
  2. What makes Apache Spark effective?
  3. Dot repartition in Spark
  4. Parallel processing systems
  5. What is an aggregation in Spark sequel?
  6. Analytics with Spark 
  7. What is MPP?
  8. Testing for production
  9. Spark algorithms
Sponsors
Sponsored By:
Album Art
Apache Spark Integration and Platform Execution for ML - ML 073
0:00
45:41
Playback Speed: