ISSN: 2265-6294

Distributed Computing with Dask and Apache Spark: A Comparative Study

Main Article Content

Ankita Jain,Devendra Singh Sendar,Sarita Mahajan

Abstract

In the unexpectedly expanding landscape of dispensed computing, the choice of frameworks profoundly affects the efficiency and scalability of records processing workflows. This comparative take a look at delves into the architectures, overall performance metrics, and consumer reports of main allotted computing frameworks: Dask and Apache Spark. Both frameworks have won prominence for his or her ability to handle huge-scale records processing, yet they diverge of their essential tactics. Dask embraces a flexible mission graph paradigm, even as Apache Spark is predicated on a resilient allotted dataset (RDD) abstraction.

Article Details