Apache Spark is powerful, large-scale data processing has made it a core analytics technology for organizations. With performance that’s up to 100 times faster than Hadoop, Apache Spark makes large demands on underlying storage infrastructure.
The Pure Storage FlashBlade™ array is an all-flash data platform that not only handles Spark’s data requirements with ease – it accelerates Spark queries by up to six times. It is easily deployed, scaled, and managed. FlashBlade delivers competitive advantages over existing storage architectures.
The FlashBlade array is here to use with all the file management tools typical to Spark. This includes Mesos, Kubernete, Parquet, Hadoop Yarn, and more. With a full REST API, you’ll be able to integrate FlashBlade with any tool you choose.