Spark
Apache Spark is a powerful open source big data analytics tool. It offers over 80 high-level operators that make it easy to build parallel apps. It is used at a wide range of organizations to process large datasets.
Features:
It helps to run an application in Hadoop cluster, up to 100 times faster in memory, and ten times faster on disk
It offers lighting Fast Processing
Support for Sophisticated Analytics
Ability to Integrate with Hadoop and Existing Hadoop Data
It provides built-in APIs in Java, Scala, or Python