Big data people see Spark and its toolset ??? the Shark SQL query engine, the Spark Streaming tool for processing data on the fly, the MLib library for machine learning, and the GraphX API for graph processing ??? as the successor of technologies based on MapReduce, the initial programming model for the Hadoop ecosystem of open-source tools for analyzing lots of different kinds of data.