Chetan Rawal, MathWorks
Import three years of New York City taxi fare data from a Hadoop® Distributed File System (HDFS) into a MATLAB®tall table. Then use Statistics and Machine Learning Toolbox™ to predict fares based on location and time of day in the city. The algorithm scales automatically to multiple cores on your local computer or a compute cluster using Parallel Computing Toolbox™. With MATLAB Compiler™, the algorithm is deployed to an Apache Spark™ cluster with minimal changes needed to the code.