MLlib: Machine learning in Apache Spark

Apache Spark is a popular open-source platform for large-scale data processing that is well-suited for iterative machine learning tasks. In this paper we present MLLIB, Spark's open-source distributed machine learning library. MLLIB provides efficient functionality for a wide range of learning...

Full description

Bibliographic Details
Main Authors: Meng, Xiangrui (Author), Bradley, Joseph (Author), Yavuz, Burak (Author), Sparks, Evan (Author), Venkataraman, Shivaram (Author), Liu, Davies (Author), Freeman, Jeremy (Author), Tsai, DB (Author), Amde, Manish (Author), Owen, Sean (Author), Xin, Doris (Author), Franklin, Michael J. (Author), Zadeh, Reza (Author), Talwakar, Ameet (Author), Zaharia, Matei A (Contributor)
Other Authors: Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science (Contributor)
Format: Article
Language:English
Published: JMLR, Inc., 2018-07-06T14:08:08Z.
Subjects:
Online Access:Get fulltext