You are viewing content from a past/completed QCon

Presentation: What One Should Know About Spark MLlib

Track: Hands-on Codelabs & Speakers Office Hours

Location: Mission

Duration: 4:00pm - 4:10pm

Day of week: Tuesday

Share this on:


The goal of Spark MLlib is make practical machine learning scalable and easy. In addition to providing a set of common learning algorithms such as classification, regression, clustering, and collaborative filtering, it also provides a set of tools to help with building maintainable Machine Learning pipelines. This talk will dive into the concepts, details of these tools as well as the benefits they provide.

Speaker: Hien Luu

Engineering Manager @Linkedin focused on Big Data

Hien Luu is an engineering manager at LinkedIn and he is a big data enthusiast. He is particularly passionate about the intersection between Big Data and Artificial Intelligence. Teaching is one his passions and he is currently teaching Apache Spark course at UCSC Silicon Valley Extension school. He has given presentations at various conferences like QCon SF, QCon London, Hadoop Summit, JavaOne, ArchSummit and Lucene/Solr Revolution.

Find Hien Luu at

2019 Tracks

  • Groking Timeseries & Sequential Data

    Techniques, practices, and approaches around time series and sequential data. Expect topics including image recognition, NLP/NLU, preprocess, & crunching of related algorithms.

  • Deep Learning in Practice

    Deep learning use cases around edge computing, deep learning for search, explainability, fairness, and perception.