Presentation: A Whirlwind Overview of Apache Beam

Track: Real-world Data Engineering

Location: Cyril Magnin III

Duration: 10:35am - 10:45am

Day of week: Wednesday

Share this on:

Abstract

Apache Beam offers a novel programming model for data processing with two major distinctive features: full unification of batch and streaming, and portability across different runners and different languages.
We give a quick overview of the fundamentals of the Beam programming model, and an even quicker overview of the project's place in the data processing ecosystem and its future directions.

Note: This is a short talk. Short talks are 10-minute talks designed to offer breadth across the areas of machine learning, artificial intelligence, and data engineering. The short talks are focused on the tools and practices of data science with an eye towards the software engineer.

Speaker: Eugene Kirpichov

Cloud Dataflow Staff SE @Google

Eugene Kirpichov is a Staff Software Engineer on the Cloud Dataflow team at Google, where he works on the Apache Beam programming model and APIs. Previously, he worked on Cloud Dataflow’s autoscaling and straggler elimination techniques. Eugene is interested in programming language theory, data visualization, and machine learning.

Find Eugene Kirpichov at

Similar Talks

Parallel processing and machine learning @Google
Parallel processing and machine learning @Google
Software Engineer @Google
TensorFlow Developer Advocate @Google
TensorFlow Developer Advocate @Google

Tracks

  • Deep Learning Applications & Practices

    Deep learning lessons using tooling such as Tensorflow & PyTorch, across domains like large-scale cloud-native apps and fintech, and tacking concerns around interpretability of ML models.

  • Predictive Data Pipelines & Architectures

    Best practices for building real-world data pipelines doing interesting things like predictions, recommender systems, fraud prevention, ranking systems, and more.

  • ML in Action

    Applied track demonstrating how to train, score, and handle common machine learning use cases, including heavy concentration in the space of security and fraud

  • Real-world Data Engineering

    Showcasing DataEng tech and highlighting the strengths of each in real-world applications.