Building Realtime Pipelines in Cloud Data Fusion
About this Course
This is a self-paced lab that takes place in the Google Cloud console. In addition to batch pipelines, Data Fusion also allows you to create real-time pipelines, that can process events as they are generated. Currently, realtime pipelines execute using Apache Spark Streaming on Cloud Dataproc clusters. In this lab, you will learn how to build a streaming pipeline using Data Fusion.Created by: Google Cloud

Related Online Courses
Parallel, concurrent, and distributed programming underlies software in multiple domains, ranging from biomedical research to financial services. This specialization is intended for anyone with a... more
This specialization starts with an in-depth look at the physical geography of the Arctic and its key climate features, including the ocean\'s floating sea ice cover, the Greenland Ice Sheet, and... more
Code and run your first Java program in minutes without installing anything! This course is designed for learners with limited coding experience, providing a solid foundation of not just Java, but... more
In this course, we will explore fundamental issues of fairness and bias in machine learning. As predictive models begin making important decisions, from college admission to loan decisions, it... more
If I Googled you, what would I find? As we move around the online world we leave tracks and traces of our activity all the time: social media accounts, tagged images, professional presences, scraps... more