My first batch pipeline with Apache Beam
Presented by Israel Herraiz at Beam College 2022.
Who is mentioned more in Don Quixote, Dulcinea or Sancho? In this workshop we will develop our first pipeline in Python with Apache Beam and the text of Don Quixote, the world renowned Spanish novel, to find out who is the actual soulmate of Don Quixote. Bring your favourite Python development environment, with Python 3.9, 3.8 or 3.7, clone the Beam College repo, and enjoy!
Slides: https://github.com/Beam-College/season-2022/blob/main/day1/5-Workshop-FirstPipeline.pdf
Code: https://github.com/Beam-College/season-2022/tree/main/day1/workshop-code