Github url: https://github.com/vigneshSs-07/Cloud-AI-Analytics/tree/main/Apache%20Beam%20-Python
In this videos we are going to discuss about what is Pcollections and characteristics of Pcollection in Apache Beam. Implemented PCollections practically using Apache beam Python SDK.
A PCollection represents a distributed data set that your Beam pipeline operates on. The data set can be bounded, meaning it comes from a fixed source like a file, or unbounded, meaning it comes from a continuously updating source via a subscription or other mechanism. Your pipeline typically creates an initial PCollection by reading data from an external data source, but you can also create a PCollection from in-memory data within your driver program. From there, PCollections are the inputs and outputs for each step in your pipeline.
Apache Beam - Playlist
https://youtube.com/playlist?list=PLA3TuOOaQOnmxerILIQlBIIy7igMo0Ax1
Tutorial 0 - Introduction to Apache beam
https://www.youtube.com/watch?v=82p6j5I_VD4&list=PLA3TuOOaQOnmxerILIQlBIIy7igMo0Ax1&index=2
Tutorial 2 - PCollections In Apache Beam - Python
https://www.youtube.com/watch?v=8wDs7vNJoiY&list=PLA3TuOOaQOnmxerILIQlBIIy7igMo0Ax1
Tutorial 3a – Map, FlatMap and Filter Transforms in Apache Beam
https://www.youtube.com/watch?v=dSZOY4vkr1E&list=PLA3TuOOaQOnmxerILIQlBIIy7igMo0Ax1&index=4
Tutorial 3 b – ParDo, Keys, Kvswap, Values, ToString Transform in Apache Beam
https://www.youtube.com/watch?v=NaH-rVwUDTI&list=PLA3TuOOaQOnmxerILIQlBIIy7igMo0Ax1&index=5
Tutorial 3c – GroupBy, GroupByKey, CoGroupByKey and GroupIntoBatches Transform in Apache Beam
https://www.youtube.com/watch?v=hcUTBjsgl3Q&list=PLA3TuOOaQOnmxerILIQlBIIy7igMo0Ax1&index=6
Subscribe to Channel: https://lnkd.in/ehFZbVH5
Follow us in LinkedIn: https://lnkd.in/gDT3ESdm
Follow us in Instagram: https://lnkd.in/gZ278ShA
Follow us in Facebook: https://lnkd.in/gQGF_3Eb
Follow us in Twitter: https://lnkd.in/gh7dZACW
Join in Telegram channel: https://lnkd.in/guFt2sAg
Join in WhatsApp group: https://lnkd.in/gAqkuDPA
Connect with me here:
Instagram: https://www.instagram.com/thearrow0494
LinkedIn: https://www.linkedin.com/in/vignesh-sekar-sujatha-02aa9b125/
🙏🙏🙏🙏🙏🙏🙏🙏
YOU NEED TO DO BELOW THINGS to support my channel
1. LIKE
2. SHARE
&
3. SUBSCRIBE
TO MY YOUTUBE CHANNEL
#gcpcloud #gcpdataengineer #Apachebeam