In this video I give an overview of pandas 2.0 and the main changes related to the apache arrow backend.
Marc Garcia's Article: https://datapythonista.me/blog/pandas-20-and-the-arrow-revolution-part-i
Timeline:
00:00 Intro
01:04 Legacy Numpy
02:49 Arrow Backend
03:44 Missing Values
04:33 Speed
05:47 Interoperability
07:42 Arrow Data Types
Check out my other videos:
Data Pipelines: Polars vs PySpark vs Pandas: https://youtube.com/watch?v=mi9f9zOaqM8&feature=shares
Polars for Data Science: https://youtube.com/watch?v=VHqn7ufiilE&feature=shares
Speed up Pandas Dataframes: https://youtube.com/watch?v=u4rsA5ZiTls&feature=shares
Avoid These Pandas Mistakes: https://youtube.com/watch?v=_gaAoJBMJ_Q&feature=shares
Links to my stuff:
* Youtube: https://youtube.com/@robmulla?sub_confirmation=1
* Discord: https://discord.gg/HZszek7DQc
* Twitch: https://www.twitch.tv/medallionstallion_
* Twitter: https://twitter.com/Rob_Mulla
* Kaggle: https://www.kaggle.com/robikscube
::::::::::::::::::::
Music: Head Candy - William Rosati
Support by RFM - NCM: https://bit.ly/3jpOhJn
::::::::::::::::::::