Practical Guide to Designing Implants for Pandas

Since version 0.23, the pandas library allows using custom user types for internal representation in series and data frames by introducing the ExtensionArray and ExtensionDtype interfaces (in places where a NumPy array would be used). Version 0.24 brings that forward by implementing all its “exotic” types in terms of the mentioned interfaces.

The talk will explore the possibilities and shortcomings of extension arrays and will gradually build towards a simple proof-of-concept custom column that supports physical units (including the dimension-aware arithmetics and conversions).

In addition, two other approaches of adding custom behaviour to pandas – inheriting from pandas types and creating accessors for series / data frames / indices – will be presented.

Jan Pipek
Data Scientist, DTOne

I am a data scientist and DTone, a global network for mobile top-ups, rewards and airtime credit. I only recently converted from Monte Carlo simulations in medical physics. I am a data scientist and DTone, a global network for mobile top-ups, rewards and airtime credit. I only recently converted from Monte Carlo simulations in medical physics.
I’ve been using Python for more than ten years, with a strong inclination for data analysis and visualization (having written several useless and hopefully at least one useful library – physt), but also trying to enjoy the language in the broader sense.