Talks

Presentations and Tutorials I have given at various public events.

2023

DKFZ Data Science Seminar – Best practices for parallelizing data pipelines | YouTube

2020

Techtiefen #31 - Effiziente Datenverarbeitung | Podcast

PyData Global 2020 – pandas.(to/from)_sql is simple but not fast | SlideShare | SpeakerDeck

2019

PyData Südwest Karlsruhe – Lightning Talk on the creation of a conda-forge package | Pull Request

PyData Frankfurt (Main) – (Efficient) Data Exchange with “Foreign” Ecosystems | SlideShare | SpeakerDeck

Berlin Buzzwords – Taming the language border in data analytics and science with Apache Arrow | SlideShare | SpeakerDeck | YouTube

2018

PyConDE / PyData Karlsruhe – Fulfilling Apache Arrow’s Promises: Pandas on JVM memory without a copy | SlideShare | SpeakerDeck | YouTube

PyConDE / PyData Karlsruhe – Scalable Scientific Computing with Dask (Workshop) | YouTube

data2day Heidelberg - Free Movement of Data with Apache Arrow | SlideShare | SpeakerDeck

GridSchool, Karlsruhe – Scalable Scientific Analysis in Python using Pandas and Dask

PyData Berlin - Extending Pandas using Apache Arrow and Numba | SlideShare | SpeakerDeck | YouTube

PyData Amsterdam - Building customer-visible data science dashboards with Altair / Vega / Vue | SlideShare | SpeakerDeck | YouTube

Man AHL (open source) Hackathon, London – Mentor for Apache Arrow contributions

2017

PyConDE / PyData Karlsruhe – Connecting PyData to other Big Data Landscapes using Arrow and Parquet | SlideShare | SpeakerDeck | YouTube

PyData London – Efficient and portable DataFrame storage with Apache Parquet | SlideShare | SpeakerDeck | YouTube

2016

ApacheCon Big Data Europe (Seville) — Parquet Format in Practice & Detail | SlideShare | SpeackerDeck

Software Engineering Daily - Apache Arrow

PyData Paris — How Apache Arrow and Parquet boost cross-language interoperability | SlideShare | SpeakerDeck | YouTube