Writing a self-contained ETL pipeline with Python

For this tutorial we are going to create a self contained small ETL pipeline for processing reddit posts in real time. Read more


Better Python Object Serialization

The Python standard library is full of underappreciated gems. One of them allows for simple and elegant function dispatching based on argument types. This makes it perfect for serialization of arbitrary objects – for example to JSON in web APIs and stru... (more…)

Read more »