Data Pipelines: Cassandra, Kafka and Python (and Go!)

Last year I started working on a ‘Big Data’ exercise. It’s an ongoing project that mixes large amounts of web traffic, data ingestion and analytics. It’s also really fun. We get to play with an array of new technologies – sometimes on a bet, granted – but…

Similar

Reproducible Python Bytecode

Some Linux distributions (like Alpine and Arch Linux) are shipping something called “python bytecode” in their packages. It’s stored in .pyc files and is generated during the package build. They’re stored in __pycache__ folders and can be seen here: (more…)

Read more »