Fredrik Lundh crafted our current string search algorithms, and
they've served us very well. They're nearly always as fast as
dumbest-possible brute force search, and sometimes much faster. This
was bought with some very cheap one-pass preprocessing of t... (more…)
Read more »
Python’s itertools package provides you with a tonne of iterators. In this episode, we take a whirlwind tour of all the things itertools has to offer, and al... (more…)
Read more »
Tutorial explaining how to create a topic model using Gensim and Dremio on data stored in Amazon S3. (more…)
Read more »
Whenever I started learning about parsing and writing interpreters, I would get to how to handle comments. The gist of it was simple: just throw them out! You find them in the raw text output as you are generating lexing tokens, and discard them, meaning ... (more…)
Read more »
A document that NSA uses for teaching Python. This was obtained via a FOIA request, per https://twitter.com/chris_swenson/status/1225836060938125313?s=09... (more…)
Read more »