Stanza: A Python Natural Language Processing Toolkit for Many Human Languages

We introduce Stanza, an open-source Python natural language processing
toolkit supporting 66 human languages. Compared to existing widely used
toolkits, Stanza features a language-agnostic fully neural pipeline for text
analysis, including tokenization, m… Read more

Similar

Fast ODBC with Python – turbodbc

Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. In addition to complying with the Python Database API Specification 2.0, turbodbc offers built-in NumPy support. Don’t wait minutes for your re... (more…)

Read more »

Property-Based Testing in Python

NOTE: This blog post complements a PyDistrict presentation on the same topic posted on this date. Thanks to Rami Chowdhury for inviting me to speak, and the PyDistrict organizers for hosting me. Code samples from this talk are available at this GitHub rep... (more…)

Read more »