CoCalc Blog

NLTK text corpus

haraldschilly • • python

The full 2.4gb NLTK text corpus is now available. You can for example run this in our SageMath or the Anaconda Python environment:

from nltk.corpus import brown
w = brown.words()
len(list(w))

which gives

1161192