For some context of setting up this thread, I was giving some tutorial on TF-IDF to a crowd having their first taste of NLP on https://goo.gl/SHt8CS
And I got the question. I tried to explain it for numerical stability but I thought it would be good to seek answers from experts on corpora list too. -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 883 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20180601/20ca3801/attachment.txt>