What tokens are used more vs. less in #TidyTuesday place names?
This article is originally published at https://juliasilge.com/blog/Let’s use byte pair encoding tokenization along with Poisson regression to understand which tokens are more more often (or less often) in US place names.
Thanks for visiting r-craft.org
This article is originally published at https://juliasilge.com/blog/
Please visit source website for post related comments.