text mining

castarter - content analysis starter toolkit for R

castarter is designed to make it easy also for relatively inexperienced users to create a textual dataset from a website, or a section of a website, keep it up-to-date, and explore it through word frequency graphs or a web interface that makes it possibe to tag items. Documentation is available on castarter’s website.

Roundtable: Research Data Quality Assessment for the Area-Studies on the post-Soviet region: New Approaches needed?

Surfing the post-Soviet web with style. Text mining post-Soviet de facto states

Scholars working on the post-Soviet space frequently refer to web contents at different stages of their research process. However, they (we) usually approach the internet as an inordinate mass of contents, that can be superficially explored thanks to …