There is an abundance of easily mineable text data (Whatsapp, twitter, and even our own e-mails!), and we have no excuse to not analyze it. In this workshop, we will learn some tips and tricks to deal with messy text data, before moving on to some lesser looked at text analysis techniques, such as text summarization, working with distance metrics, and an old personal favorite - topic models.
I am looking for editors/curators to help with branches of the tree. Please send me an email if you are interested.