This is a proforma welcome message for Batch 7 folks to the Text-Analytics for Business Applications (TABA) course.
Re what we did in the course:
We've gone through 5 TABA sessions which covered, respectively:
Session 1: Elementary Text-An + sentiment-An
Session 2: Data factorization + Topic Modeling 1
Session 3: Topic Modeling 2
Session 4: Basic NLP + PoS tagging + Named entity recognition (NER) and finally,
Session 5: Supervised text classification (using mainly the Maxent algo in R).
Re Readings and other material:
I shall put up links to relevant readings etc. In a lot of cases, just googling for particular topics can throw up a wealth of information.
The recommended textbook is theory-heavy and fairly comprehensive. I've linked a review of the same.
If you don;t have a free Github account, now is the time to get one and familiarize yourself with what Git is, how it works etc. For instance, here's a nice 10 minute read that can act as a starting guide.
I want you to think about sourcing some of your code, data etc directly from Git into Rstudio. I'll ask Pandey ji to address any of your queries on this score.
Re Home-works:
I'm committed to have your home-works (HWs) out early and on time. There'll be 2 individual and 2 group HWs.
The individual HWs will be simple, practice-based and clear-cut. Deliverable will be in R markdown form. I don;t expect any higher than 1.5-2 hrs of effort into each indiv HW.
The group HWs will be a little more comprehensive. Again, these too aren;t meant to be overly intricate or complex. Kindly ensure you're not over-thinking, over-analyzing or over-complicating any group HW.
Update:
I'm combining the 2 individual HWs into one. Likewise, combining the 2 group HWs into 1. So, there'll only be two HWs in TABA - one individual and one group.
Any Qs, comments, feedback etc. pls let me know.
Sudhir
No comments:
Post a Comment