Employer-brand tracking using NLP for sentiment analysis, topic modeling of web-scraped reviews

Developed the technological core and Proof-of-Concept for a digital service intended to provide insights and analytics regarding employers/companies to

  1. their HR/branding departments,
  2. their current employees, and
  3. prospective applicants.

Static data and time series of key derived attributes were generated by applying Natural-Language Processing methods on employee reviews that were web-scraped from Indeed using BeautifulSoup and post-processed using Python and spaCy, NLTK, and VADER for tokenization, topic modeling, sentiment analysis, calculation of various indices, correlation between rated employer attributes, relation to current or past employment status per review (to estimate a “regret index” or a “so-long score”), etc. The service was never launched, as the web-scraped data was not my own IP.

OVERBRING software
indeed-sentiment 2017
/images/software/stack/Python.png Python
/images/software/stack/NumPy.png NumPy
NLTK
/images/software/stack/spaCy.png spaCy
VADER
/images/software/stack/pandas.png pandas
Proprietary Deprecated