Where can I get a training set for an NLP project moderating sexual harassment comments?
I would check with Rebecca Chiao, co-founder at HARASSmap(harassmap.org/en). If she doesn’t have a dataset of sexual harassment comments, I suspect she would have some good ideas on where to get one.
References:
- Using Data From Court Cases and Employee Surveys to Design Sexual Harassment Policies (2015)
- HarassMap: using crowdsourced data to map sexual harassment in Egypt (2014)
- Test Of A Causal Model For Sexual Harassment Using Data From A Meta-analysis (2014)
- He Said, She Said, Let’s Hear What the Data Say: Sexual Harassment in the Media, Courts, EEOC, and Social Science (2012)