Applying machine learning classification methods to identify suicidal posts in "SuicideWatch" subreddit.
Suicide is one of the leading causes of death globally, with an estimated 800,000 deaths annually. That is one death every 40 seconds. A key aspect in suicide prevention is addressing suicidal thoughts and ideas before they turn into actions. By the help of Machine learning, detecting suicidal ideation can assist public health agency in better resource allocation to target people at risk
Data used in this project can be found here. Below is the information required to reproduce the project:
Helper functionssaved asutils.py- Dataset is saved in
_datafolder asSuicide_Detection.csv Data cleaningnotebook saved asdata_cleaning.ipynb(the data produced from this notebook is found in the_datafolder asclean_reddit.csv)Data Processingnotebook found inprocessing.ipynbModellingnotebook found inmodelling.ipynbFinal reportsaved asreport.pdf
After creating your virtual environment, please run the following in command line to replicate
git clone https://github.com/eugenebaraka/Predict-Suicidal-Ideation-on-Reddit.git
cd Predict-Suicidal-Ideation-on-Reddit
pip install -r requirements.txt
- Building a suicidal tweet classifier using NLP
- Detecting suicidality on Twitter
- Detecting Suicide Ideation in the Era of Social Media
- Tracking suicide risk factors through Twitter in the US
- "Right time, right place" health communication on Twitter
- Tweeting celebrity suicides
- A comparative analysis on suicidal ideation detection using NLP, ML, and DL
- Suicidal profiles detection in Twitter
- A machine learning approach predicts future risk to suicidal ideation from social media data
- Systematic review: machine learning and suicidal ideation
- An unsupervised learning approach for automatically to categorize potential suicide
- A lexicon-based approach to detecting suicide-related messages on Twitter
- Detection of Suicide Ideation in Social Media Forums Using Deep Learning
- Supervised learning for suicidal ideation detection in online user content
- Machine learning for suicidal ideation identification: A systematic literature review
- A brief introduction to weakly supervised learning