Machine learning for Project Cognoma

This repository hosts machine learning code and discussion (see Issues) for Project Cognoma.

NOTE: This repository is no longer up-to-date with the web application

The production notebook that is served to website users can be found in the ml-workers repository. This repository will be used for continued data exploration and new modeling approaches.

Notebooks

The following notebooks implement the primary machine learning workflow for Cognoma:

1.download.ipynb: downloads the cancer datasets.
2.mutation-classifier.ipynb: builds a classifier for mutation in a given gene.
3.pathway-classifier.ipynb: builds a classifier for mutation in any gene for a given pathway.

If you've modified a notebook and are submitting a pull request, then export the notebooks to scripts:

jupyter nbconvert --to=script --FilesWriter.build_directory=scripts *.ipynb

Environment

This repository uses conda to manage its environment and install packages. If you don't have conda installed on your system, you can download it here. You can install the Python 2 or 3 version of Miniconda (or Anaconda), which determines the Python version of your root environment. Since we create a dedicated environment for this project, named cognoma-machine-learning whose explicit dependencies are specified in environment.yml, the version of your root environment will not be relevant.

With conda, you can create the cognoma-machine-learning environment by running the following from the root directory of this repository:

# Create or overwrite the cognoma-machine-learning conda environment
conda env create --file environment.yml

If environment.yml has changed since you created the environment, run the following update command:

conda env update --file environment.yml

Activate the environment by running source activate cognoma-machine-learning on Linux or OS X and activate cognoma-machine-learning on Windows. Once this environment is active in a terminal, run jupyter notebook to start a notebook server.

May	JUN	Jul
	12
2017	2018	2019

Failed to load latest commit information.
algorithms
data/api	Simplify converting objects to JSON serializable formats (#60 )	Oct 18, 2016
explore
scripts	Revise parameter grid (#114 )	Sep 26, 2017
.gitignore
1.download.ipynb
2.mutation-classifier.ipynb	Revise parameter grid (#114 )	Sep 26, 2017
3.pathway-classifier.ipynb
LICENSE.md
README.md	Update README to reflect break with ml-workers	Apr 5, 2018
environment.yml	Change plots to use plotnine (#112 )	Aug 28, 2017
execute.sh	Automate cancer-data download from figshare (#42 )	Aug 31, 2016
utils.py

cognoma/machine-learning

Join GitHub today

Clone with HTTPS

Launching GitHub Desktop...

Launching GitHub Desktop...

Launching Xcode...

Launching Visual Studio...

README.md

Machine learning for Project Cognoma

NOTE: This repository is no longer up-to-date with the web application

Notebooks

Environment