The Wayback Machine - https://web.archive.org/web/20210120094428/https://github.com/sdtblck
Skip to content
Avatar

Pinned

  1. An implementation of model parallel GPT2& GPT3-like models, with the ability to scale up to full GPT3 sizes (and possibly more!), using the mesh-tensorflow library.

    Python 628 49

  2. Python tools for processing the stackexchange data dumps into a text dataset for Language Models

    Python 3 2

  3. Extracting pdfs using pdfminer.six and pyPDF2

    Python 1 3

712 contributions in the last year

Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec Jan Mon Wed Fri

Contribution activity

January 2021

Created 3 repositories

Created a pull request in lucidrains/DALLE-pytorch that received 5 comments

change gumbel softmax dimension

somewhat certain we should be softmaxing over hdim here - pytorch's default is -1 which is the height dimension

+1 −1 5 comments

Created an issue in EleutherAI/gpt-neox that received 6 comments

Implement Pipeline Parallelism

Should be fairly easy as our net is already expressed in terms of layers https://www.deepspeed.ai/tutorials/pipeline/

6 comments
You can’t perform that action at this time.