The Wayback Machine - https://web.archive.org/web/20230711225125/https://github.com/sgugger

sgugger

Follow

Sylvain Gugger sgugger

Follow

Senior ML Open Source Engineer at HuggingFace

2.8k followers · 4 following

Achievements

BetaSend feedback

Achievements

BetaSend feedback

Organizations

Block or Report

Block or report sgugger

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories

Deep-Learning Public

A few notebooks about deep learning in pytorch

Jupyter Notebook 561 115
Adam-experiments Public

Experiments with Adam/AdamW/amsgrad

Python 187 36
hf_examples Public

NLP Examples using the 🤗 libraries

Jupyter Notebook 40 3
torchdynamo-tests Public

Python 19 3
SwiftData Public

Swift 15 1
fastai Public

Forked from fastai/fastai

The fast.ai deep learning library, lessons, and tutorials

Jupyter Notebook 8 2

3,958 contributions in the last year

Learn how we count contributions

Activity overview

Contributed to huggingface/transformers, huggingface/accelerate, huggingface/safetensors and 15 other repositories

Contribution activity

July 2023

Created 10 commits in 2 repositories

Created a pull request in huggingface/transformers that received 2 comments

Skip keys not in the state dict when finding mismatched weights

What does this PR do? When looping through the keys in find_mismatched_weights, we loop through all the loaded_keys which are all the keys in the c…

+3 −0 • 2 comments

Opened 1 other pull request in 1 repository

huggingface/transformers 1 merged

Allow existing configs to be registered Jul 11

Reviewed 26 pull requests in 4 repositories

huggingface/transformers 14 pull requests

Unpin protobuf in docker file (for daily CI) Jul 11
[PoC] add PEFT support directly in transformers pipeline Jul 11
Replacement of 20 asserts with exceptions Jul 11
🐛 Handle empty gen_kwargs for seq2seq trainer prediction_step function Jul 11
Fix lr scheduler not being reset on reruns Jul 11
Skip some slow tests for doctesting in PRs (Circle)CI Jul 11
Fix eval_accumulation_steps leading to incorrect metrics Jul 11
📝 Add parameter names to code examples in README Jul 11
[docs] Performance docs tidy up, part 1 Jul 11
add gradient checkpointing for distilbert Jul 11
Fix typo in LocalAgent Jul 11
Docs: add kwargs type to fix formatting Jul 11
[DOC] Clarify relationshi load_best_model_at_end and save_total_limit Jul 11
Llama: add RoPE scaling Jul 11

huggingface/accelerate 9 pull requests

Skip tests when bnb isn't available Jul 11
Fixes for issue #1683: failed to run accelerate config in colab Jul 11
Move mixed precision wrapping ahead of DDP/FSDP wrapping Jul 11
Add Ascend NPU accelerator support Jul 11
Add offload for 8-bit model Jul 11
[docs] project_dir parameter update Jul 11
Fix launcher validation Jul 11
Improve quality errors Jul 11
Deepcopy on Accelerator to return self Jul 11

huggingface/notebooks 2 pull requests

Fix 404 Link Jul 11
Update torch_xla to a Colab compatible version Jul 11

huggingface/safetensors 1 pull request

Support musicgen conversion. Jul 11