Block or Report
Block or report sgugger
Report abuse
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abusePopular repositories
3,958 contributions in the last year
Less
More
Activity overview
Contributed to
huggingface/transformers,
huggingface/accelerate,
huggingface/safetensors
and 15 other
repositories
Contribution activity
July 2023
Created 10 commits in 2 repositories
Created a pull request in huggingface/transformers that received 2 comments
Skip keys not in the state dict when finding mismatched weights
What does this PR do?
When looping through the keys in find_mismatched_weights, we loop through all the loaded_keys which are all the keys in the c…
+3
−0
•
2
comments
Opened 1 other pull request in 1 repository
huggingface/transformers
1
merged
Reviewed 26 pull requests in 4 repositories
huggingface/transformers
14 pull requests
- Unpin protobuf in docker file (for daily CI)
-
[
PoC] add PEFT support directly in transformers pipeline - Replacement of 20 asserts with exceptions
-
🐛 Handle empty gen_kwargs for seq2seq trainer prediction_step function - Fix lr scheduler not being reset on reruns
- Skip some slow tests for doctesting in PRs (Circle)CI
- Fix eval_accumulation_steps leading to incorrect metrics
-
📝 Add parameter names to code examples in README - [docs] Performance docs tidy up, part 1
- add gradient checkpointing for distilbert
- Fix typo in LocalAgent
-
Docs: add
kwargstype to fix formatting - [DOC] Clarify relationshi load_best_model_at_end and save_total_limit
- Llama: add RoPE scaling
huggingface/accelerate
9 pull requests
- Skip tests when bnb isn't available
- Fixes for issue #1683: failed to run accelerate config in colab
- Move mixed precision wrapping ahead of DDP/FSDP wrapping
- Add Ascend NPU accelerator support
- Add offload for 8-bit model
- [docs] project_dir parameter update
- Fix launcher validation
- Improve quality errors
- Deepcopy on Accelerator to return self







