huggingface / transformers Public

Notifications You must be signed in to change notification settings
Fork 27.3k
Star 136k

Code
Issues 990
Pull requests 525
Actions
Projects 1
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Issues: huggingface/transformers

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

990 Open 15,352 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇄1�7 + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Default arguments in DebertaConfig disable relative attention, contrary to the docs and deberta-base bug

#35335 opened Dec 19, 2024 by bauwenst

4 tasks

AllAboardBertweetModel New model

#35333 opened Dec 19, 2024 by CharC0de

2 tasks done

DeBERTa's DisentangledSelfAttention hardcodes float dtype, which causes bfloat16 overflow error bug

#35332 opened Dec 19, 2024 by bauwenst

2 of 4 tasks

MPI environment variables are not set. bug

#35331 opened Dec 18, 2024 by fabiogeraci

2 of 4 tasks

tokenizer decode decode with timestamp fails for extended vocabulary bug

#35330 opened Dec 18, 2024 by bnestor

2 of 4 tasks

InternVL is ExecuTorch Compatible ExecuTorch Feature request

Request for a new feature

#35327 opened Dec 18, 2024 by guangy10

unable to convert llama 3.3 weights to hf.py bug

#35326 opened Dec 18, 2024 by AshishMulupuri

1 of 4 tasks

Deepseek v2 New model

#35317 opened Dec 18, 2024 by VladOS95-cyber

2 tasks done

train_new_from_iterator() does not work when pre_tokenizer is null bug

#35315 opened Dec 18, 2024 by cecheta

1 of 4 tasks

Unclear what happens when using torchrun, multi-gpu and trainer arguments. bug

#35311 opened Dec 17, 2024 by davies-w

2 of 4 tasks

Multi-GPU training crashes with IterableDataset and different length input (e.g. Next token prediction) bug

#35308 opened Dec 17, 2024 by avishaiElmakies

2 of 4 tasks

[Question] Why doesn't trainer.state.epoch fall round after training? trainer

#35298 opened Dec 16, 2024 by qgallouedec

Custom 4D tensor caused shape mismatch error bug

#35290 opened Dec 16, 2024 by fingertap

1 of 4 tasks

version 4.47.0 provides different generation results when using quantized awq model bug

#35286 opened Dec 16, 2024 by xin3he

2 of 4 tasks

Request to add D-FINE contributions-welcome Good Second Issue

Issues that are more difficult to do than "Good First" issues - give it a try if you want!

New model Vision

#35283 opened Dec 15, 2024 by brockt96

1 of 2 tasks

Qwen2vl support for GGUF Feature request

Request for a new feature

#35282 opened Dec 15, 2024 by cheald

Vision models don't work for non-square object bug Vision

#35280 opened Dec 15, 2024 by liujilei156231

2 of 4 tasks

KeyError: 'intern_vit_6b' bug Multimodal

#35279 opened Dec 15, 2024 by Sweewangyu

1 of 4 tasks

microsoft/Phi-3.5-mini-instruct not working with FA2 due to position_ids bug

#35274 opened Dec 14, 2024 by BramVanroy

4 tasks

Strange behavior with attn_implementation="eager" bug Multimodal

#35270 opened Dec 14, 2024 by pspdada

2 of 4 tasks

inconsistent execution time bug

#35265 opened Dec 13, 2024 by pure-rgb

2 of 4 tasks

RuntimeError: "rshift_cuda" not implemented for 'Half' bug

#35256 opened Dec 13, 2024 by davidray222

2 of 4 tasks

Mismatch Between txt img_token and Image Count in Multimodal Processor Causes Debugging Feature request

Request for a new feature

Multimodal

#35254 opened Dec 13, 2024 by jp1924

xpu: parallelize() not supported for PyTorch XPU backend

#35252 opened Dec 13, 2024 by dvrogozh

4 tasks

#35248 opened Dec 12, 2024 by nosu3380

Previous 1 2 3 4 5 … 39 40 Next

Previous Next

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Nov	DEC	Jan
	19
2023	2024	2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly