Skip to main content

NLP Collective

A collective focused on NLP (natural language processing), the transformation or extraction of useful information from natural language data.
38.5k Questions
+5
10.4k Members
+55
Contact

Pinned content

View all 2 collections

NLP admins have deemed these posts noteworthy.

Pinned
9 votes
2k views
Collection

Natural Language Processing FAQ

Frequently asked questions relating to NLP. Many of these may be questions that are often asked over and over, duplicates would likely be closed in favor of these. Add the best answer (using the ...
Berthold's user avatar
  • 101

Can you answer these questions?

View all unanswered questions

These questions still don't have an answer

0 votes
0 answers
7 views

Vertex AI TEI Deployment Fails for Private Hugging Face Model - "Could not download model artifacts"

I'm trying to deploy a Hugging Face model to Vertex AI using the Text Embeddings Inference (TEI) workflow, but I'm getting consistent errors during deployment. This same deployment approach worked for ...
0 votes
0 answers
8 views

Language Model Evaluation with Custom Task - Hugging Face Lighteval

I am creating a benchmark to evaluate a language model. First, I generated the dataset that I'm gonna prompt the Language model with. Subsequently, I tried to evaluate any tiny language model just to ...
1 vote
0 answers
13 views

Fine-tuned LLaMA 2–7B with QLoRA, but reloading fails: missing 4bit metadata. Likely saved after LoRA+resize. Need proper 4bit save method

I’ve been working on fine-tuning LLaMA 2–7B using QLoRA with bitsandbytes 4-bit quantization and ran into a weird issue. I did adaptive pretraining on Arabic data with a custom tokenizer (vocab size ~...
-1 votes
0 answers
22 views

ChunkedEncodingError: ('Connection broken: IncompleteRead(6182 bytes read, 4058 more expected)', IncompleteRead(6182 bytes read, 4058 more expected))

I wanna fetch the data from "https://www.yellowpages.co.th/" and store as a pd dataframe. import time def retry_fetching(soup, times=3, string="h1[class='typ-profile-title ...
0 votes
0 answers
21 views

Marianmt fine-tunning for translation. seq2seq trainer stucked

I'm fine-tuning a MarianMT model for a translation task using Hugging Face's Seq2SeqTrainer. Everything works fine until I add early stopping — then the training crashes silently, without any error ...