Newest 'embedding' Questions

0 votes

0 answers

68 views

Low cosine similarity score when using Langchain FAISS wrapper and CLIP Model

I am currently using Langchain wrapper of FAISS vector store and CLIP model. Since Langchain wrapper of FAISS supports only text embedding models, I have implemented my own embedding class(using ...

Aashishkumar

21

asked Apr 7 at 19:05

Tooling

0 votes

2 replies

68 views

Speaker identification/ IVR detection in Indian telephony-calls along with real-time speaker count

I’m working on a Python pipeline for Indian telephony call recordings (mostly 8kHz mono, noisy, multilingual, and with mixed accents). I need to: estimate how many speakers are in the call run speaker ...

Nandini Singhal

1

asked Mar 17 at 11:56

Best practices

0 votes

0 replies

57 views

Best way to generate embeddings for structured product attributes in B2B ecommerce search?

I am building a B2B product search system using vector embeddings and would like advice specifically on how to generate embeddings for structured product attributes. Context Domain: B2B ecommerce ...

Sumit Raj

21

asked Feb 4 at 6:04

0 votes

0 answers

462 views

Problems using local OLLAMA during embedding to ChromaDB

I am running a Python script on a Linux VM without CUDA support and a local OLLAMA. When checking OLLAMA on the console with ollama run jina/jina-embeddings-v2-base-de "Hello world" the ...

Lars Hinrichsen

63

asked Jan 4 at 11:00

0 votes

0 answers

985 views

How to fix EOF Server Ollama Error when embedding

I built the RAG system in Laravel with Neuron AI framework. I tried to store docs to the vectorstore but I got error: [2025-12-04 06:49:45] local.ERROR: Server error: `POST http://localhost:11434/api/...

Leslie Joe

335

asked Dec 4, 2025 at 6:59

2 votes

4 answers

155 views

Loading many PyTorch .pt files from Google Drive in Google Colab is extremely slow

I am storing my ESM protein embeddings as individual .pt files in Google Drive (about 46,000 files, total size ~182 MB). When I try to load them in Google Colab, the process is extremely slow (only ~2 ...

Max__08

21

asked Nov 27, 2025 at 8:59

3 votes

2 answers

218 views

Multimodal embedding requires video first, then image - why?

I am working with OmniEmbed model (https://huggingface.co/Tevatron/OmniEmbed-v0.1), which is built on Qwen2.5 7B. My goal is to get a multimodal embedding for images and videos. I have the following ...

n_arch

76

asked Oct 2, 2025 at 15:07

-1 votes

1 answer

86 views

How to reconstruct sentences from mean-pooled embeddings (embedding inversion) [closed]

I’m working on a research problem where I want to reconstruct or paraphrase sentences starting from synthetic embeddings. The embeddings are global (mean-pooled), not token-level, so they lose ...

melissa mattos

1

asked Sep 30, 2025 at 0:02

1 vote

2 answers

664 views

How can I match the token count used by BGE-M3 embedding model before embedding?

For my particular project, it would be very helpful to know how many tokens the BGE-M3 embedding model would break a string down into before I embed the text. I could embed the string and count the ...

ManBearPigeon

13

asked Sep 2, 2025 at 18:38

0 votes

0 answers

62 views

itext 5.5 add mp4 video

Is it possible to add mp4 videos to itet 5.5 and let them play upon showing? What I tried: final PdfPCell cell = new PdfPCell(); final Chunk chunk = new Chunk(); chunk.setAnnotation(PdfAnnotation ...

Mani Afschar Yazdi

43

asked Jun 11, 2025 at 5:43

1 vote

1 answer

325 views

Rapidfuzz giving no matches but Fuzzywuzzy does

I have been developing a matching system which matches the rows of the client and our central database depending on similarity. I have used a hybrid approach where I needed to somehow map the Company, ...

Prabhjit Singh

21

asked Jun 4, 2025 at 6:21

0 votes

0 answers

185 views

Embed Android.Views.View in .net maui

I'm migrating a project from android native to .net maui, by replacing native forms with embedded maui views (following the embedding method described in this MSDN page) However, now I'm facing a ...

Jesoo

104

asked May 6, 2025 at 10:04

0 votes

0 answers

601 views

Qdrant 400 Bad Request Error When Inserting Multi-Vector Embeddings with Larger Batch Sizes via API

Context I am working on a semantic search application and using Qdrant to store three types of embeddings per document: Dense embeddings (from OpenAI) Sparse embeddings (from Qdrant/BM25) Rerank ...

Himanshu Gupta

1

asked Apr 28, 2025 at 9:15

1 vote

1 answer

231 views

Spring-AI embedding, cannot search result by method 'similaritySearching' [closed]

I'm using Spring ai to build an app, the stack I chose is jdk 21, spring-boot 3.3.11 ,spring-ai 1.0.0M6. And for both embedding and chat models, I chose to use Ollama, thus the spring-ai-ollama-spring-...

xiong

19

asked Apr 15, 2025 at 12:39

0 votes

1 answer

187 views

How to validate search terms when using embedding to look for objects in images

I have a search on my site that does both tradition full text search and searches using embeddings. So, for example, when you search 'red balloon' I want both the text and image results. The problem ...

Frank Conry

2,758

asked Apr 5, 2025 at 2:15

Collectives™ on Stack Overflow

Low cosine similarity score when using Langchain FAISS wrapper and CLIP Model

Speaker identification/ IVR detection in Indian telephony-calls along with real-time speaker count

Best way to generate embeddings for structured product attributes in B2B ecommerce search?

Problems using local OLLAMA during embedding to ChromaDB

How to fix EOF Server Ollama Error when embedding

Loading many PyTorch .pt files from Google Drive in Google Colab is extremely slow

Multimodal embedding requires video first, then image - why?

How to reconstruct sentences from mean-pooled embeddings (embedding inversion) [closed]

How can I match the token count used by BGE-M3 embedding model before embedding?

itext 5.5 add mp4 video

Rapidfuzz giving no matches but Fuzzywuzzy does

Embed Android.Views.View in .net maui

Qdrant 400 Bad Request Error When Inserting Multi-Vector Embeddings with Larger Batch Sizes via API

Spring-AI embedding, cannot search result by method 'similaritySearching' [closed]

How to validate search terms when using embedding to look for objects in images

Hot Network Questions