Skip to main content

Questions tagged [natural-language-processing]

Natural language processing (NLP)

-1 votes
0 answers
23 views

Looking for readily available pragmatic approaches to topic modeling?

Ditto I am working on language sentimental analysis over quite a few compartments. And I am wondering that if they are reusable supervised neural networks. That work in transient with other nlp ...
wyvernghost's user avatar
0 votes
0 answers
53 views

string matching algorithm question for matching approximately similar names between two lists

The focus of this question is on natural language processing, specifically matching names between 2 lists. I am looking at employees that work in the same organization, however I obtained data from ...
krishnab's user avatar
  • 171
0 votes
3 answers
171 views

Does there exist a decent Python-to-English converter (abstractly, i.e. may not be coded on Earth) that is round-trip?

I was wondering how much Python can be converted in as close as possible a near 1-1 correspondence with English statements? And how one could go about writing one given that they are able to examine a ...
Daniel Donnelly's user avatar
1 vote
1 answer
87 views

bijection map of LLM

I just want to know: could we built a bridge or bijection mapping for AI agent between Prompt and Response? In other words, AI ...
YuzheCheung's user avatar
0 votes
1 answer
77 views

Forming Sentences

I have a bunch of transcripts from online videos but I suspect the transcripts aren't formatted well. There are no punctuations, sentences are broken abruptly, some words aren't complete (for example ...
Mr.A's user avatar
  • 1
1 vote
1 answer
93 views

Generate product description from product specifications

I am looking for a python NLP library that can generate a proper product description based on product features provided to it. Till now, i have tried transformers library and this is the code: ...
FaisalShakeel's user avatar
1 vote
1 answer
102 views

Is the fact true that GPT-3 wasn't trained on any explicitly gathered code dataset?

The paper mentions a line about GPT-3's training dataset - ...
Deepak Ahire's user avatar
0 votes
1 answer
73 views

Strange notation in the article about sparse self-attention

While reading an article devoted to the sparse self-attention, I came across a notation that was not very clear: $$ Attend(X, S) = \Big( a(x_i, S_i) \Big)_{i∈{\{1,...,n}\}} $$ What means $\Big( \space ...
b1ackf0x's user avatar
1 vote
0 answers
76 views

Are there known algorithms for generating abbreviations from a (sequence) of words (similar to hyphenation)

I'm currently developing a custom UI layout library. At the moment a user of that library should supply strings for UI labels in a format that allow for subsequent shortening of those labels, so that ...
datenwolf's user avatar
  • 111
0 votes
0 answers
69 views

n-gram model: why conditioning on the start symbol <s>?

from the book, Speech and Language Processing (3rd ed. draft) I understand conditioning on <s> will give context about the first word. From the formula above, ...
Izana's user avatar
  • 131
0 votes
2 answers
227 views

ML model to pick to classify data with inputs as an array of strings?

I have a medical dataset, which is something like this ...
Ashrjz's user avatar
  • 1
1 vote
1 answer
96 views

How can a Machine Learning model predict this classification problem?

I am pretty new to ML and have a basic but fundamental question. Let’s imagine we want to create a simple Sentiment Analysis model using Machine Learning not Deep Learning algorithms, so we need to ...
Z Bokaee's user avatar
0 votes
1 answer
150 views

If Large Language Models like ChatGPT grow, do also their problems grow?

In a video Noam Chomsky said, that if these LLM get bigger, than also the things they are not good at get bigger too. He doesn't explain more details about this. So is this true? In what way do their ...
JonnyPython's user avatar
1 vote
0 answers
88 views

How should one interpret the minhash error rate?

On Wikipedia the error rate for the minhash algorithm vs baseline jaccard similarity is stated as 1/sqrt(1/hashes). But, how should one interpret this number in a ...
micimize's user avatar
  • 111
1 vote
1 answer
119 views

Resources to learn NLP

I am an undergraduate student in mathematics. I have a fair bit of experience with deep learning in computer vision research and am willing to dabble into Natural Language Processing (NLP). I hope ...
The Limit Does Not Exist's user avatar

15 30 50 per page
1
2 3 4 5
14