6
votes
Accepted
Database structure for word co-occurrence frequencies in a large corpus
Do I need related SQL tables (e.g. with the book metadata in one table and word data in another)
A separate table for words can be used, but it is probably not necessary. A "word" is identified by all ...
5
votes
Divergent Bodies of Truth, Relationships of Concepts
What you're describing is the task of cluster analysis. The goal is to find distinct clusters in data where the elements in the cluster are correlated. The attributes of the data form the "context" ...
3
votes
Accepted
Divergent Bodies of Truth, Relationships of Concepts
This is a fairly broad question. It's not clear to me if you're looking for algorithms that find the potential tags, or algorithms that can deal witht the fact that the universe is not black or ...
2
votes
Its possible to distinguish between functional and non functional software requirements by computer software?
Non-functional requirements can not reliably be identified based on the vocabulary alone, without a semantic analysis.
Whatever terms or group of terms you may find, I will be able to find a ...
1
vote
Accepted
Is there a way to encode binary into natural language?
So Meriam Webster has some 470,000 English words. More than enough for this task.
Take some subset of those words arrange them in a list from 0 to N, try and make N = 2^K -1 to make this easy.
For ...
1
vote
Designing the schema for a database of Spanish language words?
Analysing your statements:
There is a common part of properties to all the words
In this common part, the part of speech acts like a type : the word may have additional properties depending on ...
1
vote
Accepted
What approaches can I take to figure out the "relevancy" of certain terms in a string?
Have a look at the Microsoft LUIS offering
https://azure.microsoft.com/en-us/services/cognitive-services/language-understanding-intelligent-service/
I believe amazon have a similar api offering as ...
Only top scored, non community-wiki answers of a minimum length are eligible
Related Tags
natural-language-processing × 42algorithms × 11
machine-learning × 9
artificial-intelligence × 7
text-processing × 6
parsing × 5
search × 4
database × 3
search-engine × 3
design × 2
c# × 2
architecture × 2
data-structures × 2
terminology × 2
requirements × 2
strings × 2
semantics × 2
dictionary × 2
java × 1
object-oriented × 1
c++ × 1
python × 1
web-development × 1
testing × 1
programming-languages × 1