6,157 questions
0
votes
0
answers
48
views
Selective Inference on Ordinal Clustering
I've been using an ordered stereotype (OSM) approach to ordinal clustering with the R library 'clustord'
clustord is very well-documented with step-by step vignette. Therefore, to execute row ...
0
votes
0
answers
37
views
How to avoid overmerging with mclust, and failure to reproduce clustering?
I have been working with mclust, and have encountered issues that I can't find an obvious reason for. My main concern is that the threshold for multiple components to be found seems really high, and I ...
0
votes
0
answers
25
views
How to programmatically handle container partition redistribution in GridDB cluster after node failure?
Question
GridDB Container Partition Recovery After Node Failure
I'm working with a 3-node GridDB cluster and need to implement automatic recovery logic when one node fails. My application creates ...
0
votes
0
answers
44
views
Changing post and line colour in deg patterns cluster figures
I have had cluster plots produced for some RNA Seq time course data using the LRT analysis. I believe the plots are produced using the command:
clusters <- degPatterns(cluster_rlog, metadata = meta,...
5
votes
3
answers
237
views
Efficiently group rows within tolerance for multiple numeric columns
I'm trying to group rows that have values within specific error/tolerance.
Input looks like this:
input <- data.frame(Row_number = 1:22,
Name = c(rep("A",6), rep("...
0
votes
0
answers
27
views
Conditional logistic regression with robust standard errors for data matched with replacement
I am working with matched case-control data that used risk-set sampling with replacement (a control can be matched to more than one case). I am trying to figure out the correct syntax for conditional ...
1
vote
0
answers
57
views
Unsupervised Time Series Segmentation Without Predefined Number of Segments
I'm working with time series data where I need to identify distinct segments without prior knowledge of how many segments exist. The data looks like:
I've tried the claspy and ruptures packages, but ...
2
votes
1
answer
148
views
How to dynamically partition a 2D array into boxes based on inverse area density?
Context:
I have a 2D array (size N x M), let's call it U, where each cell contains a non-negative value K ≥ 0 representing a "density" at that point. I want to algorithmically divide the ...
0
votes
1
answer
39
views
Spatial clustering with two separate datasets
I'm hoping to get some advice on approaching a clustering problem. I have two separate spatial datasets, being real data and modelled data. The real data contains a binary output (0,1), which is ...
1
vote
1
answer
109
views
How to set minimum-maximum load constraint in Google Route Optimization API
I'm using Google RO API to create clusters. There is a capacity constraint on the clusters and the clusters should not overlap with each other. To do this, I've set the load demand of each shipment to ...
0
votes
1
answer
134
views
DragonFly benchmark: slow on Cluster
I need help regarding dragonfly db, particularly benchmarking.
So here is the story, I tried benchmarking dragonfly as a cache to replace redis. I got the expected result when testing single node; it ...
3
votes
5
answers
151
views
Combine connected list elements to form distinct list elements
I need to combine interconnected list elements to form distinct elements in base R with no additional packages required (while removing NA and zero-length elements).
Edit: I look for flexibility of ...
1
vote
1
answer
125
views
Capacitated Clustering using Google Route Optimization API
Fixed sized clusters
I need help with a capacitated clustering task. I have 400 locations (the number can vary each time), and I need to create fixed-size clusters (e.g., 40 locations per cluster). ...
0
votes
0
answers
23
views
Cluster lat/lon values based on values
I'm trying to cluster values from a map in Python (these values could be income, kindness towards dogs or amount of penguins in supermarkets, for me the values are floats) from different data sources. ...
0
votes
0
answers
49
views
Finding subclusters of a specific cluster
I performed HDBSCAN Clustering
hdbscan_clusterer = hdbscan.HDBSCAN(min_cluster_size=200)
df['Cluster'] = hdbscan_clusterer.fit_predict(data_matrix_for_clustering)
Now, I’m interested in getting the ...