Skip to main content
Advice
2 votes
3 replies
76 views

Context I am developing a high-stakes application that uses Byzantine State Machine Replication server-side, distributing a copy of the application state to multiple independent servers in such a way ...
Matteo Monti's user avatar
  • 9,150
0 votes
0 answers
80 views

I have a Ray cluster, I run it through these commands: Scheduler: nohup ray start --head \ --node-ip-address=<HOST_IP> \ --port=6379 \ --resources='{"is_worker": 1}' --...
cuneyttyler's user avatar
  • 1,424
3 votes
1 answer
89 views

I am trying to setup a distributed environment for NLP processing. I use Ray and Python on GCP. I have a master and several workers. What happens is that when I run 1 worker or 8 workers, it takes ...
cuneyttyler's user avatar
  • 1,424
1 vote
0 answers
37 views

I have a minute-level K-line table with a large amount of data, partitioned by date and stock code using value partitioning. Now I need to calculate a daily factor - computing the return skewness for ...
Jane's user avatar
  • 59
Advice
0 votes
2 replies
90 views

What's the best way to allow programs to discover each other on the network? Let's say we are writing a system that tracks the usage of computers over the network. We have an agent program that sends ...
Isembart's user avatar
Advice
2 votes
2 replies
91 views

I am working on a global PDE problem that is solved using a standard domain-decomposition strategy (e.g., Scotch, METIS). This part of the computation is well balanced across all MPI processes. ...
hrx71's user avatar
  • 1
0 votes
1 answer
48 views

I have a base table A and a result table B in DolphinDB. Table B was initially empty and is used to store calculated results based on table A. When trying to insert the calculated results into table B,...
RORO's user avatar
  • 1
0 votes
0 answers
237 views

Environment: Ray version: 2.x vLLM version: 0.9.2 Python version: 3.9 OS / Container base: Linux (CentOS-based UBI8 in Kubernetes) Cloud / Infrastructure: AWS based Kubernetes cluster (pods scheduled ...
NullUser's user avatar
3 votes
1 answer
152 views

I’m working with Apache Ignite 2.17.0. I load database tables into Ignite caches and run SQL queries using the SQLFieldsQuery API. Recently, I modified the cache configuration for some tables to use ...
kushal Baldev's user avatar
0 votes
0 answers
67 views

I have the following code to test. I created a table on worker 1. Then I tried to read the table on worker 2 and it got TABLE_OR_VIEW_NOT_FOUND. Worker 2 is in the some computer as Master. I ran the ...
Rick C. Ferreira's user avatar
3 votes
2 answers
357 views

I'm working with Ray async actors and I want to understand exactly what happens—at a deep technical level—when a synchronous method is called on such an actor. I know that calling a synchronous method ...
hegash's user avatar
  • 893
0 votes
0 answers
211 views

I'm trying to set up a multi-machine communication environment using MS-MPI on two Windows 11 laptops, but I'm encountering some issues. Here are the details of my setup: Environment Details: ...
user29094781's user avatar
1 vote
1 answer
193 views

I have a Spark DataFrame created from a Delta table, with one column of type STRUCT(JSON). For each row in this DataFrame, I need to make a REST API call using the JSON payload in the column. ...
uds0128's user avatar
  • 53
0 votes
1 answer
1k views

The problem I am facing is that my "used" memory is only around 16GB, however the cached memory takes up so much space, that I am forced to use a compute with higher memory (64GB). So I ...
Manav Karthikeyan's user avatar
1 vote
0 answers
111 views

I am training a model using TensorFlow 2.18.0 with the tf.distribute.MirroredStrategy across two GPUs. The training works fine on a single GPU, but when I try to run it on two GPUs, it ends with a ...
TGD's user avatar
  • 56

15 30 50 per page
1
2 3 4 5
191