Questions tagged [cluster]
discussion related to cluster mechanisms.
219 questions
1
vote
0
answers
27
views
Ocfs2: link between cluster and device?
I am having 2 servers (Debian 12) that use a storage-disk (SD).
Both see this SD as a device via fdisk. I have no details about the storage-device itself or the connection type - for me it is just a ...
0
votes
0
answers
51
views
Adding a New Server to Existing Proxmox Cluster - Network Configuration and VM Communication
I’m looking for some guidance on expanding my Proxmox setup. Here’s my current setup and what I’m trying to achieve:
Current Setup
I have a dedicated OVH server running Proxmox.
On this server, I ...
0
votes
0
answers
34
views
Disable read-ahead caching for GFS2 Logical Volume
I have 10 node deployment which implement red hat clustering software - pacemaker/corosync to mount gfs2 and ensure high-availability. Nodes are actually mail servers and use gfs2 to store user's data ...
0
votes
0
answers
31
views
How to set new features to N during kernel compilation from an old .config file?
I am compiling a custom linux kernel for a compute cluster. The cluster is currently running on kernel version 4.4.47 since last 5 years. I need to upgrade the kernel to a more recent version. I've ...
1
vote
1
answer
399
views
Need a method for managing systemd services across multiple hosts
I have six Linux servers running RHEL 8.6 - and need to ensure that one specific service is running at least one and at most one of those six servers.
Does systemd support something like this?
If not, ...
1
vote
0
answers
23
views
Script/Daemon to kill specific resource-consuming tools?
I'm working on a SGE linux cluster and beginners often run memory/resource consuming tools on the login node instead of using qsub or qlogin ( https://gridscheduler.sourceforge.net/htmlman/htmlman1/...
1
vote
1
answer
132
views
qsub-like behavior for a slurm cluster
I recently switched to slurm and looking for a job submission tool, that behaves similar to qsub:
It takes input through a pipe
It prints the output to stdout
Example:
for n in `seq 1 10`; do
...
0
votes
2
answers
555
views
Unable to install Slurm on PC
I am trying to install slurm on Ubuntu PC. Therefore, I followed the instructions given over here
I did the following -
sudo apt update -y
sudo apt install slurmd slurmctld -y
mkdir sudo /etc/slurm-...
1
vote
0
answers
62
views
Shell script looking for a missing module
I want to run a shell script on a compute cluster but I get an error because at some point it is looking for a module that does not exist since a major update on the cluster a few months ago. This ...
0
votes
1
answer
60
views
Running arbitrary binary program with cluster computers
I have 3 VPS. Let's say master, slave1, slave2.
Their specifications are identic.
Processor: 1CPU
Memory: 1GB
Disk: 10GB
Network: running on LAN each other
I expect any arbitrary binary program (...
1
vote
1
answer
174
views
Can I fully utilize HDR Infiniband network throughput between servers and NFS volume?
I'm working on a project building a CPU cluster, and those servers and NFS storage (not a parallel file system) are going to be connected through HDR InfiniBand cables. In this architecture, can I get ...
1
vote
1
answer
56
views
Unable to run linpack on head node of cluster
I recently set up my own home cluster - 4 units of raspberry pi. But I am having problems trying to benchmark all 4 units using Linpack
One node is the head node called rpislave1, it connects to the ...
2
votes
1
answer
363
views
How to set up a bunch of linux servers with shared file system without using job scheduler?
I am managing multiple GPU servers in our lab, which are mainly used for deep learning tasks. We would like these machines to share the same file system, so it is easier to switch between them.
...
0
votes
0
answers
80
views
Proper way to design filesystems structure for a cluster of diskless nodes
I'm trying to learn the basics of Linux clustering so I started designing a really humble cluster:
6 worker nodes (Libre Computer La Frite | Cortex-A53 @ 1.2 GHz | 1GB RAM)
1 master node (Raspberry ...
0
votes
1
answer
271
views
Remove internet access without losing LAN
I have a small cluster (all nodes run Debian 10) and need to remove the internet connections of all slave nodes. The internet cable connection connects to a computer that acts as a firewall, then, ...