Product Documentation
Cadence Job Monitor Reference
Product Version IC23.1, June 2023

A


Understanding Clusters and Queues

A cluster is a set of hosts working together to balance the job load. Each cluster is controlled by a daemon process called cdsqmgr. Jobs from different applications are submitted to the cdsqmgr, which sends the jobs to the hosts in the cluster.

Following are the best practices for farm software:

The information in this chapter is applicable only to Default-LBS. When you run jobs using LSF-LBS, cdsqmgr is not used. For more information about the daemon process involved in LSF, see LSF documentation available at www.platform.com.

To start cdsqmgr:

When using ssh to start cdsqmgr, access must be enabled using the fully qualified domain name for each host, otherwise ssh will not be able to resolve the name to the correct IP address.

How Applications Connect to cdsqmgr

You can set the LBS_CLUSTER_MASTER environment variable to control the cdsqmgr to which your application connects. This variable should be set to the name of the host on which the cdsqmgr resides. This host is also known as the cluster master. The default cluster master is the local host.

The following logic determines how applications connect to the cdsqmgr.

Applications do not have to code this logic; linking to the LBS client-side libraries or using the LBS JAVA class files is sufficient.
  1. The application can connect to cdsqmgr residing on the cluster master using login_name. login_name is the login name of the person who attempts to launch the application.
  2. If there is no instance of cdsqmgr running as login_name, the application attempts to connect to cdsqmgr running as root on the cluster master.
  3. If there is no instance of cdsqmgr running as root, an instance of cdsqmgr is automatically started up on the cluster master, and the application connects to it. Because cdsqmgr was started using the login name of the person who attempts to launch the application, it continues to run as login_name.
    Because cdsqmgr is started automatically, a configuration file cannot be specified. In this case, only the DEFAULT queue is considered to have been configured in the cluster.

To balance loads across all the users in a cluster, the person who is logged in as root must start up cdsqmgr on a known cluster master. The users should set the LBS_CLUSTER_MASTER environment variable to this cluster master. They will then connect to the same cdsqmgr, which will balance the load across all users’ jobs.

If each user were to connect to a separate cdsqmgr, the load would be balanced only across each user’s jobs.


Return to top
 ⠀
X