- Platform Roll
- Platform OCS LSF HPC
- Platform Lava
- Intel® Software Tools Roll
- CluMon Roll
- PVFS2 Roll
- Modules Roll
- Extra Tools Roll
- ntop Roll
- Dell Roll
- Learn About Platform Products
- Get Technical Support
- Copyright and Trademarks
[ Top ]
Platform Roll
The Platform roll includes some useful tools that help make cluster management easier once you have set up your initial cluster.
Add-hosts tool
The add-hosts tool lets you quickly and easily add nodes to your cluster without using the
Insert-Etherscommand. By pre-populating the Platform OCS database from an XML configuration file, you can pre-load all the host information and simply startup your machines to start the compute node installation process.The add-hosts tool is bundled with the Platform OCS software.
How it works
The add-hosts tool allows you to define an XML configuration file that describes the information for each host you want to add to your cluster. For example, when you list your hosts in the configuration file, your host attributes (like IP address, MAC address, and host name) are inserted directly into the Platform OCS database. By populating the database using the add-hosts tool, you no longer need to start your machines in a pre-determined order.
You need to do two things before you use the add-hosts tool:
- Enter the MAC addresses of your nodes into a text file
- Enter your host information into the XML configuration file
The XML configuration file allows you to define information on a host or a subnet (group of hosts) level.
The host section of the configuration file lets you list your hosts and host attributes individually.
The subnet section lets you collect your nodes into a virtual grouping with the same appliance type so that you do not have to list individual hosts. When you specify a beginning IP address and the number of nodes in the subnet section, the add-hosts tool can extrapolate a group of nodes, name them, and deduce their IP addresses. If your IP addresses are not incremental or if you have gaps in your IP addresses, you can also specify any IP addresses to be excluded.
The add-hosts tool processes the <host> and <subnet> sections in the order that they occur in the XML configuration file. The MAC addresses in the MAC address file must be listed in the same order that the <host> and <subnet> sections occur. Also, these MAC addresses must correspond to the physical order of your hosts. The XML configuration file is processed in tandem with the MAC address file. Each host is paired up with the next occurring MAC address in the MAC address file.
Note: The add-hosts tool only loads host information into the database. You may need to perform additional configuration to set up a network (for example, you may need to set up switches to route from one subnet to another).
- Test mode
Run this tool in test mode to verify the your results before actually populating the database and changing any configuration files. To run this tool in test mode, follow the steps below but use the
--testmodeoption of theadd-hostscommand:#add-hosts --testmodeInstalling a new Platform OCS cluster
- Assumptions
- Using the add-hosts tool
The order in this text file must be the same as the physical order of your nodes. Comments (added to the beginning or end of a line, or on their own lines, marked by a "#") can be useful in tracking groups of MAC addresses.
The order of hosts in this file must match the order of MAC addresses in the
mac.txtfile.The skeleton XML configuration file:
<?xml version="1.0"standalone="yes"?> <add-hosts> <mac_addr_file value = file path /> <num_hosts_per_rack value = number /> <order_by_rack value ="yes"|"no"/> <netmask value = address /> <host_num_length value = number /> <host_start_num value ="0"|"1"/> <host> <name value = string /> <ip value = string /> <appliance value = string /> </host> <subnet> <host_prefix value = string /> <host_suffix value = string /> <baseip value = string /> <num_hosts_in_subnet value = number /> <appliance value = string /> <ip_exclude_list value = list of IP addresses /> <restart_numbering value ="yes"|"no"/> <gen_descending_ip value = "yes" | "no" /> </subnet> </add-hosts>
You can have zero or more<host>sections and zero or more<subnet>sections. Note that<host>sections must be defined before<subnet>sections.
Parameter Description Global Variables
mac_addr_file:
The absolute path name to the location of the MAC address text file. By default, located at /opt/rocks/etc/mac.txt.
num_hosts_per_rack:
The number of hosts per rack.
order_by_rack:
"Yes" (default) specifies that you want to order by rack and rank (node-0-0, node-0-1...node-2-4, node-2-5) and "No" indicates that you want to order by rank only (node-1, node-2, node-3).
netmask:
The netmask for this network.
host_num_length (optional):
Specifies the number of digits in your host names. Applies only if order_by_rack value="no". This option is useful for restricting your host names to a fixed length.
For example, if you have 20 hosts with each host using the same prefix:
- If you do not set host_num_length (or set host_num_length=0) and you have 20 hosts in your cluster with each host using the same prefix, your host names will have different lengths, for example, host-0, host-1, ..., host-10, host-11, ...
- If you set host_num_length=2, you can restrict the numeric portion of your host names to a fixed length of 2, for example, host-00, host-01, ..., host-10, host-11, ...
host_start_num (optional):
Specifies whether nodes are numbered beginning with 0 (default) or 1.
Host Section Parameters: All host sections must precede all subnet sections.
host:
Section used to define individual hosts.
name:
The name of the host.
ip:
The IP address of the host.
appliance:
Valid appliance type names with a default Platform OCS installation are:
This list can change depending on the rolls you install with Platform OCS.
Please see PVFS2 meta server if you have a PVFS2 meta server appliance type.
To see your valid appliance types:
A list of appliances is displayed.
Subnet section parameters
subnet:
Section used to define a group of hosts in one <subnet>.
host_prefix:
Host name not including the number that will be assigned. For example, if host_prefix value ="alpha", then the first host is named alpha0-0. Note that a dash is automatically added between the prefix and the assigned number.
host_suffix (optional):
Host name suffix. For example, if host_suffix value ="A", hosts are named charlie0-0A.
base_ip:
The first IP address in this subnet.
num_hosts_in_subnet:
Number of hosts in the subnet.
appliance:
Valid names are the same as for the Host Section Parameter. Each subnet section can only contain one appliance type. Set up different subnets for different appliances types.
ip_exclude_list (optional):
IP addresses to be excluded from the subnet. List IP addresses separated by a space. Use this option if you have gaps in your IP addresses.
restart_numbering:
"Yes" restarts host numbering for the current subnet. "No" (default) continues numbering from the last subnet.
gen_descending_ip:
Specifying "yes" will generate the IP numbers in descending order, and specifying "no" will generate the IP numbers in ascending order. The default value is "no"
- Save your configuration file
/opt/rocks/etc/add-hostsrc.- From the command line, run
add-hostsoradd-hosts --testmodeto test your outcome before making any configuration file changes.- Check
./add-hosts.logto see if you were successful. Each host added successfully has its own line and indicates "SUCCESS" at the end. If you failed to add a host, the line indicates "FAILED".- Perform a PXE startup on your compute nodes. For each host that was added to the Platform OCS database by the add-hosts tool, the compute node installation begins automatically.
- PVFS2 meta server
If you have a PVFS2 meta server appliance type, you must name your hosts as follows:pvfs2-meta-server-0-0. The PVFS2 roll only recognizes PVFS2 meta server hosts with this host name.
Adding a new host to your existing cluster
- Assumptions
- Using the add-hosts tool
- Add the new MAC address to the list in
/opt/rocks/etc/mac.txt. Make sure the order in this text file is the same as the physical order of your nodes.- In
/opt/rocks/etc/add-hostsrc, add a host section or edit a subnet section to include the new host. For example, when adding a host to a subnet, increase the value ofnum_hosts_in_subnetby 1.- Save the configuration file.
- From the command line, run
add-hostsoradd-hosts --testmodeto test your outcome before making any configuration file changes.- An error will occur indicating trouble adding the first host. This error is expected since you have already added this host to your database.
- When prompted, select
allto skip all subsequent errors.- Check
./add-hosts.logto see if you were successful. Each host added successfully has its own line and indicates "SUCCESS" at the end. If you failed to add a host, the line indicates "FAILED".Replacing a host
Once you have run the add-hosts tool, you can easily replace one host with another in the same physical location with the same IP address:
- Replace the old MAC address with the new one in
/opt/rocks/etc/mac.txt.- Remove the old MAC address in the Platform OCS database. For example:
#dbreport ethers | grep 00:c0:9f:45:02:1600:c0:9f:45:02:16 compute-0-3.local #insert-ethers --remove "compute-0-3"- From the command line, run
add-hostsoradd-hosts --testmodeto test your outcome before making any configuration file changes.An error will occur indicating trouble adding the first host. This error is expected since you have already added this host to your database.
- When prompted, select
allto skip all subsequent errors.- Check
./add-hosts.logto see if you were successful. Each host added successfully has its own line and indicates "SUCCESS" at the end. If you failed to add a host, the line indicates "FAILED".Upgrading your Platform OCS installation
If you are upgrading your Platform OCS installation, you can transfer your existing host configuration to the new installation.
- Before upgrading the front end, copy
/opt/rocks/etc/add-hostsrcand/opt/rocks/etc/mac.txtonto a disk or shared directory on another file server.- After you have upgraded your front end, copy
/opt/rocks/etc/add-hostsrcand/opt/rocks/etc/mac.txtto/opt/rocks/etc.- Run the add-hosts tool.
- Check
./add-hosts.logto see if you were successful. Each host added successfully has its own line and indicates "SUCCESS" at the end. If you failed to add a host, the line indicates "FAILED".Add/remove roll tool
This command-line tool allows you to dynamically add or remove a roll on the front end. You can also use this tool to specify if you want a roll to install only on the front end and not on all compute nodes. Once you have added or removed a roll on the front end, you must reinstall the compute nodes.
Requirements
Limitations
- This tool only adds or removes rolls supported by Platform Computing.
- This tool does not prevent you from removing rolls that should not be removed. Check the list below to see which rolls can be safely removed.
- The tool now supports upgrading certain supported rolls.
- Installing 32-bit rolls on a 64-bit front end is not supported at this time.
Platform Lava and Platform LSF HPC cannot run on the same node. The Platform LSF HPC roll will disable Platform Lava
Different versions of the Intel compilers roll cannot be used together. They will overwrite each other.
Rolls you can add and remove
The following is a list of rolls that you can add or remove after you have installed on the front end.
WARNING: If you try to add or remove a roll that is not supported, you could seriously damage or impair your system. Check the list of rolls that can be safely added and removed before using this tool.
Rolls that can be added and removed:
- Intel® Software Tools
- Cisco® Topspin®
- CluMon
- Extra Tools
- intel_mpirt
- Lava
- LSF HPC
- mattool
- ntop
- PVFS2
Rolls that can be added but not removed:
Do not attempt to remove the following rolls. They will not uninstall correctly.
To see a list of installed rolls
In the command line, run the following command:
#rollops -lTo add a new roll
You can add a new roll by either using the DVD or having a copy of the ISO file for the roll you want.
- Make sure you have not already installed the roll. To see a list of installed rolls, run the following command:
#rollops -l- Add the roll.
- If you have the DVD with the roll you want to add, insert the DVD and run the following command:
#rollops -a- If you have a meta roll, type
rollops -aroll_name. For example:#rollops -a lava- If you have an ISO file, type
rollops -a -iiso_file_name. For example:#rollops -a -i lava-4.0.0-i386-disk1.isoIf you do not specify a roll name and the DVD or ISO contains a meta roll, you will be prompted to select from a list of available rolls.
If you are adding the CluMon roll, you are prompted for your
rootpassword.- The tool gives you a message indicating your success or failure to add the new roll. For example:
The lava roll was added successfully.- If you have added any of the following rolls, you must restart your front end:
- Reinstall your compute nodes.
To remove a roll
To remove a roll from the front end, type
rollops -e<roll_name>. For example:#rollops -e lavaIf you are removing the CluMon roll, you are prompted for your root password.
The tool gives you a message indicating your success or failure to remove the roll. For example:
The lava roll has been sucessfully removed.
If you have removed this roll on your front end node, you must reinstall your compute nodes for the removal to take effect.
To upgrade a roll
To upgrade a roll in the front end, type
rollops -u<roll_name>. For example:#rollops -u dellIf you have the media with the roll you want to upgrade, insert the media and run the following command without specifying the roll name:
#rollops -uNote: If you have a meta roll, you will be prompted with a list of the rolls found in the meta roll, if available. Specify the name of the roll you want to upgrade, as listed in the meta roll.
If you have an ISO image, type
rollops -u<optional_roll_name>-i<iso_file_name>. For example:#rollops -u -i dell-4.1.1-0.x86_64.disk1.isoNote: If you specified a meta roll ISO image and you do not specify a roll name, you will be prompted with a list of the rolls found in the meta roll, if available.
Only the Dell roll supports upgrading at this time.
The tool gives you a message indicating your success or failure to add the new roll. For example:
The dell roll was added successfully.
If you upgraded the Dell roll, you must restart your front end node.
If you upgraded the Dell roll on your front end node, you must reinstall your compute nodes for them to have the latest version installed.
To prevent a roll from installing on compute nodes
You can choose to install a roll on the front end only and not on the compute nodes.
Typerollops -r<roll_name>-p no. For example: #rollops -r lava -p noWhen you install on your compute nodes, the roll for which you specified
-p nois not installed.Patch management tool
The patch management tool lets you update your cluster with new packages from your operating system's update network. With this tool, you no longer have to reinstall your whole cluster to get the latest patches and enhancements for your OS.
You may use the patch management tool regardless of whether you have a central install server or not.
- With the add-hosts tool
If you used the add-hosts tool to pre-populate the Platform OCS database with your host information, you can transfer your existing host configuration to the new installation. See Upgrading your Platform OCS installation before you use the patch management tool to ensure you save your host configuration.
How it works
The patch management tool checks for updates and enhancements to your operating system available from your operating system's update network, directs the download of the updates, and tracks the version that you are working with. Any appliance is patched as long as the packages on the appliance exist on both the front end and appliance type itself.
Requirements
- For upgrades from Red Hat Network with Platform OCS Enterprise Edition, you must have a subscription to Red Hat Network and registration with
up2date.Limitations
- Patching for cross-kickstarted front ends and compute nodes is not supported.
- You can patch any appliance type, but be aware that RPM packages installed on the front end may have dependencies on appliance packages of a specific version. If you break this dependency, your front end may break your appliance.
- If you completely upgrade your front end, any updates made with the Patch Management tool will be overwritten. The version number that tracks your upgrade path will return to the original: 4.1.1-2.0.
- You can not select from the packages to download because some items will have dependencies. Do not deselect any packages when you register with Red Hat.
- Rollbacks are not supported.
- You cannot selectively choose packages to download from a front end installed from a central server.
- The Patch Management tool will not allow you to update kernel-dependent packages.
Downloading an update
Run the following command:
#rocks-update -d<packagename>For Platform OCS Enterprise Edition, if you have not registered with Red Hat Network, you will be prompted to register. Enter your Red Hat Network account information and follow the prompts. Do not de-select any of the packages listed for download. When the registration is complete,
rocks-updatewill download the update required.
rocks-updatedownloads the update to your front end. If there are new packages available, the patch management tool returns the following message indicating the new repository version and that you can proceed to update your appliances.rocks-update: repository for updates is now version 4.1.1-2.0. You can now install/update your compute node or front end appliances.The repository version database is also updated. It does not install the packages on the front end.
Downloading or patching updates from the central server front end
Before downloading or patching updates from the central server front end, change the following settings:
- On the central install server, enable the Apache user access to the
app_globalstable for your new front end using the following commands:#mysql -u root -p clustermysql>GRANT SELECT on cluster.app_globals to apache@<new_front_end_external_ip>where
<new_front_end_external_ip>is the external IP address of the new front end.- On the new front end, edit the
/etc/front end.repofile withviand replace thebaseurlline with the following line:baseurl=http://<central_install_server_ip>/updates/where
<central_install_server_ip>is the IP address of the central server.Follow these steps to download updates from a central server front end:
- Run the following command:
#rocks-update -gThe tool displays the download details and the following message:
rocks-update: repository for updates is now version 4.0.0.1. You can now install/update your compute node or front end appliances.- If you want to reinstall your compute nodes, you may do so now they will have the new updated applied to them.
Follow these steps to patch updates from a central server front end:
- Run the following command:
#rocks-update -fThe tool displays the installed packages and the following message:
rocks-update: 4 Update(s) installed successfully on front end!- If you want to reinstall your compute nodes, you may do so now they will have the new updated applied to them.
Patching a compute node
From
root, runrocks-update -c.The tool checks for new updates and returns a message indicating whether there was a patch available or not. The default is to perform updates on 64 nodes at a time. You can change this default by specifying the number of nodes to update concurrently, up to a maximum of 250 nodes.
For example,
rocks-update -c 128. 128 compute nodes will be patched concurrently.Installing or reinstalling a compute node
Run
insert-ethers --replace=<host_designation>OR
Use the
add-hoststool to add a host. See Adding a new host to your existing cluster and Replacing a host.Listing versions of installed updates
From your front end, run
rocks-update -g.Extend-compute CLI tools
Two new CLI tools have been added to ease the process of customizing your compute nodes. These tools are as follows:
rocks-compute: this tool allows you to customize the RPM packages and lets you add a script to customize the post-installation for compute node installations.custom-partition: this tool allows you to customize therootandswappartition sizes on your compute nodes.Requirements
Using the rocks-compute tool
This tool customizes compute nodes by modifying the
extend-compute.xmlfile. This XML file is located in/export/home/install/site-profiles/4.1.1/nodes. The tool allows you to add, list, or remove RPM packages and post-install scripts.
The extend-compute tool does not handle package dependencies. If you wish to install a new package to your compute nodes you need to keep in mind of any dependencies the package may need. You will need to add all of them using the extend-compute tool.
You may occasionally need to update
extend-compute.xmlmanually to add, remove, or update the contents. User-added changes are preserved as long as they are not affected by operations carried out by therocks-computetool.
- Adding a custom package
You can add your own RPM packages to a compute node installation by running the following command:
#rocks-compute -a -p<path_to_rpm_package>where <path_to_rpm_package> can be an absolute or relative path to the RPM package.
This will add the package to the Platform OCS distribution for your native architecture. If you want to add packages to a distribution for a non-native architecture, run the following command:
#rocks-compute -a -p<path_to_rpm_package>--arch<architecture_type>where <architecture_type> can be
i386,x86_64, oria64.To automatically rebuild your Platform OCS distribution (
rocks-disttree) after adding your package, run the commands above with the-boption.- Removing a custom package
To delete a custom package that you added to your compute nodes, you need to know its ID and the architecture of the Platform OCS distribution from where you want to remove it. Obtain the RPM ID by running the following command:
#rocks-compute -l pRemove the package from a native architecture by running the following command:
#rocks-compute -d -p<rpm_id>Remove the package from a non-native architecture by running the following command:
#rocks-compute -d -p<rpm_id>--arch<architecture_type>To automatically rebuild your Platform OCS distribution (
rocks-disttree) after removing your package, run the commands above with the-boption.- Listing custom packages
To display a list of all the custom packages you added to your compute nodes, run the following command:
#rocks-compute -l pThis command gives the name and ID of each custom package.
- Adding a custom post-install script
In a typical Platform OCS compute node installation, a post-install script executes after all RPM packages are installed. A post-install script allows you to perform custom actions on the system, such as service configuration or executing installation scripts. The specified script must be a valid
bashshell script.To add a script to the Platform OCS distribution for your native architecture, run the following command:
#rocks-compute -a -s<path_to_script>where <path_to_script> can be an absolute or relative path to the post-install script.
To add a script to a Platform OCS distribution for a non-native architecture, run the following command:
#rocks-compute -a -s<path_to_script>--arch<architecture>To add a script to a Platform OCS distribution for an unspecified architecture, run the following command:
#rocks-compute -a -s<path_to_script>--no-arch-scriptTo automatically rebuild your Platform OCS distribution (
rocks-disttree) after adding your script, run the commands above with the-boption.- Removing a custom post-install script
To delete a script, you need to know its ID and the architecture of the Platform OCS distribution from where you want to remove it. Obtain the script ID by running the following command:
#rocks-compute -l sIf you are removing the script for your native architecture, run the following command:
#rocks-compute -d -s<script_id>If you are removing the script for a non-native architecture, run the following command:
#rocks-compute -d -s<script_id>--arch<architecture_type>If you are removing the script that was added to a Platform OCS distribution for an unspecified architecture, run the following command:
#rocks-compute -d -s<script_id>--no-arch-scriptAs with the add operation, you can rebuild your Platform OCS distribution (rocks-dist tree) by running the deletion commands above with the "-b" option.
To automatically rebuild your Platform OCS distribution (
rocks-disttree) after removing a script, run the commands above with the-boption.- Listing the contents of a custom post-install script
To list the contents of a custom post-install script that you added, run the following command:
#rocks-compute -l sYou will be prompted to select one of the scripts. Make a selection by specifying the number next to the post-script ID.
Using the custom-partition tool
This tool customizes the partition sizes for compute nodes by modifying the
extend-auto-partition.xmlfile. This file is located in/export/home/install/site-profiles/4.0.0/nodes.You may occasionally need to update
extend-auto-partition.xmlmanually to add, remove, or update the contents. User-added changes are preserved as long as they are not affected by operations carried out by thecustom-partitiontool.
- Changing root/swap partition sizes
The default
rootpartition size of a Platform OCS cluster is 10 GB (10 000 MB). You can change this size by running the following command:#custom-partition -r<root_partition_size>where <root_partition_size> is specified in MB and must be at least 6 000 MB.
The default
swappartition size of a Platform OCS cluster is 1 GB (1 000 MB). You can change this size by running the following command:#custom-partition -s<swap_partition_size>where <swap_partition_size> is specified in MB and must be at least 1 000 MB.
Note that you can use the
-rand-soptions together in the same command.- Listing current root/swap partition sizes
To list the current
rootandswappartition sizes, run the following command:#custom-partition -l- Restoring default root/swap partition sizes
To restore the root and swap partition sizes to the Platform OCS default values, run the following command:
#custom-partition -d[ Top ]
Platform OCS LSF HPC
Platform OCS LSF HPC® is an optional roll for managing and accelerating High Performance Computing (HPC) mission-critical workload.
With Platform LSF HPC you can intelligently schedule parallel and serial workload providing the capability of solving large, challenging problems while utilizing the available computing resources at maximum capacity.
For more information about Platform LSF HPC, see the Platform Web site: http://www.platform.com/products/HPC.
Product support
For Platform OCS hardware, operating system support, and CD distributions, see Readme for Platform OCS.
Platform LSF HPC will disable Platform Lava. Do not install Platform LSF HPC unless you intend to use it.
If you have already installed Platform LSF HPC, but want to use Platform Lava, you must use the
rollopstool to remove Platform LSF HPC, then re-enable Platform Lava by runningchkconfig -add lavaon all affected nodes. On the front end node, you must also runchkconfig --add lavagui.Configuring and managing Platform OCS LSF HPC
Once you have installed Platform OCS and the LSF HPC roll, get a Platform LSF HPC license and start license manager daemons. Then set up and start your Platform LSF HPC cluster.
Setting up a Platform LSF HPC license
- Decide which machine will be a license server machine.
The following steps use
front end-0as the license server.- Get FLEXlm
hostidUse the
lmhostidcommand on the FLEXlm server host to get thehardware identifier of your FLEXlm license server host. For example:
# lmhostid lmhostid - Copyright (c) 1989-2003 by Macrovision Corporation. All rights reserved. The FLEXlm host ID of this machine is "0006296d1f2c"In this example, send the code "0006296d1f2c" to Platform.
- Contact license@platform.com to get a permanent Platform LSF HPC license.
Send the following information to Platform at license@platform.com:
- When you receive your license file, save it as
/opt/lsfhpc/conf/license.dat.The following is an example of a permanent license:
SERVER front end-0 0006296f1f2c 1700 DAEMON lsf_ld /opt/lsfhpc/6.1/linux2.4-glibc2.3-x86/etc/lsf_ld FEATURE lsf_base lsf_ld 6.000 1-sep-0000 10 CCF7C3C92A5471A12345 "Platform" FEATURE lsf_manager lsf_ld 6.000 1-sep-0000 10 4CF7C37944B023A12345 "Platform" FEATURE lsf_sched_fairshare lsf_ld 6.000 1-sep-0000 10 8CA763A93AC825C12345 "Platform" FEATURE lsf_sched_parallel lsf_ld 6.000 1-sep-0000 10 3C77F30945F7FBC12345 "Platform" FEATURE lsf_sched_preemption lsf_ld 6.000 1-sep-0000 10 3C0733892C1683812345 "Platform" FEATURE lsf_sched_resource_reservation lsf_ld 6.000 1-sep-0000 10 ECD7C369072CA3812345 "Platform" FEATURE platform_hpc lsf_ld 6.000 1-sep-0000 10 CA6CBE08B635EAC765EC "Platform"- Copy the license.dat file to
/opt/lsfhpc/conf/license.dat.
LSF_LICENSE_FILEinlsf.confis set automatically during installation toLSF_LICENSE_FILE=/opt/lsfhpc/conf/license.dat.- Start the license daemons (
lmgrd):
- Log on to the license server host.
- Use the
lmgrdcommand to start the license server daemon. For example:%lmgrd -c /opt/lsfhpc/conf/license.dat -l /tmp/license.log
DO NOT runlmgrdas root.
LSF installation puts the
lmgrdcommand in LSF_SERVERDIR. For example:/opt/lsfhpc/6.1/linux2.4-glibc2.3-x86/etc/lmgrd.You should include LSF_SERVERDIR in your PATH environment variable. You should also include the full
lmgrdcommand line in your system startup files on the license server host, so thatlmgrdstarts automatically during system restart.- Check the license daemons (
lmstat):License server status: 1700@front end-0License file(s) on front end-0: /opt/lsfhpc/conf/license.dat: front end-0: license server UP (MASTER) v7.0 Vendor daemon status (on front end-0): lsf_ld: UP v7.0 Feature usage info: Users of lsf_base: (Total of 4 licenses available) Users of lsf_manager: (Total of 4 licenses available) Users of Platform_HPC: (Total of 4 licenses available)See Licensing Platform LSF for detailed information about configuring Platform LSF licenses.
Configure compute nodes in your Platform LSF HPC cluster:
Compute nodes are automatically added to the cluster either when the
lsfservice is restarted or ifinsert-ethersexists. New compute nodes are added to thelsf.cluster.lsfhpcfile. Computer nodes are assigned to thelammpiandmpichp4resources by default. You can change the default by changing the value ofDefaultLSFHostResourcein the Platform OCS database, as follows:#mysql -u apache clustermysql>insert into app_globals (service,component,value) values (`Info','DefaultLSFHostResource','<resource_list>') ;where <resource_list> is a list of all the Resources for a node. LSF HPC includes support for the following MPI implementations:
- Infiniband:
mvapich- Myrinet:
mpich_gm- LAM:
lammpi- Mpich p4:
mpichp4- Scali MPI:
sca_mpimon- Mpich MX MPI:
mpich_mx- MPICH Shared Memory:
mpichsharememSee the
lsf.sharedfile for a full list of all supported MPI implementations.The following is a sample host entry:
Begin Host HOSTNAME model type server r1m mem swp RESOURCES ... compute-0-1 ! ! 1 3.5 () () (lammpi mpichp4) ... End HostSetting up and starting your Platform LSF HPC cluster.
- Log on to the front end node as
root.- Install the Platform LSF HPC license as described above.
- Restart the LSF HPC services:
#service lsf restartThe compute nodes start LSF HPC at boot time.
After starting your cluster, run a few basic LSF commands (
lsid,lshosts,bhosts). For example:%lsidPlatform LSF HPC 6.1 for Linux, Sep 1 2005 Copyright 1992-2005 Platform Computing Corporation My cluster name is lsfhpc My master name is front end-0.publicConfiguring your Platform LSF HPC cluster for master failover
The
LSF HPC Masterappliance type is a node type for LSF HPC master candidates. These nodes are used to offload the LSF Master to a less busy node. You should use theLSF HPC Masterappliance type in larger clusters to install one or more nodes as LSF master candidates.Configure your cluster for master failover using the following procedure:
- Run
insert-ethersand select theLSF HPC Masterappliance type.- Install one or more of the
LSF HPC Masternodes.- Exit
insert-ethers.This is needed to update the
lsf.cluster.lsfhpcfile.- Verify that the NFS filesystem can be mounted on the
lsfhpc-0-0node by running the following commands:- Run the
config-lsf-masterscript in/home/install/upgrades/lsfhpc- Answer the dialog questions when prompted to by the script.
After completing the questions, the LSF HPC cluster should be using
lsfhpc-0-0node as the primary LSF master, and the front end node should be the last node to which it will fail over.You should run the
config-lsf-masterscript each time you add or remove anLSF HPC Masterappliance node.Removing a compute node
To remove a compute node, you should shut down the entire compute node, or at least shut down the Platform LSF HPC daemons, then remove the compute node from LSF and the Platform OCS cluster.
#sshcompute_node_name#shutdownOR
#/etc/init.d/lsf stop- On the front end node, run remove the related host line from the
lsf.cluster.lsfhpcfile.#lsadmin reconfig#badmin reconfigRemoving a host from the Platform OCS cluster:
#insert-ethers --remove="<compute_node_name>"Troubleshooting
Solution:
- Symptom: Unable to launch parallel jobs using LAM MPI when the number of processes increases (approximately 32).
Explanation: lamboot is failing to ssh to the nodes in a timely manner. The issue can be further traced to name resolution in ssh. The cluster may be trying to use a non-existent domain or DNS server.
To test if the issue exists:
- Connect to a compute node and try to ssh to another compute node.
Take note of the time required.
- Log out of the other compute node and try to ssh to the IP address of another compute node.
- If the result is dramatically faster, the issue exists.
- To verify the problem is with name resolution, edit the
/etc/resolv.conffile on a compute node. Change the search line to only include the private domain.- ssh to another compute node. It should respond faster.
Solution: Fix this problem using any one of the following:
- Symptom: A suspended lammpi job cannot be terminated by the owner with
bkill.Solution: Resume the job to be killed; the job will go to
EXIT.- Symptom: If an
MPIjob is launched on a compute node to run across other compute nodes, and other compute nodes are not accessible from the launching node, the job execution goes to failure, butbjobsshows the job isDONE.Solution: Make sure all compute nodes are accessible to each other.
- Symptom: An
mpich_gmjob can be dispatched to a host without Myrinet card installed.Solution: Configure a host group
gm_hostsfor those hosts with Myrinet card properly installed and specify the option-m gm_hostsforbsubwhen submitting the job.- Symptom: When a job failed to launch or a job is terminated during execution, the related temp files are left over under
HOME.Solution: Remove the files manually.
- Symptom: Ctrl-C cannot terminate a started interactive
lammpijob.Solution: Use
bkillinstead.- Symptom:
lammpiapplication fails if run across the nodes mixed with and without Myrinet card installed.Solution: Set the environment variable
LAM_MPI_SSI_rpi=tcpbefore submitting the job.[ Top ]
Platform Lava
Platform Lava is a distributed batch system for submitting jobs and managing the workload on a Platform OCSTM cluster. Platform Lava is free and is based on Platform LSF.
Platform Lava lets you easily manage the day-to-day worklfront endad of a whole cluster, providing simplified job execution, management, and accounting.
Platform LSF HPC will disable Platform Lava. Do not install Platform LSF HPC unless you intend to use it.
If you have already installed Platform LSF HPC, but want to use Platform Lava, you must use the
rollopstool to remove Platform LSF HPC, then re-enable Platform Lava by runningchkconfig -add lavaon all affected nodes. On the front end node, you must also runchkconfig --add lavagui.About the Platform Lava GUI
The Platform Lava GUI gives you the ability to monitor and control your Platform Lava jobs, queues, and hosts. This browser-based interface is installed on your front end only; no components are installed on your compute hosts.
The Platform Lava GUI is not intended to be production quality. We hope to get feedback on its usefulness as a tool for future releases.
Supported browsers
The Platform Lava GUI Web address
Use your browser to navigate to your Platform Lava GUI Web address which is http://<host_name>:<port_number>/Platform.
The word "Platform" in the URL is case sensitive. Ensure that it appears exactly as it appears here. The default port number is 8080.
Logging in to the Platform Lava GUI
To log in to the Platform Lava GUI:
- Navigate to http://<host_name>:<port_number>/Platform with your browser. The default port is 8080.
- Log in as the Platform Lava Administrator or as a listed user with your OS user name and password.
- To login as a Platform Lava Administrator, run
passwd lavaadminto set the password. The username is lavaadmin, with no password as default. For more information, see Administering Platform Lava.
You can not log in as root.
Using the MPI job submission scripts
Platform Lava includes two scripts to aid in submitting LAM over Ethernet and MPICH over Ethernet MPI jobs. The scripts are wrappers to
mpirunwhich handle the setup of themachinefilefor MPICH, or thebhostsfor LAM. The scripts are included with the Lava roll and are located in the$LSF_BINDIRdirectory. The scripts are for use with Lava only, and are named according to the type of MPI:Run the following command to run the
lam-mpirunscript:%bsub -n<num_processors>lam-mpirun -np<num_processors><MPI_JOB> <ARGS>Note: A properly configured LAM environment is required before using this wrapper. The
lam-mpirunwill also calllambootandlamhalt.Run the following command to run the
mpich-mpirunscript:%bsub -n<num_processors>mpich-mpirun -np<num_processors><MPI_JOB> <ARGS>Known issues
Lava GUI
- Relative priority
When selecting scheduling options with the submit job screen, Relative Priority has the following description: 1 is the highest. This should read: 1 is the lowest.
- Host information after restarting Platform Lava
If you stop Platform Lava and then restart it, the
EXEC_HOSTparameter (the host on which a job ran) for recently run jobs is displayed aslost_and_found.- Host page delay
When monitoring jobs through the Hosts page, there is a substantial delay in displaying updated information. Use the Jobs page to monitor job status.
- Logging error
The file
/opt/lava/webgui/log/gabd.loglogs an error every time you log in to the Platform Lava GUI. The error is "Select Interrupted system call." Ignore this error.- Platform Lava GUI service
When you check the status of the Platform Lava GUI service in the command line, you get the following descriptive sentence: Script for starting up and shutting down gabd. The word "gabd" should be "Platform Lava GUI."
- Number of disks
If your host has more than one disk, the Platform Lava GUI displays only one in the host details.
- Mozilla window
When using Mozilla as your browser, the window does not close properly when you change your working directory by going to the Jobs page, choosing Tools | Job Environment. To fix, close the Job Environment window and open it again.
When you request an email to be sent to you when a job completes, the actual email sender is LSF and never the Platform Lava GUI.
SMTP_SERVERandMAIL_SENDERinga.confare ignored.- Maximum remote pending jobs
When you look at Queue | Details for a specific queue, the Maximum Remote Pending Jobs has the value Infinite. The value should be 5000.
- Upload/Download
The Upload/Download feature has been disabled for security reasons.
Troubleshooting
- Symptom: The Lava GUI (
lavagui) link is broken in the main page after reinstall the Lava Roll using therollopstool.Solution: Restart the
lavaguidaemon as follows:The Lava GUI should function normally after you restart the
lavaguidaemon.- Symptom: After removing a compute node from your Platform OCS cluster, the
bhostsorlsloadcommand shows that the Platform Lava daemons are still running on the host. This occurs after you run the following command:#insert-ethers --removehost_nameSolution: Restart the daemons on the master Platform Lava host by running the following command:
#/etc/init.d/lava stop#/etc/init.d/lava start- Symptom: After physically disconnecting a compute node from your Platform OCS cluster, the
bhostsorlsloadcommand shows that the host is UNKNOWN.Solution: Restart the daemons on the master Platform Lava host by running the following commands:
#/etc/init.d/lava stop#/etc/init.d/lava start[ Top ]
Intel® Software Tools Roll
The Intel® Software Tools roll contains the Intel C++ compiler and the Intel MPI Library 2.0 packages, including library and integrated performance primitives, LAM MPI, and MPICH.
Getting a license for Intel compiler and tools
To get a license (either evaluation or commercial) for the Intel compiler and tools, follow these steps:
- Go to the following URL:
http://www.intel.com/software/products/distributors/rock_cluster.htm
- Select the tool for which you want to obtain a license by clicking on the appropriate link under the Intel Software Development Products section.
You need a separate license for each Intel tool.
- Get the license:
Troubleshooting
- Symptom: When trying to run or compile applications using the X86, or EM64T Intel MPI package, you get link time and runtime errors.
Explanation: This occurs because the MPI libraries are not set in the system library path.
Solution: Run the following commands on a front end and compute nodes.
You will see warnings about shared libraries being too small. They can be safely ignored.
- Symptom: Applications compiled with the Intel MKL library cannot run. The following error is encountered when running applications compiled with the Intel MKL:
xhpl: error while loading shared libraries: libmkl_lapack64.so: cannot open shared object file: No such file or directorySolution: Add the MKL library path to
/etc/ld.so.confas follows and runldconfig:
- For x86, add
/opt/intel/mkl701cluster/lib/32to/etc/ld.so.conf- For Intel EM64T, add
/opt/intel/mkl701cluster/lib/em64tto/etc/ld.so.confTo avoid the following error:
xhpl: relocation error: /usr/lib64/libguide.so: undefined symbol: _intel_fast_memsetmake sure the line
/opt/intel/mkl701cluster/lib/em64tcomes before the/lib64and/usr/lib64lines in/etc/ld.so.confto make it link correctly: For example:/opt/gm/lib /usr/X11R6/lib64 /usr/kerberos/lib /usr/X11R6/lib /usr/kerberos/lib64 /usr/lib64/mysql /usr/local/topspin/lib64 /usr/local/topspin/mpi/mpich/lib64 /opt/gridengine/lib/lx24-amd64/opt/intel/mkl701cluster/lib/em64t/lib64 /usr/lib64 /usr/kerberos/lib64 /opt/sge/lib/glinux /opt/nmi/lib /usr/lib64/qt-3.1/lib /usr/lib64/mysql /usr/X11R6/lib64 /opt/intel_fce_80/lib /opt/intel_cce_80/lib[ Top ]
CluMon Roll
CluMon (Beta) is an open source cluster monitoring system developed at the National Center for Supercomputing Applications (NCSA) to keep track of its Linux® clusters. CluMon is a tunable system that can be made to work for almost any set of Linux machines.
For more information on CluMon, see http://clumon.ncsa.uiuc.edu.
The optional CluMon roll fully integrates the CluMon monitoring application with Platform Lava or Platform LSF HPC.
The CluMon roll is for both Standard and Enterprise Editions of Platform OCS version 4.1.1-2.0.
CluMon roll supports x86 Red Hat Enterprise Linux® 4.0 and Intel EM64T Red Hat Enterprise Linux® 4.0.
Known issues
The CluMon roll is beta quality and has the following known issues:
Installation
- CluMon needs Platform Lava or Platform LSF HPC
CluMon is hard coded to work only with Platform Lava or Platform LSF HPC. In the case that both Platform Lava and Platform LSF HPC are not installed, CluMon does not work with the rest of Platform OCS.
- PCP RPM does not install on Segment Server nodes
CluMon PCP RPM does not install on Segment Server nodes. You can not use CluMon to monitor these types of nodes.
CluMon Interface
- No scale units on Host List page
On the Host List page of CluMon graphical user interface (GUI), for each host, the related "1 Min Load", "Mem Usage" charts do not have scale units defined. The "Disk Usage" chart is missing all numerical information.
- Incorrect information on Host List page
The Host List page does not show all running jobs for each host. In addition, the job's Name, Acct, and Owner information fields are empty.
- Incorrect information on Queues Information page
The queues table on the Queues Information page is always empty for all queues. On the Information page for each queue, many values are incorrect (for example very large or negative numbers). In addition, in Jobs Currently Queued for <queue>, all jobs have the state Unknown and all the other columns are empty.
- Closing host
With a Platform Lava cluster, if you run
badmin hcloseto close a host, the CluMon GUI will not show the data for that host.- Incorrect information on Job Information page
After a Platform Lava job is submitted, the Job Information Page shows that the job is running on 0 nodes. However, a job always runs on at least one node.
- Values on Job Information page for Platform Lava jobs
Platform Lava jobs display unusual values on the Job Information page in the CluMon GUI. For example, the following values are displayed for a Platform Lava job:
- Refresh rate
The CluMon GUI refresh rate is hard-coded to 120 seconds. If you have short-running jobs, you can not easily monitor them using the GUI.
- Error messages on Job Information page
During and after a Platform Lava job runs, the Job Information page displays unclear and incomplete messages indicating that no information for the job is available (in the job information and job variable sections). The queue name is missing.
- Incomplete error message
When the PCP (Performance Co-Pilot) is down at the compute node, the CluMon GUI shows the following incomplete error message: "There were problems encountered while collecting data. The errors were encountered while:".
- No updates after daemons are stopped
After the CluMon and PCP daemons are stopped, the CluMon GUI can no longer be updated to reflect the fact that CluMon is shut down.
[ Top ]
PVFS2 Roll
The PVFS2 (Parallel Virtual Filesystem 2) roll is a bundle of all the components you need to run a high-performance distributed file system.
The following groups have collaborated on or supported the development of PVFS:
- Parallel Architecture Research Laboratory at Clemson University
- Mathematics and Computer Science Division at Argonne National Laboratory
- Ohio Supercomputer Center
- NASA Goddard Space Flight Center Code 931
- National Computational Science Alliance (NCSA)
There are three components in PVFS2 1.3.2:
The roll creates a sample distributed file system that should then be fine tuned for your own configuration and hardware. PVFS2 allows the disk space in each node to be accessible to all nodes as a single file system, creating a high-speed file system ideal for datasets and job information. There are some limitations and an administrator should understand them before configuring it. The latest documentation is available from the PVFS2 Web site at: http://www.pvfs.org/pvfs2/documentation.html.
One host must be dedicated as the meta server and named
pvfs2-meta-server-0-0. Once this host is installed, all the others will be able to access a sample PVFS2 file system under/mnt/pvfs2.During installation, an
autofsconfiguration file is installed along with the binaries and source code on all clients. On the first startup after installation, the kernel module will be built. The source