Oracle® Clusterware Installation Guide 11g Release 1 (11.1) for Linux Part Number B28263-01 |
|
|
View PDF |
This appendix provides troubleshooting information for installing Oracle Clusterware.
See Also:
The Oracle Database 11g Oracle RAC documentation set included with the installation media in the Documentation directory:Oracle Clusterware Administration and Deployment Guide
Oracle Real Application Clusters Administration and Deployment Guide
This appendix contains the following topics:
To address troubleshooting issues, Oracle recommends that you install OS Watcher, and if you intend to install an Oracle RAC database, RACDDT. You must have access to OracleMetaLink to download OS Watcher and RACDDT.
OS Watcher (OSW) is a collection of UNIX/Linux shell scripts that collect and archive operating system and network metrics to aid Oracle Support in diagnosing various issues related to system and performance. OSW operates as a set of background processes on the server and gathers operating system data on a regular basis. The scripts use common utilities such as vmstat
, netstat
and iostat
.
RACDDT is a data collection tool designed and configured specifically for gathering diagnostic data related to Oracle RAC technology. RACDDT is a set of scripts and configuration files that is run on one or more nodes of an Oracle RAC cluster. The main script is written in Perl, while a number of proxy scripts are written using Korn shell. RACDDT will run on all supported Unix and Linux platforms, but is not supported on any Windows platforms.
OSW is also included in the RACDDT script file, but is not installed by RACDDT. OSW must be installed on each node where data needs to be collected.
To download binaries for OS Watcher and RACDDT, go to the following URL:
https://metalink.oracle.com
Download OSW by searching for OS Watcher, and downloading the binaries from the User Guide bulletin. Installation instructions for OSW are provided in the user guide. Download RACDDT by searching for RACDDT, and downloading the binaries from the RACDDT User Guide bulletin.
The following is a list of examples of types of errors that can occur during installation. It contains the following issues:
Failed to connect to server, Connection refused by server, or Can't open display
Nodes unavailable for selection from the OUI Node Selection screen
PROT-8: Failed to import data from specified file to the cluster registry
/etc/oratab
pointing to a non-existent Oracle home. The OUI error file should show the following error: "java.io.IOException: /home/oracle/OraHome//bin/kfod: not found" (OracleMetalink bulletin 276454.1)/etc/oratab
pointing to a non-existing Oracle home.$ xhost fully_qualified_remote_host_name
For example:
$ xhost somehost.example.com
Then, enter the following commands, where workstation_name
is the host name or IP address of your workstation.
Bourne, Bash, or Korn shell:
$ DISPLAY=workstation_name:0.0
$ export DISPLAY
To determine whether X Window applications display correctly on the local system, enter the following command:
$ xclock
The X clock should appear on your monitor.If the X clock appears, then close the X clock and start Oracle Universal Installer again.
/dev/shm
size for PGA and SGA.
If you are installing on a Linux system, note that Memory Size (SGA and PGA), which sets the initialization parameter MEMORY_TARGET
or MEMORY_MAX_TARGET
, cannot be greater than the shared memory filesystem (/dev/shm
) on your operating system.
/dev/shm
mountpoint size. For example:
# mount -t fmpfs shmfs -o size=4g /dev/shm
Also, to make this change persistent across system restarts, add an entry in /etc/fstab
similar to the following:
shmfs /dev/shm tmpfs size=4g 0
Run the shell command ifconfig -a
. Compare the output of this command with the contents of the /etc/hosts
file to ensure that the node IP is listed.
Run the shell command nslookup
to see if the host is reachable.
As the oracle
user, attempt to connect to the node with ssh
or rsh
. If you are prompted for a password, then user equivalence is not set up properly. Review the section "Configuring SSH on All Cluster Nodes".
rootupgrade.sh
. To confirm, look for the error "utopen:12:Not enough space in the backing store" in the log file $ORA_CRS_HOME/log/
hostname/client/ocrconfig_
pid
.log
./var/opt/oracle/srvConfig.loc
, and copy the contents of this raw device to the new device using the command dd
.time stamp 2005-04-04 14:49:49 is 106 s in the future
nscd
has not been activated./sbin/service nscd start
You have missing operating packages on your system if you receive error messages such as the following during Oracle Clusterware, Oracle RAC, or Oracle Database installation:
libstdc++.so.5: cannot open shared object file: No such file or directory libXp.so.6: cannot open shared object file: No such file or directory
Typically, errors such as these occur if you have not fully checked required operating system packages during preinstallation, and failed to confirm that all required packages were installed. Run Cluster Verification Utility (CVU), either from the shiphome mount point (runcluvfy.sh
), or from an installation directory (CRS_home
/bin
). CVU reports which required packages are missing.
If you have a Linux support network configured, such as the Red Hat network or Oracle Unbreakable Linux support, then you can also use the up2date
command to determine the name of the package. For example:
# up2date --whatprovides libstdc++.so.5 compat-libstdc++-33.3.2.3-47.3
If Oracle Universal Installer (OUI) does not display the Node Selection page, then perform clusterware diagnostics by running the olsnodes
-v
command from the binary directory in your Oracle Clusterware home (CRS_home
/bin
on Linux and UNIX-based systems, and CRS_home
\BIN
on Windows-based systems) and analyzing its output. Refer to your clusterware documentation if the detailed output indicates that your clusterware is not running.
In addition, use the following command syntax to check the integrity of the Cluster Manager:
cluvfy comp clumgr -n node_list -verbose
In the preceding syntax example, the variable node_list
is the list of nodes in your cluster, separated by commas.
If you use more than one NIC for the interconnect, then you must use NIC bonding, or the interconnect will fail.
If you install Oracle Clusterware and Oracle RAC, then they must use the same NIC or bonded NIC cards for the interconnect.
If you use bonded NIC cards, then they must be on the same subnet.