Troubleshooting a 7318 IPX Directed Boot


Table of contents

About this document
Prerequisite information
Checking the 7318's status lights
Checking the interface
Checking daemons
Verify the IPX network
Checking for unique IPX internal network numbers
7318 SMIT configuration
Checking file permissions
Checking the IPX communication channel
Verifying the 7318's NVRAM
Viewing the 7318's load sequence
Setting up error logging
PTF level

About this document

The purpose of this document is to assist with trouble shooting an IPX directed boot for an IBM 7318 model P10 or model S20. This document was designed for AIX System Administrators with a knowledge of AIX administration and the 7318. Using this document, IPX directed boot trouble shooting takes about 25 minutes. In a directed load configuration, the 7318 requests its boot image and configuration file from a specific host. The boot configuration is primarily on the 7318 but the host must have active daemons that will respond to the 7318 file requests. This document contains modules that troubleshoot the directed load configuration on the 7318 and the host.

This document applies to AIX levels 3.2.5 to 4.3.2.

Remote troubleshooting

The 7318 was originally designed to boot and operate local to an IBM eServer pSeries or RS/6000 (for example, the pSeries or RS/6000 and 7318 are on the same LAN). Frequently the 7318's functionality is needed in a remote network (for example, the pSeries or RS/6000 and 7318 are on separate networks, linked by a gateway or router). The 7318 is capable of booting while remote from the pSeries or RS/6000, but special care needs to be taken for the communication channel. Remote troubleshooting tips will be listed throughout this document when troubleshooting a remote 7318 varies from troubleshooting one local to the pSeries or RS/6000.


Prerequisite information

Assumptions

Before using this document, the following is assumed:

Terms used in this document:

Requirements

To use this document effectively, you must:


Checking the 7318's status lights

The lights on the front of the 7318 show its status. From left to right, the lights correspond to Power, Ready, AUI interface in use, and 10Base-T interface in use. Different light sequences will be shown during the 7318's power-on and hardware check, booting, and normal operation. For more information on the light sequences, refer to page 2-5 in the 7318 Serial Communications Network Server Guide and Reference (SC23-2542-00).

  1. Check the light sequence on the front of the 7318 and compare it to the possible sequences listed below.

    Legend:

    Light Sequence Condition and Resolution, if applicable:

    0 0 0 0 The 7318 is not powered On.The 7318 does not have a power on/off switch. You must use the AC plug to power on and off the 7318.

    1 0 0 0 The 7318 has a hardware problem, if this sequence persists for more than 1 minute. In that case, the 7318 needs to be serviced or replaced.

    1 B 0 0 The 7318 is trying to boot but cannot communicate with the network. Check all cabling connecting the 7318 to the LAN. Replace cabling if necessary. Continue Troubleshooting once the 7318 is able to transmit onto the LAN.

    1 B 1 0 The 7318 is trying to boot and can transmit onto the LAN.

    1 B 0 1 The 7318 is trying to boot and can transmit onto the LAN. The 7318 is trying to boot but cannot find a load host. Proceed to the next section to continue troubleshooting.

    1 1 1 0 The 7318 is booted and can transmit onto the LAN.

    1 1 0 1 The 7318 is booted and can transmit onto the LAN. The 7318 has booted and is operating as designed. If you are unable to communicate with the 7318, the problem is beyond booting and the scope of this document.

  2. Determine the next action. If the sequence 1 B 1 0 or 1 B 0 1 appears, continue with the next section. If any other sequence appears, correct the condition or resolution listed above before continuing to the next section.

Checking the interface

The host communicates with the network through an interface (tr0, en0, et0, fi0). To configure an interface for IPX, the interface must be active (the words <UP, RUNNING> appear in the flags when the ifconfig command is run).

NOTE:

Verify that the communications interface is active using the ifconfig command and the adapter name.

If the interface is active (the words <UP, RUNNING> appear in the flags list), then continue to the next section. If the words <UP, RUNNING> do not appear in the flags list, then configure the interface through SMIT, execute:

   smit inet 

Checking daemons

The Terminal Server depends on many daemons for operation. The sapd and npsd daemons (Novell Protocol Suite) manage the IPX channel for communication and the cnsview daemon manages all Terminal Server devices and processes on the host. All three daemons must be active for a successful IPX boot and normal Terminal Server operation.

  1. Verify to see if the IPX npsd daemon is active, execute:
       ps -ef | grep npsd 
    

    Sample output when active is as follows:

    	root 5150 	  1 0 06:45:54 - 	0:00 ./npsd 
    	root 16170 15336 2 15:47:33 pts/1    0:00 grep npsd 
    
  2. Check to see if the IPX sapd daemon is active, execute:
       ps -ef | grep sapd 
    

    Sample output when active is as follows:

    	root 5404      1 0 06:45:52 -      0:09 ./sapd 
    	root 16178 15336 5 15:47:39 pts/1  0:00 grep sapd 
    
  3. Check to see if the cnsview daemon is active, execute:
    ps -ef | grep cnsview 
    

    Sample output when active is as follows:

    	root 3370      1 0 06:46:03 -      0:03 /usr/bin/cnsview -c daemon start 
    	root 18978 15336 6 15:47:25 pts/1  0:00 grep cnsview 
    

    If all three processes were listed in the output from the preceding ps commands, then please continue to the section "Verify the IPX network". If the cnsview daemon was active but the sapd and/or npsd daemons are not active, stop the cnsview daemon and restart all three daemons using the sequence in step 4 below.

    To stop the cnsview daemon, execute:

       cnsview -c "daemon stop" 
    

    If all three processes were NOT listed in the output from the preceding ps commands, continue to start the daemons.

  4. To start the sapd and npsd daemons, execute:
       /usr/lpp/netware/bin/startnps 
    
  5. To start the cnsview daemon, execute:
       cnsview -c "daemon start" 
    

    NOTE: The cnsview daemon cannot be active until both the sapd and npsd daemons are active. If you are stopping daemons and yet they are still listed as an active system process, execute a kill -1 on the process id and restart all three daemons starting with step 4 above. If you are still encountering difficulties, please contact an IBM AIX Technical Specialist.

  6. Once these daemons are active, continue to the next section.

Verify the IPX network

The host's IPX routing table can be viewed by executing the following program:

   /usr/lpp/netware/bin/drouter 

If the IPX network is configured properly, it will be listed in the IPX routing table.

Remote configurations

All remote IPX networks will appear in the host's IPX routing table if there is an active IPX communication channel between the two IPX networks (i.e. a router or gateway is configured for IPX and passes IPX packets).

  1. To view the host's IPX routing table, execute:
       /usr/lpp/netware/bin/drouter 
    

    Sample output is as follows:

    NETWORK   HOPS TIME  NODE          NETWORK   HOPS  TIME  NODE 
    --------  ---- ----  ------------  --------  ----  ----  ------ 
    00000001  0000 0001  000000000001  00000002  0000  0001  02608C2F7119 
    00000003  0000 0001  02608C2F1591  00000004  0001  0002  00406E0002F5 
    00000005  0001 0002  00406E0002DB 
    

    NETWORK is the network number, internal or external. There should be one entry in this table for each network segment in the overall network.

    HOPS is the number of routers which must be passed through to get to this network.

    NODE is the Ethernet address of the station used to get to the network.

  2. Verify that the IPX network is shown in the listing.

    If the IPX network on which the 7318 resides is shown in the host's IPX network table, continue to the next section.

    If an IPX network is not shown and must be configured, refer to documentation on configuring IPX networks or call an IBM AIX Technical Specialist.


Checking for unique IPX internal network numbers

If any two IPX hosts on the same inter-network have the same internal network number, the SPX link between the 7318 and the host will be unstable and the 7318 may not boot. Therefore, no two hosts can have the same IPX internal network number.

NOTE: The default internal network number is 00000001 but it can be any eight digit hexadecimal number (it is usually the last eight digits of the host's mac address). The internal network number must also be different from the LAN network numbers in the environment.

  1. Open the /etc/netware/NPSConfig file with your favorite editor.

  2. Search on internal_network.

    Below is a sample internal network number from the /etc/netware/NPSConfig file.

       internal_network = "00000001" 
    
  3. Check the internal_network number in the /etc/netware/NPSConfig file on every other IPX host on the network (repeat steps 1 and 2).

    If all the internal_network numbers are unique, continue to the next section.

    If any internal network numbers are the same, make each host's internal network number unique and continue with step 4.

  4. After making any modifications to the /etc/netware/NPSConfig file, save the changes and exit the file.

  5. Execute the command sequence below on each modified host to recycle the IPX daemons.

    WARNING: Recycling these daemons will disconnect any device communicating with the host via IPX (includes P10 Style Ports)!

       cnsview -c "daemon stop" 
       /usr/lpp/netware/bin/stopnps 
       /usr/lpp/netware/bin/startnps 
       cnsview -c "daemon start" 
    
  6. Continue with the next section.

7318 SMIT configuration

Verify that the 7318's device information, specified download image, and IPX network address are correct in its SMIT configuration.

  1. Start SMIT and use the fast path ts7318_cs_mnu, execute:
       smitty ts7318_cs_mnu 
    
  2. Choose Show/Change Configured ComNetServers.

  3. Select the appropriate 7318. A sample stanza of a 7318 configuration is as follows:
       [Entry Fields]
       ComNetServer Number                    01
       ComNetServer Network Address       [00000002]
       ComNetServer Ethernet Address      [00406ee00155]
       ComNetServer Bootfile              [/usr/lib/cns/cns-p10] 
    
  4. Verify that the ComNetServer Network Address is correct.

    It should match an IPX network address listed in the host's IPX routing table. If the network address is incorrect, change it. If the network address is not listed in the host's IPX routing table, either change the 7318's network address in the ComNetServer configuration stanza or check with your network administrator to verify the correct IPX network configuration.

  5. Verify that the ComNetServer Ethernet Address is correct.

    This number is the 7318's hardware address and is labeled on the back of the 7318.

  6. Make sure the ComNetServer Bootfile is specified.

    The default boot image and path for the model P10 and S20 are /usr/lib/cns/cns-p10 and /usr/lib/cns/cns-s20 (or /usr/lib/cns/cns-s20e) respectively.

  7. If no modifications were made in SMIT, then skip to step 8 in this section, otherwise press Enter to implement the changes.

    NOTE: SMIT automatically refreshes the cnsview daemons. Reboot the 7318. This can be done in two ways.

  8. If the 7318's SMIT configuration is correct, delete the 7318 device configuration, re-add the 7318 definitions once more, and reboot the 7318.

    NOTE: It is possible that the device data in the ODM is corrupt and a complete reconfiguration solves the problem. If the 7318 still fails to boot, continue to the next section.


Checking file permissions

In a broadcast load configuration, the 7318 requests its boot image and configuration from the host. The boot image and configuration file must have permissions that allow the 7318 to download them.

  1. Change to the directory in which the load image and configuration file reside.

    The default is /usr/lib/cns.

  2. Verify that the permissions for the files are world-readable, execute:
       ls -l | more 
    

    Correct sample file permissions are as follows:

       -r--r--r--      1 root     system    442532 Jul 24 01:56 cns-p10 
       -r--r--r--      1 root     system   1240228 Jul 24 01:56 cns-s20e 
       -rw-r--r--      1 root     system     17539 Sep 09 1995 p10.cfg 
       -rw-r--r--      1 root     system     43120 Jul 24 01:56 s20.cfg 
    
  3. If the file is not world-readable, change the permissions, execute:
       chmod 444  
    
  4. Continue to the next section.

Checking the IPX communication channel

Now that we have verified that an IPX network exists, it is time to check IPX communication between the host and the 7318. To verify IPX communication between the 7318 and host machine, send a broadcast IPX ping to every 7318 on the network.

  1. To send a broadcast IPX ping, execute:
       cnsview -c "ipxping -b" 
    

    To IPX ping a specific 7318, execute:

       cnsview -c "ipxping external_ipx_network_number:7318's_Ethernet_address" 
    

    for example,

       cnsview -c "ipxping 00000002:00406ee00155" 
    

    Sample output is as follows:

       [root@ivorye] / # cnsview -c "ipxping -b"
       00000002:00406ee00175 is responding but not online
       00000002:00406ee00155 is responding but not online
       00000002:00406ef000f0 is responding and online 
    

    NOTE: The success of an IPX ping implies the 7318 is powered on. There is an IPX path to the 7318. The routers, if any, are routing the packets correctly.

    NOTE: The success of an IPX ping does NOT imply that the 7318 has booted, the 7318 is configured so that the SPX link is present.

  2. Check whether the 7318 (Terminal Server) is listed in the output.

    If the 7318 is not listed and is local to the host (on the same LAN, including hubs) check if the IPX network on which the 7318 resides is listed in the host's IPX routing tables.


Verifying the 7318's NVRAM

In this section, we will verify the boot parameters specified in the NVRAM. For a directed load configuration, all booting parameters (load host and boot files) are specified in the 7318's NVRAM (Non-Volatile RAM), which can be modified through a BIOS console session. A BIOS console session can be accessed by connecting a terminal (IBM3151) to one of the ports on the front of the 7318 with an RJ-45 cable and a null-modem adapter.

  1. Verify the terminal connectivity and configuration. The terminal should be connected to any one of the 16 ports on the front of the 7318. The terminal should emulate an ASCII terminal with the following settings:
       9600 baud       8 data bits     no parity       1 stop bit 
    
  2. Start a BIOS console session.

    Recycle the power on the 7318 by unplugging and replugging the power cord. When the 7318 is powered back on, hold the Shift key and press 3 (the # sign) repeatedly until four # signs scroll across the screen of the terminal.

    NOTE: You can access a BIOS Console between the time the 7318 is first powered on and the ready light (second light) starts blinking. If the ready light begins blinking, you have missed the window to enter the BIOS Console and must recycle the power and try again. Once four # signs scroll across the screen, the 7318 will begin a BIOS console session.

  3. Make changes to the NVRAM, change to the administrative user, execute:
       admin 
    
  4. Display the current contents of the NVRAM, execute:
       show 
    
  5. Verify the boot parameters using the set command.
    1. Verify the load image for the 7318.

      Default load images (in the /usr/lib/cns directory) are as follows:

      • cns-p10
      • cns-s20
      • cns-s20e

      Sample command to set the load image for a P10 is as follows:

         set loadimage /usr/lib/cns/cns-p10 
      
    2. Verify the configuration file for the 7318.

      Sample command to set the configuration file for 00406ef000f0 is as follows:

         set config /usr/lib/cns/00406ef000f0.cfg 
      

      NOTE: If the 7318's configuration file has not been created, copy the 7318's default configuration file and name it 00406eXXXXXX.cfg (where XXXXXX refers to the last six digits of the 7318's hardware address). The default configuration files (in the /usr/lib/cns directory) are: s20.cfg, and p10.cfg. You can use the default configuration to boot the 7318 but do not modify the default configuration file.

    3. Verify the load interface used by the 7318.

      To modify the load interface, use the set command to specify the correct load interface. The 7318 can be forced to use either the AUI or 10BaseT Ethernet interface instead of searching for the attached interface. If unsure, leave the default value.

      Sample command to set the load interface is as follows:

         set interface 1 
      

      where:

      0 - auto (default)

      1 - 10BaseT Interface

      2 - AUI Interface

      NOTE: This may be needed if the HA feature is being used and the system to load from is on the AUI port (the 10BaseT is checked first). Otherwise, leave the default value.

    4. Verify that the BIOS code level on the 7318 is at the latest BIOS level, execute:
         show 
      

      NOTE: The latest BIOS level as of 03/25/99 is 5.23. If you are unsure whether the 7318 is at the latest BIOS level, please contact an IBM AIX Technical Specialist.

    5. Verify the frametype used by the 7318.

      To modify the frametype, use the set command to specify the correct packet type. The 7318 can be forced to use a packet type by setting the frametype parameter. If unsure, leave the default value.

      Sample command to set the frametype is as follows:

         set frametype 1 
      

      where:

      0 - auto (default)

      1 - Ethernet Version 2

      2 - Ethernet SNAP

      3 - Ethernet 802.3

      4 - Ethernet 802.2

      NOTE: This feature is most useful for speeding up the boot process. If the frametype is specified, the 7318 will not search for the frametype needed.

    6. Verify the protocol.

      To modify the protocol, use the set command to specify the correct protocol. The 7318 can be forced to use either IPX (1) or TCP/IP (2) to load. If unsure, leave the default value.

      Sample command to set the protocol is as follows:

         set protocol 1 
      

      where:

      0 - auto (default)

      1 - IPX

      2 - TCP/IP)

      NOTE: Since IPX is tried first, specifying this parameter speeds up the boot process for a 7318 using TCP/IP (UDP). No time will be wasted looking for an IPX boot host when one does not exist.

    7. Verify the IPX load host's internal IPX address.

      To force the 7318 to download files from a specific SPX/IPX host, set the host1 parameter with the internal IPX network number of the host (this is done in the 7318's BIOS Console).

      Sample command to set an IPX load host (host's internal_network number is 8ae46c31) is as follows:

         set host1 8ae46c31:000000000001 
      
    8. Save the Boot Parameters

      If any boot parameters were changed, save the modifications, execute:

         save 
      

    Continue to the next section.


Viewing the 7318's load sequence

Now that we have verified the 7318's NVRAM configuration:

  1. Reboot the 7318.
  2. Watch as the 7318's boot sequence is displayed on the terminal connected to a port on the 7318.1.
  3. Issue the load command from the BIOS console session on the 7318.
  4. Use the boot sequence displayed on the terminal to troubleshoot the load configuration.
  5. Watch the output sent to the terminal as the 7318 cycles through its boot sequence and tries to find its load host.
  6. If the 7318 continues to repeat its boot cycle, continue to the next section.

Setting up error logging

Many times error reporting can show why the Terminal Server is not booting properly. When configured, these errors and their codes can be logged to a file. This section sets up error reporting for cns, the software that manages the Terminal Server.

  1. To setup error reporting, open the /usr/lib/cns/cnsd.conf file with a text editor.

  2. Search for the entrylog:
       config  svclts  2sess   periodic method vpd     boot    stats 
    
  3. If this line is commented out, remove the # sign at the beginning of the line.

  4. Save the modifications and exit the editor.

  5. Recycle the cnsview daemon to start error reporting, execute:
       cnsview -c "daemon stop"# cnsview -c "daemon start" 
    

    WARNING: All other Terminal Server communication (IPX) will be stopped on the host when these daemons are recycled.

  6. Reboot the 7318. This can be done in two ways.

  7. Both the device driver and the cnsview daemon will log errors to different files. These files are the /usr/lib/cns/cnsd.log file and the AIX error log.

    NOTE: The resource names used in the AIX error log are as follows:

       cnsdd - Events logged by the CNS device driver 
       cnsview - Events logged by the CNSVIEW daemon for the 7318 units 
       cnld - Events logged by the CNSVIEW daemon associated with downloading/ 
               booting 
    
  8. To review the driver-logged errors, execute:
       errpt -aNcnsdd | more 
    

    NOTE: Every 3-4 minutes, the cnsview daemon checks the status of the SPX links to those 7318 devices which were configured. When the SPX links are not present, an error log entry is made, similar to the following:

       ERROR LABEL:            CNS_DISCONNECT 
       ERROR ID:               5EBD0D06 
       Date/Time:              Fri Jun  9 22:34:41 
       Sequence Number:        9987 
       Machine Id:             000001871800 
       Node Id:                levesconte 
       Error Class:            S 
       Error Type:             PERM 
       Resource Name:  cnsdd 
       Error Description 
       Driver for ComServer 
       Probable Causes 
       REMOTE NODE 
       Failure Causes 
       COMMUNICATIONS/REMOTE NODE 
       SOFTWARE PROGRAM 
       Recommended Actions 
       RUN STANDALONE DIAGNOSTICS 
       Detail Data 
       ERROR CODE 
       0000 0000 
       Comm 
       Probable Cause 
       REMOTE NODE 
       Failure Causes 
       COMMUNICATIONS/REMOTE NODE 
       SOFTWARE PROGRAM 
       Recommended Actions 
       RUN STANDALONE DIAGNOSTICS 
       Detail Data 
       ERROR CODE 
       0000 0000 
       Communications Device Name: 
       00406e0002db 
    
  9. If the problem is undetermined, make copies of the error reports and send them to an IBM AIX Technical Specialist.

PTF level

If the 7318 still fails to boot, there may be a problem with the cns software on the system. If you suspect a software bug, run the following command to determine the level of the cns software on your pSeries or RS/6000.

  1. For AIX 3.2, AIX 4.2 and AIX 4.3, execute:
       lslpp -h *cns* | pg 
    
  2. For AIX 4.1, execute:
       lslpp -h *7318* |pg 
    

    Contact AIX SupportLine or use the Fixdist application for a listing of the latest PTFs for the 7318 software.


    [ Doc Ref: 90605192814780     Publish Date: Jan. 03, 2001     4FAX Ref: 7448 ]