Errata for Cougaar 9.4.1


The following infos/problems/workarounds were detected/posted after Cougaar 9.4.1 was released.

Issues:


New Bugs:

Persistence/Rehydration bugs:

The following bugs were observed while persistence was enabled and a supply point agent (47-FSB) was killed and then restarted. Although the behavior of all agents was not characterized, we expect most "supply thread" agents to exhibit similar behavior as they contain a similar complement of plugins. Upon rehydration the following known bugs occur: These bugs occured after quiescence, but 1847 and 1850 were also observed while restarting in the midst of processing.

Restart reconciliation bug:

This bug occurs if an agent runs with persistence, is killed, and then restarted and rehydrated with its prior state. The restarted agent only reconciles its blackboard with the agents that exist at the restart time, and fails to reconcile with agents that are created afterwards.

This is bug #1936, which will be fixed in the next release (fixed in 9.4.1.1).

Windows AppServer kill doesn't automatically kill child processes:

Bug #1662 is due to a Windows JVM change from JDK 1.3 to 1.4. If the user "CTRL-C"s an AppServer that is currently running nodes, these child nodes are not automatically killed by the dying AppServer JVM process.

Before killing an AppServer, Windows users should be careful to either make sure that CSMART has killed all the nodes on that AppServer. The Windows TaskManager can also be used to check for running "zombie" nodes and kill them.

CSMART Performance issues and runtime bugs:

CSMART Experiment saving turns out to be quite slow, particularly when many recipes are included. We are actively addressing this on several fronts. In addition, a number of issues have been identified, in particular with the new Component Collection recipes.

Various CSMART Usability issues:

A number of relatively minor CSMART usability issues were discovered. Unless otherwise indicated, these will be fixed in the next patch release. For details on these bugs, see the CSMART Release Notes for that release.

RSS thread deadlock issue

A thread deadlock problem in the RSS was recently detected and corrected. See Bug 2029. The fix is checked in as part of the quoSumo.jar, in jars/lib. You must get the newest version of this jar to prevent this deadlock from locking up the threads of the MetricService. The command
   jar tf quoSumo.jar | grep  VERSION
should print VERSION_3-0-10_02-08-13_15-49-22 (or later).

Following bugs in 9.4.1 were fixed in 9.4.1.1: