Bugs fixed in SGE 5.3p6 since release 5.3p5
-------------------------------------------
Issue Sun BugId Description
------- ---------- ---------------------------------------------------------------------------------
4753766 Various fixes in man pages
551 4969825 not supported array task dependencies are not rejected
5006354 sge_ca script will not create user certs when SGE_CELL (default) is not set
871 5018669 qrsh/qlogin: "Connection refused" due to race condition in shepherd
632 5018695 loadsensor doing output to stderr can block
599 5018726 qalter lacks -dl option!
600 5018733 Empty parameters crashes qstat and qhost
919 5018757 HPCT jobs may fail - add variable to job environment which point to SGE binaries
5018884 SSL vulnerabilities stated in Sun Alert 57524
835 5019497 modification of loadsensor causes crash of execd
820 5019584 SGEEE: scheduler may crash on Linux AMD64 in functional ticket calculation
709 5019595 Dateformat YYMMDDhhmm was interpreted wrong (qacct, qsub, qalter,...)
635 5019601 "vmem" in qstat -j keeps the max value
771 5019624 qselect/qstat -l selection wrongly considers load and utilization
648 5019635 schedd_job_info=true causes large delays with parallel job scheduling
646 5020131 renaming a user deletes the user
607 5020134 qhost output broken for global consumables
5020139 a stored job template in qmon sets -hold_jid to a wrong value
5020141 qsh and qlogin accepted the options -h and -hold_jid and ignored them later
5020143 qdel XXX.YY- will delete the first array task of job XXX
637 5020153 mail bomb upon abort with tightly integrated par jobs
314 5020278 a colon in a job name breaks qacct
5020282 some Linux distributions are not recognized by SGE
931 5020371 sge_shepherd creates world writable files
5021405 CSP reconnect problem of scheduler and execd
5022074 qmon displays wrong numerical values on AMD Linux in spinbox widgets
Bugs fixed in SGE 5.3p5 since release 5.3p4
-------------------------------------------
Issue Sun BugId Description
------- ---------- ---------------------------------------------------------------------------------
628 4923403 When installing using -csp the sge_ssl* files are not up to date
627 4958080 suspend and resume with HPC Cluster tools 5 not properly working
626 4957760 Fix needed for CERT CA-2003-26 Multiple Vulnerabilities in SSL/TLS
619 4952767 qrsh -notify doesn't work
617 4952236 Broken mail option with SGE 5.3p4 qrsh
597 infotext ignores parameters for HP10, HP11 systems
592 4930789 An overwritten string attribut was ignored in the scheduler
589 4930786 global load values are ignored
4949917 qmon seg faults with a user hold job from qtcsh qtask file
4930793 minor issues with the sgeee ticket update interval
Fixed problem reading in qmaster_param halflife_decay_list
Bugs fixed in SGE 5.3p4 since release 5.3p3
-------------------------------------------
Issue Sun BugId Description
------- ---------- ---------------------------------------------------------------------------------
415 4775325 qrong qstat -j diagnosis message wrongly indicates not enough PE slots
469,489 4815795 qstat -alarm was broken
485 4813965 Reject tightly integrated array jobs (not supported in SGE 5.3)
492 4819479 qhost -q -l arch=xx crashes if exec host is down
499 4822742 deleted sharetree showed up again after qmaster restart
501 4822746 spooling integrity improvement was missing for sharetree, projects and users
502 4824104 SGEEE: allow qalter -p for running jobs
503 overriding queue limits with job request limits incomplete for NEC
504 4838650 Multiple job script writes for array jobs caused ETXTBSY error
506 4847819 sge->sgeee upgrade failed
508 4833346 qsh/qrsh/qlogin might core with segementation violation
509 4841414 Unable to delete task array job with negative increment
510 4835832 NOTIFY_SUSP signal only sent for first suspension of job
511 4838595 maxujobs does not count suspended or held jobs
513 qrsh (rshd) does not set limits after setting the osjobid (affects e.g. SUPER-UX/SX)
514 4838549 maxujobs scheduler config functionality is broken
515 4838636 sharetree can't be modified
521 4869772 resource limits not correctly set for Linux x86
522 4844838 sge_shepherd does not exit on SIGTERM (e.g. on "rcsge stop")
523 4842844 commd performance, delay for first job start on machines with many job slots
524 4842878 qdel -u does not delete all jobs of the user
526 4845505 cannot qalter/qhold/qrls several tasks of same job
531 4847802 scheduler crashes on Solaris(x86) due to compiler bug
532 4847814 SGEEE: scheduler dispatches wrong jobs when jobs were in error state
540 enabled output file names to have a ":" inside for qsub
541 qlogin/qlogind whitespace checking removed to allow ssh/sshd with optional arguments
544 4886017 qstat crashed, when used with the options -r -s z
545 4859658 SGE: scheduler crashed when job priorities where changed with qalter
547 4860391 qmake dumped core when starting recursive make calls
549 4851939 unexpected output for qstat -j and qmon "Why?"
555 4886025 queue name was truncated to 10 chars
557 4866711 SGE_O_* variables for a task within a tightly integrated job set incorrect
560 4869784 qmon "Qmon" resource file contains syntax errors
568 Allow "-" sign as first character of load_formula
569 Impossible to disable qmaster profiling
572 4883714 SGEEE: qmaster crashed on qdel of tightly integrated PE job with usage_scaling activated
575 4881949 Slave tasks of tightly integrated PE jobs not killed when h_rt limit was exceeded
576 4886026 max_u_jobs could count rejected jobs as submitted
577 4885719 disabled infotext error output if SGE_ROOT is not set
578 4885906 NSLOTS and NHOSTS incorrectly set in environment of tightly integrated tasks
579 4885930 failure of master task of a tightly integrated PE job did not delete the job
580 *_RESERVED_USAGE gives wrong values for PE jobs
174 Checkpointed job was migrated on bad exit status
4749151 sge_ca script -usercert needs RANDFILE env var
4822799 cannot install on Solaris 10
4876169 An incomplete resource request (-l =1) caused qmaster to crash
4893432 upgrade to openssl 0.9.7a
Bugs fixed in SGE 5.3p3 since release 5.3p2
-------------------------------------------
Issue Sun BugId Description
------- ---------- ---------------------------------------------------------------------------------
126 4780316 race condition if signals are to be delivered in job's startup phase
264 array job parameters
314 4713013 qacct may display incorrect accounting information
380 4818741 startup failure of qrsh job is reported as regular job exit
391 4756556 .cshrc error causes [pro|epi]log,pe-[start|stop] failure
392 4756557 non-resolvable hosts in host_aliases file cause wrong hostname resolving
393 4760981 Empty sge_request file causes submission error
398 4816541 no newline character at end of sge_aliases file may crash qsub
402 4778762 Array jobs which contain only one task (id=1) will be handled as single job
403 4769608 qalter shows wrong priority number when using negative priorities with -p option
416 4776016 execd does res. consuming process tracking even if no job is to be controlled
417 4776821 qtcsh can't be used as normal tcsh
418 4776754 complex values for user defined complexes are rejected with global host
423 4778757 stepsize 0 in array job specification results in qmaster exception
424 4778758 memcpy leak in execd
430 4795475 qstat -f output broken for pe jobs on same queue
432 4787598 schedd_job_info messages shown by qstat -j even if it is set to false
435 4790540 sge_schedd process consumes more memory than needed if schedd_job_info=true
436 4790547 Job notification signals won't be delivered if user redefines suspend_method ...
437 4790592 conflicting policies can cause job being started and immediately suspended
438 4791238 SGE may create duplicate accounting entries for parallel jobs
439 4791908 job logging file exists but is empty in certain configurations
440 4792036 job arguments larger than 10k crash qmaster
441 4787623 failover to shadow master leaves sge_schedd on the original qmaster host
446 4794242 wrong usage reported by qstat -j
449 4802831 cannot set -C to null string as described in man qsub
450 4805423 STRING complex attribute handling with RELOP "!=" is broken
455 4802171 qacct -l selection broken
472 4807677 qrsh crash when command line arguments are longer than 4K
478 4813188 qstat -r shows wrong dependencies
486 scripts/util/arch: backed out hp11-64 and aix51 detection since no port available yet
487 4815774 Uninitialized pointer cause segmentation fault in qsh/qrsh on submit only hosts
490 4816529 qmon crash when pressing Why for a list of selected jobs
493 4818737 SGEEE: huge scheduling times if maxujobs is set
4811230 qconf -Muser and qconf -Auser report no success messages
496 Functional policy pending ticket calculations are not intuitive
Bugs fixed in SGE 5.3p2 since release 5.3p1
-------------------------------------------
Issue Sun BugId Description
---------------------------------------------------------------------------------------------------
214 4658716 protocol doing termin. on failure for tightly integr. par. job could be leaner
325 4716824 qlogin and qrsh accept unsupported options
328 4718880 qsub/qalter -l ... might select wrong resource.
4719218 "Job Submission" GUI: blank text in pop out window
331 4719755 wrong port output in qstat error message when qmaster not reachable
4721129 A misoperation in host configuration on qmon leads to qmaster daemon's death
4721134 qmon gets shutdown with the message "Segmentation Fault" in terminal
4721181 Wrong path for qmaster messages file in exe host installation
4722060 CLI: invalid option "-jid jid" for qconf in qconf -help
4723543 Too small panes and cells to display some item names
346 4727457 Proper $SGE_CELL setting fails with non default cell in settings.sh
345 4727515 maxujobs prevents dispatching even if job limit is not reached
4728293 qmon gets shutdown with a word "global" in Cluster Configuration
355 4731288 qmon cluster config dialog does not show gid_range in SGE product mode
356 4731347 can configure fshare/otickets in acls of type DEPT
4731348 access_list(5) man page does not describe file format for SGEEE ACL's
370 4732031 OK without hostname in host_configuration kill sge_qmaster
359 4732545 gethostbyaddr in ~/solaris64 doesn't work.
4733043 qmon dumps core when mouse over an interactive job in Job Control window
4733089 qmon dies after checking 'transfer' in 'queue control' window
4733092 requirement for installation process of exec host with non root user
360 4733859 Userset "defaultdepartment" accepts users in CLI
4735258 CLI: Wrong info for usage
363 4735972 scheduler crashes if all subnodes of a node have 0 shares in sharetree
371 4739596 rlim_fd_max > 1024 can cause 32 bit daemons to crash at startup phase
350 4740335 qmon dies with changes in Edit Tickets on Solaris64.
313 4740350 problems with destin_id_list syntax
333 4740407 SGEEE: Insufficient checking for PTF_{MIN,MAX}_PRIORITY in execd_params
330 4740578 load formula of the scheduler
354 4741230 qmod help output is incomplete
350 4742082 Calculation failure in Functional policy wit hvery large numbers
219 4742133 rcsge startup script has hardwired execd_spool_dir for "stop" procedure
374 4742189 schedd_job_info = true causes immense daemons memory growth in large systems
342 4744523 no error message for interactive job start failure due to wrong DISPLAY settings
325 4745387 qsh, qrsh and qlogin silently ignore options -ac, -dc, -sc and -w
335 4745399 qmake without any information about parallel execution fails
327 4745404 qmake does an incorrect resource request if ARCH is an empty string
379 4747829 accounting record about qrsh termination incomplete
302 4753668 prevent deletion of still referenced objects
308 4753669 qconf gets commd timeout
4753766 Various fixes for 5.3p2 in man pages
4754435 OpenSSL 0.9.6c security vulnerability
389 4755931 possible file access problems on 64-bit file system with 32-bit binaries
383 Fixed restart_command for checkpoint jobs
linux clients use port 65535 by error if COMMD_PORT/sge_commd service not set
Exiting sgeee jobs can be shown in qmon dialog dialog for pending jobs
Documentation update for loose Sun HPC cluster tools integration
Bugs fixed in SGE 5.3p1 since release 5.3
-----------------------------------------
Issue Sun BugId Description
---------------------------------------------------------------------------------------------------
182 allow arguments for *_command and *_daemon parameters in cluster config
215 4673738 disallow "none" load formula and do additional tests for the correctness
223 4700286 complex default value not considered for load/suspend thresholds
226 4665780 qmaster error message during startup: global configuration doesn't exist - creat
229 4668148 solaris psets won't be used by SGE
231 4697491 signal notification can prevent delivery of actual suspend/termination signal
232 4708239 Allow specification of arguments to [rsh|rlogin|qlogin]_[daemon|command]
236 4670664 parallel jobs (e.g. qmake) fail
237 4670669 error message "can't set additional group id for job" for interactive jobs
239 4675410 queue suspend threshold alarm nsuspend>1 does not susp. multiple jobs
240 4675416 use of suspend thresholds can break user sort policy and cause logging
243 4676340 memory leak in sge_schedd
247 4696768 SGE(EE) allows to submit binary job scripts
250 4682966 qsh(1) ignores -S in sge_request(5) files
251 4683852 qalter on running jobs can confuse consumable mgmt
259 4686157 qhost -j is broken
271 4692957 non-privileged users can submit jobs with priority higher than 0
272 4708235 SGE should allow to start qrsh jobs when /etc/nologin exits
275 4706929 qmon does not display job predecessors in job control
284 4699665 qstat resource based job selection broken
287 4701640 problems launching jobs from qtcsh with "&"
299 4677087 execd could crash when executing tightly integrated parallel jobs
311 4712023 global load values can prevent dispatching of jobs
314 4713013 qacct may display incorrect accounting information