- 08 Jun, 2017 1 commit
-
-
Paul Rich authored
-
- 18 May, 2017 1 commit
-
-
Paul Rich authored
-
- 01 May, 2017 1 commit
-
-
Benjamin Allen authored
Instead of enumerating all groups on the system and comparing, check members of specific queue groups. This change makes CQM compatible with sssd.conf: enumerate = False, and overall is doing less work.
-
- 11 Apr, 2017 1 commit
-
-
Paul Rich authored
We need to log and end the task timers on both __rc_unknown and __rc_success on __task_finalize. The only outcome that doesn't officially end a task is __rc_retry, as it will go into communication retries at that point.
-
- 10 Apr, 2017 1 commit
-
-
Paul Rich authored
Only set the timers/emit messages once, despite retry attempts.
-
- 07 Apr, 2017 1 commit
-
-
Paul Rich authored
start.
-
- 01 Nov, 2016 1 commit
-
-
Paul Rich authored
-
- 31 Oct, 2016 1 commit
-
-
Paul Rich authored
-
- 06 Oct, 2016 1 commit
-
-
Paul Rich authored
-
- 12 Sep, 2016 2 commits
- 02 Jun, 2016 1 commit
-
-
Paul Rich authored
This adds the limiter for maximum jobs overall running in queue. Useful for profiling machines with noisy network environments. This also adds output to cqadm for this information, and an entry in the cqadm manpage.
-
- 16 Oct, 2015 1 commit
-
-
Paul Rich authored
Adds feature requested for #4320. This will signal the head pid rather than the process group. Not recommended for use with csh scripts.
-
- 04 May, 2015 1 commit
-
-
Paul Rich authored
When upgrading you would lose all dep-held jobs. This makes sure that the appropriate flag is added and set on reinitialization post-upgrade.
-
- 26 Mar, 2015 1 commit
-
-
Paul Rich authored
I was getting a dep-release message when using qsub -h. This fixes the message. Dependency handling and reporting appears to be working. Restart tests are pending.
-
- 23 Mar, 2015 1 commit
-
-
Paul Rich authored
Statefile tests and multi-chain tests pending.
-
- 20 Mar, 2015 1 commit
-
-
Paul Rich authored
Some systems are configured such that the user is only associated with their default group in /etc/pwd as opposed to /etc/groups. This allows the groups check to function in an expected manner, and pass in that circumstance.
-
- 18 Mar, 2015 1 commit
-
-
Paul Rich authored
If groups and users are set on a queue, you have to be both in users and have a group in groups.
-
- 13 Mar, 2015 1 commit
-
-
This makes dep_hold a proper hold state (as opposed to an iterative check) dep_holds should now follow the same score accrual rules as other holds (don't gain score). Verification required, and integration testing must be done. Also, upgrade tests are pending.
-
- 11 Mar, 2015 2 commits
-
-
Paul Rich authored
There is simple scaling for dep_frac that can be enabled in the cobalt.conf that gives a lower amount of fraction for dependent jobs if the dependent job is smaller in nodecount relative to it's parent. This is based on a patch submitted by Adam Scovel <ascovel@alcf.anl.gov>
-
This lets a Unix group be set on a queue as a restriction. If the submitter isn't a part of the group, the job is rejected.
-
- 10 Mar, 2015 1 commit
-
-
Paul Rich authored
-
- 22 Jan, 2015 1 commit
-
-
Paul Rich authored
Jobs now only inherit if they share a project
-
- 12 Jan, 2015 1 commit
-
-
Paul Rich authored
When users release holds, total_etime should be reset. This is a patch from Adam Scovel.
-
- 03 Oct, 2013 1 commit
-
-
Paul Rich authored
This fixes bad defaults in ion kernel options and ensures proper operation with alt-kernel support disabled.
-
- 09 Sep, 2013 2 commits
-
-
Brian Toonen authored
-
George Rojas authored
Also, fixed local simulator setup script.
-
- 05 Sep, 2013 1 commit
-
-
Brian Toonen authored
-
- 22 Aug, 2013 1 commit
-
-
George Rojas authored
1. PBS-style directive for script jobs 2. Multiple env unions 3. Added appropriate tests
-
- 21 Aug, 2013 1 commit
-
-
Paul Rich authored
Also adding in geometry and passthrough blocking information to the logging database. We can backfill data later.
-
- 09 Aug, 2013 1 commit
-
-
George Rojas authored
-
- 06 Aug, 2013 1 commit
-
-
George Rojas authored
Fixed typo
-
- 24 Jul, 2013 1 commit
-
-
Paul Rich authored
Have some more work to do with suppressing reboots if we're still on the default kernel, but this appears to be doing the right thing for the most part.
-
- 12 Jul, 2013 1 commit
-
-
Paul Rich authored
This rev is likely broken. Checked in to archive changes with testing pending.
-
- 04 Feb, 2013 1 commit
-
-
Paul Rich authored
Flag to qsub/qalter to request that a script job not be prebooted. This gets ignored for cN mode jobs. Also adding in a client that lets us get the bootable blocks that are children of a given block. For use by scripts so they have an easier time splitting up blocks for ensemble jobs.
-
- 05 Nov, 2012 1 commit
-
-
Paul Rich authored
Still require qalter, and display support. The basic cases are working, though. Also need to add validation code to make sure that qsub gets rid of the obviously wrong cases. This allows a user to state: only run this job on this geometry.
-
- 17 Jul, 2012 1 commit
-
-
Paul Rich authored
This is a problem we saw on Vesta where a resource would not be released if a user put a job on hold while the job was being scheduled by bgsched.
-
- 20 Jun, 2012 1 commit
-
-
Brian Toonen authored
-
- 19 Jun, 2012 1 commit
-
-
Brian Toonen authored
-
- 18 Jun, 2012 1 commit
-
-
Paul Rich authored
cqm run_job will set a resource reservation in the system component if there isn't one already when starting a job.
-