1. 03 May, 2018 1 commit
  2. 09 Apr, 2018 2 commits
  3. 26 Mar, 2018 1 commit
    • Eric Pershey's avatar
      Added instrumented logging to many of the components. · b9de5d5e
      Eric Pershey authored
      Wrapped Proxy errors with sanatize password.
      Cleaned up logging spelling mistakes.
      Added extract_traceback to format the exceptions
      Added get_current_thread_identifier to help identify threads
      Added get_caller to help identify the caller.
      Added cray_messaging to aid in debugging and allow importing.
      Fixed a bug that would allow retrying of a fork.
      Added debugging tools of slp_hammer.py and slp_nail.py
      Updated the simulator to simulate background tasks.
      b9de5d5e
  4. 05 Mar, 2018 2 commits
  5. 14 Feb, 2018 1 commit
  6. 02 Feb, 2018 3 commits
    • Paul Rich's avatar
      Fixing docstrings and remooving defunct variable · 281a9c68
      Paul Rich authored
      queue is no longer needed for cleanup, and the docstrings should
      reflect the changed behavior.
      281a9c68
    • Paul Rich's avatar
      Bringing in tests for reservatins + disjoint queues · 3c00ba6f
      Paul Rich authored
      These are the test cases for the bug that precipitated this hunt.  These
      should largely be degenerate with the other single-equivalence cases
      (because they are now single-equivalence cases), but do things like
      make sure that drains are getting correctly cleared and times are
      getting correctly reset.
      3c00ba6f
    • Paul Rich's avatar
      Refactor find_queue_equivalence_classes and drain clear code · 2cda012d
      Paul Rich authored
      After discussions, the current find_queue_equivalence_classes for this
      system really only complicates the codebase for very little actual gain.
      After this, the system will have only one equivalence class at all times
      consisting of all active queues assigned to nodes and all active
      reservations.
      
      This simplification allows us to ensure that find_job_location only gets
      called twice, once for reservations, which ignore drain times, and then
      immediately after for the normal "production" queue jobs, which do set
      drain times.  In both cases we can just clear drain times across the
      machine.
      
      In addition to testing (and more tests coming for the case that caused
      this examination to begin with), we know that this works, as any system
      with a queue or set of overlapping queues across all resources on the
      machine forms a single equivalence class under the old code.
      2cda012d
  7. 03 Jul, 2017 2 commits
  8. 23 Nov, 2016 1 commit
  9. 03 Nov, 2016 1 commit
  10. 06 Oct, 2016 1 commit
  11. 26 Sep, 2016 2 commits
  12. 23 Sep, 2016 2 commits
  13. 16 Sep, 2016 1 commit
    • Paul Rich's avatar
      Draining and backfilling basics operational. · 7856d38b
      Paul Rich authored
      Draining and backfilling are passing basic tests.  Need to add more test
      cases to the automated suite and test corner cases around
      queues/reservations/locations list.
      
      Also need to add backfill time display to nodelist/nodeadm -l.
      7856d38b
  14. 14 Sep, 2016 1 commit
  15. 13 Sep, 2016 4 commits
  16. 07 Sep, 2016 1 commit
  17. 29 Aug, 2016 1 commit
  18. 24 Aug, 2016 4 commits