1. 03 May, 2018 1 commit
  2. 09 Apr, 2018 2 commits
  3. 26 Mar, 2018 1 commit
    • Eric Pershey's avatar
      Added instrumented logging to many of the components. · b9de5d5e
      Eric Pershey authored and Eric Pershey's avatar Eric Pershey committed
      Wrapped Proxy errors with sanatize password.
      Cleaned up logging spelling mistakes.
      Added extract_traceback to format the exceptions
      Added get_current_thread_identifier to help identify threads
      Added get_caller to help identify the caller.
      Added cray_messaging to aid in debugging and allow importing.
      Fixed a bug that would allow retrying of a fork.
      Added debugging tools of slp_hammer.py and slp_nail.py
      Updated the simulator to simulate background tasks.
      b9de5d5e
  4. 23 Mar, 2018 1 commit
  5. 15 Mar, 2018 1 commit
  6. 05 Mar, 2018 2 commits
  7. 26 Feb, 2018 1 commit
  8. 15 Feb, 2018 1 commit
  9. 14 Feb, 2018 1 commit
  10. 07 Feb, 2018 1 commit
  11. 02 Feb, 2018 3 commits
    • Paul Rich's avatar
      Fixing docstrings and remooving defunct variable · 281a9c68
      Paul Rich authored
      queue is no longer needed for cleanup, and the docstrings should
      reflect the changed behavior.
      281a9c68
    • Paul Rich's avatar
      Bringing in tests for reservatins + disjoint queues · 3c00ba6f
      Paul Rich authored
      These are the test cases for the bug that precipitated this hunt.  These
      should largely be degenerate with the other single-equivalence cases
      (because they are now single-equivalence cases), but do things like
      make sure that drains are getting correctly cleared and times are
      getting correctly reset.
      3c00ba6f
    • Paul Rich's avatar
      Refactor find_queue_equivalence_classes and drain clear code · 2cda012d
      Paul Rich authored
      After discussions, the current find_queue_equivalence_classes for this
      system really only complicates the codebase for very little actual gain.
      After this, the system will have only one equivalence class at all times
      consisting of all active queues assigned to nodes and all active
      reservations.
      
      This simplification allows us to ensure that find_job_location only gets
      called twice, once for reservations, which ignore drain times, and then
      immediately after for the normal "production" queue jobs, which do set
      drain times.  In both cases we can just clear drain times across the
      machine.
      
      In addition to testing (and more tests coming for the case that caused
      this examination to begin with), we know that this works, as any system
      with a queue or set of overlapping queues across all resources on the
      machine forms a single equivalence class under the old code.
      2cda012d
  12. 13 Nov, 2017 1 commit
  13. 03 Oct, 2017 1 commit
  14. 21 Sep, 2017 1 commit
  15. 03 Jul, 2017 2 commits
  16. 24 Jan, 2017 2 commits
  17. 08 Dec, 2016 1 commit
    • Paul Rich's avatar
      Commiting work to date on backfiller revisions for BG/Q · d0a678e6
      Paul Rich authored
      After discussion the current algorithm for determining backfill time
      needs to be replaced and needs to depend on which blocks are selected
      for draining.  This is a commit for the current algorithm's optimistic
      and pessimistic backfill modes.
      d0a678e6
  18. 06 Dec, 2016 1 commit
  19. 23 Nov, 2016 1 commit
  20. 03 Nov, 2016 1 commit
  21. 31 Oct, 2016 1 commit
  22. 06 Oct, 2016 1 commit
  23. 26 Sep, 2016 2 commits
  24. 23 Sep, 2016 2 commits
  25. 16 Sep, 2016 1 commit
    • Paul Rich's avatar
      Draining and backfilling basics operational. · 7856d38b
      Paul Rich authored
      Draining and backfilling are passing basic tests.  Need to add more test
      cases to the automated suite and test corner cases around
      queues/reservations/locations list.
      
      Also need to add backfill time display to nodelist/nodeadm -l.
      7856d38b
  26. 14 Sep, 2016 1 commit
  27. 13 Sep, 2016 4 commits
  28. 12 Sep, 2016 1 commit
  29. 08 Sep, 2016 1 commit