RELEASE_NOTES 6.08 KB
Newer Older
1 2 3 4 5 6 7 8
0.5.0

general:
==========

codes-base and codes-net have been combined into a single project (now at
https://xgitlab.cels.anl.gov/codes/codes).

9
updated to ROSS revision d9cef53.
10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84

fixed a large number of warnings across the codebase.

networks:
==========

addition of the SlimFly network topology, corresponding to the Wolfe et al.
  paper "Modeling a Million-node Slim Fly Network using Parallel Discrete-event
  Simulation", to appear at SIGSIM-PADS'16. See ___

modelnet now supports sampling at regular intervals. Dragonfly LPs can
  currently make use of this - others can be added based on demand. See
  ___.

allow 0-byte messages in model-net.

enable "local" model-net messages (for LPs sharing the same model-net endpoint -
  approximates a zero-copy ownership pass of the payload)

the model_net_event family of functions now return a token value which must
  be passed to model_net_event_rc2.

workloads:
==========

concurrent workload support added to workload generators and the MPI
  simulation layer. See codes-jobmap.h and the modified codes-workload.h.
  Thanks to Xu Yang for the partial contributions. Not all workload generators
  support concurrent workloads at this time.

scripts for generating job allocations specific to the torus and dragonfly
  topologies. See scripts/allocation_gen. Thanks to Xu Yang for the
  contributions.

multiple fixes to the MPI simulation layer and DUMPI workload generator.

concurrent workload support and more flexible rank mappings in the MPI
  simulation layer. Thanks to Xu Yang for the initial code.

removed scalatrace workload generator, which never made it to a usable state.

a new checkpoint IO workload generator has been added, based on the Daly paper
  "A higher order estimate of the optimum checkpoint interval for restart
  dumps" at Future Generation Computing Systems 2004. See ___.

utilities:
==========

fixes to rc-stack - memory no longer leaks in sequential mode, and optimistic
  debug mode is now supported.

added a more performance-sensitive function, codes_mapping_get_lp_info2, to
  codes-mapping (passes const pointers around instead of copying strings)

added a "mapping context" API for better controlling implicit LP-LP mappings,
  including modelnet, local storage model, and resource. See
  codes/codes-mapping-context.h and added functions in the mentioned LPs for
  details.

formalized a callback mechanism for CODES, replacing previous ad-hoc
  methods of passing control from LPs to arbitrary other LPs. The request and
  local storage model LP APIs have been changed to use this mechanism, and
  model-net has additional APIs to use user-provided mapping contexts. See
  codes/codes-callback.h for the API and tests/resource-test.c for advanced
  usage.

deprecations:
==========

model_net_event_rc (use model_net_event_rc2, which will eventually be renamed
  to model_net_event_rc)

codes_event_new (define your own bounds-checking macro if need be)


85 86
0.4.0

87 88
codes-base

89 90
general:
==========
91

92
significant source reorganization / refactoring
93

94
refactor some private headers out of the public eye
95

96 97 98 99
dead code removal

documentation:
==========
100

101
improved example_heterogeneous example program
102

Jonathan Jenkins's avatar
Jonathan Jenkins committed
103
added configuration to example_heterogeneous showing two torus networks
104

105 106
reorganized files to prevent name collisions on OSX. Top-level docs other than
  copyright now in doc directory
107

108 109 110 111
additions to best practice document

configurator:
==========
112

113
more stable file format for configurator output
114

115
ignore unrelated parameters passed into filter_configs
116

117 118 119 120
handle empty cfields in configurator

workloads:
==========
121

122
combined network and IO workload APIs into a single one
123

124
adding dumpi workload support in codes-workload-dump utility
125

126
workload dump utility option cleanup
127

128
renamed "bgp" workload generator to "iolang", significant cleanups
129

130
put network workload ops in workload dump util
131

132
removing one of the dumpi libraries from the build. It was generating some unwanted dumpi files.
133

134 135 136 137
network workload API more fleshed out

utilities:
==========
138

Jonathan Jenkins's avatar
Jonathan Jenkins committed
139
configuration bug fixes for larger LP type counts
140

141
resource LP annotation mapping hooks
142

143
local storage model API switch to use annotations
144

145
better configuration error handling
146

147
hedge against precision loss in codes_local_latency (see codes.h)
148

149 150 151
use a different RNG than default for codes_local_latency
- prevents addition/removal of codes_local_latency calls from poisoning RNG
  stream of calling model
152

153 154
added simple GVT-aware stack with garbage collection (see rc-stack.h)

155 156
codes-net

157 158
general:
==========
159

160
cleanup of much of the code base
161

162
more informative error for failure to find modelnet lps
163

Jonathan Jenkins's avatar
Jonathan Jenkins committed
164
removed redundant include directory on install (was 'install/codes/codes/*.h')
165 166 167

documentation:
==========
168

169 170
reorganized files to prevent name collisions on OSX. Top-level docs other than
  copyright now in doc directory
171

172
updated code documentation
173

174
fix linker error in certain cases with codes-base
175

176 177 178 179 180
tweaked config error handling


networks:
==========
181
fix to loggp latency calculation when using "receive queue"
182

183
made torus lps agnostic to groups and aware of annotations
184

Jonathan Jenkins's avatar
Jonathan Jenkins committed
185
miscellaneous fixes to dragonfly model
186

187 188
updates to simplep2p: support for having different latency/bw at sender &
  receiver end. See src/models/networks/model-net/doc/README.simplep2p.txt
189

190
minor fixes to usage of quickhash in replay tool
191

192
fixed RNG reverse computation bug in loggp
193

194 195 196 197
fixed swapped arguments in round-robin scheduler causing short circuit

workloads:
==========
198

199
minor changes to dumpi trace config files
200

201
resolving minor bug with reverse computation in dumpi traces
202

203
Updating network trace code to use the combined workload API
204

205
Adding synthetic traffic patterns (currently with dragonfly model)
206

207
Adding network workload test program for debugging
208

209
Updating MPI wait/wait_all code in replay tool
210

211 212
0.3.0

213 214
codes-base

215 216
Initial "official" release. Against previous repository revisions, this release
includes more complete documentation.
217 218 219

codes-net

Jonathan Jenkins's avatar
Jonathan Jenkins committed
220 221 222 223
Initial "official" release. Against previous repository revisions, this release
includes more complete documentation and a rename of the "simplewan" model to
the "simplep2p" (simple point-to-point) model to more accurately represent
what it's modeling.