Skip to content

Releases: justindujardin/mathy

v0.7.3: chore(release): 0.7.3

30 Mar 20:32
Compare
Choose a tag to compare

0.7.3 (2020-03-17)

Features

  • env: add print_history helper (41b3a0f)

v0.7.2: chore(release): 0.7.2

30 Mar 20:32
Compare
Choose a tag to compare

0.7.2 (2020-03-16)

Bug Fixes

  • gym: return node ids and action mask for np observations (d04366e)

v0.7.1: chore(release): 0.7.1

30 Mar 20:32
Compare
Choose a tag to compare

0.7.1 (2020-03-15)

Features

  • Add support for integrating with Fragile library (be3ab58)

v0.7.0: chore(release): 0.7.0

30 Mar 20:31
Compare
Choose a tag to compare

0.7.0 (2020-03-08)

Bug Fixes

  • a3c: remove root noise from action selector (16f86ff)
  • a3c: use episode outcome for log coloring (910bcd6)
  • cli: use greedy selector during inference (15cc58a)
  • env: clamp episode win signal to 2.0 max (3d2d78b)
  • env: remove reentrant state reward scaling (0849e3c)
  • get_terms_ex: support negated implicit coefficients (f763e20)
  • parser: memory leak in cache (6b7a847)
  • rewards: restore reentrant state scaling (1361d74)
  • rules: make commutative swap choose the closest sibling (f32600e)

chore

  • drop time feature from embedding (f5740ad)

Code Refactoring

  • model: remove episode long RNN state tracking (11095ab)

Features

  • a3c: add bahdanau attention layer (daba776)
  • a3c: add exponential decay to learning rate (684191d)
  • a3c: add self-attention over sequences (b750bfc)
  • a3c: use stepped learning rate decay (e9cd8f5)
  • embedding: use bilstm for node sequences (ad23139)
  • embedding: use LSTMs for batch and time axes (a8f0d54)
  • mathy_alpha_sm: more stable recurrent model (02e63e2)
  • training: add yellow output to weak wins (fd9998a)

BREAKING CHANGES

  • this removes a model feature that makes previous pretrained models incompatible
  • model: this removes long-term RNN state tracking across episodes. Tracking the state was a significant amount of code and it wasn't clear that it made the model substantially better at any given task.

The overhead associated with keeping lots of hidden states in memory and calculating state histories was not insignificant on CPU training setups as well.

v0.6.7: chore(release): 0.6.7

30 Mar 20:31
Compare
Choose a tag to compare

0.6.7 (2020-02-10)

Features

  • mathy_pydoc: fix formatting of str defaults (b4f6fde)

v0.6.6: chore(release): 0.6.6

30 Mar 20:31
Compare
Choose a tag to compare

0.6.6 (2020-02-10)

Features

  • build: deploy mathy_pydoc package to pypi (e2d5775)
  • mathy_pydoc: cleanup return type annotations (186be77)
  • mathy_pydoc: preserve Optional types in docs (830c949)
  • mathy_pydoc: unwrap ForwardRef types (4e172c4)

v0.6.5: chore(release): 0.6.5

30 Mar 20:31
Compare
Choose a tag to compare

0.6.5 (2020-01-27)

Bug Fixes

  • build: really fix typing extensions (7f15bca)

v0.6.4: chore(release): 0.6.4

30 Mar 20:30
Compare
Choose a tag to compare

0.6.4 (2020-01-26)

Bug Fixes

  • package: require typing_extensions (55b0bc9)

v0.6.3: chore(release): 0.6.3

30 Mar 20:30
Compare
Choose a tag to compare

0.6.3 (2020-01-26)

Bug Fixes

  • commutative_swap: don't transform commute chains in ways that cause inner-nesting (ed662e3)
  • model: remove second LSTM from recurent model (0241070)
  • model: when trasnferring weights from another model, copy the config file too (401da56)
  • policy_value_model: value head was not learning from hidden state (ee77ae5)
  • sleep: use smaller worker_wait defaults (460f80c)
  • training: use n-step windows during a3c training (02b11ee)

Features

  • a3c: replace set_weights/get_weights with thinc from_bytes/to_bytes (b04fbce)
  • a3c: set update frequency so multiple updates happen per episode (1a28a9d)
  • build: add tools/clean.sh (e3c2308)
  • cli: add --lr for setting adam learning rate (427352f)
  • config: add prediction_window_size (d4095c5)
  • envs: rebalance poly/complex difficulties (88a9b30)
  • mathy_alpha_sm: add pretrained model with simplified architecture (5365a26)
  • add mathy.example helper for generating inputs (92695d6)
  • MathyWindowObservation: add option to return inputs using numpy instead of tf.Tensor (0c76609)
  • tensorflow: update to 2.1.0 (95e764e)

v0.6.1: chore(release): 0.6.1

30 Mar 20:29
Compare
Choose a tag to compare

0.6.1 (2020-01-13)

Features

  • config: add print_model_call_times option (47ad597)