brainpy.optim.MomentumNesterov#

class brainpy.optim.MomentumNesterov(lr, train_vars=None, weight_decay=None, momentum=0.9, name=None)[source]#

Nesterov accelerated gradient optimizer 2.

\[\begin{split}\begin{align} \begin{split} v_t &= \gamma v_{t-1} + \eta \nabla_\theta J( \theta - \gamma v_{t-1} ) \\ \theta &= \theta - v_t \end{split} \end{align}\end{split}\]

Parameters: lr (float, Scheduler) – learning rate.

References

2: Nesterov, Y. (1983). A method for unconstrained convex minimization problem with the rate of convergence o(1/k2). Doklady ANSSSR (translated as Soviet.Math.Docl.), vol. 269, pp. 543– 547.

__init__(lr, train_vars=None, weight_decay=None, momentum=0.9, name=None)[source]#

Methods

`__init__`(lr[, train_vars, weight_decay, ...])
`check_grads`(grads)
`cpu`()	Move all variable into the CPU device.
`cuda`()	Move all variables into the GPU device.
`load_state_dict`(state_dict[, warn])	Copy parameters and buffers from `state_dict` into this module and its descendants.
`load_states`(filename[, verbose])	Load the model states.
`nodes`([method, level, include_self])	Collect all children nodes.
`register_implicit_nodes`(*nodes[, node_cls])
`register_implicit_vars`(*variables, ...)
`register_train_vars`([train_vars])
`register_vars`([train_vars])
`save_states`(filename[, variables])	Save the model states.
`state_dict`()	Returns a dictionary containing a whole state of the module.
`to`(device)	Moves all variables into the given device.
`tpu`()	Move all variables into the TPU device.
`train_vars`([method, level, include_self])	The shortcut for retrieving all trainable variables.
`tree_flatten`()	Flattens the object as a PyTree.
`tree_unflatten`(aux, dynamic_values)	New in version 2.3.1.
`unique_name`([name, type_])	Get the unique name for this object.
`update`(grads)
`vars`([method, level, include_self, ...])	Collect all variables in this node and the children nodes.

Attributes

name

Name of the model.