maml_step#

ivy.maml_step(batch, inner_cost_fn, outer_cost_fn, variables, inner_grad_steps, inner_learning_rate, /, *, inner_optimization_step=<function gradient_descent_update>, inner_batch_fn=None, outer_batch_fn=None, average_across_steps=False, batched=True, inner_v=None, keep_inner_v=True, outer_v=None, keep_outer_v=True, return_inner_v=False, num_tasks=None, stop_gradients=True)[source]#

Perform step of vanilla second order MAML.

Parameters:

batch (Container) – The input batch
inner_cost_fn (Callable) – callable for the inner loop cost function, receiving sub-batch, inner vars and outer vars
outer_cost_fn (Callable) – callable for the outer loop cost function, receiving task-specific sub-batch, inner vars and outer vars. If None, the cost from the inner loop will also be optimized in the outer loop.
variables (Container) – Variables to be optimized during the meta step
inner_grad_steps (int) – Number of gradient steps to perform during the inner loop.
inner_learning_rate (float) – The learning rate of the inner loop.
inner_optimization_step (Callable, default: <function gradient_descent_update at 0x7f907628e4d0>) – The function used for the inner loop optimization. Default is ivy.gradient_descent_update.
inner_batch_fn (Optional[Callable], default: None) – Function to apply to the task sub-batch, before passing to the inner_cost_fn. Default is None.
outer_batch_fn (Optional[Callable], default: None) – Function to apply to the task sub-batch, before passing to the outer_cost_fn. Default is None.
average_across_steps (bool, default: False) – Whether to average the inner loop steps for the outer loop update. Default is False.
batched (bool, default: True) – Whether to batch along the time dimension, and run the meta steps in batch. Default is True.
inner_v (Optional[Container], default: None) – Nested variable keys to be optimized during the inner loop, with same keys and boolean values. (Default value = None)
keep_inner_v (bool, default: True) – If True, the key chains in inner_v will be kept, otherwise they will be removed. Default is True.
outer_v (Optional[Container], default: None) – Nested variable keys to be optimized during the inner loop, with same keys and boolean values. (Default value = None)
keep_outer_v (bool, default: True) – If True, the key chains in inner_v will be kept, otherwise they will be removed. Default is True.
return_inner_v (Union[str, bool], default: False) – Either ‘first’, ‘all’, or False. ‘first’ means the variables for the first task inner loop will also be returned. variables for all tasks will be returned with ‘all’. Default is False.
num_tasks (Optional[int], default: None) – Number of unique tasks to inner-loop optimize for the meta step. Determined from batch by default.
stop_gradients (bool, default: True) – Whether to stop the gradients of the cost. Default is True.

Return type:

Tuple[Array, Container, Any]

Returns:

ret – The cost and the gradients with respect to the outer loop variables.

Examples

With ivy.Container input:

>>> import ivy
>>> from ivy.functional.ivy.gradients import _variable

>>> ivy.set_backend("torch")

>>> def inner_cost_fn(sub_batch, v):
...     return sub_batch.mean().x / v.mean().latent
>>> def outer_cost_fn(sub_batch,v):
...     return sub_batch.mean().x / v.mean().latent

>>> num_tasks = 2
>>> batch = ivy.Container({"x": ivy.arange(1, num_tasks + 1, dtype="float32")})
>>> variables = ivy.Container({
...     "latent": _variable(ivy.repeat(ivy.array([[1.0]]), num_tasks, axis=0))
... })

>>> cost = ivy.maml_step(batch, inner_cost_fn, outer_cost_fn, variables, 5, 0.01)
>>> print(cost)
(ivy.array(1.40069818), {
latent: ivy.array([-1.13723135])
}, ())