8. Custom operators

Goals

At the end of this tutorial you will be able to:

Control the transformation to NNEF of nn.Module as you wish. This is often useful in case those modules are not representable in the jit graph of PyTorch or because you wish to use custom NNEF operator for your inference engine.

Prerequisite

PyTorch and Python basics
5 min to read this page

Sometimes you want to control how an operation is exported to NNEF. This can happen because you want to target a custom fragment in your inference engine instead of a long chain of primitives, or simply because the logic is not traceable faithfully.

This page shows two supported patterns: - New: t2n_extra custom ops (function-level, external-friendly) - ModuleInfoExtractor (module-level, legacy/advanced)

Both are first-class and can co-exist in the same model.

New: t2n_extra custom ops

Use PyTorch’s torch.library.custom_op to declare an opaque operation under the t2n_extra::<name> namespace, call it in your model, and register a small NNEF emit function with torch_to_nnef.op.extras.register("<name>").

Minimal end-to-end example:

# 1) Define a custom PyTorch op (anywhere that runs before export)
import torch
lib = torch.library.Library("t2n_extra", "DEF")
lib.define("my_relu(Tensor x) -> Tensor")

# Optional: eager/meta kernels for runtime use, not required for export-only
# lib.impl(...)

# 2) Register the NNEF handler in a module you can import
from torch_to_nnef.op.extras import register

@register("my_relu")
def my_relu(g, node, name_to_tensor, null_ref, *, torch_graph, inference_target, op_helper, **_):
    # Convert the input IR node to an NNEF tensor
    x = op_helper.get_or_add_tensor_variable_in_nnef(node.inputs[0])
    # Emit the NNEF op and bind its output to the traced tensor name
    y = op_helper.add_single_output_op_from_nnef_tensors(
        node=node,
        nnef_op_type="relu",
        inputs=x,
        force_full_output_tensor_name=node.outputs[0].export_name,
    )
    return []  # no custom fragment keys emitted

# 3) Call it from your model
class M(torch.nn.Module):
    def forward(self, x):
        return torch.ops.t2n_extra.my_relu(x)

# 4) Ensure the handler module is imported before export
#    Option A: explicitly import your module
import my_project.t2n_to_nnef_handlers  # noqa: F401

#    Option B: let the exporter auto-import it
from torch_to_nnef import export_model_to_nnef, TractNNEF
export_model_to_nnef(
    M(),
    args=(torch.randn(2, 3),),
    file_path_export="/tmp/m.nnef",
    inference_target=TractNNEF.latest(),
    load_extra_op_modules=["my_project.t2n_to_nnef_handlers"],
)

#    Option C: via env var (comma-separated modules)
#    TORCH_TO_NNEF_EXTRA_MODULES=my_project.t2n_to_nnef_handlers python export.py

Notes - Namespace: use t2n_extra::<name> only — the exporter routes those to your handler via torch_to_nnef.op.extras. - Helper: the op_helper offers small utilities to convert inputs and emit ops; see torch_to_nnef.op.helper.OpHelper. - Fragments: if your op maps to a custom fragment, return its key(s) from the handler so they are written into the archive.

See also - Example repo folder with a runnable script: https://github.com/sonos/torch-to-nnef/tree/main/examples/t2n_extra_custom_op - Real-world multi-step handler (Mamba selective scan): https://github.com/sonos/torch-to-nnef/blob/main/torch_to_nnef/op/extras/scan_ops.py and end-to-end example: https://github.com/sonos/torch-to-nnef/tree/main/examples/mamba

Module-level extractors (ModuleInfoExtractor)

Alternatively, you can intercept a torch.nn.Module call site and author its NNEF directly by subclassing torch_to_nnef.op.custom_extractors.ModuleInfoExtractor:

torch_to_nnef.op.custom_extractors.ModuleInfoExtractor

ModuleInfoExtractor()

Class to take manual control of NNEF expansion of a nn.Module.

You need to subclass it, and set MODULE_CLASS according to your targeted module.

Then write .convert_to_nnef according to your need.

Methods:

Name	Description
`convert_to_nnef`	Control NNEF content to be written for each MODULE_CLASS.
`generate_in_torch_graph`	Internal method called by torch_to_nnef ir_graph.
`get_by_kind`	Get ModuleInfoExtractor by kind in torch_to_nnef internal IR.
`get_by_module`	Search if the module is one of the MODULE_CLASS registered.
`ordered_args`	Odered args for the module call.

convert_to_nnef

convert_to_nnef(g, node, name_to_tensor, null_ref, torch_graph, inference_target, **kwargs)

Control NNEF content to be written for each MODULE_CLASS.

This happen at macro level when converting from internal IR to NNEF IR stage.

This is the Core method to overwrite in subclass.

It is no different than any op implemented in torch_to_nnef in the module

generate_in_torch_graph

generate_in_torch_graph(torch_graph, *args, **kwargs)

Internal method called by torch_to_nnef ir_graph.

get_by_kind `classmethod`

get_by_kind(kind: str)

Get ModuleInfoExtractor by kind in torch_to_nnef internal IR.

get_by_module `classmethod`

get_by_module(module: Module)

Search if the module is one of the MODULE_CLASS registered.

return appropriate ModuleInfoExtractor subclass if found

ordered_args

ordered_args(torch_graph)

Odered args for the module call.

Sometimes torch jit may reorder inputs. compared to targeted python ops in such case ordering need to be re-addressed

To make it work you need to accomplish 4 steps:

sub-classing it
defining its MODULE_CLASS attribute.
defining its convert_to_nnef
ensuring that the subclass you just defined is imported in your export script

This would look like:

from torch_to_nnef.op.custom_extractors import ModuleInfoExtractor

class MyCustomHandler(ModuleInfoExtractor):
    MODULE_CLASS = MyModuleToCustomConvert

    def convert_to_nnef(
        self,
        g,
        node,
        name_to_tensor,
        null_ref,
        torch_graph,
        **kwargs
    ):
        pass

You can take inspiration from our own management of RNN layers like:

torch_to_nnef.op.custom_extractors.LSTMExtractor
```
LSTMExtractor()
```
Bases: _RNNMixin, ModuleInfoExtractor

But ultimately this is just a chain of op's that needs to be written, inside the g graph, like we do when adding new aten operator

Which one should I use?

t2n_extra custom ops: best for function-style ops that you can call from Python (easy to ship externally; no coupling to a module class). You patch your model to call torch.ops.t2n_extra.<name>(...) and register one small handler function.
ModuleInfoExtractor: best when you need to preserve a module call boundary (e.g., complex nn.Module like RNNs) or need tight control over inputs/outputs independent of the traced function usage.

Both routes produce a single opaque node in the IR and then emit your custom NNEF subgraph; pick the one that matches your authoring style and where you want the “hook” to live (op call vs. module call).