fastvideo.v1.layers.layernorm

`fastvideo.v1.layers.layernorm`#

Custom normalization layers.

Module Contents#

Classes#

`FP32LayerNorm`
`LayerNormScaleShift`	Fused operation that combines LayerNorm with scale and shift operations. This reduces memory bandwidth by combining memory-bound operations.
`RMSNorm`	Root mean square normalization.
`ScaleResidual`	Applies gated residual connection.
`ScaleResidualLayerNormScaleShift`	Fused operation that combines:

API#

class fastvideo.v1.layers.layernorm.FP32LayerNorm(normalized_shape: torch.nn.modules.normalization._shape_t, eps: float = 1e-05, elementwise_affine: bool = True, bias: bool = True, device=None, dtype=None)[source]#

Bases: torch.nn.LayerNorm

forward(inputs: torch.Tensor) → torch.Tensor[source]#

class fastvideo.v1.layers.layernorm.LayerNormScaleShift(hidden_size: int, norm_type: str = 'rms', eps: float = 1e-06, elementwise_affine: bool = False, dtype: torch.dtype = torch.float32, compute_dtype: torch.dtype | None = None, prefix: str = '')[source]#

Bases: torch.nn.Module

Fused operation that combines LayerNorm with scale and shift operations. This reduces memory bandwidth by combining memory-bound operations.

Initialization

Initialize internal Module state, shared by both nn.Module and ScriptModule.

forward(x: torch.Tensor, shift: torch.Tensor, scale: torch.Tensor) → torch.Tensor[source]#: Apply ln followed by scale and shift in a single fused operation.

class fastvideo.v1.layers.layernorm.RMSNorm(hidden_size: int, eps: float = 1e-06, dtype: torch.dtype = torch.float32, var_hidden_size: int | None = None, has_weight: bool = True)[source]#

Bases: fastvideo.v1.layers.custom_op.CustomOp

Root mean square normalization.

Computes x -> w * x / sqrt(E[x^2] + eps) where w is the learned weight. Refer to https://arxiv.org/abs/1910.07467

Initialization

Initialize internal Module state, shared by both nn.Module and ScriptModule.

extra_repr() → str[source]#

forward_native(x: torch.Tensor, residual: torch.Tensor | None = None) → torch.Tensor | tuple[torch.Tensor, torch.Tensor][source]#: PyTorch-native implementation equivalent to forward().

class fastvideo.v1.layers.layernorm.ScaleResidual(prefix: str = '')[source]#

Bases: torch.nn.Module

Applies gated residual connection.

Initialization

Initialize internal Module state, shared by both nn.Module and ScriptModule.

forward(residual: torch.Tensor, x: torch.Tensor, gate: torch.Tensor) → torch.Tensor[source]#: Apply gated residual connection.

class fastvideo.v1.layers.layernorm.ScaleResidualLayerNormScaleShift(hidden_size: int, norm_type: str = 'rms', eps: float = 1e-06, elementwise_affine: bool = False, dtype: torch.dtype = torch.float32, compute_dtype: torch.dtype | None = None, prefix: str = '')[source]#

Bases: torch.nn.Module

Fused operation that combines:

Gated residual connection
LayerNorm
Scale and shift operations

This reduces memory bandwidth by combining memory-bound operations.

Initialization

Initialize internal Module state, shared by both nn.Module and ScriptModule.

forward(residual: torch.Tensor, x: torch.Tensor, gate: torch.Tensor, shift: torch.Tensor, scale: torch.Tensor) → tuple[torch.Tensor, torch.Tensor][source]#

Apply gated residual connection, followed by layernorm and scale/shift in a single fused operation.

Returns:

normalized and modulated output
residual value (value after residual connection but before normalization)

Return type:

Tuple containing

fastvideo.v1.layers.layernorm

Contents

fastvideo.v1.layers.layernorm#

Module Contents#

Classes#

API#

`fastvideo.v1.layers.layernorm`#