Root Mean Square Layer Normalization (RMSNorm) is a normalization technique that normalizes the input by the root mean square of its elements. Variants:Documentation Index
Fetch the complete documentation index at: https://bench.flashinfer.ai/docs/llms.txt
Use this file to discover all available pages before exploring further.
- Standard RMSNorm: basic RMS normalization that scales input by RMS and applies learned weight parameters
- Fused Add RMSNorm: adds residual connection before normalization in a single fused operation
batch_size: variablehidden_size: constant
hidden_states: [batch_size, hidden_size]weight: [hidden_size]- For Fused Add RMSNorm only:
residual: [batch_size, hidden_size]
output: [batch_size, hidden_size]

