Why does the volume latency or IOPS not match the aggregate in Active IQ Unified Manager or ONTAP?

ONTAP decouples frontend IOPS from backend due to optimizations and background workloads
Backend disk/aggregate IOPS should not be used as a metric for monitoring performance unless Performance Capacity hits 100% on the aggregate in Active IQ Unified Manager or disk latency is seen on user work

Reads are prefetched through ONTAP's readahead engine
- Readahead reduces latencies as readahead has been optimized for years and is very efficient at predicting accurately what is needed
- By prefetching, the reads are in cache (RAM) as the IOP comes in through the network
Reads are also cached in RAM, and may be cached using Flash Cache or Flash Pool technology with lower latency
Writes are cached in RAM until written asynchronously to disk in a consistency point, delivering low latency on writes
Other IOPS may not require going to disk as metadata structures are also cached in RAM as needed

Why Aggregate Latency graph in UM, shows constantly higher latency for one aggregate?
- This article also shows that AIQUM does a weighted latency and the value does not match what statit may show for latency

Example: The first volume listed on the left has a latency of 0.569 ms/op, while aggregate average latency is approximately 10 ms