If the code is making long sequential sweeps through one or more data structures that are contiguous in memory (e.g., the sort of code that benefits most directly from a vectorizing compiler and vector hardware) then typically a long cache fill will be desirable.