Here are some lists of characters that are useful for normalization. I’ll probably add some others later.
The lists apply to Unicode version 5.1.
Combining characters with non-zero properties
Characters with non-zero combining properties are assigned to a sparse array indexed by codepoint. The value gives the combining property value.