2006.04.24 11:39 "RE: [SPAM HEADER] - [Tiff] Microsoft Document Imaging status / snapshot - Email found in subject", by Gerben Vos
0xef 0x82 0xa7 = some kind of bullet point symbol
0xef 0x82 0xb7 = some kind of bullet point symbol (different to a7)
0xe2 0x80 0x93 = em-dash
0xe2 0x80 0x9c = `` (smart doublequotes, left side of quoted material)
0xe2 0x80 0x9d = '' (smart doublequotes, right side of quoted material)
0xe2 0x80 0x99 = ' (apostrophe of some kind)
0xe2 0x80 0xa6
0xe2 0x80 0x94 = short dash?
0xc3 0xa9 = e with grave. (00a9 is the unicode equivalent, perhapsthis will form some pattern)
These are clearly UTF-8 encoded Unicode characters:
U+F0A7 = (user-defined)
U+F0B7 = (user-defined)
U+2013 = en-dash (shorter than em-dash!)
U+201C = left double quote
U+201D = right double quote
U+2019 = right single quote
U+2026 = ellipsis (three dots)
U+2014 = em-dash (longer than en-dash!)
U+00A9 = e-grave
Some of the ones you list (e.g., the first two bullets) are in the "implementation defined" Unicode area, but lists with the Microsoft assignments in there are easy to find on the Internet.
By the way, thanks for posting this; I was intending to try to figure this out, but had to postpone it.
Gerben.