Discussion:
Uniquely identifying a message and finding "duplicate"
(too old to reply)
MikeNike
2005-06-19 14:24:35 UTC
Permalink
Hi,

I have a requirement that I want to uniquely identify a message and at
the same time find out if a message is a "duplicate" of another message
from a same or different PST i.e for all practical purpose it is the
same message as original message. e.g. if a message M1 has been copied
as message M2 then I want to de-duplicate M2 against M1. or if I send a
message M1 to two different people & they both store the copy of their
message M3 and M4 in a PST file & give it to me. I should be able to
de-duplicate those two messages (M3 and M4) against M1. I know I
cannot use PR_ENTRYID as it would be different in a copy of the
message. I came acroos PR_SEARCH_KEY. I would like to know the
following things:

- Whether search key would be available in all the messages
irrespective of how they were generated?
- Can I use it to de-deuplicate the messages based on this key
(especially across PSTs)?
- If not, is there any better way to find duplicate of a given message
based on some MAPI properties?

Thanks a lot in advance

- Mukesh
Thomas Quester
2005-06-20 08:34:30 UTC
Permalink
The search key seems to be a bit differently implemented differently as the
mapi specification tells, it seems to be always a guid and it seems to be
the same if you copy a message. If this is true for all store providers,
nobody can tell because there is no excact documentation telling how to
implement the seach key so it may be that other store providers make
different search keys. You will get a new search key if you have emails
which are downloaded twice or for example contacts that imported twice.

I once made a duplicate remover in my application QSync and sorted the
contents table by normalized subject using the C function qsort then
iterating the row set and comparing other fields (message size, search key,
body and so on) on elements with the same subject, this way it is also
finding emails which have been received duplicate.

Thomas Quester
Post by MikeNike
Hi,
I have a requirement that I want to uniquely identify a message and at
the same time find out if a message is a "duplicate" of another message
from a same or different PST i.e for all practical purpose it is the
same message as original message. e.g. if a message M1 has been copied
as message M2 then I want to de-duplicate M2 against M1. or if I send a
message M1 to two different people & they both store the copy of their
message M3 and M4 in a PST file & give it to me. I should be able to
de-duplicate those two messages (M3 and M4) against M1. I know I
cannot use PR_ENTRYID as it would be different in a copy of the
message. I came acroos PR_SEARCH_KEY. I would like to know the
- Whether search key would be available in all the messages
irrespective of how they were generated?
- Can I use it to de-deuplicate the messages based on this key
(especially across PSTs)?
- If not, is there any better way to find duplicate of a given message
based on some MAPI properties?
Thanks a lot in advance
- Mukesh
Loading...