16 Jan
2002
16 Jan
'02
3:25 p.m.
Hi, Maik Jablonski wrote:
a simple approach: i would use hash() on a unique "property" of your objects... this can be the source of the document or a combination of unique properties...
for example: if you would like to store emails, you could use a hash of "sender","subject","date","body".
id=str(hash(sender+subject+date+body))
mails with exactly the same values for that would be identical (result will be the same hash-value)... and there's no need to store them.
No, not really. A hash is just very likely to be different for different data, but it is not guaranteed. Markus -- "GPL software is not free - the cost is cooperation"