Is it wrong to use a hash for a unique ID?

Question

I want to use a unique ID generated by PHP in a database table that will likely never have more than 10,000 records. I don't want the time of creation to be visible or use a purely numeric value so I am using: Is it wrong to use a hash for a unique ID? Don't all hashes lead to collisions

Accepted Answer

If you have 2 keys you will have a theoretical best case scenario of 1 in 2 ^ X probability of a collision, where X is the number of bits in your hashing algorithm. &#8216;Best case&#8217; because the input usually will be ASCII which doesn&#8217;t utilize the full charset, plus the hashing functions do not distribute perfectly, so they will collide more often than the theoretical max in real life.To answer your final question:A further point: if the number of characters to be hashed is less thanthe number of characters in a sha1 hash, won&#8217;t it always be unique?Yeah that&#8217;s true-sorta. But you would have another problem of generating unique keys of that size. The easiest way is usually a checksum, so just choose a large enough digest that the collision space will be small enough for your comfort.As @wayne suggests, a popular approach is to concatenate microtime() to your random salt (and base64_encode to raise the entropy).

Advertisement

Answer