A small numerical value used to detect data integrity errors.

A checksum is a numerical value that is used to detect data integrity errors. For example, you can use a checksum to flag data corruption during transmission or storage. A checksum cannot be relied upon to authenticate data.

The procedure which generates this checksum is called a checksum function or checksum algorithm. Depending on its design goals, a good checksum algorithm will usually output a significantly different value, even for small changes made to the input. This is especially true of cryptographic hash functions, which may be used to detect many data corruption errors and verify overall data integrity; if the computed checksum for the current data input matches the stored value of a previously computed checksum, there is a very high probability the data has not been accidentally altered or corrupted.

Checksum functions are related to hash functions, fingerprints, randomization functions, and cryptographic hash functions. However, each of those concepts has different applications and therefore different design goals. For instance, a function returning the start of a string can provide a hash appropriate for some applications but will never be a suitable checksum. Checksums are used as cryptographic primitives in larger authentication algorithms. For cryptographic systems with these two specific design goals, see HMAC.

Check digits and parity bits are special cases of checksums, appropriate for small blocks of data (such as Social Security numbers, bank account numbers, computer words, single bytes, etc.). Some error-correcting codes are based on special checksums which not only detect common errors but also allow the original data to be recovered in certain cases.

Adapted from content published on
  • Image By Original by Helix84, adapted by Jorge Stolfi - Created by adapting Hash function.svg, Public Domain — from
Last modified on April 23, 2021, 12:33 pm is a service provided by Codecide, a company located in Chicago, IL USA.