A KEM allows a sender who knows a public key to simultaneously generate a short random secret key and an encapsulation or ciphertext of the secret key by the KEM's encapsulation algorithm.
The receiver who knows the private key corresponding to the public key can recover the same random secret key from the encapsulation by the KEM's decapsulation algorithm.[1][2][3]
The security goal of a KEM is to prevent anyone who doesn't know the private key from recovering any information about the encapsulated secret keys, even after eavesdropping or submitting other encapsulations to the receiver to study how the receiver reacts.[1][2][3]
Difference from public-key encryption
The difference between a public-key encryption scheme and a KEM is that a public-key encryption scheme allows a sender to choose an arbitrary message from some space of possible messages, while a KEM chooses a short secret key at random for the sender.[1][2][3]
The sender may take the random secret key produced by a KEM and use it as a symmetric key for an authenticated cipher whose ciphertext is sent alongside the encapsulation to the receiver.
This serves to compose a public-key encryption scheme out of a KEM and a symmetric-key authenticated cipher in a hybrid cryptosystem.[1][2][3][5]
Most public-key encryption schemes such as RSAES-PKCS1-v1_5, RSAES-OAEP, and Elgamal encryption are limited to small messages[6][7] and are almost always used to encrypt a short random secret key in a hybrid cryptosystem anyway.[8][9][5]
And although a public-key encryption scheme can conversely be converted to a KEM by choosing a random secret key and encrypting it as a message, it is easier to design and analyze a secure KEM than to design a secure public-key encryption scheme as a basis.
So most modern public-key encryption schemes are based on KEMs rather than the other way around.[10][5]
Key generation, , takes no inputs and returns a pair of a public key and a private key .
Encapsulation, , takes a public key , randomly chooses a secret key , and returns along with its encapsulation .
Decapsulation, , takes a private key and an encapsulation , and either returns an encapsulated secret key or fails, sometimes denoted by returning (called ‘bottom’).
Correctness
A KEM is correct if, for any key pair generated by , decapsulating an encapsulation returned by with high probability yields the same key , that is, .[2][3][11][12]
Security: IND-CCA
Security of a KEM is quantified by its indistinguishability against chosen-ciphertext attack, IND-CCA, which is loosely how much better an adversary can do than a coin toss to tell whether, given a random key and an encapsulation, the key is encapsulated by that encapsulation or is an independent random key.[2][3][11][12]
Specifically, in the IND-CCA game:
The key generation algorithm is run to generate .
is revealed to the adversary.
The adversary can query for arbitrary encapsulations of the adversary's choice.
The encapsulation algorithm is run to randomly generate a secret key and encapsulation , and another secret key is generated independently at random.
A fair coin is tossed, giving an outcome .
The pair is revealed to the adversary.
The adversary can again query for arbitrary encapsulations of the adversary's choice, except for .
The adversary returns a guess , and wins the game if .
The IND-CCA advantage of the adversary is , that is, the probability beyond a fair coin toss at correctly distinguishing an encapsulated key from an independently randomly chosen key.
Generate a -bit semiprime with at random satisfying , where is the Carmichael function.
Compute .
Return as the public key and as the private key. (Many variations on key generation algorithms and private key formats are available.[16])
Encryption of -bit message to public key , giving :
Encode the bit string as an integer with .
Return .
Decryption of ciphertext with private key , giving :
Compute .
Decode the integer as a bit string .
This naive approach is totally insecure.
For example, since it is nonrandomized, it cannot be secure against even known-plaintext attack—an adversary can tell whether the sender is sending the message ATTACK AT DAWN versus the message ATTACK AT DUSK simply by encrypting those messages and comparing the ciphertext.
Even if is always a random secret key, such as a 256-bit AES key, when is chosen to optimize efficiency as , the message can be computed from the ciphertext simply by taking real number cube roots, and there are many other attacks against plain RSA.[13][14]
Various randomized padding schemes have been devised in attempts—sometimes failed, like RSAES-PKCS1-v1_5[13][17][18]—to make it secure for arbitrary short messages .[13][14]
Since the message is almost always a short secret key for a symmetric-keyauthenticated cipher used to encrypt an arbitrary bit string message, a simpler approach called RSA-KEM is to choose an element of at random and use that to derive a secret key using a key derivation function, roughly as follows:[19][8]
Key generation: As above.
Encapsulation for a public key , giving :
Choose an integer with uniformly at random.
Return and as its encapsulation.
Decapsulation of with private key , giving :
Compute .
Return .
This approach is simpler to implement, and provides a tighter reduction to the RSA problem, than padding schemes like RSAES-OAEP.[19]
Elgamal
Traditional Elgamal encryption is defined over a multiplicative subgroup of the finite field with generator of order as follows:[20][21]
Key generation, :
Choose uniformly at random.
Compute .
Return as the private key and as the public key.
Encryption of a message to public key , giving :
Choose uniformly at random.
Compute:
Return the ciphertext .
Decryption of a ciphertext for a private key , giving :
Fail and return if or if , i.e., if or is not in the subgroup generated by .
Compute .
Return .
This meets the syntax of a public-key encryption scheme, restricted to messages in the space (which limits it to message of a few hundred bytes for typical values of ).
By validating ciphertexts in decryption, it avoids leaking bits of the private key through maliciously chosen ciphertexts outside the group generated by .
However, this fails to achieve indistinguishability against chosen ciphertext attack.
For example, an adversary having a ciphertext for an unknown message can trivially decrypt it by querying the decryption oracle for the distinct ciphertext , yielding the related plaintext , from which can be recovered by .[20]
Traditional Elgamal encryption can be adapted to the elliptic-curve setting, but it requires some way to reversibly encode messages as points on the curve, which is less trivial than encoding messages as integers mod .[22]
Since the message is almost always a short secret key for a symmetric-keyauthenticated cipher used to encrypt an arbitrary bit string message, a simpler approach is to derive the secret key from and dispense with and altogether, as a KEM, using a key derivation function:[1]
Key generation: As above.
Encapsulation for a public key , giving :
Choose uniformly at random.
Compute .
Return and as its encapsulation.
Decapsulation of with private key , giving :
Fail and return if , i.e., if is not in the subgroup generated by .
Compute .
Return .
When combined with an authenticated cipher to encrypt arbitrary bit string messages, the combination is essentially the Integrated Encryption Scheme.
Since this KEM only requires a one-way key derivation function to hash random elements of the group it is defined over, in this case, and not a reversible encoding of messages, it is easy to extend to more compact and efficient elliptic curve groups for the same security, as in the ECIES, Elliptic Curve Integrated Encryption Scheme.
^ abcdefgGalbraith, Steven (2012). "§23.1.1: The KEM/DEM paradigm". Mathematics of Public-Key Cryptography. Cambridge University Press. pp. 471–478. ISBN978-1-107-01392-6.
^ abcdAumasson, Jean-Philippe (2018). "10. RSA". Serious Cryptography: A Practical Introduction to Modern Encryption. No Starch Press. pp. 181–199. ISBN978-1-59327-826-7.
^ abcStinson, Douglas R. (2006). "5. The RSA Cryptosystem and Factoring Integers". Cryptography Theory and Practice (3rd ed.). Chapman & Hall/CRC. pp. 161–232. ISBN978-1-58488-508-5.
^ abGalbraith, Steven (2012). "§20.3: Textbook Elgamal encryption". Mathematics of Public-Key Cryptography. Cambridge University Press. pp. 471–478. ISBN978-1-107-01392-6.