diff --git a/bip-taproot.mediawiki b/bip-taproot.mediawiki
index d11f9361..8a51747f 100644
--- a/bip-taproot.mediawiki
+++ b/bip-taproot.mediawiki
@@ -83,68 +83,61 @@ The following rules only apply when such an output is being spent. Any other out
=== Signature validation rules ===
-The following rules apply:
+We first define a reusable common signature message calculation function, followed by the actual signature validation as it's used in key path spending.
-* If the signature is not 64['''Why permit two signature lengths?''' By making the most common type of ]hash_type
implicit, a byte can often be saved. or 65 bytes, fail.
-* If the signature size is 65 bytes:
-** If the final byte is not a valid hash_type
(defined hereinafter), fail.
-** If the final byte is 0x00
, fail['''Why can the ]hash_type
not be 0x00
in 65-byte signatures?''' Permitting that would enable malleating (by third parties, including miners) 64-byte signatures into 65-byte ones, resulting in a different `wtxid` and a different fee rate than the creator intended.
-** If the first 64 bytes are not a valid signature according to bip-schnorr for the public key and message set to the transaction digest with hash_type
set as the final byte, fail.
-* If the signature size is 64 bytes:
-** If it is not a valid signature according to bip-schnorr for the public key and the hash_type = 0x00
transaction digest as message, fail.
-* Otherwise the signature is valid.
+==== Common signature message ====
-==== hash_type ====
-
-hash_type
is an 8-bit unsigned value. The SIGHASH
encodings from the legacy script system are used, including SIGHASH_ALL
, SIGHASH_NONE
, SIGHASH_SINGLE
, and SIGHASH_ANYONECANPAY
. Use of the default hash_type = 0x00
results in signing over the whole transaction just as for SIGHASH_ALL
.
-
-The following use of hash_type
are invalid, and fail execution:
+The function ''SigMsg(hash_type, ext_flag)'' computes the message being signed as a byte array. It is implicitly also a function of the spending transaction and the outputs it spends, but these are not listed to keep notation simple.
+The parameter ''hash_type'' is an 8-bit unsigned value. The SIGHASH
encodings from the legacy script system are reused, including SIGHASH_ALL
, SIGHASH_NONE
, SIGHASH_SINGLE
, and SIGHASH_ANYONECANPAY
, plus a default ''hash_type'' (0) which results in signing over the whole transaction just as for SIGHASH_ALL
. The following restrictions apply, which cause validation failure if violated:
+* Using any undefined ''hash_type'' (not ''0x00'', ''0x01'', ''0x02'', ''0x03'', ''0x81'', ''0x82'', or ''0x83''['''Why reject unknown ''hash_type'' values?''' By doing so, it is easier to reason about the worst case amount of signature hashing an implementation with adequate caching must perform.]).
* Using SIGHASH_SINGLE
without a "corresponding output" (an output with the same index as the input being verified).
-* Using any hash_type
value that is not 0x00
, 0x01
, 0x02
, 0x03
, 0x81
, 0x82
, or 0x83
['''Why reject unknown ]hash_type
values?''' By doing so, it is easier to reason about the worst case amount of signature hashing an implementation with adequate caching must perform..
-* The signature has 65 bytes, and hash_type
is 0x00
.
-==== Transaction digest ====
+The parameter ''ext_flag'' is an integer in range 0-127, and is used for indicating the presence of extensions.
-As the message for signature verification, transaction digest is ''hashTapSighash'' of the following values (size in byte) serialized. Numerical values in 2, 4, or 8-byte are encoded in little-endian.
+If the parameters take acceptable values, the message is the concatenation of the following data, in order(with byte size of each item listed in parentheses). Numerical values in 2, 4, or 8-byte are encoded in little-endian.
* Control:
-** epoch
(1): always 0. ['''What's the purpose of the epoch?''' The ]epoch
can be increased to allow securely creating new transaction digest algorithms with large changes to the structure or interpretation of hash_type
if needed.
-** hash_type
(1).
+** ''hash_type'' (1).
* Transaction data:
-** nVersion
(4): the nVersion
of the transaction.
-** nLockTime
(4): the nLockTime
of the transaction.
-** If the SIGHASH_ANYONECANPAY
flag is not set:
-*** sha_prevouts
(32): the SHA256 of the serialization of all input outpoints.
-*** sha_amounts
(32): the SHA256 of the serialization of all input amounts.
-*** sha_sequences
(32): the SHA256 of the serialization of all input nSequence
.
-** If neither the SIGHASH_NONE
nor the SIGHASH_SINGLE
flag is set:
-*** sha_outputs
(32): the SHA256 of the serialization of all outputs in CTxOut
format.
+** ''nVersion'' (4): the ''nVersion'' of the transaction.
+** ''nLockTime'' (4): the ''nLockTime'' of the transaction.
+** If the ''hash_type & 0x80'' does not equal SIGHASH_ANYONECANPAY
:
+*** ''sha_prevouts'' (32): the SHA256 of the serialization of all input outpoints.
+*** ''sha_amounts'' (32): the SHA256 of the serialization of all input amounts.
+*** ''sha_sequences'' (32): the SHA256 of the serialization of all input ''nSequence''.
+** If ''hash_type & 3'' does not equal SIGHASH_NONE
or SIGHASH_SINGLE
:
+*** ''sha_outputs'' (32): the SHA256 of the serialization of all outputs in CTxOut
format.
* Data about this input:
-** spend_type
(1):
-*** Bit 0 is set if an annex is present (the original witness stack has two or more witness elements, and the first byte of the last element is 0x50
).
-*** The other bits are unset.
-** scriptPubKey
(35): scriptPubKey
of the previous output spent by this input, serialized as script inside CTxOut
. Its size is always 35 bytes.
-** If the SIGHASH_ANYONECANPAY
flag is set:
-*** outpoint
(36): the COutPoint
of this input (32-byte hash + 4-byte little-endian).
-*** amount
(8): value of the previous output spent by this input.
-*** nSequence
(4): nSequence
of this input.
-** If the SIGHASH_ANYONECANPAY
flag is not set:
-*** input_index
(4): index of this input in the transaction input vector. Index of the first input is 0.
-** If bit 0 of spend_type
is set:
-*** sha_annex
(32): the SHA256 of (compact_size(size of annex) || annex).
+** ''spend_type'' (1): equal to ''(ext_flag * 2) + annex_present'', where ''annex_present'' is 0 if no annex is present, or 1 otherwise (the original witness stack has two or more witness elements, and the first byte of the last element is ''0x50'')
+** ''scriptPubKey'' (35): ''scriptPubKey'' of the previous output spent by this input, serialized as script inside CTxOut
. Its size is always 35 bytes.
+** If ''hash_type & 0x80'' equals SIGHASH_ANYONECANPAY
:
+*** ''outpoint'' (36): the COutPoint
of this input (32-byte hash + 4-byte little-endian).
+*** ''amount'' (8): value of the previous output spent by this input.
+*** ''nSequence'' (4): ''nSequence'' of this input.
+** If ''hash_type & 0x80'' does not equal SIGHASH_ANYONECANPAY
:
+*** ''input_index'' (4): index of this input in the transaction input vector. Index of the first input is 0.
+** If an annex is present (the lowest bit of ''spend_type'' is set):
+*** ''sha_annex'' (32): the SHA256 of ''(compact_size(size of annex) || annex)'', where ''annex'' includes the mandatory ''0x50'' prefix.
* Data about this output:
-** If the SIGHASH_SINGLE
flag is set:
-*** sha_single_output
(32): the SHA256 of the corresponding output in CTxOut
format.
+** If ''hash_type & 3'' equals SIGHASH_SINGLE
:
+*** ''sha_single_output'' (32): the SHA256 of the corresponding output in CTxOut
format.
-The total number of bytes hashed is at most ''210'' (excluding sub-hashes such as `sha_prevouts`)['''What is the number of bytes hashed for the signature hash?''' The total size of the input to ''hashTapSighash'' (excluding the initial 64-byte hash tag) can be computed using the following formula: ''178 - is_anyonecanpay * 52 - is_none * 32 + has_annex * 32''.]. Sub-hashes may be cached across signatures of the same transaction.
+The total length of ''SigMsg()'' is at most ''209'' bytes['''What is the output length of ''SigMsg()''?''' The total length of ''SigMsg()'' can be computed using the following formula: ''177 - is_anyonecanpay * 52 - is_none * 32 + has_annex * 32''.]. Note that this does not include the size of sub-hashes such as ''sha_prevouts'', which may be cached across signatures of the same transaction.
In summary, the semantics of the [https://github.com/bitcoin/bips/blob/master/bip-0143.mediawiki BIP143] sighash types remain unchanged, except the following:
-# The way and order of serialization is changed.['''Why is the serialization in the transaction digest changed?''' Hashes that go into the digest and the digest itself are now computed with a single SHA256 invocation instead of double SHA256. There is no expected security improvement by doubling SHA256 because this only protects against length-extension attacks against SHA256 which are not a concern for transaction digests because there is no secret data. Therefore doubling SHA256 is a waste of resources. The digest computation now follows a logical order with transaction level data first, then input data and output data. This allows to efficiently cache the transaction part of the digest across different inputs using the SHA256 midstate. Additionally, sub-hashes can be skipped when calculating the digest (for example `sha_prevouts` if ]SIGHASH_ANYONECANPAY
is set) instead of setting them to zero and then hashing them as in BIP143. Despite that, collisions are made impossible by committing to the length of the data (implicit in hash_type
and spend_type
) before the variable length data.
-# The digest commits to the scriptPubKey
['''Why does the transaction digest commit to the ]scriptPubKey
?''' This prevents lying to offline signing devices about output being spent, even when the actually executed script (scriptCode
in BIP143) is correct. This means it's possible to compactly prove to a hardware wallet what (unused) execution paths existed..
+# The way and order of serialization is changed.['''Why is the serialization in the transaction digest changed?''' Hashes that go into the digest and the digest itself are now computed with a single SHA256 invocation instead of double SHA256. There is no expected security improvement by doubling SHA256 because this only protects against length-extension attacks against SHA256 which are not a concern for transaction digests because there is no secret data. Therefore doubling SHA256 is a waste of resources. The digest computation now follows a logical order with transaction level data first, then input data and output data. This allows to efficiently cache the transaction part of the digest across different inputs using the SHA256 midstate. Additionally, sub-hashes can be skipped when calculating the digest (for example `sha_prevouts` if ]SIGHASH_ANYONECANPAY
is set) instead of setting them to zero and then hashing them as in BIP143. Despite that, collisions are made impossible by committing to the length of the data (implicit in ''hash_type'' and ''spend_type'') before the variable length data.
+# The digest commits to the ''scriptPubKey''['''Why does the transaction digest commit to the ''scriptPubKey''?''' This prevents lying to offline signing devices about output being spent, even when the actually executed script (''scriptCode'' in BIP143) is correct. This means it's possible to compactly prove to a hardware wallet what (unused) execution paths existed.].
# If the SIGHASH_ANYONECANPAY
flag is not set, the digest commits to the amounts of ''all'' transaction inputs.['''Why does the transaction digest commit to the amounts of all transaction inputs?''' This eliminates the possibility to lie to offline signing devices about the fee of a transaction.]
-# The digest commits to all input nSequence
if SIGHASH_NONE
or SIGHASH_SINGLE
are set (unless SIGHASH_ANYONECANPAY
is set as well).['''Why does the transaction digest commit to all input ]nSequence
if SIGHASH_SINGLE
or SIGHASH_NONE
are set?''' Because setting them already makes the digest commit to the prevouts
part of all transaction inputs, it is not useful to treat the nSequence
any different. Moreover, this change makes nSequence
consistent with the view that SIGHASH_SINGLE
and SIGHASH_NONE
only modify the digest with respect to transaction outputs and not inputs.
-# The digest commits to taproot-specific data epoch
, spend_type
and annex
(if present).
+# The digest commits to all input ''nSequence'' if SIGHASH_NONE
or SIGHASH_SINGLE
are set (unless SIGHASH_ANYONECANPAY
is set as well).['''Why does the transaction digest commit to all input ''nSequence'' if ]SIGHASH_SINGLE
or SIGHASH_NONE
are set?''' Because setting them already makes the digest commit to the prevouts
part of all transaction inputs, it is not useful to treat the ''nSequence'' any different. Moreover, this change makes ''nSequence'' consistent with the view that SIGHASH_SINGLE
and SIGHASH_NONE
only modify the digest with respect to transaction outputs and not inputs.
+# The message includes commitments to the taproot-specific data ''spend_type'' and ''annex'' (if present).
+
+==== Taproot key path spending signature validation ====
+
+To validate a signature ''sig'' with public key ''q'':
+* If the ''sig'' is 64 bytes long, return ''Verify(q, hashTapSigHash(0x00 || SigMsg(0x00, 0)), sig)''['''Why is the input to ''hashTapSigHash'' prefixed with 0x00?''' This prefix is called the sighash epoch, and allows reusing the ''hashTapSigHash'' tagged hash in future extensions that make invasive changes to how hashing is performed. An alternative is switching to a different tag, but supporting a growing number of tags may become undesirable.], where ''Verify'' is defined in bip-schnorr.
+* If the ''sig'' is 65 bytes long, return ''sig[64] ≠ 0x00['''Why can the ]hash_type
not be 0x00
in 65-byte signatures?''' Permitting that would enable malleating (by third parties, including miners) 64-byte signatures into 65-byte ones, resulting in a different `wtxid` and a different fee rate than the creator intended and Verify(q, hashTapSighash(0x00 || SigMsg(sig[64], 0)), sig[0:64])''.
+* Otherwise, fail['''Why permit two signature lengths?''' By making the most common type of ]hash_type
implicit, a byte can often be saved..
== Constructing and spending Taproot outputs ==
diff --git a/bip-tapscript.mediawiki b/bip-tapscript.mediawiki
index ed729c7a..3bbe8f3c 100644
--- a/bip-tapscript.mediawiki
+++ b/bip-tapscript.mediawiki
@@ -92,7 +92,7 @@ The following rules apply to OP_CHECKSIG
, OP_CHECKSIGVERIFY
** If n
is larger than 4 bytes, the script MUST fail and terminate immediately.
* If the public key size is zero, the script MUST fail and terminate immediately.
* If the public key size is 32 bytes, it is considered to be a public key as described in bip-schnorr:
-** If the signature is not the empty vector, the signature is validated according to the bip-taproot signature validation rules against the public key and the tapscript transaction digest (to be defined hereinafter) as message. Validation failure MUST cause the script to fail and terminate immediately.
+** If the signature is not the empty vector, the signature is validated against the public key (see the next subsection).
* If the public key size is not zero and not 32 bytes, the public key is of an ''unknown public key type''['''Unknown public key types''' allow adding new signature validation rules through softforks. A softfork could add actual signature validation which either passes or makes the script fail and terminate immediately. This way, new ]SIGHASH
modes can be added, as well as [https://lists.linuxfoundation.org/pipermail/bitcoin-dev/2018-December/016549.html NOINPUT-tagged public keys] and a public key constant which is replaced by the taproot internal key for signature validation. and no actual signature verification is applied. During script execution of signature opcodes they behave exactly as known public key types except that signature validation is considered to be successful.
* If the script did not fail and terminate before this step, regardless of the public key type:
** If the signature is the empty vector:
@@ -104,26 +104,20 @@ The following rules apply to OP_CHECKSIG
, OP_CHECKSIGVERIFY
*** For OP_CHECKSIG
, a 1-byte value 0x01
is pushed onto the stack.
*** For OP_CHECKSIGADD
, a CScriptNum
with value of n + 1
is pushed onto the stack.
-===Transaction digest===
+===Signature validation===
-As the message for signature opcodes signature verification, transaction digest has the same definition as in bip-taproot, except the following:
+To validate a signature ''sig'' with public key ''p'':
+* Compute the tapscript message extension ''ext'', consisting of:
+** ''tapleaf_hash'' (32): the tapleaf hash as defined in bip-taproot
+** ''key_version'' (1): a constant value ''0x00'' representing the current version of public keys in the tapscript signature opcode execution.
+** ''codesep_pos'' (4): the opcode position of the last executed OP_CODESEPARATOR
before the currently executed signature opcode, with the value in little endian (or ''0xffffffff'' if none executed). The first opcode in a script has a position of 0. A multi-byte push opcode is counted as one opcode, regardless of the size of data being pushed.
+* If the ''sig'' is 64 bytes long, return ''Verify(q, hashTapSigHash(0x00 || SigMsg(0x00, 1) || ext), sig)'', where ''Verify'' is defined in bip-schnorr.
+* If the ''sig'' is 65 bytes long, return ''sig[64] ≠ 0x00 and Verify(q, hashTapSighash(0x00 || SigMsg(sig[64], 0) || ext), sig[0:64])''.
+* Otherwise, fail.
-The one-byte spend_type
has a different value, specifically at bit 1:
-* Bit 0 is set if an annex is present (the original witness stack has at least two witness elements, and the first byte of the last element is 0x50
).
-* Bit 1 is set.
-* The other bits are unset.
-
-As additional pieces of data, added at the end of the input to the ''hashTapSighash'' function:
-* tapleaf_hash
(32): the tapleaf hash as defined in bip-taproot
-* key_version
(1): a constant value 0x00
representing the current version of public keys in the tapscript signature opcode execution.
-* codeseparator_position
(4): the opcode position of the last executed OP_CODESEPARATOR
before the currently executed signature opcode, with the value in little endian (or 0xffffffff
if none executed). The first opcode in a script has a position of 0. A multi-byte push opcode is counted as one opcode, regardless of the size of data being pushed.
-
-The total number of bytes hashed is at most ''247''['''What is the number of bytes hashed for the signature hash?''' The total size of the input to ''hashTapSighash'' (excluding the initial 64-byte hash tag) can be computed using the following formula: ''215 - is_anyonecanpay * 52 - is_none * 32 + has_annex * 32''.].
-
-In summary, the semantics of the [https://github.com/bitcoin/bips/blob/master/bip-0143.mediawiki BIP143] sighash types remain unchanged, except the following:
-# The exceptions mentioned in bip-taproot.
-# The digest commits to taproot-specific data key_version
.['''Why does the transaction digest commit to the ]key_version
?''' This is for future extensions that define unknown public key types, making sure signatures can't be moved from one key type to another.
-# The digest commits to the executed script through the tapleaf_hash
which includes the leaf version and script instead of scriptCode
. This implies that this commitment is unaffected by OP_CODESEPARATOR
.
+In summary, the semantics of signature validation is identical to bip-taproot, except the following:
+# The digest commits to tapscript-specific data ''key_version''.['''Why does the transaction digest commit to the ''key_version''?''' This is for future extensions that define unknown public key types, making sure signatures can't be moved from one key type to another.]
+# The digest commits to the executed script through the ''tapleaf_hash'' which includes the leaf version and script instead of ''scriptCode''. This implies that this commitment is unaffected by OP_CODESEPARATOR
.
# The digest commits to the opcode position of the last executed OP_CODESEPARATOR
.['''Why does the transaction digest commit to the position of the last executed ]OP_CODESEPARATOR
?''' This allows continuing to use OP_CODESEPARATOR
to sign the executed path of the script. Because the codeseparator_position
is the last input to the digest, the SHA256 midstate can be efficiently cached for multiple OP_CODESEPARATOR
s in a single script. In contrast, the BIP143 handling of OP_CODESEPARATOR
is to commit to the executed script only from the last executed OP_CODESEPARATOR
onwards which requires unnecessary rehashing of the script. It should be noted that the one known OP_CODESEPARATOR
use case of saving a second public key push in a script by sharing the first one between two code branches can be most likely expressed even cheaper by moving each branch into a separate taproot leaf.
===Resource limits===