Package convex.core.data
Class BlobTree
- All Implemented Interfaces:
IValidated
,IWriteable
,Comparable<ABlob>
Implementation of a large Blob data structure consisting of 2 or more chunks.
Intention is to enable relatively large binary content to be handled without
too many tree levels, and without too many references in a single tree node
We choose a branching factor of 16 as a reasonable tradeoff.
Level 1 can hold up to 64k Level 2 can hold up to 1mb Level 3 can hold up to
16mb Level 4 can hold up to 256mb ... Level 15 (max) should be big enough for
the moment
One smart reference is maintained for each child node at each level
-
Field Summary
FieldsModifier and TypeFieldDescriptionstatic final int
static final int
static final int
Maximum byte length of an encoded BlobTree node.Fields inherited from class convex.core.data.ABlob
contentHash
Fields inherited from class convex.core.data.ACell
cachedRef, EMPTY_ARRAY
-
Method Summary
Modifier and TypeMethodDescriptionAppends another blob to this BlobTree.boolean
appendHex
(BlobBuilder bb, long length) Append hex string up to the given length in hex digits (a multiple of two)byte
byteAtUnchecked
(long i) Gets the byte at the specified position in this data object, without bounds checking.static long
calcChunks
(long length) Computes the number of chunks (4096 bytes or less) for the canonical BlobTree of the given lengthstatic int
childCount
(long length) Gets the number of children for a BlobTree of given total length.static long
childSize
(long length) Gets the size of a BlobTree child for a blob of given total length.long
Computes the length of the longest common hex prefix between two blobslong
count()
Gets the length of this Blobstatic BlobTree
Create a BlobTree from an arbitrary Blob.static BlobTree
createWithChildren
(ABlob[] children) Create a BlobTree with the given children.int
encode
(byte[] bs, int pos) Writes this Cell's encoding to a byte array, including a tag byte which will be written first.int
encodeRaw
(byte[] bs, int pos) Writes this Blob's encoding to a byte array, excluding the tag byteboolean
Determines if this Blob is equal to another Blob.boolean
boolean
equalsBytes
(byte[] bytes, int byteOffset) Tests if the byte contents of this instance are equal to a subset of a byte arrayboolean
equalsBytes
(ABlob b) Tests if this Blob has exactly the same bytes as another Blobint
Estimate the encoded data size for this Cell.Gets a byte buffer containing this Blob's raw data.int
getBytes
(byte[] dest, int pos) Copies the bytes from this instance to a given destinationgetChunk
(long chunkIndex) Gets a chunk of this Blob, as a canonical Blob up to the maximum chunk size.getRef
(int i) Gets a numbered child Ref from within this Cell.int
Gets the number of Refs contained within this Cell.byte
getTag()
Gets the tag byte for this cell.long
hexMatchLength
(ABlob b, long start, long length) Returns the number of matching hex digits in the given hex range of another Blob.boolean
Returns true if this Cell is in a canonical representation for message writing.boolean
Returns true if this is a fully packed set of chunksboolean
Returns true if this is a fully packed set of chunkslong
Converts this Blob to the corresponding long value.static BlobTree
Reads an encoded BlobTree from a Blob.slice
(long start, long end) Gets a contiguous slice of this Blob, as a new Blob.Converts this Cell to its canonical version.long
Gets the long value of this Blob if the length is exactly 8 bytes, otherwise throws an ExceptionConverts this object to a flat array-backed Blob instance.protected void
updateDigest
(MessageDigest digest) updateRefs
(IRefFunction func) Updates all Refs in this object using the given function.void
validate()
Validates the complete structure of this object.void
Validates the local structure and invariants of this cell.Methods inherited from class convex.core.data.ABlob
byteAt, compareTo, computeHash, empty, equals, get, getContentHash, getElementRef, getHexDigit, getType, hashCode, hexEquals, hexEquals, hexLength, isCVMValue, isRegularBlob, print, shortAt, slice, toByteBuffer, toHexString, toHexString
Methods inherited from class convex.core.data.ACountable
isEmpty, size
Methods inherited from class convex.core.data.ACell
announce, announce, attachMemorySize, attachRef, cachedEncoding, cachedHash, calcMemorySize, createAnnounced, createEncoding, createPersisted, createPersisted, createRef, equals, getCanonical, getChildRefs, getEncoding, getEncodingLength, getHash, getMemorySize, getRef, isCompletelyEncoded, isEmbedded, mark, mark, toCVMString, toString
Methods inherited from class convex.core.data.AObject
attachEncoding, print, print
-
Field Details
-
BIT_SHIFT_PER_LEVEL
public static final int BIT_SHIFT_PER_LEVEL- See Also:
-
FANOUT
public static final int FANOUT- See Also:
-
MAX_ENCODING_SIZE
public static final int MAX_ENCODING_SIZEMaximum byte length of an encoded BlobTree node. Note: - Last child might be embedded, others cannot - With max 16 children , not possible to have biggest VLC length- See Also:
-
-
Method Details
-
create
Create a BlobTree from an arbitrary Blob. Must be of sufficient size to convert to BlobTree- Parameters:
blob
- Source of BlobTree data- Returns:
- New BlobTree instance
-
createWithChildren
Create a BlobTree with the given children. SECURITY: Does not validate children in any way- Parameters:
children
- Child blobs for this BlobTree node- Returns:
- New BlobTree instance
-
calcChunks
public static long calcChunks(long length) Computes the number of chunks (4096 bytes or less) for the canonical BlobTree of the given length- Parameters:
length
- The length of the Blob in bytes- Returns:
- Number of chunks needed for a given byte length.
-
isCanonical
public boolean isCanonical()Description copied from class:ACell
Returns true if this Cell is in a canonical representation for message writing. Non-canonical objects may be used on a temporary internal basis, they must always be converted to canonical representations for external use (e.g. Encoding).- Specified by:
isCanonical
in classACell
- Returns:
- true if the object is in canonical format, false otherwise
-
getBytes
public int getBytes(byte[] dest, int pos) Description copied from class:ABlobLike
Copies the bytes from this instance to a given destination -
count
public long count()Description copied from class:ABlob
Gets the length of this Blob -
slice
Description copied from class:ABlob
Gets a contiguous slice of this Blob, as a new Blob. Shares underlying backing data where possible -
toFlatBlob
Description copied from class:ABlob
Converts this object to a flat array-backed Blob instance. Warning: might be O(n) in size of Blob, may not be canonical etc.- Specified by:
toFlatBlob
in classABlob
- Returns:
- A Blob instance containing the same data as this Blob.
-
updateDigest
- Specified by:
updateDigest
in classABlob
-
byteAtUnchecked
public byte byteAtUnchecked(long i) Description copied from class:ABlob
Gets the byte at the specified position in this data object, without bounds checking. Only safe if index is known to be in bounds, otherwise result is undefined.- Specified by:
byteAtUnchecked
in classABlob
- Parameters:
i
- Index of the byte to get- Returns:
- The byte at the specified position
-
equals
Description copied from class:ABlob
Determines if this Blob is equal to another Blob. Blobs are defined to be equal if they have the same on-chain representation, i.e. if and only if all of the following are true: - Blob is of the same general type - Blobs are of the same length - All byte values are equal -
equals
-
equalsBytes
public boolean equalsBytes(byte[] bytes, int byteOffset) Description copied from class:ABlob
Tests if the byte contents of this instance are equal to a subset of a byte array- Specified by:
equalsBytes
in classABlob
- Parameters:
bytes
- Byte array to compare withbyteOffset
- Offset into byte array from which to start comparison- Returns:
- true if exactly equal, false otherwise
-
equalsBytes
Description copied from class:ABlob
Tests if this Blob has exactly the same bytes as another Blob- Specified by:
equalsBytes
in classABlob
- Parameters:
b
- Blob to compare with- Returns:
- True if byte content is exactly equal, false otherwise
-
encode
public int encode(byte[] bs, int pos) Description copied from class:ACell
Writes this Cell's encoding to a byte array, including a tag byte which will be written first. Cell must be canonical, or else an error may occur.- Specified by:
encode
in interfaceIWriteable
- Specified by:
encode
in classACell
- Parameters:
bs
- A byte array to which to write the encodingpos
- The offset into the byte array- Returns:
- New position after writing
-
encodeRaw
public int encodeRaw(byte[] bs, int pos) Description copied from class:ABlob
Writes this Blob's encoding to a byte array, excluding the tag byte -
read
Reads an encoded BlobTree from a Blob. Assumes there will be encoded children.- Parameters:
count
- Length to readsrc
- Source data, assumed to include tag and count at startpos
- Position to read from, assumed to be tag byte- Returns:
- BlobTree instance.
- Throws:
BadFormatException
- If BlobTree encoding is invalid
-
estimatedEncodingSize
public int estimatedEncodingSize()Description copied from interface:IWriteable
Estimate the encoded data size for this Cell. Used for quickly sizing buffers. Implementations should try to return a size that is highly likely to contain the entire object when encoded, including the tag byte. Should not traverse soft Refs, i.e. must be usable on arbitrary partial data structures- Returns:
- The estimated size for the binary representation of this object.
-
append
Appends another blob to this BlobTree. Potentially O(n) but can be faster. We are careful to slice from (0...n) on the appended array, to minimise reconstruction of BlobTrees -
isFullyPacked
public boolean isFullyPacked()Returns true if this is a fully packed set of chunks- Returns:
- True if fully packed, false otherwise
-
isChunkPacked
public boolean isChunkPacked()Returns true if this is a fully packed set of chunks- Returns:
- True if fully packed, false otherwise
-
getChunk
Description copied from class:ABlob
Gets a chunk of this Blob, as a canonical Blob up to the maximum chunk size. Returns empty Blob if and only if referencing the end of a Blob with fully packed chunks -
validate
Description copied from interface:IValidated
Validates the complete structure of this object. It is necessary to ensure all child Refs are validated, so the general contract for validate is:- Call super.validate() - which will indirectly call validateCell()
- Call validate() on any contained cells in this class
- Specified by:
validate
in interfaceIValidated
- Overrides:
validate
in classABlob
- Throws:
InvalidDataException
- If the data Value is invalid in any way
-
getByteBuffer
Description copied from class:ABlob
Gets a byte buffer containing this Blob's raw data. Will have remaining bytes equal to this Blob's size.- Specified by:
getByteBuffer
in classABlob
- Returns:
- A ByteBuffer containing the Blob's data.
-
appendHex
Description copied from class:ABlob
Append hex string up to the given length in hex digits (a multiple of two) -
validateCell
Description copied from class:ACell
Validates the local structure and invariants of this cell. Called by validate() super implementation. Should validate directly contained data, but should not validate all other structure of this cell. In particular, should not traverse potentially missing child Refs.- Overrides:
validateCell
in classABlob
- Throws:
InvalidDataException
- If the Cell is invalid
-
commonHexPrefixLength
Description copied from class:ABlob
Computes the length of the longest common hex prefix between two blobs- Specified by:
commonHexPrefixLength
in classABlob
- Parameters:
b
- Blob to compare with- Returns:
- The length of the longest common prefix in hex digits
-
hexMatchLength
Description copied from class:ABlob
Returns the number of matching hex digits in the given hex range of another Blob. Assumes range is valid for both blobs. Returns length if this Blob is exactly equal to the specified hex range.- Specified by:
hexMatchLength
in classABlob
- Parameters:
b
- Blob to compare withstart
- Start position (in hex digits)length
- Length to compare (in hex digits)- Returns:
- The number of matching hex characters
-
toExactLong
public long toExactLong()Description copied from class:ABlob
Gets the long value of this Blob if the length is exactly 8 bytes, otherwise throws an Exception- Specified by:
toExactLong
in classABlob
- Returns:
- The long value represented by the Blob
-
longValue
public long longValue()Description copied from class:ABlob
Converts this Blob to the corresponding long value. Assumes big-endian format, as if the entire blob is interpreted as an unsigned big integer. Higher bytes outside the Long range will be ignored, i.e. the lowest 64 bits are taken -
getRefCount
public int getRefCount()Description copied from class:ACell
Gets the number of Refs contained within this Cell. This number is final / immutable for any given instance and is defined by the Cell encoding rules. WARNING: may not be valid id Cell is not canonical Contained Refs may be either external or embedded.- Specified by:
getRefCount
in classACell
- Returns:
- The number of Refs in this Cell
-
getRef
Description copied from class:ACell
Gets a numbered child Ref from within this Cell. WARNING: May be unreliable is cell is not canonical -
updateRefs
Description copied from class:ACell
Updates all Refs in this object using the given function. The function *must not* change the hash value of Refs, in order to ensure structural integrity of modified data structures. The implementation *should* re-attach any original encoding in order to prevent re-encoding or surplus hashing This is a building block for a very sneaky trick that enables use to do a lot of efficient operations on large trees of smart references. Must return the same object if no Refs are altered.- Overrides:
updateRefs
in classACell
- Parameters:
func
- Ref update function- Returns:
- Cell with updated Refs
-
getTag
public byte getTag()Description copied from class:ACell
Gets the tag byte for this cell. The tag byte is always equal to the first byte of the Cell's canonical Encoding, and is sufficient to distinguish how to read the rest of the encoding. -
toCanonical
Description copied from class:ACell
Converts this Cell to its canonical version. Must return this Cell if already canonical, may be O(n) in size of value otherwise.- Specified by:
toCanonical
in classABlob
- Returns:
- Canonical version of Cell
-
childSize
public static long childSize(long length) Gets the size of a BlobTree child for a blob of given total length.- Parameters:
length
- Length of Blob- Returns:
- Size of child, or 1 if not a BlobTree
-
childCount
public static int childCount(long length) Gets the number of children for a BlobTree of given total length.- Parameters:
length
- Length of Blob- Returns:
- Number of Child blobs
-