Package convex.core.data
Class AString
- All Implemented Interfaces:
IValidated,IWriteable,Comparable<AString>
- Direct Known Subclasses:
StringShort,StringSlice,StringTree
Abstract base Class representing a CVM String.
CVM Strings are UTF-8 byte strings with an immutable, fixed count in bytes.
CVM Strings are NOT enforced to be valid UTF-8, for reasons of performance, simplicity and
consistent behaviour (e.g. in conversions to and from Blobs). It is up to clients to decide
how to represent invalid UTF-8 if necessary.
-
Field Summary
FieldsFields inherited from class convex.core.data.ACell
cachedRef, EMPTY_ARRAY -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionAppend a CVM String to this CVM String.Convenience method to add a Java String to a CVM String.abstract bytebyteAt(long i) Gets a byte at the specified index.final intcharAt(long i) Gets the Unicode character at position i, or -1 if not validabstract intlongcount()Returns the number of elements in this data structurefinal StringShortempty()Returns the singleton empty Stringintencode(byte[] bs, int pos) Writes this Cell's encoding to a byte array, including a tag byte which will be written first.abstract intencodeRaw(byte[] bs, int pos) Encode the data of this String.get(long i) Gets the Character at the specified point in the String, or null if there is no valid Character at this position.intgetBytes(byte[] dest, int destOffset) Copies the bytes from this instance to a given destinationgetElementRef(long i) Gets a Ref to the element at the specified element index in this collectionfinal bytegetTag()Gets the tag byte for this cell.getType()Gets the most specific known runtime Type for this Cell.final inthashCode()Gets a Java hashCode for this CVM String.intintAt(long index) Gets 32 bytes integer at given position.booleanprint(BlobBuilder sb, long limit) Prints this Object to a readable String Representation.protected abstract voidprintEscaped(BlobBuilder sb, long start, long end) Prints this string as escaped UTF-8abstract AStringslice(long start, long end) Gets a slice of this string, or null if not a valid sliceSplits this string by the given character The result will always have at least one String, and as many additional Strings as the split character occurs.abstract ABlobtoBlob()Converts this String to a Blob representation.toCVMString(long limit) Returns the CVM String representation of this Cell.final StringtoString()Returns the Java String representation of this Cell.abstract intwriteRawData(byte[] bs, int pos) Write the raw UTF-8 data of this String to a byte array.Methods inherited from class convex.core.data.ACountable
isEmpty, size, sliceMethods inherited from class convex.core.data.ACell
announce, announce, attachMemorySize, attachRef, cachedEncoding, cachedHash, calcMemorySize, createAnnounced, createEncoding, createPersisted, createPersisted, createRef, equals, equals, getCanonical, getChildRefs, getEncoding, getEncodingLength, getHash, getMemorySize, getRef, getRef, getRefCount, isCanonical, isCompletelyEncoded, isCVMValue, isEmbedded, mark, mark, toCanonical, updateRefs, validate, validateCellMethods inherited from class convex.core.data.AObject
attachEncoding, print, printMethods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, wait, wait, waitMethods inherited from interface convex.core.data.IWriteable
estimatedEncodingSize
-
Field Details
-
length
protected long length
-
-
Constructor Details
-
AString
protected AString(long length)
-
-
Method Details
-
getType
Description copied from class:ACellGets the most specific known runtime Type for this Cell. -
print
Description copied from class:AObjectPrints this Object to a readable String Representation. SECURITY: Must halt and return false in O(1) time when limit of printing is exceeded otherwise DoS attacks may be possible. -
count
public long count()Description copied from class:ACountableReturns the number of elements in this data structure- Specified by:
countin classACountable<CVMChar>- Returns:
- Number of elements in this collection.
-
printEscaped
Prints this string as escaped UTF-8- Parameters:
sb-
-
empty
Returns the singleton empty String -
charAt
public final int charAt(long i) Gets the Unicode character at position i, or -1 if not valid- Parameters:
i- Index into String (byte position)- Returns:
- Unicode code point, or -1 if not a valid code point at this position
-
intAt
public int intAt(long index) Gets 32 bytes integer at given position. Extends with 255 (invalid UTF-8) if needed. The main purpose of this function is to enable fast peeking at UTF-8 characters- Parameters:
index- Index into String (byte position)- Returns:
- Raw integer value
-
byteAt
public abstract byte byteAt(long i) Gets a byte at the specified index. Returns -1 (0xff) if outside String. -
get
Gets the Character at the specified point in the String, or null if there is no valid Character at this position.- Specified by:
getin classACountable<CVMChar>- Parameters:
i- Index of element to get- Returns:
- CVMChar instance, or null for invalid UTF-8 or any character out of the string bounds
-
getElementRef
Description copied from class:ACountableGets a Ref to the element at the specified element index in this collection- Specified by:
getElementRefin classACountable<CVMChar>- Parameters:
i- Index of element to get- Returns:
- Element at the specified index
-
getBytes
public int getBytes(byte[] dest, int destOffset) Description copied from class:ABlobLikeCopies the bytes from this instance to a given destination -
compareTo
- Specified by:
compareToin interfaceComparable<AString>
-
toString
Description copied from class:ACellReturns the Java String representation of this Cell. The String representation is intended to be a easy-to-read textual representation of the Cell's data content. -
toCVMString
Description copied from class:ACellReturns the CVM String representation of this Cell. Normally, this is as printed, but may be different for some types. MUST return null in O(1) time if the length of the CVM String would exceed limit. The String representation is intended to be a easy-to-read textual representation of the Cell's data content.- Overrides:
toCVMStringin classACell- Parameters:
limit- Limit of CVM String length in UTF-8 bytes- Returns:
- CVM String, or null if limit exceeded
-
append
Append a CVM String to this CVM String. Potentially O(n). Concatenates raw UTF-8 bytes.- Parameters:
b- String to append- Returns:
- Concatenated String
-
slice
Gets a slice of this string, or null if not a valid slice- Specified by:
slicein classACountable<CVMChar>- Parameters:
start- Start index (inclusive)end- End index (Exclusive)- Returns:
- Specified substring
-
split
Splits this string by the given character The result will always have at least one String, and as many additional Strings as the split character occurs.- Parameters:
c- CMVChar instance with which to split- Returns:
- Vector of Strings, excluding the split character.
-
encode
public int encode(byte[] bs, int pos) Description copied from class:ACellWrites this Cell's encoding to a byte array, including a tag byte which will be written first. Cell must be canonical, or else an error may occur.- Specified by:
encodein interfaceIWriteable- Specified by:
encodein classACell- Parameters:
bs- A byte array to which to write the encodingpos- The offset into the byte array- Returns:
- New position after writing
-
encodeRaw
public abstract int encodeRaw(byte[] bs, int pos) Encode the data of this String. Assumes tag already written -
writeRawData
public abstract int writeRawData(byte[] bs, int pos) Write the raw UTF-8 data of this String to a byte array.- Parameters:
bs- Destination byte arraypos- Position in target array to write to- Returns:
- End position in array after writing
-
getTag
public final byte getTag()Description copied from class:ACellGets the tag byte for this cell. The tag byte is always equal to the first byte of the Cell's canonical Encoding, and is sufficient to distinguish how to read the rest of the encoding. -
hashCode
public final int hashCode()Gets a Java hashCode for this CVM String. Use the hashcode of underlying Blob -
toBlob
Converts this String to a Blob representation. Must round trip all values.- Returns:
- Blob representation of UTF-8 String
-
append
Convenience method to add a Java String to a CVM String. Not particularly efficient.- Parameters:
string- String to append- Returns:
- CVM String
-