Class AString

All Implemented Interfaces:
IValidated, IWriteable, Comparable<AString>
Direct Known Subclasses:
StringShort, StringSlice, StringTree

public abstract class AString extends ACountable<CVMChar> implements Comparable<AString>
Class representing a CVM String. CVM Strings are UTF-8 byte strings with an immutable, fixed count in bytes. CVM Strings are NOT enforced to be valid UTF-8, for reasons of performance, simplicity and consistent behaviour (e.g. in conversions to and from Blobs). It is up to clients to decide how to represent invalid UTF-8 if necessary.
  • Field Details

    • length

      protected long length
  • Constructor Details

    • AString

      protected AString(long length)
  • Method Details

    • getType

      public AType getType()
      Description copied from class: ACell
      Gets the most specific known runtime Type for this Cell.
      Overrides:
      getType in class ACell
      Returns:
      The Type of this Call
    • print

      public boolean print(BlobBuilder sb, long limit)
      Description copied from class: AObject
      Prints this Object to a readable String Representation. SECURITY: Must halt and return false in O(1) time if limit of printing is exceeded otherwise DoS attacks may be possible.
      Specified by:
      print in class AObject
      Parameters:
      sb - BlobBuilder to append to
      limit - Limit of printing in string bytes
      Returns:
      True if fully printed within limit, false otherwise
    • count

      public long count()
      Description copied from class: ACountable
      Returns the number of elements in this data structure
      Specified by:
      count in class ACountable<CVMChar>
      Returns:
      Number of elements in this collection.
    • empty

      public final StringShort empty()
      Returns the singleton empty String
      Specified by:
      empty in class ACountable<CVMChar>
      Returns:
      An empty Countable value
    • charAt

      public final int charAt(long i)
      Gets the Unicode character at position i, or -1 if not valid
      Parameters:
      i - Index into String (byte position)
      Returns:
      Unicode code point, or -1 if not a valid code point at this position
    • intAt

      public int intAt(long index)
      Gets 32 bytes integer at given position. Extends with 255 (invalid UTF-8) if needed. The main purpose of this function is to enable fast peeking at UTF-8 characters
      Parameters:
      index - Index into String (byte position)
      Returns:
      Raw integer value
    • byteAt

      protected abstract byte byteAt(long i)
      Gets a byte at the specified index. Returns -1 (0xff) if outside String.
      Parameters:
      i - Index into String (byte position)
      Returns:
      Raw byte value
    • get

      public CVMChar get(long i)
      Gets the Character at the specified point in the String, or null if there is no valid Character at this position.
      Specified by:
      get in class ACountable<CVMChar>
      Parameters:
      i - Index of element to get
      Returns:
      CVMChar instance, or null for invalid UTF-8 or any character out of the string bounds
    • getElementRef

      public Ref<CVMChar> getElementRef(long i)
      Description copied from class: ACountable
      Gets a Ref to the element at the specified element index in this collection
      Specified by:
      getElementRef in class ACountable<CVMChar>
      Parameters:
      i - Index of element to get
      Returns:
      Element at the specified index
    • compareTo

      public abstract int compareTo(AString o)
      Specified by:
      compareTo in interface Comparable<AString>
    • toString

      public final String toString()
      Description copied from class: ACell
      Returns the Java String representation of this Cell. The String representation is intended to be a easy-to-read textual representation of the Cell's data content.
      Overrides:
      toString in class ACell
    • toCVMString

      public AString toCVMString(long limit)
      Description copied from class: ACell
      Returns the CVM String representation of this Cell. Normally, this is as printed, but may be different for some types. MUST return null in O(1) time if the length of the CVM String would exceed limit. The String representation is intended to be a easy-to-read textual representation of the Cell's data content.
      Overrides:
      toCVMString in class ACell
      Parameters:
      limit - Limit of CVM String length in UTF-8 bytes
      Returns:
      CVM String, or null if limit exceeded
    • append

      public AString append(AString b)
      Append a CVM String to this CVM String. Potentially O(n). Concatenates raw UTF-8 bytes.
      Parameters:
      b - String to append
      Returns:
      Concatenated String
    • writeToBuffer

      protected abstract void writeToBuffer(ByteBuffer bb)
    • slice

      public abstract AString slice(long start, long end)
      Gets a slice of this string, or null if not a valid slice
      Specified by:
      slice in class ACountable<CVMChar>
      Parameters:
      start - Start index (inclusive)
      end - End index (Exclusive)
      Returns:
      Specified substring
    • split

      public AVector<AString> split(CVMChar c)
      Splits this string by the given character The result will always have at least one String, and as many additional Strings as the split character occurs.
      Parameters:
      c - CMVChar instance with which to split
      Returns:
      Vector of Strings, excluding the split character.
    • encode

      public int encode(byte[] bs, int pos)
      Description copied from class: ACell
      Writes this Cell's encoding to a byte array, including a tag byte which will be written first. Cell must be canonical, or else an error may occur.
      Specified by:
      encode in interface IWriteable
      Specified by:
      encode in class ACell
      Parameters:
      bs - A byte array to which to write the encoding
      pos - The offset into the byte array
      Returns:
      New position after writing
    • encodeRaw

      public abstract int encodeRaw(byte[] bs, int pos)
      Encode the data of this String. Assumes tag already written
      Specified by:
      encodeRaw in class ACell
      Parameters:
      bs - A byte array to which to write the encoding
      pos - The offset into the byte array
      Returns:
      New position after writing
    • encodeRawData

      public abstract int encodeRawData(byte[] bs, int pos)
      Encode the raw UTF-8 data of this String. Assumes tag/length already written
      Parameters:
      bs - Byte array to encode to
      pos - Position in target array to write to
      Returns:
      End position in array after encoding
    • getTag

      public final byte getTag()
      Description copied from class: ACell
      Gets the tag byte for this cell. The tag byte is always written as the first byte of the Cell's Encoding
      Specified by:
      getTag in class ACell
      Returns:
      Tag byte for this Cell
    • hashCode

      public final int hashCode()
      Gets a Java hashCode for this CVM String. Use the hashcode of underlying Blob
      Overrides:
      hashCode in class ACell
      Returns:
      integer hash code.
    • toBlob

      public abstract ABlob toBlob()
      Converts this String to a Blob byte representation. Must round trip all values.
      Returns:
      Blob representation of UTF-8 String
    • append

      public AString append(String string)
      Convenience method to add a Java String to a CVM String. Not particularly efficient.
      Parameters:
      string - String to append
      Returns:
      CVM String