Splits the text by boundary symbol that are not within a protection marker 2. replaces
all ignore marker symbols with nothing 3. Clean all sentences that ended up being empty
between boundaries 4. Puts back all replacement symbols with their original meaning 5.
Collects sentence information
Reads through symbolized data, and computes the bounds based on regex rules following symbol meaning