p

sanskritnlp

transliteration

package transliteration

Ordering
  1. Alphabetic
Visibility
  1. Public
  2. All

Type Members

  1. trait IastBase extends RomanScript
  2. trait IndicScript extends AnyRef
  3. trait NativeIndicScript extends IndicScript
  4. trait RomanScript extends IndicScript

Value Members

  1. object as extends RomanScript

    Much Sanskrit in the text appears in the European Indological form, which is coded in pe.txt with the the AS (Anglicized Sanskrit) coding.

    Much Sanskrit in the text appears in the European Indological form, which is coded in pe.txt with the the AS (Anglicized Sanskrit) coding.

    The general AS scheme, as described in CDSL.pdf, uses Latin alphabetical letters 'x (a-z,A-Z), possibly with suffixed numbers; the letter-number combinations are, in the general scheme: x1 = macron x2 = dot below x3 = dot above x4 = accent aigu x5 = tilde x6 = dash below x7 = umlaut x10 = circonflex (hat) x11 = accent grave

    Here are the characters that occur in pe.txt in this coding, with their approximate frequency:

    A1 7721 := Ā (Ā) LATIN CAPITAL LETTER A WITH MACRON a1 67706 := ā (ā) LATIN SMALL LETTER A WITH MACRON d2 4974 := ḍ (ḍ) LATIN SMALL LETTER D WITH DOT BELOW D2 380 := Ḍ (Ḍ) LATIN CAPITAL LETTER D WITH DOT BELOW h2 439 := ḥ (ḥ) LATIN SMALL LETTER H WITH DOT BELOW H2 9 := Ḥ (Ḥ) LATIN CAPITAL LETTER H WITH DOT BELOW I1 1625 := Ī (Ī) LATIN CAPITAL LETTER I WITH MACRON i1 19497 := ī (ī) LATIN SMALL LETTER I WITH MACRON l2 2 := ḷ (ḷ) LATIN SMALL LETTER L WITH DOT BELOW m2 13 := ṃ (ṃ) LATIN SMALL LETTER M WITH DOT BELOW M3 180 := Ṁ (Ṁ) LATIN CAPITAL LETTER M WITH DOT ABOVE m3 2500 := ṁ (ṁ) LATIN SMALL LETTER M WITH DOT ABOVE N2 1010 := Ṇ (Ṇ) LATIN CAPITAL LETTER N WITH DOT BELOW n2 20671 := ṇ (ṇ) LATIN SMALL LETTER N WITH DOT BELOW N3 356 := Ṅ (Ṅ) LATIN CAPITAL LETTER N WITH DOT ABOVE n3 3161 := ṅ (ṅ) LATIN SMALL LETTER N WITH DOT ABOVE N5 197 := Ñ (Ñ) LATIN CAPITAL LETTER N WITH TILDE n5 2679 := ñ (ñ) LATIN SMALL LETTER N WITH TILDE R2 1625 := Ṛ (Ṛ) LATIN CAPITAL LETTER R WITH DOT BELOW r2 6630 := ṛ (ṛ) LATIN SMALL LETTER R WITH DOT BELOW S2 1027 := Ṣ (Ṣ) LATIN CAPITAL LETTER S WITH DOT BELOW s2 17116 := ṣ (ṣ) LATIN SMALL LETTER S WITH DOT BELOW S4 12639 := Ś (Ś) LATIN CAPITAL LETTER S WITH ACUTE s4 11533 := ś (ś) LATIN SMALL LETTER S WITH ACUTE T2 528 := Ṭ (Ṭ) LATIN CAPITAL LETTER T WITH DOT BELOW t2 5280 := ṭ (ṭ) LATIN SMALL LETTER T WITH DOT BELOW U1 614 := Ū (Ū) LATIN CAPITAL LETTER U WITH MACRON u1 4898 := ū (ū) LATIN SMALL LETTER U WITH MACRON

  2. object asTest
  3. object assamese extends NativeIndicScript
  4. object bengali extends NativeIndicScript
  5. object devanagarii extends NativeIndicScript
  6. object gujarati extends NativeIndicScript
  7. object gurmukhi extends NativeIndicScript
  8. object harvardKyoto extends RomanScript
  9. object harvardKyotoTest
  10. object iast extends IastBase
  11. object iastDcs extends IastBase
  12. object iastDcsTest
  13. object iastTest
  14. object kannaDa extends NativeIndicScript
  15. object kolkata extends IastBase
  16. object malayalam extends NativeIndicScript
  17. object optitrans extends RomanScript
  18. object optitransTest
  19. object oriya extends NativeIndicScript
  20. object slp extends RomanScript
  21. object slpTest
  22. object telugu extends NativeIndicScript
  23. object transliterator

    General transliteration utilities.

  24. object wx extends RomanScript
  25. object wxTest

Ungrouped