package transliteration
- Alphabetic
- Public
- All
Type Members
- trait IastBase extends RomanScript
- trait IndicScript extends AnyRef
- trait NativeIndicScript extends IndicScript
- trait RomanScript extends IndicScript
Value Members
-
object
as
extends RomanScript
Much Sanskrit in the text appears in the European Indological form, which is coded in pe.txt with the the AS (Anglicized Sanskrit) coding.
Much Sanskrit in the text appears in the European Indological form, which is coded in pe.txt with the the AS (Anglicized Sanskrit) coding.
The general AS scheme, as described in CDSL.pdf, uses Latin alphabetical letters 'x (a-z,A-Z), possibly with suffixed numbers; the letter-number combinations are, in the general scheme: x1 = macron x2 = dot below x3 = dot above x4 = accent aigu x5 = tilde x6 = dash below x7 = umlaut x10 = circonflex (hat) x11 = accent grave
Here are the characters that occur in pe.txt in this coding, with their approximate frequency:
A1 7721 := Ā (Ā) LATIN CAPITAL LETTER A WITH MACRON a1 67706 := ā (ā) LATIN SMALL LETTER A WITH MACRON d2 4974 := ḍ (ḍ) LATIN SMALL LETTER D WITH DOT BELOW D2 380 := Ḍ (Ḍ) LATIN CAPITAL LETTER D WITH DOT BELOW h2 439 := ḥ (ḥ) LATIN SMALL LETTER H WITH DOT BELOW H2 9 := Ḥ (Ḥ) LATIN CAPITAL LETTER H WITH DOT BELOW I1 1625 := Ī (Ī) LATIN CAPITAL LETTER I WITH MACRON i1 19497 := ī (ī) LATIN SMALL LETTER I WITH MACRON l2 2 := ḷ (ḷ) LATIN SMALL LETTER L WITH DOT BELOW m2 13 := ṃ (ṃ) LATIN SMALL LETTER M WITH DOT BELOW M3 180 := Ṁ (Ṁ) LATIN CAPITAL LETTER M WITH DOT ABOVE m3 2500 := ṁ (ṁ) LATIN SMALL LETTER M WITH DOT ABOVE N2 1010 := Ṇ (Ṇ) LATIN CAPITAL LETTER N WITH DOT BELOW n2 20671 := ṇ (ṇ) LATIN SMALL LETTER N WITH DOT BELOW N3 356 := Ṅ (Ṅ) LATIN CAPITAL LETTER N WITH DOT ABOVE n3 3161 := ṅ (ṅ) LATIN SMALL LETTER N WITH DOT ABOVE N5 197 := Ñ (Ñ) LATIN CAPITAL LETTER N WITH TILDE n5 2679 := ñ (ñ) LATIN SMALL LETTER N WITH TILDE R2 1625 := Ṛ (Ṛ) LATIN CAPITAL LETTER R WITH DOT BELOW r2 6630 := ṛ (ṛ) LATIN SMALL LETTER R WITH DOT BELOW S2 1027 := Ṣ (Ṣ) LATIN CAPITAL LETTER S WITH DOT BELOW s2 17116 := ṣ (ṣ) LATIN SMALL LETTER S WITH DOT BELOW S4 12639 := Ś (Ś) LATIN CAPITAL LETTER S WITH ACUTE s4 11533 := ś (ś) LATIN SMALL LETTER S WITH ACUTE T2 528 := Ṭ (Ṭ) LATIN CAPITAL LETTER T WITH DOT BELOW t2 5280 := ṭ (ṭ) LATIN SMALL LETTER T WITH DOT BELOW U1 614 := Ū (Ū) LATIN CAPITAL LETTER U WITH MACRON u1 4898 := ū (ū) LATIN SMALL LETTER U WITH MACRON
- object asTest
- object assamese extends NativeIndicScript
- object bengali extends NativeIndicScript
- object devanagarii extends NativeIndicScript
- object gujarati extends NativeIndicScript
- object gurmukhi extends NativeIndicScript
- object harvardKyoto extends RomanScript
- object harvardKyotoTest
- object iast extends IastBase
- object iastDcs extends IastBase
- object iastDcsTest
- object iastTest
- object kannaDa extends NativeIndicScript
- object kolkata extends IastBase
- object malayalam extends NativeIndicScript
- object optitrans extends RomanScript
- object optitransTest
- object oriya extends NativeIndicScript
- object slp extends RomanScript
- object slpTest
- object telugu extends NativeIndicScript
-
object
transliterator
General transliteration utilities.
- object wx extends RomanScript
- object wxTest