https://bugzilla.redhat.com/show_bug.cgi?id=1427550
--- Comment #8 from Shriramana Sharma samjnaa@gmail.com --- The OP has pinged me many times via direct mail but being busy I wasn't able to respond to this issue till today.
First it should be understood that to keep things formally correct the OP should submit a request to Unicode to allot any characters that he desires to use with the Tamil script a ScriptExtension ("SE") of Taml (http://www.unicode.org/Public/UNIDATA/ScriptExtensions.txt). Just adding the characters to Lohit Tamil and adding appropriate OT tables may work but might not be portable.
The OP is advised to use http://www.unicode.org/reporting.html or better still submit a document as per http://www.unicode.org/pending/docsubmit.html with appropriate attestations for the usages he asks to be supported.
(In reply to Seshadri N from comment #0)
Extentions unicode U1CD0 series.
Not all of these characters are attested to be used with Tamil.
Some such requests that I have come across in my FB and other groups:
- How can I include the signs like lines above the letters as shown in the
picture. This is required to show the swaram i.e. high or low pitch of pronouncing the letter/word as in Sanskrit. This kind of lines are used in Sanskrit also.
The characters attested to be used with Tamil are 0951, 0952 and 1CDA and they are already allotted ScriptExtensions for Tamil.
0951 ; Beng Deva Gran Gujr Guru Knda Latn Mlym Orya Shrd Taml Telu # Mn DEVANAGARI STRESS SIGN UDATTA 0952 ; Beng Deva Gran Gujr Guru Knda Latn Mlym Orya Taml Telu # Mn DEVANAGARI STRESS SIGN ANUDATTA 1CDA ; Deva Knda Mlym Taml Telu # Mn VEDIC TONE DOUBLE SVARITA
- I need to type musical notations in Tamil Using MSword processor. To
indicate octave we put a dot on top or bottom of the letter. how to add ?
--- My answer for above point 2: dot above = anuswara, dot below = nukta, Lohit Tamil does not contain nukta, but includes anuswara (U+0B82). ---
Nuktas and anusvaras form part of Indic syllabic structure. Musical notations are outside syllable structure. Combining characters from the 03xx range should be used instead.
Note that separately single- and double-dot nukta-s are attested for Tamil and need to be added to Lohit Tamil:
http://www.unicode.org/L2/L2015/15256-tamil-nukta.pdf
Of these the double-dot nukta from Grantha is allotted SE for Tamil:
1133C ; Gran Taml # Mn GRANTHA SIGN NUKTA
and the single-dot nukta is in the pipeline to be encoded in the Grantha block with Script=Inherited (which means it can be used with all scripts).
U+0310 COMBINING CANDRABINDU >>> replace with VEDIC TONE CANDRABINDU / anunasika U+0901, thats in Devanagari range, to be consistent with all other scrips.
In fact it would seem Unicode advises to use Grantha characters for Tamil rather than Devanagari. See visarga matter below. Thus U+11301 Grantha Candrabindu would be the appropriate character.
Visarga U+0903 ( as separate and distinct, in addition to Tamil Visarga U+0B83 )
UTC advises to use U+11303 Grantha Sign Visarga: 11303 ; Gran Taml # Mc GRANTHA SIGN VISARGA
AVAGRAHA U+093D
I suppose U+1133D Grantha Sign Avagraha would be appropriate.
CANDRABINDU VIRAMA U+A8F3
Already in SE: A8F3 ; Deva Taml # Lo DEVANAGARI SIGN CANDRABINDU VIRAMA
NUKTA U+093C VEDIC TONE UDATTA / SVARITA U+0951 VEDIC TONE ANUDATTA U+0952 VEDIC TONE DOUBLE/Deerga SVARITA U+1CDA
See above.
VEDIC TONE PRENKHA U+1CD2 VEDIC TONE CANDRA U+1CF4
Please provide attestation as to where these are used with Tamil.