The first Slovak orthography was proposed by Anton Bernolák (1762–1813) in his Dissertatio philologico-critica de litteris Slavorum, used in the six-volume Slovak-Czech-Latin-German-Hungarian Dictionary (1825–1927) and used primarily by Slovak Catholics.
A combined character is considered different from the same character written with a single unicode character in string comparisons, and the two characters are considered to have a different length (for example, as returned by the CHARLENGTH function or in result set metadata).
The standard orthography of the Slovak language is immediately based on the standard developed by Ľudovít Štúr in 1844 and reformed by Martin Hattala in 1851 with the agreement of Štúr. The then-current (1840s) form of the central Slovak dialect was chosen as the standard. It uses the Latin script with small modifications that include the four diacritics (ˇ, ´, ¨, ˆ) placed above certain letters. After Hattala's reform, the Slovak language remained mostly unchanged.
- 2Sound–spelling correspondences
Alphabet[edit]
The Slovak alphabet is an extension of the Latin alphabet used for writing the Slovak language.
It has 46 letters which makes it the longest Slavic and European alphabet.
The 46 letters of the Slovak alphabet are:
Majuscule forms (also called uppercase or capital letters) | |||||||||||||||||||||||||||||||||||||||||||||
A | Á | Ä | B | C | Č | D | Ď | Dz | Dž | E | É | F | G | H | Ch | I | Í | J | K | L | Ĺ | Ľ | M | N | Ň | O | Ó | Ô | P | Q | R | Ŕ | S | Š | T | Ť | U | Ú | V | W | X | Y | Ý | Z | Ž |
Minuscule forms (also called lowercase or small letters) | |||||||||||||||||||||||||||||||||||||||||||||
a | á | ä | b | c | č | d | ď | dz | dž | e | é | f | g | h | ch | i | í | j | k | l | ĺ | ľ | m | n | ň | o | ó | ô | p | q | r | ŕ | s | š | t | ť | u | ú | v | w | x | y | ý | z | ž |
Letter | Letter name |
---|---|
A a | á |
Á á | dlhé á |
Ä ä | prehlasované á; a s dvoma bodkami; široké e |
B b | bé |
C c | cé |
Č č | čé |
D d | dé |
Ď ď | ďé |
Dz dz | dzé |
Dž dž | džé |
E e | é |
É é | dlhé é |
F f | ef |
G g | gé |
H h | há |
Ch ch | chá |
I i | í |
Í í | dlhé í |
J j | jé |
K k | ká |
L l | el |
Ĺ ĺ | dlhé el |
Ľ ľ | mäkké el |
M m | em |
N n | en |
Ň ň | eň |
O o | o |
Ó ó | ó |
Ô ô | ô |
P p | pé |
Q q1 | kvé |
R r | er |
Ŕ ŕ | dlhé er |
S s | es |
Š š | eš |
T t | té |
Ť ť | mäkké té |
U u | u |
Ú ú | dlhé ú |
V v | vé |
W w1 | dvojité vé |
X x1 | iks |
Y y | ypsilon |
Ý ý | dlhý ypsilon |
Z z | zet |
Ž ž | žet |
- The letters Q, W and X are only used in loanwords.
Sound–spelling correspondences[edit]
Grapheme | IPA (key) | Notes |
---|---|---|
a | /a/ | |
á | /aː/ | |
ä | /æ/ | |
e | /ɛ/ | |
é | /ɛː/ | |
i | /i/ | |
í | /iː/ | |
ia | /ɪ̯a/ | May be pronounced as two monophthongs /i.a/ in foreign words. |
ie | /ɪ̯ɛ/ | May be pronounced as two monophthongs /i.ɛ/ in foreign words. |
iu | /ɪ̯u/ | May be pronounced as two monophthongs /i.u/ in foreign words. |
o | /ɔ/ | |
ó | /ɔː/ | |
ô | /ʊ̯ɔ/ | |
u | /u/ | |
ú | /uː/ | |
y | /i/ | |
ý | /iː/ |
Grapheme | IPA (key) | Notes |
---|---|---|
b | /b/ | |
c | /t͡s/ | |
č | /t͡ʃ/ | |
d | /d/ | |
ď | /ɟ/ | |
dz | /d͡z/ | |
dž | /d͡ʒ/ | |
f | /f/ | |
g | /ɡ/ | |
h | /ɦ/ | |
ch | /x/ | |
j | /j/ | |
k | /k/ | |
l | /l/ | Can be syllabic /l̩/. |
ĺ | /l̩ː/ | |
ľ | /ʎ/ | |
m | /m/ | |
n | /n/ | Becomes [ŋ] before /k/ and /ɡ/. |
ň | /ɲ/ | |
p | /p/ | |
q | /kv/ | Only occurs in loanwords. |
r | /r/ | Can be syllabic /r̩/. |
ŕ | /r̩ː/ | |
s | /s/ | |
š | /ʃ/ | |
t | /t/ | |
ť | /c/ | |
v | /v/ | |
w | /v/ | Only occurs in loanwords. |
x | /ks/ | Only occurs in loanwords. |
z | /z/ | |
ž | /ʒ/ |
Some additional notes includes the following (transcriptions in IPA unless otherwise stated):
The primary principle of Slovak spelling is the phonemic principle. The secondary principle is the morphological principle: forms derived from the same stem are written in the same way even if they are pronounced differently. An example of this principle is the assimilation rule (see below). The tertiary principle is the etymological principle, which can be seen in the use of i after certain consonants and of y after other consonants, although both i and y are pronounced the same way.
Finally, the rarely applied grammatical principle is present when, for example, the basic singular form and plural form of masculine adjectives are written differently with no difference in pronunciation (e.g. pekný = nice – singular versus pekní = nice – plural).
Most foreign words receive Slovak spelling immediately or after some time. For example, 'weekend' is spelled víkend, 'software' - softvér, 'gay' - gej (both not exclusively), and 'quality' is spelled kvalita (possibly from Italianqualità). Personal and geographical names from other languages using Latin alphabets keep their original spelling unless a fully Slovak form of the name exists (e.g. Londýn for 'London').
To accelerate writing, a rule has been introduced that the frequent character combinations ďe, ťe, ňe, ľe, ďi, ťi, ňi, ľi, ďí, ťí, ňí, ľí, ďie, ťie, ňie, ľie, ďia, ťia, ňia, ľia are written without a caron de, te, ne, le, di, ti, ni, li, dí, tí, ní, lí, die, tie, nie, lie, dia, tia, nia, lia. These combinations are usually pronounced as if a caron were found above the consonant. Some exceptions are as follows:
- foreign words (e.g. telefón is pronounced with a hard t and a hard l)
- the following words: ten (that), jeden (one), vtedy (then), teraz (now)
- nominative masculine plural endings of pronouns and adjectives do not 'soften' preceding n, d, t, l (e.g. tí odvážni mladí muži/tiː ɔdvaːʒni mladiː muʒi/, the/those brave young men)
- short e in adjectival endings, which is derived from long é shortened by the 'rhythmical rule' (see below), does not 'soften' preceding n, d, t, l (e.g. krásne stromy/kraːsnɛ strɔmi/, beautiful trees, c.f. zelené stromy/zɛʎɛnɛː strɔmi/, green trees)
Slovak features some heterophonic homographs (words with identical spelling but different pronunciation and meaning), the most common examples being krásne/ˈkraːsnɛ/ (beautiful) versus krásne/ˈkraːsɲɛ/ (beautifully).
The letter ľ is nowadays pronounced by many speakers, particularly from western Slovakia, as a non-palatalized l.
When a voiced obstruent (b, d, ď, dz, dž, g, h, z, ž) is at the end of the word before a pause, it is pronounced as its voiceless counterpart (p, t, ť, c, č, k, ch, s, š, respectively). For example, pohyb is pronounced /pɔɦip/ and prípad is pronounced /priːpat/.
When 'v' is at the end of the syllable, it is pronounced as non-syllabic u [ʊ̯], with the exception of the position before 'n' or 'ň'. For example, kov[kɔʊ̯] (metal), kravský[kraʊ̯skiː] (cow - adjective), but povstať[pɔfstac] (uprise), because the 'v' is not at the end of the syllable (po-vstať) and hlavný[ɦlaʋniː] because 'v' stands before the 'n'.
The graphic group -ou (at the end of words) is pronounced [ɔʊ̯] but is not considered a diphthong. Its phonemic interpretation is /ɔv/.
Consonant clusters containing both voiced and voiceless elements are entirely voiced if the last consonant is a voiced one, or voiceless if the last consonant is voiceless. For example, otázka is pronounced /ɔtaːska/ and vzchopiť sa is pronounced [fsxɔpit sa]. This rule applies also over the word boundary. One example is as follows: prísť domov[priːzɟ dɔmɔʊ̯] (to come home) and viac jahôd[ʋɪ̯adz jaɦʊ̯ɔt] (more strawberries). The voiced counterpart of 'ch' /x/ is [ɣ], and the unvoiced counterpart of 'h' /ɦ/ is [x].
One of the most important changes in Slovak orthography in the 20th century was in 1953 when s began to be written as z where pronounced [z] in prefixes (e.g. smluva into zmluva as well as sväz into zväz). The phonemic principle has been given priority over the etymological principle in this case.
Rhythmical rule[edit]
A long syllable (that is, a syllable containing á, é, í, ý, ó, ú, ŕ, ĺ, ia, ie, iu, ô) cannot be followed by another long one in the next syllable within the same word. This rule has morphonemic implications for declension (e.g. žen-ám but tráv-am) and conjugation (e.g. nos-ím but súd-im). Several exceptions of this rule exist. It is typical of the literary Slovak language, and does not appear in Czech or in some Slovak dialects.
Diacritics[edit]
The acute mark (in Slovak 'dĺžeň', 'prolongation mark' or 'lengthener') indicates length (e.g. í = approximately [iː]). This mark may appear on any vowel except 'ä' (wide 'e', široké 'e' in Slovak). It may also appear above the consonants 'l' and 'r', indicating the long [lː] and [rː] sounds.
The circumflex ('vokáň') exists only above the letter 'o'. It turns the o into a diphthong (see below).
The umlaut ('prehláska', 'dve bodky' = two dots) is only used above the letter 'a'. It indicates a raised vowel, almost an 'e', similar to German ä.
The caron (in Slovak 'mäkčeň', 'palatalization mark' or 'softener') indicates a change of alveolar fricatives into either post-alveolar or palatal consonants, in informal Slovak linguistics often called just 'palatalization'. Eight consonants can bear a caron. Not all 'normal' consonants have a 'caroned' counterpart:
- In printed texts, the caron is printed in two forms: (1) č, dž, š, ž, ň and (2) ľ, ď, ť (looking more like an apostrophe), but this is just a convention. In handwritten texts, it always appears in the first form.
- Phonetically, two forms of 'palatalization' exist: ľ, ň, ď, ť are palatal consonants, while č, dž, š, ž are postalveolar affricates and fricatives.
Computer encoding[edit]
The Slovak alphabet is available within the ISO/IEC 8859-2 “Latin-2” encoding, which generally supports Eastern European languages. All vowels, but none of the specific consonants (that is, no č, ď, ľ, ĺ, ň, ŕ, š, ť, ž) are available within the “Latin-1” encoding, which generally supports only Western European languages.
See also[edit]
References[edit]
- ^Kráľ, Ábel (1988). Pravidlá slovenskej výslovnosti. Bratislava: Slovenské pedagogické nakladateľstvo.
I got the answer to my encoding problem which I faced last week.
Here is explanation....
By Default the Excel file encoding being set to Windows-1252.Windows-1252 [This Codepage is designed for Latin 1 languages: Afrikaans, Basque, Catalan, Danish, Dutch, English, Faroese, Finnish, French, Galician, German, Icelandic, Indonesian, Italian, Malay, Norwegian, Portuguese, Spanish, Swahili, Swedish.]
When you select/change polish/ Czech language from regional setting, XP automatically set code page to windows-1250. Windows-1250[This Codepage is designed for Central Europe languages: Albanian, Croatian, Czech, Hungarian, Polish, Romanian, Serbian (Latin), Slovak, Slovenian.]
You can change the system code page using the Control Panel. The following instructions are for English Windows XP.
• Start the Control Panel, and open Regional and Language Options. On the Regional Options tab, specify information in the Standards and formats box and in the Location box to match the target language and country.
• On the Advanced tab, specify settings in the Language for non-Unicode programs boxes to match the target language.
• Click OK. You might need to insert the operating system CD to install the necessary files. If you already have the files installed, the system prompts you to use the existing files. Click Yes. Next, the system prompts you to restart the computer. Click Yes. You must reboot. Otherwise, the new system code page won't be effective.
So problem here is due to configuration setting of PC. Now I can able to see the CSV file in Excel with proper text after select polish language.
I really thanks to all of you for kind help.