Croatian alphabet

Gaj's Latin alphabet (

  1. REDIRECT Template:Lang-hbs, or gajica) is the form of the Latin script used for Serbo-Croatian (Bosnian, Croatian, Serbian, and Montenegrin). It was devised by Croatian linguist Ljudevit Gaj in 1835, based on Jan Hus's Czech alphabet. A slightly reduced version is used as the script of the Slovene language, and a modified version is used for romanization of the Macedonian language. Pavao Ritter Vitezović had proposed an idea for the orthography of the Croatian language, stating that every sound should have only one letter; this idea inspired Croatian linguist Ljudevit Gaj, who reformed the Croatian variant of the Latin alphabet, introducing new letters based on the Czech alphabet. Today's Croatian alphabet is used in Croatia, Bosnia, Serbia and Montenegro. Many to this day think that it was originally Ljudevit Gaj's idea.[1]


The alphabet consists of thirty upper and lower case letters:

Letter IPA Letter IPA Letter IPA
A, a /a/ G, g /ɡ/ O, o /o/
B, b /b/ H, h /x/ P, p /p/
C, c /ts/ I, i /i/ R, r /r/
Č, č /tʃ/ J, j /j/ S, s /s/
Ć, ć /tɕ/ K, k /k/ Š, š /ʃ/
D, d /d/ L, l /l/ T, t /t/
, dž /dʒ/ Lj, lj /ʎ/ U, u /u/
Đ, đ /dʑ/ M, m /m/ V, v /ʋ/
E, e /e/ N, n /n/ Z, z /z/
F, f /f/ Nj, nj /ɲ/ Ž, ž /ʒ/

Gaj's original alphabet contained a digraph ⟨dj⟩, which was later replaced by the letter ⟨đ⟩.

The letters do not have names, and consonants are normally pronounced as such when spelling is necessary (or followed by a short schwa, e.g. /fə/). When clarity is needed, they are pronounced similar to the German alphabet: a, be, ce, če, će, de, dže, đe, e, ef, ge, ha, i, je, ka, el, elj, em, en, enj, o, pe, er, es, eš, te, u, ve, ze, že. These rules for pronunciation of individual letters are common as far as the 22 letters that match the ISO basic Latin alphabet are concerned. The use of others is mostly limited to the context of linguistics,[2][3] while in mathematics, ⟨j⟩ is commonly pronounced jot, as in German. The missing four letters are pronounced as follows: ⟨q⟩ as ku or kju, ⟨w⟩ as dublve or duplo ve, ⟨x⟩ as iks, ⟨y⟩ as ipsilon.


Note that the digraphs⟩, ⟨lj⟩, and ⟨nj⟩ are considered to be single letters. This means that:

  • In dictionaries, njegov comes after novine, in a separate ⟨nj⟩ section after the end of the ⟨n⟩ section, and bolje comes after bolnica, and so forth.
  • In vertical writing (such as on signs), ⟨dž⟩, ⟨lj⟩, ⟨nj⟩ are written horizontally, as a unit. For instance, if mjenjačnica ('Bureau de Change') is written vertically, ⟨nj⟩ appears on the fourth line (but note ⟨m⟩ and ⟨j⟩ appear separately on the first and second lines, respectively, because ⟨mj⟩ contains two letters, not one). In crossword puzzles, ⟨dž⟩, ⟨lj⟩, ⟨nj⟩ each occupy a single square.
  • In cases where words are written with a space between each letter (such as on signs), each of these letters is written together. For instance: M J E NJ A Č N I C A.
  • In cases where only the initial letter of a word is capitalized, only the first of the two component letters is capitalized: Njemačka and not NJemačka. In Unicode, the form ⟨Nj⟩ is referred to as titlecase, as opposed to the uppercase form ⟨NJ⟩, representing one of the few cases where titlecase and uppercase differ. Uppercase would be used if the entire word was capitalized: NJEMAČKA.


The Croatian Latin was mostly designed by Ljudevit Gaj, who modelled it after Czech and Polish, and invented ⟨lj⟩, ⟨nj⟩ and ⟨dž⟩. In 1830, he published in Buda the book Kratka osnova horvatsko-slavenskog pravopisanja ("Brief basics of the Croatian-Slavonic orthography"), which was the first common Croatian orthography book. It was not the first ever Croatian orthography work, as it was preceded by works of Rajmund Đamanjić (1639), Ignjat Đurđević and Pavao Ritter Vitezović. Croats had previously used the Latin script, but some of the specific sounds were not uniformly represented. Versions of the Hungarian alphabet were most commonly used, but others were too, in an often confused, inconsistent fashion.

Gaj followed the example of Pavao Ritter Vitezović and the Czech orthography, making one letter of the Latin script for each sound in the language. His alphabet mapped completely on Serbian Cyrillic which had been standardized by Vuk Karadžić a few years before.

Đuro Daničić suggested in his Rječnik hrvatskoga ili srpskoga jezika ("Dictionary of Croatian or Serbian language") published in 1880 that Gaj's digraphs ⟨dž⟩, ⟨dj⟩, ⟨lj⟩ and ⟨nj⟩ should be replaced by single letters : ⟨ģ⟩, ⟨đ⟩, ⟨ļ⟩ and ⟨ń⟩ respectively. The original Gaj alphabet was eventually revised, but only the digraph ⟨dj⟩ has been replaced with Daničić's ⟨đ⟩, while ⟨dž⟩, ⟨lj⟩ and ⟨nj⟩ have been kept.


In the 1990s, there was a general confusion about the proper character encoding to use to write text in Latin Croatian on computers.

  • An attempt was made to apply the 7-bit "YUSCII", later "CROSCII", which included the five letters with diacritics at the expense of five non-letter characters ([, ], {, }, @), but it was ultimately unsuccessful. Because the ASCII character @ sorts before A, this led to jokes calling it žabeceda (žaba=frog, abeceda=alphabet).
  • Other short-lived vendor-specific efforts were also undertaken.[which?]
  • The 8-bit ISO 8859-2 (Latin-2) standard was developed by ISO.
  • MS-DOS introduced 8-bit encoding CP852 for Central European languages, disregarding the ISO standard.
  • Microsoft Windows spread yet another 8-bit encoding called CP1250, which had a few letters mapped one-to-one with ISO 8859-2, but also had some mapped elsewhere.
  • Apple used yet another encoding called the Macintosh Central European encoding.
  • EBCDIC also has a Latin-2 encoding.[4]

The preferred character encoding for Croatian today is either the ISO 8859-2, or the Unicode encoding UTF-8 (with two bytes or 16 bits necessary to use the letters with diacritics). However, as of 2010, one can still find programs as well as databases that use CP1250, CP852 or even CROSCII.

Usage in the Slovene language

Since the early 1840s, Gaj's alphabet was increasingly used for the Slovene language. In the beginning, it was Slovene authors who treated Slovene as a variant of Croatian (such as Stanko Vraz) who most commonly used it, but it was later accepted by a large spectrum of Slovene-writing authors. The breakthrough came when the Slovene conservative leader Janez Bleiweis started using Gaj's script in his journal Kmetijske in rokodelske novice ("Agricultural and Artisan News)"), which was read by a wide public in the countryside. By 1850, Gaj's alphabet (known as gajica in Slovene) became the only official Slovene alphabet, replacing three other writing systems which circulated in the Slovene Lands since the 1830s: the traditional bohoričica (after its inventor Adam Bohorič) and the two innovative proposals by the Peter Dajnko (the dajnčica) and Franc Serafin Metelko (the metelčica).

The Slovene version of Gaj's alphabet differs from the Croatian one in the following traits:

  • the Slovene alphabet does not have the characters ⟨ć⟩ and ⟨đ⟩; the sounds these letters represent are not present in the Slovene language;
  • in the Slovene variant, the digraphs ⟨lj⟩ and ⟨nj⟩ are treated as two separate letters and represent separate sounds (e.g. the word polje is pronounced /polje/ in Slovene, as opposed to /poʎe/ in Croatian).
  • while the phoneme /dʒ/ exists in modern Slovene and is written ⟨dž⟩, it is only used in borrowed words, and ⟨d⟩ and ⟨ž⟩ are considered separate letters, not a digraph.

Slovene orthography is comparatively less phonetic than Serbo-Croatian. For instance, letter ⟨e⟩ can be pronounced in three ways (/e/, /ɛ/ and /ə/), and letter ⟨v⟩ in two (/ʋ/ and /w/). Also, it does not record consonant voicing assimilation: compare e.g. Slovene ⟨odpad⟩ and Serbo-Croatian ⟨otpad⟩ ('junkyard', 'waste').

Usage in the Macedonian

Romanization of Macedonian, according to academic sources,[5][6] is done according to Gaj's Latin alphabet. However, this alphabet is slightly modified according to the phonetics and phonology of the language. Therefore, Gaj's letters ć and đ are not used at all, but instead the letters and ǵ were introduced. The rest of the letters of the alphabet are used to represent the equivalent Cyrillic letters. Additionally to that, Macedonian uses the letter dz, which is not part of the Serbo-Croatian phonetic inventory. However, the backs of record sleeves published in the Yugoslavia era (e.g. Mizar's debut album) used ć and đ, among other sources.

See also



External links

  • Omniglot
This article was sourced from Creative Commons Attribution-ShareAlike License; additional terms may apply. World Heritage Encyclopedia content is assembled from numerous content providers, Open Access Publishing, and in compliance with The Fair Access to Science and Technology Research Act (FASTR), Wikimedia Foundation, Inc., Public Library of Science, The Encyclopedia of Life, Open Book Publishers (OBP), PubMed, U.S. National Library of Medicine, National Center for Biotechnology Information, U.S. National Library of Medicine, National Institutes of Health (NIH), U.S. Department of Health & Human Services, and, which sources content from all federal, state, local, tribal, and territorial government publication portals (.gov, .mil, .edu). Funding for and content contributors is made possible from the U.S. Congress, E-Government Act of 2002.
Crowd sourced content that is contributed to World Heritage Encyclopedia is peer reviewed and edited by our editorial staff to ensure quality scholarly research articles.
By using this site, you agree to the Terms of Use and Privacy Policy. World Heritage Encyclopedia™ is a registered trademark of the World Public Library Association, a non-profit organization.