Nepali orthography: Summary and conventions
Words used in Nepali can be broadly categorized into two groups - Nepali words and Sanskrit loanwords used in Nepali. Sanskrit loanwords, called Tatsam, are words that have made it to the Nepali language without any modification. Nepali words include words, called Tadbhav, that originated from Sanskrit but which got modified as they finally made it to the Nepali language. Nepali words also include loanwords from sources other than Sanskrit.
An important orthographic distinction between these two groups is that the Nepali words are written using Nepali writing rules, whereas the Sanskrit loanwords are written the same way they are written in Sanskrit as they made it to the Nepali language without any modification. The Nepali writing rules, compared to its Sanskrit counterpart, is much simpler making it easy to write non-Sanskrit loanwords and words that came from Sanskrit with modification. In contrast to this, writing Sanskrit loanwords requires knowledge of additional rules as they have to be written the same way they are written in Sanskrit.
Nepali orthography has evolved over time. The conventions used at Nepali Language Resource Center are based on guidelines published by Nepal's language academy, Nepal Academy.
Raswa Dirgha in Nepali
A vowel can occur either as an independent vowel letter or as a dependent vowel sign in combination with a consonant letter. See Alphabet and Writing System
for more details. The two vowels /i/ and /u/ can also occur either as short vowels or long vowels. In Nepali, Raswa refers to the occurrence of such short vowels, इ, ि, उ and ु, and Dirgha to the occurrence of long vowels, ई, ी, ऊ and ू, in an orthographic syllable.
Using Nepali Spell Checker
To spell-check, paste your Nepali text into the editor box and click the "Check Spelling" button. Nepali Spell Checker will analyze your text and if it finds a misspelled term, either a word or a phrase, it will provide you with relevant suggestions. You can, then, either replace the misspelled term with one of the suggestions or ignore the suggestions and not make the change.
While replacing the misspelled term with one of the suggestions, you can do so for just that particular occurrence of the misspelled term or have the change applied to all occurrences of the misspelled term. In a similar fashion, you can ignore a particular occurrence of the term or ignore all occurrences of the term. The Replace All, or Ignore All, operation replaces, or ignores, the term from the current position of the term to the end of the text.
How suggestions are made
Suggestions offered by Nepali Spell Checker are generated by an algorithm, which is based on a number of factors including the entries in the official Nepali dictionary, Nepali grammar and orthography rules and the guidelines from Nepal Academy. The order in which the suggestions are presented is based on objective factors, such as popularity of the suggestions.
The approach taken to predict suggestions can be categorized broadly into two steps. The first step includes the term being looked up against a comprehensive set of words and phrases, which includes a varieties of entries, like the entries from the official Nepali dictionary, new words not yet included in the dictionary, proper nouns, etc. If a term matches an entry in the set, the spell checker learns that the term is correct and moves on to the next term.
If the term does not exist in the dictionary, then, as a part of the second step, it processes the term against the grammar and orthography rules. Often times, a typical text contains many valid terms, like inflections and derived words, etc., that are not root terms. The spell checker decomposes such a term into different components. A lookup is performed for each component, and if the lookup succeeds, i.e., if all components exist in the set, it finally checks to see if all the components relate to each other according to the grammar and orthography rules.
The word "Gharma," meaning "in the house" and containing the root word, Ghar, meaning "house" and the case ending Ma, meaning "in," can be taken as an example to illustrate the approach. In the first step, a lookup is made for the entire term. Even though the term is valid, the lookup fails because the entire term does not exist as a single entry in the set, even though the root word and the case ending exist as separate entries. Then the second step kicks in. The term is decomposed into the root term and the case ending, after which a lookup for both components is performed. The lookup succeeds and then it proceeds to check the grammar and orthography rules. It passes the grammar rule that states that a case ending can be applied to a noun and also passes the orthography rules that state that a case ending should appear after the root word and that the case ending and the root word should be written as one single word. The spell checker then flags the term as a correct term.
At this point, if the spell checker cannot determine that the term is correct, it looks for closest matches in the set and comes up with a list of suggestions ranked based on popularity and how closely they match the term. The suggestions are then presented to the user. The suggestions may be entries from the set or other forms of entries, e.g., auto generated inflections.
View Nepali Spell Checker Documentation
for additional information.
View Nepali Spell Checker Frequently Asked Questions